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EDITORIAL 

Introduction  to  a  Forum  on  Individual 
Differences  in  Accounting  Behavior 

Steven  J.  Kachelmeier 

The  University  of  Texas  at  Austin 


This  issue  of  The  Accounting  Review  features  two  related  archival  studies,  both  accepted 
independently  by  different  editors,  on  the  influence  of  individual  managers  on  different 
behaviors  that  are  of  interest  to  accountants.  Bamber  et  al.  (2010)  focus  on  the  effects  of 
individual  managers  on  several  properties  of  management  earnings  forecasts.  Dyreng  et  al.  (2010) 
focus  on  the  effects  of  individual  managers  on  tax-avoidance  strategies.  Although  they  examine 
different  aspects  of  management  behavior,  both  studies  adapt  a  technique  developed  by  Bertrand 
and  Schoar  (2003)  that  tracks  managers  across  different  firms,  thereby  disentangling  manager 
effects  from  firm  effects. 

From  my  perspective,  both  as  an  editor  and  as  a  behavioral  experimentalist,  a  prominent 
contribution  of  these  two  studies  is  that  they  help  to  narrow  the  gap  that  too  often  exists  between 
archival  accounting  research  and  so-called  “behavioral”  accounting  research  that  we  typically 
associate  with  experiments.  For  example,  Bamber  et  al.  (2010)  draw,  in  part,  on  a  review  article  by 
Hirst  et  al.  (2008)  regarding  management  earnings  forecasts,  which  in  turn  integrates  both  archival 
and  experimental  research  perspectives.  Indeed,  I  have  often  wondered  why  we  tend  to  associate 
the  adjective  “behavioral”  with  experimental  research,  although  I  suppose  the  emergence  of  “be¬ 
havioral  finance”  as  a  mainstream  archival  research  area  has  made  inroads  in  reinforcing  that  both 
finance  and  accounting  are  ultimately  involved  with  human  behavior,  reflecting  human  informa¬ 
tion  processing  limitations,  biases,  and  idiosyncrasies  (also  see  Shleifer  2000). 

More  fundamentally,  Bamber  et  al.  (2010)  and  Dyreng  et  al.  (2010)  remind  us  that  firms  do 
not  make  decisions.  Rather,  people  make  decisions,  and  those  decisions  are  shaped  by  the  person¬ 
alities  of  those  involved.  From  a  research  perspective,  one  is  tempted  to  build  on  that  premise  by 
identifying  characteristics  such  as  gender,  education,  ethnicity,  military  service,  and  a  host  of  other 
measures  that  might  influence  manager  styles.  On  this  point,  however,  I  found  it  of  considerable 
interest  that,  notwithstanding  the  spirited  attempt  in  both  studies  to  identify  such  characteristics, 
idiosyncratic  variation  appears  to  dominate  the  variation  that  can  be  nicely  categorized  by  mea¬ 
surable  forces  that  shape  manager  personalities. 

In  the  final  analysis,  it  would  appear  that  managers,  like  other  humans,  are  richly  varied,  such 
that  the  noise  term  in  regressions  involving  “firm”  behavior  will  likely  continue  to  be  larger  than 
researchers  would  like  to  see.  I  applaud  the  authors  of  both  studies  in  this  forum  for  helping  us  to 
peer  inside  the  noise  term  in  what  seems  to  be  the  beginnings  of  a  fruitful  avenue  for  future 
accounting  research  of  archival,  experimental,  and  other  varieties. 
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ABSTRACT:  Financial  economics  has  posited  a  limited  role  for  idiosyncratic  noneco¬ 
nomic  manager-specific  influences,  but  the  strategic  management  literature  suggests 
such  individual  influences  can  affect  corporate  outcomes.  We  investigate  whether  indi¬ 
vidual  managers  play  an  economically  significant  role  in  their  firms’  voluntary  financial 
disclosure  choices.  Tracking  managers  across  firms  over  time,  we  find  top  executives 
exert  unique  and  economically  significant  influence  (manager-specific  fixed  effects)  on 
their  firms’  voluntary  disclosures,  incremental  to  known  economic  determinants  of  dis¬ 
closure,  and  firm-  and  time-specific  effects.  Managers’  unique  disclosure  styles  are 
associated  with  observable  demographic  characteristics  of  their  personal  backgrounds: 
managers  promoted  from  finance,  accounting,  and  legal  career  tracks,  managers  born 
before  World  War  II,  and  those  with  military  experience  develop  disclosure  styles  dis¬ 
playing  certain  conservative  characteristics;  and  managers  from  finance  and  account¬ 
ing  and  those  with  military  experience  favor  more  precise  disclosure  styles.  These 
plausible  associations  confirm  that  our  estimated  manager-specific  fixed  effects  capture 
systematic  long-lived  differences  in  managers’  unique  disclosure  styles. 

Keywords:  voluntary  disclosure;  management  earnings  forecasts;  individual 
differences;  upper  echelons  theory. 

Data  Availability:  Data  are  publicly  available  from  the  sources  identified  in  the  text. 
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1.  INTRODUCTION 

We  investigate  two  related  research  questions:  (1)  Do  top  managers  exhibit  unique  and 
economically  significant  individual-specific  styles  in  voluntary  corporate  financial  dis¬ 
closure?  (2)  If  so,  are  managers’  unique  experiences,  as  evidenced  by  observable  de¬ 
mographic  characteristics  of  their  personal  backgrounds,  associated  with  cross-sectional  differ¬ 
ences  in  their  overall  disclosure  styles? 

Published  theoretical  and  archival  research  in  economics,  finance,  and  accounting  has  largely 
posited  a  limited  role  for  idiosyncratic  noneconomic  manager-specific  influences  in  explaining 
cross-sectional  differences  in  corporate  choices  such  as  voluntary  disclosure.*  Neoclassical  eco¬ 
nomic  theory  assumes  individuals  are  rational  optimizers,  and  is  often  characterized  as  providing 
no  role  through  which  individuals  could  exert  idiosyncratic  influence  on  corporate  outcomes  (e.g., 
Weintraub  2002;  Bertrand  and  Schoar  2003).  Agency  theory  proposes  a  less  extreme,  but  still 
limited  role  through  which  individual  managers  could  idiosyncratically  influence  corporate  deci¬ 
sions.  In  this  perspective,  managers  respond  rationally  to:  (1)  the  firm’s  economic  environment, 
and  (2)  the  firm’s  monitoring  mechanisms  and  managers’  contractual  incentives.  As  a  result, 
research  has  typically  focused  on  “representative”  agents  because  individual  managers  can  be 
induced  to  make  similar  choices  through  appropriate  monitoring  and  contractual  incentives.  In  this 
sense,  neoclassical  and  agency  theories  allow  little  role  for  noneconomic  manager-specific  pref¬ 
erences  to  affect  corporate  outcomes.  Two  streams  of  research  in  management  lead  to  a  similar 
conclusion,  reasoning  that:  (1)  entrenched  norms  and  cultures  in  large  corporations  constrain 
managers’  choices  (e.g.,  Lieberson  and  O’Connor  1972),  and  (2)  similarity  in  managers’  back¬ 
grounds  and  experiences  rising  through  the  corporate  ranks  further  limits  heterogeneity  (e.g.,  Hitt 
and  Tyler  1991). 

In  contrast,  Hambrick  and  Mason’s  (1984)  upper  echelons  theory  predicts  that  managers  are 
not  effectively  interchangeable.  Their  theory  posits  that  idiosyncratic  differences  in  managers’ 
experiences  are  associated  with  differences  in  important  personal  values  and  cognitive  styles  such 
as  honesty  and  tolerance  of  ambiguity,  which  can  lead  managers  to  make  different  choices, 
particularly  in  complex  situations  lacking  clear  and  calculable  solutions.  Hambrick  and  Mason 
(1984)  further  argue  that  such  preferences  can  in  turn  lead  to  different  organizational  outcomes. 
Thus,  we  investigate  whether  individual  top  managers  play  a  significant  incremental  role  in  ex¬ 
plaining  cross-sectional  variation  in  corporate  voluntary  disclosure  choices,  after  controlling  for 
known  economic  determinants  of  disclosure  (and  firm-specific  effects).  In  other  words,  do  man¬ 
agers  exhibit  economically  significant  individual-specific  disclosure  styles? 

To  address  this  question,  we  construct  a  panel  data  set  that  tracks  top  managers  over  the 
period  1995-2005.  Given  that  firms  are  associated  with  both  the  key  independent  variable  (man¬ 
agers)  and  the  dependent  variables  (firms’  disclosure  choices  are  sticky  over  time),  it  is  critical  to 
control  for  firm-specific  effects  to  avoid  a  correlated  omitted  variable  problem,  and  to  cleanly 
isolate  manager- specific  effects.  By  isolating  manager-specific  fixed  effects  on  firms’  disclosure 
characteristics,  after  controlling  for  known  economic  determinants  of  disclosure,  firm  fixed  effects, 
and  time  period  fixed  effects,  we  can  assess  not  only  the  existence,  but  also  the  magnitude,  of 
individual  managers’  incremental  effects  on  their  firms’  voluntary  disclosures.  We  refer  to  this 
incremental  manager- specific  fixed  effect  as  the  manager’s  style. 

Our  results  suggest  top  managers  exert  economically  significant  individual-specific  influence 
over  five  aspects  of  management  forecasts:  forecast  frequency,  forecast  precision,  news  conveyed 


^  Our  study  focuses  on  properties  of  one  of  the  most  important  and  widely  investigated  forms  of  voluntary  financial 
disclosure — management  earnings  forecasts — that  Hirst  et  al.  (2008,  315)  characterize  as  “one  of  the  key  voluntary 
disclosure  mechanisms  by  which  managers  establish  or  alter  market  earnings  expectations,  preempt  litigation  concerns, 
and  influence  their  reputation  for  transparent  and  accurate  reporting.” 
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by  the  forecast,  and  the  bias  in  and  accuracy  of  the  forecast.  We  assess  the  economic  significance 
of  the  manager-specific  fixed  elfects  along  three  dimensions.  First,  adding  manager-specific  fixed 
effects  to  the  base  models  (that  include  the  economic  determinants,  firm-specific  effects,  and 
time-period  specific  effects)  increases  explanatory  power  by  7.6  to  10.3  raw  percentage  points, 
translating  to  increases  of  29.9  percent  to  over  400  percent  relative  to  the  base  models’  adjusted 
R  s.  Second,  manager-specific  fixed  effects  are  pervasive:  between  38  percent  and  51  percent  of 
the  individual  manager-specific  fixed  effects  are  significant  at  the  10  percent  level,  much  greater 
than  expected  by  chance.  Third,  the  distributions  of  managers’  fixed  effects  reveal  economically 
material  differences.  For  example,  replacing  a  manager  at  the  25th  percentile  of  the  distribution  for 
forecast  frequency  with  one  at  the  75th  percentile  more  than  triples  the  number  of  forecasts. 

Having  established  that  individual  managers  exhibit  significant  idiosyncratic  disclosure 
styles  contrary  to  implicit  assumptions  of  traditional  financial  economics  research — our  second 
question  is:  How  do  managers’  personal  experiences  influence  their  own  disclosure  styles? 

Upper  echelons  theory  suggests  that  observable  demographic  characteristics  of  managers’ 
backgrounds  reflect  key  formative  experiences  that  are  associated  with  managers’  unique  cognitive 
styles  and  values.  For  example,  this  literature  argues  that  managers’  functional  career  tracks  (e.g., 
marketing,  accounting,  legal)  affect  their  preferences  (e.g.,  Hambrick  and  Mason  1984;  Jensen  and 
Zajac  2004).  We  find  that,  on  average,  managers  promoted  from  legal  backgrounds  tend  to  guide 
expectations  down  (reflecting  greater  sensitivity  to  litigation  risk)  and  managers  promoted  from 
accounting  and  finance  develop  more  precise  disclosure  styles  that  are  conservative  in  underesti¬ 
mating  upcoming  earnings.  Consistent  with  evidence  that  individuals  who  experienced  the  after- 
math  of  the  Great  Depression  and  associated  stock  market  crash  are  more  conservative  (e.g., 
Malmendier  and  Nagel  2008;  Zemke  et  al.  2000),  we  find  that  managers  bom  before  World  War  II 
are  more  conservative  in  being  reluctant  to  forecast.  We  also  find  that  military  experience  leads 
managers  to  develop  styles  favoring  more  precise  forecasts  that  guide  expectations  down,  consis¬ 
tent  with  lower  tolerance  for  ambiguity,  conservatism,  and  valuing  honesty  and  integrity  over 
self-interest  (e.g.,  Soeters  1997;  Goertzel  and  Hengst  1971;  Franke  2001).  Finally,  recent  critics 
assert  that  M.B.A.  programs  overemphasize  the  pursuit  of  self-interest  (e.g.,  Gintis  and  Khurana 
2008).  We  find  that  M.B.A.s  tend  to  guide  expectations  upward,  but  their  forecasts  are  more 
accurate.  Coupled  with  our  evidence  on  the  magnitude  and  pervasiveness  of  the  manager-specific 
fixed  effects,  finding  that  managers’  unique  disclosure  styles  exhibit  plausible  associations  with 
their  distinctive  permanent  personal  demographic  characteristics  confirms  that  we  are  capturing 
systematic  long-lived  differences  in  managers’  unique  styles  (and  not  Just  random  noise). 

Our  study  makes  several  contributions.  First,  it  responds  to  the  Brochet  et  al.  (2009,  32) 
suggestion  that  “research  may  explore  whether  disclosure  policies  ...  vary  systematically  with 
managerial  style.”  We  provide  evidence  that — in  contrast  to  the  implicit  assumption  in  most  prior 
financial  economics  research — this  hitherto  unexplored  personal  style  dimension  plays  a  signifi¬ 
cant  incremental  role  in  explaining  cross-sectional  differences  in  voluntary  corporate  financial 
disclosure,  even  after  controlling  for  economic  determinants  of  disclosure  and  firm-specific  (and 
time-period-specific)  fixed  effects.  Our  evidence  that  not  only  CEOs,  but  also  CFOs  and  General 
Counsels,  exhibit  distinct  disclosure  styles  responds  to  Finkelstein  and  Hambrick’s  (1996)  call  for 
research  on  the  influence  of  executives  beyond  the  CEO  level. 

Our  second  contribution  is  a  response  to  the  Hirst  et  al.  (2008,  316)  call  for  a  better  under¬ 
standing  of  the  choices  managers  make  once  they  decide  to  issue  an  earnings  forecast  (e.g., 
forecast  precision),  which  Hirst  et  al.  (2008)  suggest  is  an  important  direction  for  both  theory 
development  and  empirical  research.  Our  study  provides  empirical  evidence  that  managers  de¬ 
velop  unique  and  economically  significant  styles  concerning  detailed  attributes  of  voluntary  dis¬ 
closure,  and  these  styles  reflect  their  personal  demographic  characteristics  such  as  functional 
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career  track,  age  cohort,  military  experience,  and  education.  Collectively,  our  evidence  suggests 
that  exploring  the  roles  individual  managers  play  in  other  financial  reporting  choices  is  a  fruitful 
avenue  for  future  research. 

Third,  we  contribute  more  broadly  to  the  nascent  manager-effects  literature.  While  Bertrand 
and  Schoar  (2003)  show  that  managers  matter  in  core  strategic  operational  and  financing  deci¬ 
sions,  the  magnitude  of  managers’  influence  varies  widely:  effects  are  generally  greater  for  high- 
profile  strategic  decisions  like  acquisitions,  and  smaller  for  operational  financing  decisions  like 
cash  holdings.  Our  evidence  that  managers  not  only  matter,  but  matter  significantly  in  seemingly 
second-order  decisions  such  as  the  details  of  firms’  voluntary  disclosure  decisions,  extends  the 
range  of  corporate  decisions  in  which  idiosyncratic  manager  styles  are  known  to  play  an  economi¬ 
cally  significant  role.  More  importantly,  our  study  contributes  beyond  existing  research  by  inte¬ 
grating  a  broad  set  of  literatures  to  provide  conceptual  bases  for  considering:  (1)  when  and  why 
managers  might — or  might  not — matter,  and  (2)  why  and  how  characteristics  of  managers  per¬ 
sonal  backgrounds  likely  affect  their  idiosyncratic  disclosure  styles. 

Finally,  from  a  practical  perspective,  evidence  that  disclosures  reflect  manager-specific  styles 
suggests  that  initiatives  intended  to  remedy  disclosure  problems  such  as  upward-biased  forecasts 
(e.g.,  stock  “hyping”  observed  by  Lang  and  Lundholm  [2000])  may  be  more  effective  if  they 
target  the  individual  manager,  as  the  Sarbanes-Oxley  Act  does  in  holding  managers  responsible  for 
the  firm’s  financial  reports. 

II.  CONTROVERSY  OVER  THE  ROLE  INDIVIDUAL  MANAGERS  PLAY  IN 

CORPORATE  DECISIONS 

Differing  Perspectives  on  the  Role  Individual  Top  Managers  Play  in  Corporate  Decisions 

Neoclassical  economic  theory  is  often  characterized  as  allowing  no  role  through  which  idio¬ 
syncratic  differences  among  individuals  can  affect  corporate  outcomes  (e.g.,  Weintraub  2002).  For 
example,  in  “a  neoclassical  view  of  the  firm  ...  top  managers  are  homogeneous  and  selfless  inputs 
into  the  production  process  . . .  [and]  different  managers  are  regarded  as  perfect  substitutes  for  one 
another”  (Bertrand  and  Schoar  2003,  1173).  Agency  theory  allows  individuals  to  differ  in  attributes 
such  as  effort  aversion,  but  typically  focuses  on  “representative”  agents  because  monitoring  and 
contractual  incentives  can  induce  individuals  to  make  similar  choices  (e.g.,  Christensen  and 
Feltham  2003).  Grounded  in  these  traditions,  archival  research  has  typically  abstracted  from  non¬ 
economic  manager- specific  effects  on  corporate  choices.^ 

Two  streams  of  research  in  strategic  management  similarly  conclude  that  managers  are  largely 
interchangeable.  First,  the  external  control  perspective  suggests  that  environmental  and  organiza¬ 
tional  constraints  such  as  entrenched  norms  and  cultures  limit  managers’  choices  (e.g.,  Lieberson 
and  O’Connor  1972;  Hannan  and  Freeman  1977).  Legitimacy  constraints  are  particularly  limiting, 
as  managers  conform  to  external  expectations  of  rationality  by  imitating  other  managers,  increas¬ 
ing  homogeneity  (e.g..  Spender  1989;  Hambrick  et  al.  1993;  Chalmers  and  Godfrey  2004).  Sec¬ 
ond,  the  socialization  and  selection  processes  top  managers  experience  rising  through  the  corpo¬ 
rate  ranks  further  limits  heterogeneity  (Hitt  and  lyier  1991;  Hambrick  2007),  with  DiMaggio  and 
Powell  (1983,  152)  asserting  “such  mechanisms  create  a  pool  of  almost  interchangeable  individu¬ 
als  who  occupy  similar  positions  across  a  range  of  organizations  and  possess  a  similarity  of 
orientation  and  disposition.”  For  example,  most  CEOs  of  Fortune  1000  firms  are  white  males  with 
college  degrees,  many  from  elite  institutions  (Bhargava  and  Jespersen  1993;  Datz  2000).  A  sig- 


^  A  few  studies  examine  how  manager-specific  economic  incentives  such  as  stock-based  compensation  and  insider  trading 
affect  managers’  propensity  to  issue  a  management  forecast  (e.g.,  Noe  1999;  Nagar  et  al.  2003;  Cheng  and  Lo  2006).  In 
contrast,  we  focus  on  noneconomic  manager-specific  effects  on  attributes  of  management  forecasts. 
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nificant  stream  of  empirical  research  in  strategic  management  concludes  that  idiosyncratic  influ¬ 
ence  of  individual  top  managers  is  limited  (e.g.,  Lieberson  and  O’Connor  1972;  Salanick  and 
Pfeffer  1977;  Hitt  and  Tyler  1991). 

In  contrast,  Hambrick  and  Mason’s  (1984)  upper  echelons  theory  suggests  that  individual 
managers  matter.  A  recipient  of  the  Academy  of  Management  Review's  “Frame-Breaking  Innova¬ 
tive  Theory  Award  in  2006  (Kilduff  2006),  the  study  points  out  that  simple  inventory  and  credit 
policy  decisions  may  have  calculable  solutions.  But  in  complex  ambiguous  situations  plagued  by 
multiple  and  often  incompatible  goals,  managers  operate  within  the  bounds  of  rationality,  and 
within  these  bounds  their  choices  can  be  influenced  by  their  idiosyncratic  experiences  and  values 
(Hambrick  and  Mason  1984;  Finkelstein  and  Hambrick  1996;  Hambrick  2007).  (Voluntary  disclo¬ 
sure  is  such  a  complex,  ambiguous  situation  requiring  tradeoffs  among  multiple  conflicting  goals, 
such  as.  (1)  protecting  proprietary  information  that  competitors  could  use  against  the  firm  versus 
(2)  mitigating  litigation  risk  [e.g..  Dye  1986;  Evans  and  Sridhar  2002].)  Hambrick  and  Mason’s 
(1984)  theory  spurred  a  stream  of  empirical  research  in  management,  much  of  which  concludes 
that  managers  affect  firm  choices  (e.g..  Smith  and  White  1987;  Thomas  et  al.  1991;  Tihanyi  et  al. 
2000,  Naranjo-Gil  and  Hartmann  2006;  Crossland  and  Hambrick  2007).  However,  this  research 
does  not  isolate  manager  effects  after  controlling  for  firm  effects.^  If  unmodeled  firm  characteris¬ 
tics  affect  corporate  choices,  then  by  confounding  managers’  effects  with  firm  effects,  this  litera¬ 
ture  likely  overstates  any  “manager”  effects. 

Archival  research  in  finance  and  economics  is  just  starting  to  explore  whether  individual 
managers  impose  idiosyncratic  influence  on  corporate  decisions.  The  seminal  study  in  this  genre, 
Bertrand  and  Schoar  (2003),  develops  an  innovative  design  that  disentangles  the  manager  effect 
from  the  firm  effect  by  tracking  managers  across  firms  over  time.  They  find  that  managers  develop 
unique  individual-specific  styles  in  operational  and  financing  decisions.  Individual  managers’ 
incremental  effects  are  generally  greater  for  key  strategic  decisions  like  acquisitions,  but  smaller 
for  operational  financing  policies  such  as  cash  holdings.  If  managers  have  any  idiosyncratic 
influence  on  corporate  decisions,  then  it  is  most  likely  to  be  on  high-profile  core  strategic  decisions 
where  top  managers  are  most  interested  and  actively  involved. 

Ex  ante,  it  is  unclear  whether  managers  would  exert  economically  significant  idiosyncratic 
influence  on  their  firms’  voluntary  financial  disclosure  choices.  Individual  top  managers’  effects  on 
voluntary  disclosure  hinge  on:  (1)  the  extent  to  which  managers  act  as  economic  agents  versus 
individuals  with  distinct  communication  styles,  and  (2)  the  discretion  managers  enjoy  over  the 
strategic  choice  (Hambrick  2007).  Top  managers  who  act  primarily  as  economic  agents  are  likely 
more  interested  in  and  involved  in  high-profile  strategic  operational  and  financing  decisions  (e.g., 
acquisitions,  strategic  marketing,  production  expansion),  but  are  less  interested  in  and  less  in¬ 
volved  in  seemingly  secondary  decisions  about  detailed  attributes  of  management  earnings  fore¬ 
casts.  This  suggests  that  top  managers  would  have  limited  (if  any)  effect  on  the  firm’s  voluntary 
disclosure  choices.  On  the  other  hand,  research  in  psychology  concludes  that  humans  develop 
long-lived,  individual-specific  communication  styles  (e.g.,  Pennebaker  and  King  1999).  If  top 
managers  have  distinct  communication  styles,  such  as  valuing  openness  and  transparency,  then 
this  would  likely  affect  the  characteristics  of  the  firm’s  voluntary  disclosures.  With  respect  to 
discretion,  top  managers  can  in  principle  exercise  discretion  over  management  earnings  forecasts, 
but  their  discretion  is  limited  by  the  ex  post  verifiability  of  the  disclosures  and  stickiness  of  firms’ 
disclosure  policies  over  time  (Healy  et  al.  1999;  Lang  1999). 


^  The  only  exception  we  found  that  does  conUol  for  firm  fixed  effects  is  Crossland  and  Hambrick’s  (2007)  study  of  the 
cross-cultural  differences  in  the  effect  of  CEOs  on  measures  of  firm  performance  (e.g.,  return  on  assets). 
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To  summarize,  certain  theories  and  evidence  suggest  managers  will  have  at  most  limited 
effect  on  their  firms’  voluntary  disclosure  choices,  whereas  other  theories  and  evidence  suggest  the 
opposite.  It  is  therefore  an  empirical  question  whether  top  managers  exert  economically  significant 
individual-specific  influence  over  their  firms’  voluntary  disclosures.  Thus,  we  investigate  whether 
individual  managers  develop  unique  disclosure  styles,  paying  special  attention  to  the  magnitude  of 
any  such  manager-specific  effects."^ 

What  Demographic  Characteristics  of  Managers’  Personal  Backgrounds  Might  Influence 
Their  Disclosure  Styles? 

To  the  extent  that  managers’  backgrounds,  socialization,  and  selection  processes  create  similar 
experiences  and  induce  similar  values  and  cognitive  styles,  the  demographic  characteristics  of 
their  personal  backgrounds  are  unlikely  to  explain  any  differences  in  their  disclosure  styles  (e.g., 
DiMaggio  and  Powell  1983;  Hitt  and  Tyler  1991;  Hambrick  2007).  On  the  other  hand,  upper 
echelons  theory  suggests  cross-sectional  differences  in  managers’  demographic  characteristics, 
such  as  their  functional  career  track,  age,  and  education,  differentially  shape  managers’  values  and 
cognitive  bases,  which  in  turn  can  affect  their  managerial  styles  (e.g.,  Hambrick  and  Mason  1984; 
Hitt  and  Tyler  1991).  We  focus  on  the  role  of  observable  demographic  characteristics  of  managers’ 
backgrounds  since  upper  echelons  theory  recommends  that  “An  emphasis  on  background  charac¬ 
teristics,  rather  than  psychological  dimensions,  seems  essential”  (Hambrick  and  Mason  1984,  196; 
see  also  Hambrick  2007,  335). 

Whereas  certain  demographic  background  characteristics  are  observable  and  reliably  measur¬ 
able,  top  managers  (especially  of  large  corporations)  are  reluctant  to  submit  to  batteries  of  psy¬ 
chological  tests,  which  are  “too  general  and  detached  from  executive  issues  to  be  useful  for 
studying  top  managers”  (Finkelstein  and  Hambrick  1996,  46).  That  is,  psychological  test  instru¬ 
ments  intended  for  the  general  population  are  not  tailored  to  identify  finer  psychological  traits  that 
might  distinguish  among  successful  professional  executives.  It  is  therefore  not  surprising  that 
research  using  psychological  measures  has  not  found  significant  effects.  For  example.  Sorter  et  al. 
(1964)  and  Faircloth  and  Ricchiute  (1981)  find  that  personality  characteristics  do  not  explain 
accountants’  preferred  reporting  alternatives.  In  contrast,  Faircloth  and  Ricchiute  (1981,  53)  find 
that  objective  personal  demographic  characteristics  such  as  education  do  play  a  significant  role, 
prompting  them  to  recommend  that  “additional  research  should  pay  more  attention  to  the  effects  of 
demographic  information,  which  may  in  fact  be  more  informative  than  many  previously  tested 
variables  such  as  personality  characteristics.” 

To  guide  our  exploration  of  the  relation  between  managers’  demographic  characteristics  and 
their  unique  disclosure  styles,  we  conducted  a  comprehensive  review  of  a  broad  set  of  literatures 
ranging  from  strategic  management,  to  career  counseling,  to  sociology,  to  psychology,  to  military 
science,  and  to  business  education.  We  searched  for  theories  or  evidence  suggesting  how  demo- 


A  recent  working  paper  by  Brocket  et  al.  (2009)  investigates  the  effect  of  CEO  or  CFO  turnover  on  the  frequency  and 
precision  of  management  forecasts.  When  a  new  CFO  is  appointed,  on  average  firms  that  provided  frequent  guidance 
before  the  turnover  experience  a  temporary  decrease  in  the  quantity  and  precision  of  management  earnings  forecasts. 
When  such  firms  appoint  a  new  CEO,  the  decline  in  the  frequency  of  management  forecasts  tends  to  be  longer-lived 
(documented  up  to  two  years),  but  there  is  no  effect  on  forecast  precision.  Thus,  Brocket  et  al.  (2009)  document  the 
effect  of  a  managerial  transition  per  se  on  the  firm’s  disclosure  policy.  Our  study  is  fundamentally  different.  Rather  than 
focusing  on  the  short-  or  medium-term  effects  of  a  shock  resulting  from  a  managerial  transition  per  se,  we  focus  on 
whether  a  given  individual  manager  exhibits  a  persistent  disclosure  style  measured  over  at  least  six  years  across  multiple 
employers. 

Another  working  paper  by  Ge  et  al.  (2009)  explores  the  effects  of  CFOs  on  choices  that  affect  mandatory  financial 
reporting.  They  conclude  that  CFOs  exert  manager-specific  fixed  effects  on  GAAP  eamings-related  choices  indicative  of 
earnings  management  (e.g.,  abnormal  accruals).  They  do  not  explore  the  effects  of  individual  managers  on  voluntary 
management  earnings  forecasts,  which  is  the  focus  of  our  study. 
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graphic  characteristics  highlighted  in  Hambrick  and  Mason’s  (1984)  upper  echelons  theory  might 
plausibly  be  associated  with  cross-sectional  differences  in  managers’  unique  disclosure  styles.  Our 
review  suggests  four  demographic  characteristics  of  managers’  personal  backgrounds— functional 
career  track,  age  cohort,  other  career  experience  (specifically,  military  experience),  and  M.B.A. 
education — likely  influence  managers’  unique  disclosure  styles.^ 

Functional  Career  Track 

Hambrick  and  Mason’s  (1984,  200)  upper  echelons  theory  suggests  a  manager’s  primary 
functional  career  track  affects  his/her  choices  because  “career  experiences  partially  shape  the 
lenses  through  which  they  view  current  strategic  opportunities  and  problems.”  Socialized  in  their 
functional  area  s  mindset,  managers  tend  to  converge  on  solutions  to  common  business  problems 
(Fmkelstem  and  Hambrick  1996,  93).  Empirical  research  confirms  that  managers  pursue  strategies 
m  line  with  their  own  functional  expertise  (e.g..  Smith  and  White  1987;  Thomas  et  al.  1991; 
Jensen  and  Zajac  2004).  Prior  research  characterizes  individuals  in  technical  financial  functions  as 
conventional,  orderly,  and  inhibited  (Holland  1973;  1997),  suggesting  they  may  adopt  conserva¬ 
tive  disclosure  styles.  (We  consider  fewer  forecasts,  downward  guidance,  or  guidance  that  under¬ 
shoots  upcoming  earnings  as  various  dimensions  of  a  conservative  disclosure  style.)  Managers 
from  technical  functions  are  also  less  tolerant  of  ambiguity  (Holland  1966;  1997,  27),  and  Ham- 
bnck  and  Mason  (1984)  posit  that  managers  from  finance  favor  more  budget  detail  and  thorough¬ 
ness,  which  suggests  managers  from  finance  or  accounting  may  develop  more  precise  communi¬ 
cation  styles.  Executives  with  legal  backgrounds  are  more  sensitive  to  litigation  risk,  and  so  likely 
favor  communication  styles  that  do  not  promise  too  much.^ 

Age  Cohort 

Upper  echelons  theory  suggests  that  the  manager’s  age  cohort  can  affect  his/her  values, 
cognitive  styles,  and  thus  his/her  decisions  (Hambrick  and  Mason  1984).  Schuman  and  Scott 
(1989)  show  that  the  events  an  age  cohort  (individuals  bom  in  the  same  era)  experiences  during  its 
youth  affect  later  attitudes.  Zemke  et  al.  (2000)  classify  managers  bom  before  World  War  II  as  an 
important  cohort,  shaped  by  the  lingering  sacrifices  and  shortages  of  the  Great  Depression,  the 
Dust  Bowl  that  wiped  out  the  Great  Plains  farms,  and  the  war  years,  all  of  which  inculcated 
conservative  values.  Emerging  research  in  financial  economics  finds  that  cohorts  who  experienced 
lower  stock  returns  during  their  investing  lives  are  more  conservative  (Malmendier  and  Nagel 
2008).  Blanchard  (1993)  ascribes  the  decline  in  the  equity  risk  premium  to  receding  memories  of 
the  crash  and  the  Great  Depression,  and  Palsson  (1996)  finds  that  age  is  associated  with  greater 
risk  aversion  in  portfolio  holdings.  Finally,  Bertrand  and  Schoar  (2003)  find  that  managers  from 
older  birth  cohorts  choose  lower  levels  of  corporate  expenditures,  lower  leverage,  and  larger  cash 
holdings,  consistent  with  conservatism.  These  arguments  suggest  executives  bom  before  World 
War  II  may  develop  more  conservative  communication  styles,  such  as  being  less  likely  to  issue 
forward-looking  statements  for  fear  such  disclosures  may  prove  inaccurate,  ex  post. 


Upper  echelons  theory  suggests  that  certain  demographic  characteristics  affect  managers’  cognitive  styles  and  values, 
but  we  had  to  search  elsewhere  to  support  how  these  characteristics  might  plausibly  affect  managers’  disclosure  styles.' 
As  a  result  of  this  search,  we  explored  two  additional  demographic  characteristics  beyond  upper  echelons  theory:  gender 
and  whether  the  manager  is  from  the  firm’s  founding  family.  Unfortunately,  members  of  founding  families  of  S&P  1500 
firms  do  not  move  to  similar  positions  at  other  S&P  1500  firms.  Consistent  with  the  literature  documenting  the 
homogeneity  of  managers’  backgrounds,  our  sample  includes  only  six  female  managers. 

Bagley  (2008,  379)  argues  that  “legally  astute”  top  managers  make  different  choices  because  they  “appreciate  the 
importance  of  meeting  society’s  expectations  of  appropriate  behavior  ...  and  of  treating  stockholders  fairly.” 
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Military  Experience 

Upper  echelons  theory  suggests  that  an  executive’s  other  career  experiences  can  affect  his/her 
decisions.  We  posit  that  military  service  is  a  different  form  of  career  experience  that  likely  affects 
a  manager’s  disclosure  style.  Military  personnel  tend  to  be  less  tolerant  of  ambiguity/uncertainty 
(e.g.,  Goertzel  and  Hengst  1971;  Soeters  1997),  suggesting  they  may  favor  more  precise  commu¬ 
nication  styles.  A  common  theme  is  “service  before  self’  (Franke  2001),  as  the  Army  describes 
military  values  as  “duty,  selfless  service,  courage,  integrity  . . .  and  a  sense  of  justice  (Headquar¬ 
ters,  Department  of  the  Army  1993,  1-2).  Individuals  choosing  military  service  tend  to  be  con¬ 
servative  (e.g.,  Goertzel  and  Hengst  1971;  Franke  2001).  They  are  likely  to  agree  that  “honesty  is 
the  best  policy,”  that  “one  should  take  action  only  when  one  is  sure  it  is  morally  right,”  and  are 
less  likely  to  believe  it  is  acceptable  to  “cut  corners”  (Franke  1998;  2001).  They  place  relatively 
lower  value  on  income  and  promotion  opportunities  (Soeters  1997).  Such  values  suggest  managers 
with  military  experience  may  favor  styles  that  tend  toward  prompt  and  unbiased  disclosure  of 
unfavorable  information  (i.e.,  honesty  about  bad  news). 

M.B.A.  Education 

Upper  echelons  theory  predicts  managers  holding  M.B.A.  degrees  develop  different  styles 
than  those  without  such  educational  backgrounds.  Finkelstein  and  Hambrick  (1996,  104)  suggest 
M.B.A.s  are  part  of  the  social  and  business  elite  who  value  conformity  and  conventionality. 
M.B.A.s  are  also  likely  aware  of  the  penalties  the  market  exacts  when  firms  fail  to  meet  their  own 
earnings  forecasts  (Chen  2004).  These  arguments  suggest  that  M.B.A.s  may  adopt  more  conser¬ 
vative  disclosure  styles.  On  the  other  hand,  post-Enron  era  critics  have  asserted  M.B.A.  programs 
overemphasize  neoclassical  economics,  where  rational  managers  pursue  their  own  self-interest 
(e.g.,  Ghoshal  2005;  Gintis  and  Khurana  2008).^  Self-reported  cheating  levels  are  higher  in 
M.B.A.  than  other  graduate  programs  (McCabe  et  al.  2006),  and  individuals  who  behave  dishon¬ 
estly  as  students  are  more  likely  to  behave  dishonestly  in  the  workplace  (Nonis  and  Swift  2001), 
suggesting  M.B.A.s  might  be  more  likely  to  issue  disclosures  that  spin  the  firm’s  outlook. 

III.  RESEARCH  DESIGN 

The  Existence  and  Magnitude  of  Individual  Top  Managers’  Voluntary  Disclosure  Styles 
Sample  and  Data 

Firms’  disclosure  choices  are  sticky  over  time  (Healy  et  al.  1999;  Lang  1999),  but  as  Hirst  et 
al.  (2008)  point  out,  empirical  models  leave  most  of  the  cross-sectional  variation  in  voluntary 
disclosure  choices  unexplained.  This  suggests  that  unidentified  firm  characteristics  likely  affect 
firms’  disclosure  choices.  Because  firms  are  associated  with  both  the  key  independent  variable 
(managers)  and  the  dependent  variables  (disclosure  choices),  it  is  necessary  to  control  for  firm- 
specific  fixed  effects  to  avoid  misattributing  firm  effects  to  managers. 

We  therefore  use  a  research  design  with  strong  controls  for  firm  effects.  We  adopt  Bertrand 
and  Schoar’s  (2003)  longitudinal  design  that  tracks  managers  over  time,  and  requires  managers  to 
work  for  two  or  more  employing  firms,  with  at  least  three  years  at  each  employer  so  the  manager 
has  time  to  “imprint”  his/her  style.  (If  the  manager  works  at  only  one  firm,  then  his/her  effect  is 
confounded  with  the  firm  effect  and  it  is  not  possible  to  cleanly  separate  the  manager  effect  from 
the  firm  effect.)  Figure  1  illustrates  this  design  for  two  firms.  For  each  firm,  we  control  for 
firm-specific  fixed  effects  measured  over  the  entire  sample  period.  To  isolate  the  effect  of  an 


Bertrand  and  Schoar  (2003)  provide  some  evidence  that  managers  with  M.B.A.s  make  more  aggressive  resource 
allocation  decisions.  Because  M.B.A.  education  focuses  on  operational  and  financing  decisions  and  less  on  disclosure, 
evidence  that  an  M.B.A.  degree  is  associated  with  manager-specific  effects  on  operating  and  financing  decisions  does 
not  necessarily  imply  that  there  will  be  similar  effects  on  managers’  voluntary  disclosure  decisions. 
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FIGURE  1 

Research  Design  for  Two  Illustrative  Firms 
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Manager  As  fixed  effect  is  measured  over  Firms  1  and  2,  after  controlling  for:  (1)  the  fixed  effect  of  Firm  1 
(estimated  over  the  entire  sample  period),  (2)  the  fixed  effect  of  Firm  2  (estimated  over  the  entire  sample  period), 
(3)  fixed  effects  for  each  year,  and  (4)  all  of  the  time-varying  economic  determinants  of  disclosure. 


individual  manager  on  disclosure — incremental  to  the  ejfects  of  the  firms — we  measure  the  effect 
of  Manager  A  across  the  two  employing  firms,  after  controlling  for  Firm  1  ’s  fixed  effect.  Firm  2’s 
fixed  effect,  year  fixed  effects,  and  well-documented  economic  determinants  of  voluntary 
disclosure.^  Manager  A’s  style  is  measured  by  his/her  fixed  effect. 

Manager  A’s  fixed  effect  reflects  how  Firm  I’s  forecast  attributes  in  period  1  deviate  from 
Firm  I’s  average  forecast  attributes,  and  how  Firm  2’s  forecast  attributes  in  period  2  deviate  from 
Firm  2’s  average  forecast  attributes  (after  also  abstracting  from  the  effects  of  known  economic 
determinants  and  year-specific  effects).  Thus,  our  estimated  manager-specific  fixed  effects  capture 
the  incremental  effects  of  a  manager  at  multiple  employers,  after  controlling  for  any  firm-specific 
over  six  (or  more)  years.  This  method  also  reduces  concerns  about  correlated  omitted 
variables,  which  to  explain  our  results  would  have  to  vary  over  time  and  across  firms  in  the  same 
pattern  as  the  movement  over  time  and  across  firms  of  managers  with  unique  disclosure  styles. 

From  Execucomp,  we  record  the  five  highest-paid  managers  for  S&P  1500  firms.  Our  sample 
period  starts  in  1995  when  First  Call  s  Company  Issued  Guideline  (CIG)  database  became  a  more 
complete  source  of  management  forecasts  (Anilowski  et  al.  2007).^  From  1995  to  2005,  we 
identify  303  top  managers  that  work  for  at  least  two  different  firms  and  stay  employed  by  each 
firm  for  at  least  three  years.  We  focus  on  managers  whose  last  position  is  CEO,  CFO,  or  General 
Counsel  because  these  positions  are  more  likely  to  influence  voluntary  disclosure.  We  match  each 
firm-year  with  management  forecasts  reported  in  CIG. 

Model  and  Variable  Definitions 

We  investigate  five  attributes  of  management  earnings  forecasts:  frequency,  precision,  news, 
direction  of  bias  relative  to  actual  earnings,  and  accuracy.  The  frequency  of  managers’  forecasts 
{F_Freq)  is  the  number  of  eamings-related  forecasts  the  firm  issues  during  the  year  (which  is  0  if 


Following  Bertrand  and  Schoar  (2003),  we  retain  all  firm-years  with  data  only  in  the  estimation  of  the  firm  fixed  effects 
(i.e.,  including  Manager  B  observations  for  estimation  of  Firm  I’s  fixed  effect  and  Manager  C  observations  for  estima¬ 
tion  of  Firm  2’s  fixed  effect). 

Ajinkya  et  al.  (2005,  350)  conclude  that  CIG  is  a  comprehensive  source  of  management  forecast  information,  after 
performing  two  small-sample  tests  (in  1997  and  in  2000)  matching  CIG  forecasts  with  those  identified  in  a  keyword 
search  of  Factiva  (formerly  the  Dow  Jones  News  Retrieval  Service).  Choi  and  Ziebart  (2004)  report  similar  results  for 
the  period  1993-1996. 
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the  firm  issues  no  forecasts). We  code  forecast  precision  {F_Precision)  as  3,  2,  1,  and  0,  for 
point,  range,  open-ended,  and  general  impression  forecasts  (e.g.,  Bamber  and  Cheon  1998;  Ba- 
ginski  et  al.  2002).^^  We  measure  forecast  news  as  the  difference  between  the  forecast  EPS  (or 
midpoint  of  the  range  forecast)  and  the  prevailing  median  analyst  forecast,  scaled  by  lagged  stock 
price  {FjGoodnews).  We  measure  forecast  bias  {F_BiasUP)  by  comparing  the  management  EPS 
forecast  (using  point  forecasts  and  the  midpoint  of  range  forecasts)  to  ex  post  realized  EPS. 
Eorecasts  that  exceed  actual  EPS  are  coded  as  -f  1  (i.e.,  optimistic  forecasts),  those  that  equal 
actual  EPS  are  coded  as  0,  and  those  that  fall  short  of  actual  EPS  are  coded  as  - 1  (i.e.,  pessimistic 
forecasts). Finally,  we  measure  the  inverse  of  forecast  accuracy  {F _AbsError)  as  the  absolute 
value  of  the  difference  between  the  management  EPS  forecast  (using  point  forecasts  and  the 
midpoint  of  range  forecasts)  and  actual  EPS,  scaled  by  lagged  stock  price,  following  Ajinkya  et  al. 
(2005).  Exhibit  1  summarizes  our  variable  definitions. 

To  test  whether  individual  managers  play  unique,  individual-specific  roles  in  firms’  voluntary 
financial  disclosure,  we  estimate  the  following  models; 


a  +  Tt^  y/  +  e,y; 

(1) 

^ it  —  OtF  /SXji  -P  ^Managers  T/  ^it’ 

(2) 

Yit  =a  +  pXn  +  XcEOs  +Tt+  7i  + 

(3) 

^it  =  «  +  ^^it  +  ^cFOs  +  T,  -P  7,-  -P  e,-,; 

(4) 

^it~  ®  ^^it  ^Counsels  Ti  ^if 

(5) 

Our  unit  of  analysis  is  a  firm-year.  7,-,  is  the  management  forecast  attribute  for  firm  i  in  year 
t.  When  firms  issue  multiple  forecasts  in  a  year,  we  use  firm-year  averages  for  precision,  news, 
bias,  and  accuracy.*"^  The  vector  X  controls  for  a  comprehensive  set  of  economic  determinants  of 
voluntary  disclosure  documented  in  prior  literature,  as  summarized  in  Figure  2  and  explained  in 
the  Appendix.  ^Managers^  our  main  variables  of  interest,  are  estimated  coefficients  for  indicator 
variables  corresponding  to  each  individual  manager,  and  thus  represent  the  individual  managers’ 
fixed 


F_Freq  is  skewed,  but  our  inferences  are  robust  if  we  redefine  FJFreq  as  the  natural  log  of  1  plus  the  number  of 
forecasts.  Our  inferences  are  also  robust  when  we  conduct  the  analysis  using  a  fixed  effect  count  model  (Poisson  model), 
per  Cameron  and  Trivedi  (2005). 

u  Our  main  analysis  includes  forecasts  of  all  specificity  levels.  However,  we  obtain  similar  inferences  using  an  alternative 
(inverse)  measure  of  specificity  where  we  code  point  forecasts  as  zero  and  range  forecasts  as  the  magnitude  of  the  range, 
scaled  by  lagged  share  price. 

We  compare  management  earnings  forecasts  listed  on  First  Call  to  actual  earnings  as  reported  by  First  Call  to  ensure 
consistency,  following  Ajinkya  et  al.  (2005). 

Our  main  analysis  focuses  on  the  direction  of  the  bias,  because  the  forecast  error  variable  captures  the  distance  of  the 
forecast  from  ex  post  actual  EPS.  Nonetheless,  we  obtain  similar  inferences  if  we  define  F_BiasUP  as  the  difference 
between  the  management  forecast  and  ex  post  actual  EPS,  scaled  by  lagged  share  price. 

We  conduct  our  analysis  at  the  firm-year  level  because  our  goal  is  to  assess  the  extent  to  which  individual  managers 
affect  overall  firm  disclosure  policy.  Untabulated  sensitivity  tests  reveal  that  our  results  continue  to  hold  if  we  use  the 
last  forecast  issued  in  a  year.  Any  dependence  among  multiple  observations  from  the  same  firm  does  not  affect  our 
conclusions,  because  they  continue  to  hold  when  we  allow  clustering  of  the  error  term  at  the  firm  level. 
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Dependent  Variables 


EXHIBIT  1 
Variable  Definitions 


F_Freqi,  - 
F_Precisionji  ~ 


F_Goodnewsif  ~ 


F_BiasUPi,  ~ 


number  of  management  earnings  forecasts  firm  i  issued  in  year  t; 

average  precision  of  management  earnings  forecasts  firm  i  issued  in  year  t,  where 
point  forecasts  are  coded  as  3,  range  forecasts  are  coded  as  2,  open-ended  forecasts 
are  coded  as  1,  and  qualitative  forecasts  are  coded  as  0; 

average  news  conveyed  by  forecasts  firm  i  issued  in  year  t,  measured  as  the  difference 
between  the  forecast  EPS  (or  midpoint  of  the  range  forecast)  and  the  prevailing 
median  analyst  forecast,  scaled  by  lagged  stock  price; 

average  bias  (relative  to  ex  post  actual  earnings)  in  forecasts  firm  i  issued  in  year  t.^ 
Forecasts  that  exceed  actual  earnings  are  coded  as  + 1 ,  forecasts  that  equal  actual 
earnings  are  coded  as  0,  and  forecasts  that  fall  short  of  actual  earnings  are  coded  as 
~1  (for  range  forecasts,  we  code  this  variable  based  on  the  midpoint  of  the  range); 


F^AbsErrori,  -  inverse  of  forecast  accuracy,  measured  as  the  average  absolute  error  in  the  forecasts 
that  firm  i  issued  in  year  t,  relative  to  ex  post  actual  earnings.  Specifically,  we  use  the 
absolute  value  of  the  difference  between  the  management  forecast  (using  point 
forecasts  and  the  midpoint  of  the  range  forecasts)  and  actual  EPS,  scaled  by  lagged 
stock  price,  multiplied  by  100  to  yield  an  error  in  percentage  terms. 


Control  Variables 


I AEES,,!  —  absolute  value  of  the  change  in  firm  i's  earnings  per  share  from  year  t- 1  to  t, 
deflated  by  stock  price  at  the  end  of  year  /—I; 

Dispi,  =  standard  deviation  of  analysts’  forecasts  of  firm  i’s  year  t  earnings,  divided  by  the 
absolute  value  of  the  median  forecast; 

EPSJJPi,  =  indicator  variable  coded  as  1  if  firm  i's  EPS  in  year  t  is  greater  than  or  equal  to  its 
EPS  in  year  t—  1 ,  and  0  otherwise; 

LosSi,  =  indicator  variable  that  equals  1  if  firm  i  reports  a  loss  in  year  t  (Compustat  item  172, 
net  income),  and  0  otherwise; 

R&Dj,  =  finn  i’s  expenditures  on  research  and  development  in  year  t,  scaled  by  its  total  assets; 

Mkt  -  Bkii  =  market  value  of  firm  i’s  common  equity  divided  by  the  book  value  of  its  common 
equity,  at  the  end  of  year  r; 

Outside_Dirii  =  percentage  of  members  of  firm  i’s  board  of  directors  during  year  t,  who  are  not  also 
officers  of  the  firm,  from  the  Investor  Responsibility  Research  Center  (IRRC); 

%Instit  =  percentage  of  firm  i’s  common  stock  held  by  institutions  in  year  t,  from  Thomson 
Financial; 

Lsizei,  =  natural  log  of  the  market  value  of  firm  i’s  common  equity  in  year  t; 

#AnalystSi,  =  number  of  analysts  providing  earnings  forecasts  for  firm  i  in  year  t; 

Litigation^  =  coded  as  1  if  the  firm  is  a  member  of  one  of  the  following  high-litigation-risk 
industries:  SIC  codes  2833-2836  (biotechnology),  3570-3577  and  7370-7374 
(computers),  3600-3674  (electronics),  5200-5961  (retailing),  and  8731-8734  (R&D 
service),  and  suffers  a  20  percent  or  greater  decrease  in  earnings;  and  0  otherwise; 

ClassActioni,  =  1  if  the  firm  is  involved  in  a  securities  lawsuit  in  year  t,  per  the  Stanford  Security 
Class  Action  Clearinghouse; 

Re  structuring  =  1  if  the  firm  was  engaged  in  a  restructuring  during  year  r; 

Restatementii  =  1  if  the  firm  had  a  restatement  in  year  t,  per  the  General  Accounting  Office’s 
Accounting  Restatement  database; 

Acquisitionn  =  1  if  the  firm  has  a  merger  or  acquisition  in  year  t,  per  footnote  1  in  the  annual 
Compustat  file;  and 

F_Horizoni,  =  number  of  days  between  the  forecast  date  and  the  end  of  the  fiscal  period  of  the 
forecasted  earnings  number.  We  scale  this  forecast  horizon  by  360  for  annual 
forecasts  or  by  90  for  quarterly  forecasts,  so  the  variable  measures  the  proportion 
of  the  fiscal  period  remaining  at  the  date  firm  i  issues  the  forecast. 


( continued  on  next  page ) 
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EXHIBIT  1  (continued) 


Additional  Firm  Characteristics  _ 

Mveif  —  market  value  of  firm  /’s  common  equity  at  the  end  of  year  t,  in  millions; 

Sales  If  =  sales  revenue  of  firm  i  in  year  t,  in  millions; 

ROAii  =  finn  fs  net  income  in  year  t  divided  by  lagged  total  assets; 

LeVi,  =  firm  f’s  long-term  debt  (Compustat  item  9  +  item  34)  divided  by  (long-term  debt  plus 
'  the  book  value  of  common  equity  (Compustat  item  60));  and 
Betai,  =  firm  i’s  equity  beta  for  the  fiscal  period,  calculated  using  daily  equity  returns  and  a 
'  single  factor  CAPM. 


Manager  Demographic  Characteristics 


Acct/Fin 

Legal 

PreWWII 

Military 

MBA 


1  if  managers  rise  from  accounting/finance  backgrounds,  and  0  otherwise; 
1  if  managers  rise  from  legal  backgrounds,  and  0  otherwise; 

1  if  managers  were  bom  before  World  War  II,  and  0  otherwise; 

1  if  managers  have  military  experience,  and  0  otherwise;  and 
1  if  managers  hold  an  M.B.A.  degree,  and  0  otherwise. 


All  actual  EPS  numbers  are  before  extraordinary  items. 


FIGURE  2 

Detailed  Model  Specification 


Dependent  Variable 
(Yi.) 

Vector  of  Control  Variables  (Xu) 

Number  of  forecasts 
(F_Freqi,) 

AEPSit ,  Dispit,  EPS  UPit,  Lossu,  R&Du,  Mkl-Bkn, 

Outside  Dirit,  %lnstit,  LSizeit,  #AnalystSit,  Litigationit , 
ClassActionit,  Restructuringit,  Restatemenfit,  Acquisitionit 

Forecast  precision 
(F_  Precisioni,) 

Forecast  news 
(F_  Goodnewsi,) 

Forecast  bias 
(F  BiasUPit) 

Forecast  accuracy 
(F  AbsErrori,) 

AEPSit ,  Dispit,  EPS  UPit,  Lossit,  R&Dit,  Mkt-Bkit, 

Outside  Dipt,  “Alnstu,  LSizeu,  #AnalystSit,  Litigationit, 
ClassActionit,  Restructuringit,  Restatementii,  Acquisitionit, 
FHorizonit 

effects.  Tf  controls  for  year-specific  fixed  effects,  and  controls  for  firm-specific  fixed  effects. 


15 


We  estimate  models  1-5  using  a  firm  fixed  effect  model  (XTREG  in  STATA)  that  estimates  coefficients  for  the 
economic  determinants,  manager  indicator  variables,  and  year  indicator  variables.  The  estimation  eliminates  the  unob¬ 
served  time-invariant  firm  characteristics  in  models  1-5  (i.e.,  the  y,)  by  a  within-transformation  that  demeans  each 
variable  for  each  firm  (Wooldridge  2002,  267).  An  alternative  is  OLS  estimation  with  explicit  firm  indicator  variables. 
That  approach  generates  the  same  coefficient  estimates  on  the  economic  determinants  (the  vector  X)  and  the  manager 
{\Mcmagers)  ^^>4  hmc  pciiod  effccts  (t,),  but  it  is  computationally  intensive,  infeasible  for  models  with  thousands  of  firms 
(as  is  the  case  in  one  of  our  sensitivity  tests),  and  it  also  generates  higher  adjusted  R^.  So  our  reported  adjusted  R^s  are 
more  conservative. 
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Model  1  includes  known  economic  determinants  of  voluntary  disclosure  choices  as  well  as 
time-  and  firm-specific  fixed  effects.  Model  2  adds  the  manager-specific  fixed  effects.  This  enables 
us  to  investigate  not  only  the  existence  of,  but  also  the  magnitude  of,  individual  managers’ 
incremental  effects  on  their  firms  voluntary  disclosures  after  controlling  for  the  known  economic 
determinants,  as  well  as  time-  and  firm-specific  fixed  effects.  Models  3,  4,  and  5  test  for  effects  of 
CEOs,  CFOs,  and  General  Counsels,  respectively. 

Testing  Whether  Demographic  Characteristics  of  Top  Managers’  Personal  Backgrounds  Are 
Associated  with  Their  Unique  Disclosure  Styles 

Data,  Model,  and  Variable  Definitions 

We  hand-collect  demographic  characteristics  of  managers’  backgrounds  from  Marquis  Who’s 
Who,  Zoominfo,  Mergent  Online,  Hoover’s,  NNBD.com,  and  company  websites.  To  test  whether 
these  characteristics  contribute  to  managers’  individual  disclosure  styles,  we  regress  the  fixed- 
effect  coefficient  estimates  from  Model  2  (after  controlling  for  economic  determinants,  time-  and 
firm-specific  fixed  effects),  on  managers’  personal  demographic  characteristics: 

^m,Yi  =  <^0  +  yjAcctlFin  -l-  y2Legal  4-  y^PreWWII  +  y4Military  +  y^MBA  -l-  e.  (6) 

^mji  the  manager-specific  fixed  effects  estimated  in  Model  2,  where  m  represents  the  manager 
and  Y  represents  the  disclosure  attribute.  Acct/Fin  distinguishes  managers  rising  from  accounting 
or  finance  backgrounds,  and  Legal  distinguishes  managers  with  legal  backgrounds.'^  Following 
Zemke  et  al.  (2000),  we  classify  managers  into  two  age  cohorts — managers  bom  before  World 
War  II  (PreWWII)  versus  those  bom  after  World  War  II. Military  indicates  the  manager  has 
military  experience,  and  MBA  indicates  the  manager  earned  an  M.B.A.  degree. 

IV.  RESULTS 

Descriptive  Statistics 

Table  1  compares  our  sample  firm-years  to  the  population  with  at  least  three  consecutive  years 
of  data  in  Execucomp  that  also  have  at  least  one  analyst  following  the  firm,  as  reported  by  First 
Call.  Our  sample  firms  tend  to  be  larger,  because  managers  of  larger  firms  are  more  likely  to  move 
to  another  firm  in  the  S&P  1500  population  covered  by  Execucomp.  This  biases  against  finding  a 
manager-specific  effect,  because  managerial  discretion — and,  hence,  any  manager-specific 
effect — declines  with  company  size  (Finkelstein  and  Hambrick  1996;  Bertrand  and  Schoar  2003). 
Our  sample  firms  are  more  levered,  but  similar  in  profitability  and  risk  (beta). 

On  average,  our  sample  firms  issue  1 .77  forecasts  each  year,  slightly  more  than  the  population 
average  of  1.48  (p  <  0.05).  Our  sample  firms’  forecasts  are  less  biased  (relative  to  ex  post  actual 
earnings)  than  those  of  the  population  (p  <  0.05).  However,  forecasts  issued  by  our  sample  and  the 
population  exhibit  similar  precision,  news  (relative  to  prevailing  analysts’  forecasts),  and  accuracy. 
The  average  forecast  gives  a  range  of  values,  guides  expectations  downward,  and  is  downward- 
biased  relative  to  actual  earnings. 

Turning  to  the  economic  determinants,  our  sample  firms  are  slightly  less  likely  to  enjoy  an 
increase  in  earnings  during  the  forecast  year,  and  have  a  larger  proportion  of  outside  directors. 


We  expect  the  accounting/finance  and  legal  career  tracks  to  be  most  relevant  to  voluntary  eamings-related  disclosure 
choice.  Separate  analysis  of  executives  promoted  from  production  and  marketing  tracks  does  not  affect  our  inferences, 
nor  were  the  coefficients  on  the  marketing  or  production  career  track  variables  significant. 

Zemke  et  al.  (2000)  identify  those  bom  before  1943  as  the  Veterans  cohort,  those  bom  between  1943  to  1960  as  the 
Baby  Boomers,  and  those  born  between  1960  to  1980  as  Generation  Xers.  Those  bom  from  1943  to  1946  are  often 
referred  to  as  the  Sandwich  generation,  but  Zemke  et  al.  (2000)  consider  them  part  of  the  Baby  Boomers  cultural  cohort. 
We  combine  the  Baby  Boomers  and  Generation  Xers  because  our  sample  includes  only  a  handful  of  Generation  Xers, 
and  separating  them  does  not  yield  any  additional  insights. 


The  Accounting  Review 


July  2010 

American  Accounting  Association 


1144 


Bamber,  Jiang,  and  Wang 


TABLE  1 

Comparison  of  Sample  to  the  Population  in  the  Intersection  of  Execucomp  and  First  Call** 


Sample  Firms  Execucomp/First  Call  Population 


Variable 

n 

Mean 

Std.  Dev. 

Median 

n 

Mean 

Std.  Dev. 

Median 

Firm  Characteristics 
Mve* 

3,920 

11,066 

30,774 

2,592 

18,987 

6,349 

21,305 

1,282 

Sales^ 

3,920 

7,847 

19,578 

2,217 

18,987 

4,281 

12,474 

1,058 

ROA 

3,920 

0.04 

0.16 

0.04 

18,987 

0.04 

0.37 

0.05 

Lev* 

3,909 

0.40 

0.51 

0.40 

18,915 

0.35 

0.98 

0.35 

Beta 

3,915 

0.98 

0.56 

0.87 

18,937 

0.97 

0.56 

0.87 

Management  Forecast  Characteristics 
F_Freq*  3,920  1.77 

2.27 

1 

18,987 

1.48 

2.11 

0 

F_Precision 

2,169 

1.97 

0.72 

2 

9,321 

1.96 

0.73 

2 

F_Goodnev/s 

1,920 

-0.16 

0.58 

-0.04 

8,193 

-0.18 

0.69 

-0.05 

FJBiasUP* 

1,914 

-0.12 

0.80 

-0.11 

8,151 

-0.20 

0.79 

-0.33 

F_AbsError 

1,914 

0.58 

1.15 

0.21 

8,151 

0.55 

1.04 

0.20 

Economic  Explanatory  Variables 

IAEF5I  3,920  0.07 

0.15 

0.02 

18,987 

0.06 

0.14 

0.02 

Disp 

3,920 

0.11 

0.28 

0.03 

18,360 

0.10 

0.26 

0.03 

EPS_UP* 

3,920 

0.63 

0.48 

1 

18,987 

0.65 

0.48 

1 

Loss 

3,920 

0.18 

0.39 

0 

18,987 

0.17 

0.37 

0 

R&D 

3,920 

0.03 

0.05 

0 

18,987 

0.03 

0.06 

0 

Mkt-Bk 

3,920 

3.30 

3.58 

2.34 

18,987 

3.22 

3.35 

2.30 

Outside_Dir* 

3,920 

70.39 

16.47 

75 

18,888 

66.07 

18.33 

69.23 

%Inst* 

3,920 

63.61 

18.99 

65.58 

18,961 

61.34 

20.81 

63.07 

LSize* 

3,920 

7.92 

1.60 

7.86 

18,987 

7.30 

1.57 

7.16 

It  Analysts* 

3,920 

14.71 

8.94 

13 

18,987 

11.46 

8 

9 

Litigation 

3,920 

0.10 

0.30 

0 

18,987 

0.10 

0.30 

0 

ClassAction* 

3,920 

0.04 

0.20 

0 

18,987 

0.03 

0.17 

0 

Restructuring* 

3,920 

0.35 

0.48 

0 

18,987 

0.28 

0.45 

0 

Restatement* 

3,920 

0.05 

0.21 

0 

18,987 

0.03 

0.18 

0 

Acquisition 

3,920 

0.21 

0.41 

0 

18,987 

0.21 

0.40 

0 

EJBorizon* 

2,169 

0.44 

0.31 

0.47 

9,321 

0.42 

0.33 

0.45 

*  Indicates  that  our  sample  and  the  Execucomp/First  Call  population  differ  significantly  both  on  t-tests  and  nonparametric 
Wilcoxon  rank  sum  tests  (p  <  0.05). 

“  We  compare  our  sample  firms  to  the  population  of  firms  with  at  least  three  consecutive  years  of  data  in  Execucomp,  and 
with  at  least  one  analyst  following  from  First  Call. 

Exhibit  1  defines  all  variables. 


higher  institutional  ownership,  and  larger  analyst  following.  Our  sample  firms  also  have  slightly 
higher  incidences  of  class  action  lawsuits,  restructurings,  and  restatements,  and  their  management 
earnings  forecasts  have  somewhat  longer  horizon.  However,  the  sample  and  population  exhibit 
similar  absolute  changes  in  EPS,  dispersion  in  analysts’  forecasts,  incidence  of  losses,  R&D 
intensity,  market-to-book  ratios,  litigation  risk,  and  acquisition  activity.  In  sum,  our  sample  firms 
are  somewhat  larger  than  the  population,  and  exhibit  other  differences  associated  with  larger  firms, 
but  our  sample  does  not  differ  on  other  fundamental  economic  characteristics  such  as  risk  (beta), 
return  on  assets,  absolute  changes  in  EPS,  and  market-to-book  ratios. 
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The  Existence  and  Magnitude  of  Individual  Top  Managers’  Voluntary  Disclosure  Styles 
Main  Results 

The  results  of  the  manager  fixed-effects  analyses  appear  in  Tables  2—6.  The  primary  inference 
is  that  top  managers  have  statistically  and  economically  significant  fixed  effects  on  their  firms’ 
voluntary  disclosure  choices.  We  discuss  these  results  in  more  detail. 

Table  2  presents  the  results  for  the  frequency  of  forecasts.  The  F-tests  from  Model  2  indicate 


TABLE  2 

Testing  Individual  Top  Managers’  Fixed  Effects  on  the  Number  of  Management  Earnings 

Forecasts 


F  Freq 
(n  =  3,920)“ 

Economic 
Determinants, 
Year-  and 
Firm-Specific 
Fixed  Effects 
(Model  1) 

Economic 
Determinants 
and  Manager 
Fixed  Effects, 
Year-  and 
Firm-Specific 
Fixed  Effects 
(Model  2) 

Testing  Economic  Determinants  = 

=  0 

F-statistics 

6.77 

4.32 

(p-value) 

(<0.001) 

(<0.001) 

Constraints 

15 

15 

Testing  Manager  Fixed  Effects  = 

0 

F-statistics 

7.46 

(p-value) 

(<0.001) 

Constraints 

292 

Adjusted  R^ 

33.5% 

43.5% 

Improvement  in  Adjusted  Relative  to  Model  (1): 
In  Raw  Percentage  Points  10.0 

As  a  %  of  Model  I’s  R^  29.9% 


Economic 
Determinants 
and  CEOs’ 
Fixed  Effects, 
Year-  and 
Firm-Specific 
Fixed  Effects 
(Model  3) 

Economic 
Determinants 
and  CFOs’ 
Fixed  Effects, 
Year-  and 
Firm-Specific 
Fixed  Effects 
(Model  4) 

Economic 
Determinants 
and  General 
Counsels’  Fixed 
Effects,  Year- 
and  Firm- 
Specific  Fixed 
Effects 
(Model  5) 

5.16 

5.38 

7.11 

(<0.001) 

(<0.001) 

(<0.001) 

15 

15 

15 

7.10 

5.92 

5.01 

(<0.001) 

(<0.001) 

(<0.001) 

111 

93 

22 

40.2% 

36.4% 

34.0% 

6.7 

2.9 

0.5 

20.0% 

8.7% 

1.5% 

Distribution  of  Manager  Fixed  Effects'’ 
(n  =  292  Managers) 

Manager 
Fixed  Effect 


%  Significant  at  10%  Level 

51% 

Mean  Effect 

0.00 

Median  Effect 

0.01 

25th  Percentile 

-0.97 

75th  Percentile 

0.96 

Difference  between  75th  and 
25th  Percentile 

1.93 

“  We  report  the  results  of  five  regressions  after  controlling  for  economic  determinants,  and  year-  and  firm-specific  fixed 
effects.  We  control  for  heteroscedasticity  using  White’s  standard  errors.  The  adjusted  is  calculated  based  on  the 
Within  from  XTREG,  FE  in  STATA.  Exhibit  1  defines  the  economic  determinants. 

^  The  manager  fixed  effects  are  the  estimated  coefficients  from  Model  2  that  controls  for  economic  determinants  as  well 
as  year-  and  firm-specific  fixed  effects  in  estimating  the  incremental  manager  fixed  effects. 
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that  the  manager- specific  fixed  effects  are  jointly  different  from  zero,  after  controlling  for  eco¬ 
nomic  determinants  as  well  as  firm-  and  time-specific  fixed  effects  (p  <  0.001).  We  next  assess  the 
economic  significance  of  these  manager-specific  fixed  effects  in  three  ways.  We  show;  (1)  the 
increase  in  adjusted  from  adding  the  manager  fixed  effects,  both  in  terms  of  raw  percentage 
points,  and  in  percentage  changes  (i.e.,  the  increase  in  adjusted  as  a  percentage  of  the  adjusted 
R^  from  Model  1),  (2)  that  the  proportion  of  statistically  significant  manager  fixed  effects  is  much 
higher  than  expected  by  chance,  and  (3)  that  the  interquartile  range  indicates  meaningful  cross- 
sectional  differences  in  managers’  disclosure  styles. 

First,  even  after  controlling  for  known  economic  determinants  of  disclosure  (as  well  as  time- 
and  firm-specific  fixed  effects),  adding  manager  fixed  effects  to  the  base  Model  1  significantly 
improves  explanatory  power.  Adjusted  R^  increases  from  33.5  percent  in  the  base  model  to  43.5 
percent,  an  increase  of  ten  percentage  points  in  absolute  terms  and  29.9  percent  (10/33.5)  in 
relative  terms. Models  3  to  5  show  how  managers  in  the  CEO,  CFO,  and  General  Counsel 
positions  contribute  to  the  increase  in  explanatory  power  observed  in  Model  2. 

The  second  chart  in  Table  2  displays  the  two  additional  indicators  of  the  economic  signifi¬ 
cance  of  the  manager-specific  fixed  effects  estimated  in  Model  2.  Even  after  controlling  for 
economic  determinants,  5 1  percent  of  the  individual  manager-specific  effects  are  significant  at  the 
10  percent  level,  much  higher  than  expected  by  chance.  The  interquartile  range  shows  that  replac¬ 
ing  a  manager  at  the  25th  percentile  with  one  at  the  75th  percentile  increases  the  number  of 
forecasts  issued  each  year  by  1.93,  which  is  material  given  that  the  mean  number  of  forecasts  our 
sample  firms  issue  each  year  is  1.77  (Table  1).  Clearly,  individual  managers’  unique  fixed  effects 
play  an  economically  significant  role  in  firms’  forecasting  frequency. 

Tables  3-6  show  that  the  economic  determinants  explain  much  less  of  the  cross-sectional 
variation  in  forecast  precision,  news,  bias,  and  error  (1.8  percent  to  12.5  percent)  than  of  forecast 
frequency  (33.5  percent.  Table  2),  consistent  with  the  Hirst  et  al.  (2008)  observation  that  we  know 
less  about  the  attributes  of  management  forecasts  than  we  do  about  the  decision  to  issue  a  forecast. 
For  forecast  precision,  adding  the  manager-specific  effects  to  the  base  Model  1  increases  adjusted 
R^  from  1.8  percent  to  10.0  percent,  an  increase  of  8.2  percentage  points  in  absolute  terms  (or  an 
increase  of  456  percent  in  relative  terms).  Similarly,  adding  the  manager- specific  fixed  effects  to 
the  forecast  news,  bias,  and  error  models  increases  adjusted  R^  by  9.5,  7.6,  and  10.3  percentage 
points  (which  translate  to  increases  of  82  percent,  61  percent,  and  82  percent  in  relative  terms), 
relative  to  Model  1.  The  increases  in  adjusted  R^  observed  in  Models  3  to  5  suggest  that  CEOs’ 
and  CFOs’  styles  play  a  significant  role  in  these  forecast  attributes,  but  the  General  Counsels’  role 
is  not  economically  meaningful  except  for  the  forecast  error  attribute. 

The  second  chart  in  Tables  3-6  displays  the  distribution  of  manager-specific  fixed  effects 
estimated  in  Model  2.  Between  38  percent  and  46  percent  of  the  individual  manager  fixed  effects 
are  significant  at  the  10  percent  level,  much  higher  than  expected  by  chance.  The  interquartile 
ranges  further  support  the  materiality  of  the  individual  manager  effects.  Table  3  shows  that  re¬ 
placing  a  manager  at  the  25th  percentile  with  one  at  the  75th  percentile  increases  average  precision 
by  0.75,  which  is  material  relative  to  the  unconditional  mean  of  1.97  (per  Table  1).  Turning  to 


The  increase  in  adjusted  from  adding  manager-specific  fixed  effects  compares  favorably  with  the  increase  in  adjusted 
from  adding  manager-specific  fixed  effects  to  models  of  other  corporate  decisions.  For  example,  Bertrand  and  Schoar 
(2003)  find  that  adding  manager-specific  fixed  effects  to  models  of  investment  and  financial  policies  increases  adjusted 
R^  by  an  average  of  five  percentage  points  across  eight  decisions. 

Following  Bertrand  and  Schoar  (2003),  our  primary  analyses  are  based  on  the  manager’s  position  at  the  last  firm.  When 
we  require  managers  to  retain  the  same  job  title  at  both  Firm  1  and  Firm  2  (i.e.,  CEO  in  both  firms,  CFO  in  both  firms, 
or  General  Counsel  in  both  firms),  we  still  find  that  adding  the  manager-specific  fixed  effects  increases  the  models’ 
adjusted  R^  by  an  average  of  8.3  raw  percentage  points  across  the  five  disclosure  attributes. 
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TABLE  3 

Testing  Individual  Top  Managers’  Fixed  Effects  on  Forecast  Precision 


F  Precision 
(n  =  2,169)“ 

Economic 
Determinants, 
Year-  and  Firm- 
Specific  Fixed 
Effects 
(Model  1) 

Economic 
Determinants 
and  Manager 
Fixed  Effects, 
Year-  and  Firm- 
Specific  Fixed 
Effects 
(Model  2) 

Economic 
Determinants 
and  CEOs’ 
Fixed  Effects, 
Year-  and  Firm- 
Specific  Fixed 
Effects 
(Model  3) 

Economic 
Determinants 
and  CFOs’ 
Fixed  Effects, 
Year-  and  Firm- 
Specific  Fixed 
Effects 
(Model  4) 

Economic 
Determinants 
and  General 
Counsels’  Fixed 
Effects,  Year- 
and  Firm- 
Specific  Fixed 
Effects 
(Model  5) 

Testing  Economic  Determinants  = 

=  0 

F-statistics 

1.33 

1.37 

1.31 

1.40 

1.23 

(p-value) 

(0.172) 

(0.150) 

(0.185) 

(0.132) 

(0.237) 

Constraints 

16 

16 

16 

16 

16 

Testing  Manager  Fixed  Effects  = 

0 

F-statistics 

34.88 

12.20 

21.12 

8.69 

(p-value) 

(<0.001) 

(<0.001) 

(<0.001) 

(<0.001) 

Constraints 

235 

138 

84 

17 

Adjusted 

1.8% 

10.0% 

8.3% 

3.8% 

1.7% 

Improvement 

in  Adjusted  R^  Relative  to  Model  (1): 

In  Raw  Percentage  Points 

8.2 

6.5 

2.0 

-0.1 

As  a  %  of  Model  1  ’s  R^ 

456% 

361% 

111% 

-6% 

Distribution  of  Manager  Fixed  Effects** 
(n  =  235  Managers) 

Manager 
Fixed  Effect 


%  Significant  at  10%  Level  39% 

Mean  Effect  0.04 

Median  Effect  0.02 

25th  Percentile  —0.39 

75th  Percentile  0.36 

Difference  between  75th  and  0.75 

25th  Percentile 


^  Exhibit  1  defines  the  economic  determinants.  Footnotes  in  Table’ 2  explain  the  statistical  tests. 

The  manager  fixed  effects  are  the  estimated  coefficients  from  Model  2  that  controls  for  economic  determinants  as  well 
as  year-  and  firm-specific  fixed  effects  in  estimating  the  incremental  manager  fixed  effects. 


Table  4,  replacing  a  manager  at  the  25th  percentile  with  one  at  the  75th  shifts  away  from  a 
tendency  to  issue  forecasts  to  convey  bad  news,  toward  a  tendency  to  issue  forecasts  to  convey 
good  news.  Table  5  shows  that  replacing  a  manager  in  the  25th  percentile  with  one  in  the  75th 
shifts  from  forecasts  that  undershoot  earnings  to  forecasts  that  overshoot  earnings.  Finally,  Table 
6  shows  that  replacing  a  manager  in  the  25th  percentile  with  one  in  the  75th  increases  average 
absolute  forecast  error  by  0.91  percent  of  stock  price,  which  is  material  relative  to  the  sample 
mean  absolute  error  of  0.58  percent  per  Table  1. 

Taken  together.  Tables  2-6  show  that  manager-specific  fixed  effects  play  an  economically 
significant  incremental  role  in  explaining  forecast  frequency  as  well  as  detailed  forecast  attributes 
such  as  precision,  news  conveyed  by  the  forecast,  bias  relative  to  ex  post  actual  EPS,  and  absolute 
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TABLE  6 


Testing  Individual  Top  Managers’  Fixed  Effects  on  Absolute  Forecast  Error 


F  AbsError 
(n  =  1,914)*' 

Economic 
Determinants, 
Year-  and 
Firm-Specific 
Fixed  Effects 
(Model  1) 

Economic 
Determinants 
and  Manager 
Fixed  Effects, 
Year-  and 
Firm-Specific 
Fixed  Effects 
(Model  2) 

Economic 
Determinants 
and  CEOs’ 
Fixed  Effects, 
Year-  and 
Firm-Specific 
Fixed  Effects 
(Model  3) 

Economic 
Determinants 
and  CFOs’  Fixed 
Effects,  Year- 
and  Firm- 
Specific  Fixed 
Effects 
(Model  4) 

Economic 
Determinants 
and  General 
Counsels’  Fixed 
Effects,  Year- 
and  Firm- 
Specific  Fixed 
Effects 
(Model  5) 

Testing  Economic  Determinants  = 

=  0; 

E-statistics 

6.72 

5.82 

6.37 

5.95 

6.84 

(p-value) 

(<0.001) 

(<0.001) 

(<0.001) 

(<0.001) 

(<0.001) 

Constraints 

16 

16 

16 

16 

16 

Testing  Manager  Eixed  Effects  = 

0: 

E-statistics 

51.50 

6.75 

13.78 

3.12 

(p-value) 

(<0.001) 

(<0.001) 

(<0.001) 

(<0.001) 

Constraints 

213 

126 

76 

15 

Adjusted-R^ 

12.5% 

22.8% 

19.4% 

14.0% 

14.0% 

Improvement  in 

Adjusted  R^  Relative  to  Model  (1): 

In  Raw  Percentage  Points 

10.3 

6.9 

1.5 

1.5 

As  a  %  of  Model  I’s  R^ 

82% 

55% 

12% 

12% 

Distribution  of  Manager  Fixed  Effects** 
(n  =  213  Managers) 

Manager 
Fixed  Effect 


%  Significant  at  10%  Level  44% 

Mean  Etfect  0.03 

Median  Etfect  —0.00 

25th  Percentile  —0.43 

75th  Percentile  0.48 

Difference  between  75th  and  0.91 

25  th  Percentile 


“  Exhibit  1  defines  the  economic  determinants.  Footnotes  in  Table  2  explain  the  statistical  tests. 

The  manager  fixed  effects  are  the  estimated  coefficients  from  Model  2  that  controls  for  economic  determinants  as  well 
as  year-  and  firm-specific  fixed  effects  in  estimating  the  incremental  manager  fixed  effects. 


deviations.  That  is,  do  managers  take  the  initiative  to  impose  their  styles  on  their  firms,  or  do  firms 
hire  managers  because  the  firm  wants  the  manager  to  bring  his/her  unique  disclosure  style  (which 
we  refer  to  as  the  matching  story)?  Under  either  scenario,  managers  influence  the  firm’s  disclosure 
choices.  The  only  difference  is  the  extent  of  the  firm’s  role  in  proactively  hiring  a  manager  with  a 
particular  disclosure  style. 

We  argue  that  the  impetus  likely  stems  from  top  managers  independently  imposing  their  styles 
on  the  firm.  First,  Bertrand  and  Schoar  (2003)  conclude  that  the  matching  story  does  not  explain 
the  significant  manager-specific  fixed  effects  on  fundamental  strategic  and  operational  choices 
such  as  preferences  for  internal  versus  external  growth  (e.g.,  capital  investments  and  R&D  versus 
acquisitions)  and  financial  aggressiveness  (e.g.,  dividend  and  capital  structure  policies).  If  firms  do 
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not  proactively  hire  managers  for  specific  styles  in  fundamental  investment  and  operational  deci¬ 
sions,  it  stretches  credulity  that  firms  proactively  hire  new  top  executives — particularly  CEOs — 
largely  to  change  attributes  of  voluntary  management  earnings  forecasts.  Second,  respondents  to  a 
recent  National  Investor  Relations  Institute  (NIRI)  survey  say  top  managers  do  impose  their 
unique  disclosure  styles  on  the  firm.^*  Third,  the  matching  story  would  require  the  firm’s  desired 
disclosure  strategy  to  change  over  time  in  a  pattern  directly  mirrored  by  changes  in  the  types  of 
managers  the  firm  hires,  and  the  primary  causes  of  changes  in  desired  disclosure  strategy  would 
have  to  be  new  drivers  of  voluntary  disclosure  that:  (1)  are  not  firm-specific  characteristics  (which 
are  controlled  by  the  firm-specific  fixed  effects),  (2)  are  not  year-specific  effects  (which  are 
controlled  by  the  year  fixed  effects),  and  (3)  have  not  been  identified  by  prior  research,  and  thus 
are  not  included  in  our  vector  of  economic  determinants. 

Testing  Whether  Top  Managers’  Personal  Demographic  Characteristics  are  Associated  with 
Their  Unique  Disclosure  Styles 

Table  7  provides  descriptive  statistics  on  our  sample  managers’  demographic  characteristics. 
Nearly  half  have  an  accounting  or  finance  background,  8  percent  have  a  legal  background,  and  the 
remainder  rose  from  general  management  (including  production  and  marketing).  Of  the  other 
demographic  characteristics  we  measure,  16  percent  of  our  sample  managers  were  bom  before 
World  War  II,  10  percent  have  military  experience,  and  37  percent  hold  an  M.B.A..  We  also 
manually  collected  demographic  characteristics  of  a  random  sample  of  100  managers  from  the 
Execucomp/First  Call  population  of  firms  used  for  comparison  in  Table  1 .  Our  sample  managers 
are  not  significantly  different  from  the  random  sample,  except  that  we  have  a  smaller  proportion 
of  older  managers,  which  would  bias  against  finding  results  on  our  age  cohort  variable  {PreWWII). 
Panel  B  shows  that  managers  rising  from  accounting  and  finance  are  less  likely  to  be  born  before 
World  War  II,  and  more  likely  to  hold  M.B.A.s,  consistent  with  a  trend  toward  choosing  profes¬ 
sional  administrators  who  run  the  company  “by  the  numbers.” 

Table  8  presents  the  results  of  estimating  Model  6  that  examines  the  association  between 
managers’  disclosure  styles  and  their  demographic  characteristics.  The  base  group  impounded  in 
the  intercept  is  managers  bom  after  World  War  II  who  have  neither  military  experience  nor  an 
M.B.A.,  and  who  rose  from  general  management.  Table  8  shows  that  managers  with  accounting  or 
finance  backgrounds  develop  disclosure  styles  that  reflect  lower  tolerance  of  ambiguity  (fewer  but 
more  specific  forecasts,  p  <  0.05)  and  that  are  conservative  in  tending  to  undershoot  actual 
earnings  (p  =  0.04).  Managers  with  legal  backgrounds  favor  disclosure  styles  that  guide  expec¬ 
tations  down  (p  =  0.09),  reflecting  sensitivity  to  litigation  risk.  Those  bom  before  World  War  II 
are  conservative  in  that  they  tend  to  issue  fewer  forecasts  (p  <  0.01).  Managers  with  military 
experience  tend  to  develop  disclosure  styles  that:  (1)  are  more  precise  (p  =  0.04),  reflecting  lower 
tolerance  of  ambiguity,  and  (2)  provide  downward  (not  upward)  guidance  (p  =  0.02).  Even  though 
M.B.A.s’  disclosure  styles  tend  to  guide  expectations  up  (p  =  0.05),  their  disclosures  are  more 
accurate  relative  to  ex  post  actual  EPS  (p  =  0.08). 

These  associations  between  managers’  demographic  characteristics  and  their  fixed  effects  are 
of  interest  in  their  own  right.  Finding  plausible  associations  between  manager-specific  fixed  effects 
estimated  in  the  first  stage  and  distinctive  (permanent)  characteristics  of  their  own  personal  back- 


Among  firms  that  discontinued  guidance,  the  most  popular  response  to  “What  would  make  you  consider  providing 
earnings  guidance  in  the  future?”  was  “change  in  management  philosophy.”  Similarly,  when  asked  “Where  did  the 
decision  to  consider  discontinuing  guidance  come  from?”  69  percent  responded  that  it  came  from  senior  management. 
For  companies  that  are  currently  providing  guidance,  but  are  considering  discontinuing  guidance,  63  percent  of  the 
respondents  indicated  that  this  is  attributable  to  a  “change  in  management  philosophy”  (NIRI  2007). 
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TABLE  7 

Descriptive  Statistics  on  Managers’  Personal  Demographic  Characteristics^ 


Panel  A:  Frequency  of  Managers’  Demographic  Characteristics 


Demographic 

Characteristics 
(n  =  292) 

Frequency 

Percent 

Execucomp/First 
Call  Random 
Sample  Percent 

Functional  Career  Track 

Acct/Fin 

135 

46 

38 

Legal 

22 

8 

12 

General  Management 

135 

46 

50 

Age  Cohort 

PreWWII 

48 

16 

31* 

Boomer/Xer 

244 

84 

69* 

Military  Experience 

Military 

29 

10 

6 

Non-military 

263 

90 

94 

M.B.A.  Education 

M.B.A. 

107 

37 

29 

Non-M.B.A. 

185 

63 

71 

Panel  B:  Age  Cohort,  Military  Experience,  and  M.B.A.  Degree  hy  Functional  Career  Track 

Functional  Career  Track 


General 

Acct/Fin  Legal  Management  Total 


By  Age  Cohort 


PreWWII 

15 

4 

29 

48 

Boomer/Xer 

120 

18 

106 

244 

Total 

135 

22 

135 

292 

Chi-Square  Test  of  Independence: 

(2)  = 

=  5.34,  p 

=  0.07 

By  Military  Experience 

Military 

11 

3 

15 

29 

Non-military 

124 

19 

120 

263 

Total 

135 

22 

135 

292 

Chi-Square  Test  of  Independence: 

(2)  = 

=  1.03,  p 

=  0.60 

By  M.B.A.  Status: 

M.B.A. 

68 

0 

39 

107 

Non-M.B.A. 

67 

22 

96 

185 

Total 

135 

22 

135 

292 

Chi-Square  Test  of  Independence: 

(2)  = 

=  27.18,  I 

)  <  0.001 

*  Indicates  our  sample  differs  significantly  from  the  random  sample  managers  (n  =  100)  selected  from  the  Execucomp/ 
First  Call  population  on  a  two-sample  test  of  differences  in  proportion  (p<  0.05). 

We  collect  managers’  functional  career  track  (i.e.,  employment  history),  age,  military  experience,  and  education  from 
Marquis  Who’s  Who  Online,  Hoover’s,  Mergent  Online,  Zoominfo.com,  NNBD.com,  Execucomp,  and  general  Internet 
searches.  We  classify  a  manager’s  functional  background  into  one  of  three  categories:  Accounting  and  Finance  (Acct/ 
Fin),  Fegal,  or  General  Management  (including  production,  marketing,  and  strategy). 
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TABLE  8 

Testing  Associations  between  Demographic  Characteristics  of  Managers’  Personal 
Backgrounds  and  Their  Own  Unique  Overall  Disclosure  Styles® 

ModehX^  y,-  =  ag  +  yjAcct/Fin  +  y2Legal  +  y^PreWWII  +  y^Military  +  y^MBA  +  e 


F Freq 

F Precision 

FjGoodnews 

F BiasUP 

F AbsError 

Acct/Fin  =  1 

-0.33 

0.22 

0.02 

-0.22 

0.16 

(0.05)** 

(0.01)** 

(0.78) 

(0.04)** 

(0.15) 

Legal  =  1 

-0.16 

0.18 

-0.17 

-0.15 

0.17 

(0.65) 

(0.16) 

(0.09)* 

(0.43) 

(0.50) 

PreWWII  =  1 

-0.70 

-0.06 

-0.10 

-0.03 

0.15 

(0.00)*** 

(0.60) 

(0.30) 

(0.83) 

(0.40) 

Military  =  1 

0.11 

0.29 

-0.22 

0.05 

-0.00 

(0.67) 

(0.04)** 

(0.02)** 

(0.72) 

(0.99) 

MBA  =  1 

0.06 

0.01 

0.13 

-0.13 

-0.19 

(0.72) 

(0.91) 

(0.05)* 

(0.20) 

(0.08)* 

Adjusted  (OLS) 

3% 

5% 

7% 

2% 

1% 

Robust  Adjusted  R^  (LTS)*’ 

12% 

13% 

5% 

9% 

11% 

*,**,***  Significant  at  10  percent,  5  percent,  and  1  percent,  respectively. 

Two-tailed  robust  p-values  adjusting  for  heteroscedasticity  appear  in  parentheses. 

“  y,  are  the  manager-specific  fixed  effects  estimated  in  Model  2.  Acct/Fin,  Legal,  PreWWII,  Military,  and  MBA  are 
dichotomous  variables  characterizing  whether  managers  rise  from  accounting/finance  or  legal  backgrounds,  whether 
they  were  bom  before  World  War  II,  whether  they  have  military  experience,  and  whether  they  hold  an  M.B.A.  degree. 
To  ensure  our  inferences  are  not  an  artifact  of  a  few  extreme  values,  we  report  OLS  results  after  removing  a  small 
number  of  extreme  outliers  with  (absolute)  studentized  residuals  exceeding  2. 

’’  Robust  adjusted  is  generated  from  the  least  trimmed  squares  robust  regression  (Rousseeuw  1984). 


grounds  also  confirms  these  fixed  effects  are  not  simply  random,  but  instead  capture  systematic 
long-lived  cross-sectional  differences  in  managers’  unique  disclosure  styles. 

The  explanatory  power  of  these  demographic  characteristics  is  moderate,  however.  Two 
econometric  issues  dampen  the  adjusted  from  OLS  estimation.  First,  the  dependent  variables  in 
Model  6  (i.e.,  manager-specific  disclosure  styles)  are  estimated  parameters  from  the  fixed-effect 
analysis,  and  so  contain  measurement  error  (although  this  measurement  error  should  be  smaller 
than  the  error  that  would  occur  in  simpler  designs  that  do  not  cleanly  separate  the  manager  effect 
from  the  firm  effect).  While  OLS  estimation  still  generates  unbiased  coefficients  (Wooldridge 


While  moderate,  our  adjusted  R^s  compare  favorably  to  those  of  the  only  other  study  (of  which  we  are  aware)  that 
reports  statistics  on  the  explanatory  power  of  models  attempting  to  explain  cross-sectional  variation  in  managers’  fixed 
effects  (or  styles)  regarding  other  corporate  decisions.  In  an  independent  and  contemporaneously  developed  study, 
Dyreng  et  al.  (2010,  1185)  document  manager-specific  effects  on  corporate  tax  avoidance  (measured  based  on  effective 
tax  rates).  They  conclude  that  “the  results  indicate  that  biographical  information  such  as  educational  background  and  age 
does  not  explain  much  of  the  variation  in  tax  avoidance  across  executives”  because  adjusted  R^s  from  their  regressions 
are  negative.  However,  the  authors  go  on  to  point  out  that  “this  finding  does  not  suggest  that  executives  have  no  effect 
on  tax  avoidance.  Instead,  these  results  are  evidence  that  common,  observable  characteristics  are  not  strongly  associated 
with  executives’  propensities  to  reduce  effective  tax  rates.” 
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2006,  320),  this  larger  error  variance  decreases  R^.  We  ran  a  simulation  to  confirm  that  measure- 

9  23 

ment  error  in  the  dependent  variable  can  significantly  dampen  adjusted  R  . 

Second,  measurement  error  in  the  dependent  variable  can  lead  to  outliers.  We  repeat  our 
analysis  using  robust  regression  via  least  trimmed  squares,  which  generates  more  stable  results  in 
the  presence  of  outliers  (Chen  2002;  Zaman  et  al.  2001).  The  last  row  of  Table  8  shows  that  for 
four  of  our  five  forecast  characteristics,  the  demographic  variables’  explanatory  power  is  quite  a 
bit  higher  using  the  robust  regression  estimation,  with  adjusted  R^s  ranging  from  5  percent  to  13 
percent. 

The  simulation  and  robust  regression  results  give  a  more  appropriate  sense  of  the  economic 
significance  of  the  demographic  characteristics — the  recommended  starting  point  for  exploiing 
differences  in  managers’  unique  styles  (Finkelstein  and  Hambrick  1996;  Hambrick  2007,  335). 
Although  we  conducted  a  comprehensive  review  of  a  broad  range  of  literatures,  theory  as  to  how 
demographic  characteristics  might  be  associated  with  unique  managerial  styles  is  in  its  infancy, 
and  there  are  a  limited  number  of  observable  (permanent)  demographic  characteristics  that  could 
plausibly  affect  disclosure.  Managers’  individual-specific  disclosure  styles  undoubtedly  result 
from  their  many  (unobservable)  experiences,  psychological  traits,  and  values,  for  which  demo¬ 
graphic  characteristics  such  as  functional  career  track,  age  cohort,  military  experience,  and  M.B.A. 
education  provide  a  start,  but  are  admittedly  an  incomplete  and  crude  set  of  proxies. 

Additional  Analyses 

Given  the  modest  explanatory  power  of  the  demographic  variables,  we  also  explore  whether 
two  nondemographic  factors  explain  managers’  disclosure  styles:  managers’  risk-taking  behavior 
as  revealed  by  their  fixed  effects  on  their  firms’  operating  and  financing  decisions,  and  managers’ 
fixed  effects  on  their  firms’  operational  efficiency.^"^  We  identify  firm  decisions  and  outcomes  that 
might  reflect  their  managers’  risk-taking  behavior:  the  frequency  of  acquisitions,  investment  in 
R&D  and  capital  expenditure,  cash  holdings,  and  stock  return  volatility  (Bertrand  and  Schoar 
2003;  Bargeron  et  al.  2010).  We  estimate  managers’  fixed  effects  on  these  decisions  and  outcomes 
using  models  similar  to  Bertrand  and  Schoar  (2003).  We  add  these  risk-related  manager-specific 
fixed  effects  (individually  and  in  a  composite  index)  to  our  original  set  of  demographic  variables 
in  Model  6  that  explains  managers’  overall  disclosure  styles.  Our  original  inferences  remain 
largely  robust,  and  we  find  little  evidence  that  these  risk-related  manager-specific  fixed  effects  are 
associated  with  managers’  styles  in  voluntary  disclosure.^^ 


Even  in  a  simulated  world  where  X  fully  explains  the  true  value  of  Y  (so  is  100  percent  absent  measurement  error), 
adjusted  plummets  from  97  percent  when  the  standard  deviation  of  the  error  term  in  Y  is  10  percent  of  the  mean  of 
Y,  to  just  11  percent  when  the  standard  deviation  of  the  error  term  increases  to  1.5  times  the  mean  of  Y.  (Specifically, 
we  randomly  generate  200  observations  by  drawing  X  from  a  uniform  distribution  [1,100].  Y  is  determined  by  X,  Y  = 
1-1-1  *  X,  so  R^  is  100  percent  when  we  regress  Y  on  X.  We  also  randomly  draw  200  values  of  a  measurement  error 
term,  v,  from  a  normal  distribution  with  mean  zero  and  standard  deviation  that  is  10  percent  of  the  mean  of  Y.  We 
construct  Y*  =  Y  -I-  uand  regress  Y*  on  X.  We  repeat  this  regression  1,000  times  (each  time  randomly  drawing  200 
values  for  the  error  term  ufrom  the  same  distribution),  constructing  Y*  based  on  ueach  time,  to  get  the  mean  adjusted 
R^  and  the  mean  coefficient  estimates  on  X.  We  repeat  this  simulation  for  15  trials,  and  in  each  trial  we  increase  the 
standard  deviation  of  the  error  term  vhy  10  percent  of  the  mean  of  Y  until  the  standard  deviation  is  1.5  times  the  mean 
of  Y.) 

We  are  able  to  identify  one  demographic  characteristic  that  likely  primarily  reflects  risk  aversion.  Research  in  psychol¬ 
ogy  and  economics  suggests  that  women  tend  to  be  more  risk-averse,  both  in  general  (Byrnes  et  al.  1999;  Harris  et  al. 
2006)  and  in  financial  decisions  (Powell  and  Ansic  1997;  Watson  and  McNaughton  2007;  Sapienza  et  al.  2009).  When 
we  include  gender  in  Model  6,  all  of  the  inferences  on  our  original  demographic  variables  remain  robust,  and  we  find 
that  female  managers  tend  to  adopt  disclosure  styles  that  are  less  precise  and  underestimate  upcoming  earnings,  con¬ 
sistent  with  females’  greater  risk  aversion.  However,  recall  that  only  six  of  our  managers  are  female,  so  our  findings  on 
gender  are  necessarily  tentative. 

The  only  exception  is  that  managers  who  are  more  risk-seeking  in  terms  of  engaging  in  more  frequent  acquisitions  tend 
to  forecast  more  frequently  and  tend  to  be  more  precise.  When  we  include  this  fixed  effect  in  our  Model  6,  the 
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There  are  several  explanations  why  this  additional  analysis  did  not  yield  new  insights.  First, 
to  the  extent  our  original  demographic  variables  (partially)  reflect  the  risk  preferences  of  demo¬ 
graphic  groups  (e.g.,  those  who  are  older  and  those  rising  from  accounting  backgrounds  tend  to  be 
more  risk-averse),  the  incremental  effect  picked  up  by  the  new  risk  variables  does  not  reflect  the 
manager’s  total  risk  preference,  but  rather  reflects  how  the  manager’s  risk  preference  deviates 
from  the  average  risk  preferences  of  his/her  demographic  classifications.  Second,  unlike  the  reli¬ 
ably  observable  demographic  variables,  the  new  measures  are  estimated.  Third,  estimates  of 
manager-specific  fixed  effects  on  the  firm’s  operational  and  financing  decisions  may  not  reflect 
managers’  true  risk  preferences  because  firms  often  provide  compensation  (e.g.,  equity-based  pay) 
to  encourage  even  risk-averse  managers  to  take  on  risky  operational  and  financing  projects  to 
maximize  firm  value  (Rajgopal  and  Shevlin  2002;  Coles  et  al.  2006). 

With  respect  to  managers’  operational  efficiency,  we  estimate  managers’  fixed  effects  on  asset 
turnover  using  a  model  similar  to  Bertrand  and  Schoar  (2003).  We  find  only  weak  evidence  that 
managers’  operational  efficiency  is  associated  with  one  aspect  of  their  disclosure  styles — 
managers  who  are  associated  with  higher  asset  turnover  tend  to  be  less  precise  (p  =  0.09),  while 
the  inferences  on  our  original  demographic  variables  remain  robust. 

Recall  that  Tables  2-6  document  the  incremental  effect  of  individual  managers’  total  influ¬ 
ences  on  cross-sectional  variation  in  firms’  voluntary  disclosure  choices  in  order  to  address  our 
first  research  question;  Do  individual  managers  exert  economically  significant  idiosyncratic  non¬ 
economic  influence  on  their  firms’  voluntary  disclosure  choices?  Our  second  research  question 
follows  upper  echelons  theory’s  recommendation  to  focus  on  how  managers’  demographic  char¬ 
acteristics  affect  their  unique  styles,  by  investigating  whether  reliable,  observable  demographic 
characteristics  of  managers’  personal  backgrounds  help  explain  cross-sectional  variation  in  man¬ 
agers’  own  unique  voluntary  disclosure  styles.  Table  8  provides  evidence  of  how  managers’ 
demographic  characteristics  affect  their  overall  disclosure  styles  identified  in  Tables  2-6. 

We  now  provide  ancillary  analysis  to  address  a  different  question — do  the  necessarily  limited 
set  of  demographic  characteristics  for  which  we  can  find  some  conceptual  support  in  the  literature 
play  an  incremental  role  in  directly  explaining  cross-sectional  variation  in  firms  ’  voluntary  disclo¬ 
sure  choices?  We  replace  the  manager-specific  indicator  variables  used  in  Model  2  to  estimate 
managers’  total  disclosure  styles  with  the  set  of  their  demographic  characteristics.  Regressing 
firm-level  disclosure  choices  directly  on  this  admittedly  limited  set  of  demographic  characteristics 
captures  only  the  portion  of  the  manager’s  total  disclosure  style  that  is  associated  with  these 
demographic  characteristics.  Nonetheless,  Table  9  shows  that  seven  of  the  nine  significant  results 

reported  in  Table  8  continue  to  hold  (accounting/finance  backgrounds  are  no  longer  associated 

26 

with  less  frequent  but  more  downward-biased  forecasts). 


coefficient  on  Legal  in  the  analysis  of  FjGoodnews  just  loses  significance  (p  =  0.14). 

Table  9  does  not  provide  evidence  on  managers’  total  impact  on  their  firms’  disclosure  decisions;  instead  it  only  captures 
the  effect  of  a  set  of  observable  demographic  characteristics  that  is  necessarily  limited  by  the  dearth  of  both  extant 
theory  and  data  availability.  Thus,  Table  9  captures  only  a  small  portion  of  managers’  total  idiosyncratic  effects  on 
cross-sectional  variation  in  firms’  voluntary  disclosure  choices.  In  contrast,  the  primary  specifications  in  Tables  2-6 
document  the  full  effect  of  individual  managers  on  firms’  disclosure  choices,  and  then  Table  8  zooms  in  to  explore  how 
their  demographic  characteristics  influence  their  own  unique  overall  disclosure  styles. 
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TABLE  9 

Testing  Direct  Associations  Between  Demographic  Characteristics  of  Managers’  Personal 
Backgrounds  and  Firms’  Earnings  Forecast  Characteristics® 

F_Freq  FJPrecision  FjGoodnews  FJBiasUP  F_AbsError 


Acct/Fin  =  1 

0.02 

0.12 

-0.01 

-0.06 

0.07 

(0.84) 

(0.09)* 

(0.87) 

(0.50) 

(0.39) 

Legal  =  1 

0.35 

0.09 

-0.22 

0.10 

0.33 

(0.14) 

(0.52) 

(0.02)** 

(0.51) 

(0.21) 

PreWWII  =  1 

-0.33 

-0.20 

0.11 

0.17 

0.05 

(0.03)** 

(0.08)* 

(0.21) 

(0.21) 

(0.84) 

Military  =  1 

-0.09 

0.21 

-0.29 

0.03 

0.04 

(0.67) 

(0.08)* 

(0.00)*** 

(0.83) 

(0.85) 

MBA  =  1 

0.20 

-0.09 

0.10 

0.03 

-0.15 

(0.07)* 

(0.27) 

(0.07)* 

(0.77) 

(0.08) 

Full  Model  Adjusted 

33.7% 

2.1% 

12.5% 

12.4% 

12.6% 

Incremental  Adjusted 

0.2% 

0.3% 

0.9% 

-0.1% 

0.1% 

(relative  to  Model  1) 


*,  **,  ***  Significant  at  10  percent,  5  percent,  and  1  percent,  respectively. 

Two-tailed  robust  p-values  adjusting  for  heteroscedasticity  appear  in  parentheses. 

“  All  variables  are  defined  in  Exhibit  1.  This  analysis  replaces  the  manager-specific  indicator  variables  that  we  used  in 
Model  2  to  estimate  managers’  total  disclosure  styles  (as  reported  in  Tables  2  through  6)  with  the  set  of  their  demo¬ 
graphic  characteristics.  Consequently,  this  analysis  captures  only  the  portion  of  the  manager’s  total  disclosure  style  that 
is  associated  with  these  demographic  characteristics.  For  brevity,  we  do  not  report  coefficient  estimates  for  the  economic 
determinants,  firm  fixed  effects,  and  time-period  fixed  effects. 


V.  CONCLUSIONS 

Driven  by  conflicting  views  on  whether  managers  impose  any  idiosyncratic  influence  on 
corporate  decisions  (e.g.,  Weintraub  2002;  Feltham  2005;  DiMaggio  and  Powell  1983;  Hambrick 
and  Mason  1984),  our  study  first  examines  whether  top  managers  exert  an  economically  signifi¬ 
cant  influence  on  their  firms’  voluntary  disclosure  choices.  We  estimate  the  incremental  effect  an 
individual  manager  exerts  on  voluntary  disclosure  over  two  or  more  employing  firms,  after  con¬ 
trolling  for  firm  fixed  effects,  time-specific  fixed  effects,  and  economic  determinants  of  voluntary 
disclosure  identified  in  prior  research.  From  the  increase  in  adjusted  R^,  the  proportion  of  signifi¬ 
cant  manager  fixed  effects,  and  the  interquartile  range  of  the  manager  fixed  effects,  we  conclude 
that  managers  exert  economically  significant  individual-specific  influence  over  five  attributes  of 
management  earnings  forecasts:  the  frequency  of  forecasts,  forecast  precision,  news  conveyed  by 
the  forecast,  and  the  bias  in  and  accuracy  of  the  forecasts. 

We  then  build  on  upper  echelons  theory  (Flambrick  and  Mason  1984)  and  a  broad  set  of 
literatures  ranging  from  strategic  management,  to  career  counseling,  to  sociology,  to  psychology, 
to  military  science,  and  to  business  education  to  guide  our  exploration  of  how  managers’  func¬ 
tional  career  track,  age  cohort,  military  experience,  and  M.B.A.  degree  influence  their  unique 
disclosure  styles.  We  find  that  disclosure  styles  of  managers  promoted  from  accounting  and  fi¬ 
nance  reflect  elements  of  conservatism  and  distaste  for  ambiguity  (fewer  but  more  precise  disclo¬ 
sures  that  tend  to  underestimate  upcoming  actual  earnings).  Disclosure  styles  of  managers  from 
legal  backgrounds  favor  downward  guidance,  consistent  with  sensitivity  to  litigation  risk.  Manag¬ 
ers  born  before  World  War  II  develop  more  conservative  disclosure  styles  in  terms  of  being  more 
reluctant  to  forecast.  Those  with  military  experience  favor  more  precise  disclosure  styles  that  tend 
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toward  prompt  disclosure  of  unfavorable  information.  Finally,  managers  with  M.B.A.s  develop 
styles  that  tend  to  guide  expectations  upward,  but  are  more  accurate,  consistent  with  M.B.A.s 
being  better  at  forecasting  or  more  sensitive  to  the  adverse  consequences  of  inaccurate  forecasts. 

These  plausible  associations  between  the  manager-specific  fixed  effects  and  distinctive  per¬ 
manent  demographic  characteristics  of  managers’  own  personal  backgrounds  confirm  that  the 
estimated  manager-specific  fixed  effects  reflect  systematic  long-lived  differences  in  managers’ 
disclosure  styles,  and  not  simply  random  error.  From  a  practical  perspective,  evidence  on  the 
relation  between  managers’  disclosure  styles  and  their  demographic  characteristics  can  help  users 
evaluate  managers’  disclosure  choices.  Knowing  that  older  managers  are  less  likely  to  issue 
forecasts,  and  that  those  with  military  experience  are  less  likely  to  guide  unless  there  is  bad  news, 
can  help  market  participants  interpret  when  silence  might  not  indicate  poor  performance  (contrary 
to  theory  that  nondisclosure  signals  bad  news  [e.g.,  Verrecchia  2001,  141]). 

Our  evidence  that  the  largely  unexplored  personal  style  dimension  plays  a  significant  incre¬ 
mental  role  in  explaining  cross-sectional  differences  in  corporate  voluntary  disclosure  choices 
responds  to  the  Brochet  et  al.  (2009,  32)  suggestion  for  research  on  whether  firm  disclosure 
policies  vary  systematically  with  managerial  styles.  Our  evidence  also  helps  address  the  puzzle 
noted  by  Hirst  et  al.  (2008,  316):  “Given  that  managers  have  substantially  greater  control  over 
forecast  characteristics  ...  it  is  striking  that  the  decisions  managers  make  about  such  characteris¬ 
tics  are  comparatively  less  understood.”  Consistent  with  their  conjecture  that  managers  have  more 
control  over  the  detailed  forecast  characteristics,  our  evidence  suggests  that  individual  managers’ 
idiosyncratic  styles  play  a  relatively  more  important  incremental  role  in  explaining  detailed  char¬ 
acteristics  of  the  forecasts,  such  as  precision.  Our  evidence  also  contributes  in  a  broader  fashion  to 
the  nascent  manager-effects  literature  that  has  largely  focused  on  manager-specific  effects  on 
high-profile  strategic  operating  and  financing  decisions.  Finding  that  managers  also  matter  even  in 
seemingly  second-order  decisions  such  as  detailed  voluntary  disclosure  choices  extends  the  range 
of  corporate  decisions  in  which  managers’  idiosyncratic  noneconomic  styles  are  known  to  play  an 
economically  significant  role.  We  also  contribute  by  integrating  a  broad  set  of  literatures  to  pro¬ 
vide  conceptual  grounding  as  to  why  and  when  managers  might  (or  might  not)  matter,  and  how 
various  demographic  characteristics  of  managers’  personal  backgrounds  likely  affect  their  idiosyn¬ 
cratic  disclosure  styles. 

We  believe  future  theoretical,  archival,  and  experimental  research  incorporating  richer  roles 
for  individuals’  styles  has  great  potential  for  increasing  our  understanding  of  how  firms  make 
financial  reporting  choices.  For  example,  recent  theory  is  beginning  to  allow  idiosyncratic  stylistic 
differences  across  individual  top  executives,  such  as  managers’  vision  for  the  future  of  their 
companies,  to  affect  corporate  outcomes  (e.g.,  Rotemberg  and  Saloner  2000;  Van  den  Steen  2005). 
We  hope  that  our  evidence  likewise  spurs  development  of  disclosure  theories  that  incorporate 
richer  roles  for  differences  across  managers’  (noneconomic)  characteristics. 

Upper  echelons  theory  suggests  that  managers’  demographic  characteristics  are  the  appropri¬ 
ate  starting  point  for  exploring  reasons  for  differences  in  individual  managers’  styles.  Drawing  on 
a  comprehensive  review  of  a  broad  set  of  literatures,  we  posit  and  find  plausible  associations 
between  managers’  demographic  characteristics  and  their  disclosure  styles,  although  the  explana¬ 
tory  power  of  the  demographic  characteristics  is  moderate.  Consistent  with  upper  echelon  theory’s 
emphasis  on  demographics,  our  attempts  to  identify  nondemographic  determinants  of  managers’ 
styles  met  with  limited  success.  Significant  further  improvements  in  explaining  managers’  styles 
may  therefore  require  finer  demographic  data  (relative  to  the  types  of  demographic  data  that  are 
currently  publicly  available),  theoretically  guided  exploration  of  variables  beyond  demographic 
characteristics,  and/or  very  different  research  methods.  For  example,  surveys  or  behavioral  re¬ 
search  identifying  cognitive  traits  and  values  that  distinguish  differences  among  the  styles  of 
highly  successful  professional  managers  may  be  fruitful  avenues  for  future  research. 
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APPENDIX 

ECONOMIC  DETERMINANTS  OF  VOLUNTARY  DISCLOSURE 

(the  Vector  Xu) 

To  isolate  the  effects  of  individual  managers,  we  control  for  the  following  comprehensive  set 
of  economic  determinants  that  have  been  well  documented  in  the  prior  disclosure  literature.  Prior 
research  has  established  that  firms  are  less  likely  to  issue  forecasts  when  earnings  are  more 
uncertain  and  harder  to  predict  (e.g.,  Ajinkya  and  Gift  1984;  Baginski  et  al.  2002;  Ajinkya  et  al. 
2005).  Similar  to  that  prior  research,  we  control  for  the  absolute  value  of  the  change  in  EPS 
deflated  by  lagged  stock  price  (lAEPSI)  and  the  dispersion  in  analysts’  forecasts  scaled  by  the 
absolute  value  of  the  median  forecast  {Disp)?^  Firms  missing  either  their  prior  period  earnings  or 
zero  profit  benchmarks  are  less  likely  to  issue  forecasts  (e.g..  Miller  2002),  so  we  control  for  the 
sign  of  the  change  in  the  firm’s  EPS:  EPS_UP  equals  1  if  EPS  is  greater  than  or  equal  to  the  prior 
year’s  EPS.  We  also  include  an  indicator  variable  that  equals  1  if  the  firm  reports  a  loss  (Loss). 

Firms  with  higher  proprietary  information  costs  are  less  likely  to  issue  public  earnings  fore¬ 
casts,  so  we  control  for  two  measures  of  proprietary  information  costs:  R&D,  the  firm’s  R&D 
expenditures  scaled  by  total  assets  (Wang  2007),  and  Mkt-Bk,  the  firm’s  market-to-book  ratio 
(Bamber  and  Cheon  1998;  Ajinkya  et  al.  2005).  Ajinkya  et  al.  (2005)  show  that  two  measures  of 
higher  quality  governance — proportion  of  outside  directors  and  institutional  ownership — are  as¬ 
sociated  with  a  greater  propensity  to  issue  forecasts,  so  we  control  for  the  percentage  of  board 
members  who  are  not  also  officers  of  the  firm  {Outside and  the  percentage  of  the  firm’s 
common  stock  held  by  institutions  {%Inst).  To  control  for  the  fact  that  larger,  more  heavily 
followed  firms  issue  more  voluntary  disclosures  (e.g.,  Lang  and  Lundholm  1993;  Ajinkya  et  al. 
2005),  we  include  the  natural  log  of  the  market  value  of  equity  {LSize)  and  the  number  of  analysts 
following  the  firm  {^Analysts).  Litigation  risk  is  associated  with  voluntary  disclosure.  Following 
Wang  (2007),  we  include  an  indicator  variable  that  equals  1  if  the  firm  is  in  a  high  litigation  risk 
industry  and  suffers  an  earnings  decrease  of  20  percent  or  more  {Litigation)?'^  Following  Bro¬ 
cket  et  al.  (2009),  we  control  for  lawsuits  {ClassAction  equals  1  if  the  firm  is  involved  in  a 
securities  lawsuit  in  year  t),  restructurings  {Restructuring  equals  1  if  the  firm  was  engaged  in  a 
restructuring  during  year  t),  restatements  {Restatement  equals  1  if  the  firm  had  a  restatement  in 
year  t),  and  acquisitions  {Acquisition  equals  1  if  the  firm  has  a  merger  or  acquisition  in  year  t). 

As  Hirst  et  al.  (2008)  note,  much  less  is  known  about  the  determinants  of  detailed  attributes 
of  management  forecasts.  To  be  conservative,  we  include  all  of  the  above  variables  plus  forecast 
horizon  (e.g.,  Baginski  and  Hassell  1997;  Bamber  and  Cheon  1998;  Choi  and  Ziebart  2004)  in  the 
models  explaining  forecast  precision,  news  conveyed  by  the  forecast,  and  the  bias  and  error  in  the 


Following  Baginski  et  al.  (2002),  we  control  for  the  difficulty  of  predicting  current  earnings,  since  this  is  most  relevant 
to  the  decision  whether  to  issue  an  earnings  forecast  in  the  current  period.  These  measures  also  avoid  survivorship  bias 
induced  by  historical  measures  of  earnings  predictability. 

Following  Francis  et  al.  (1994)  and  Wang  (2007),  we  classify  the  following  as  high  litigation  risk  industries:  SIC  codes 
2833-2836  (biotechnology),  3570-3577  and  7370-7374  (computers),  3600-3674  (electronics),  5200-5961  (retailin<j) 
and  8731-8734  (R&D  service).  ’ 

Our  model  controls  for  over-time  changes  in  the  regulatory  environment  through  the  year-specific  fixed  effects.  There  is 
limited  evidence  that  two  additional  phenomena  may  be  associated  with  the  propensity  to  issue  management  forecasts 
(although  there  is  no  evidence  these  phenomena  are  related  to  detailed  attributes  of  management  forecasts).  Nagar  et  al. 
(2003)  conclude  that  firms  whose  managers  enjoy  larger  stock-based  incentives  issue  more  earnings  forecasts  (although 
Barth  [2003]  questions  whether  the  results  might  reflect  a  correlated  omitted  variable).  There  is  also  conflicting  evidence 
whether  insider  trading  is  associated  with  the  frequency  of  management  earnings  forecasts.  Noe  (1999)  finds  no 
evidence  that  managers  profit  by  trading  before  public  disclosures,  and  Cheng  and  Lo  (2006)  find  managers  do  not 
adjust  their  forecasting  activity  when  selling  shares  (by  far  the  most  common  type  of  insider  trading),  but  they  find  some 
evidence  that  managers  increase  the  number  of  bad  news  forecasts  prior  to  share  purchases.  Untabulated  results  reveal 
that  adding  controls  for  stock-based  incentives  (i.e.,  the  proportion  of  managers’  compensation  that  is  sensitive  to  share 
price,  and  managerial  shareholdings),  and  total  insider  buying  does  not  affect  our  inferences. 
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forecast  relative  to  ex  post  actual  EPS.  We  define  F_Horizon  as  the  number  of  days  between  the 
forecast  announcement  and  the  end  of  the  fiscal  period,  divided  by  360  days  for  forecasts  of 
annual  earnings,  and  divided  by  90  days  for  forecasts  of  quarterly  earnings.  Thus,  F_Horizon 
measures  the  proportion  of  the  fiscal  period  remaining  at  the  date  of  the  management  forecast. 
Exhibit  1  summarizes  our  variable  definitions. 
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ABSTRACT:  This  study  investigates  whether  individual  top  executives  have  incremen¬ 
tal  effects  on  their  firms’  tax  avoidance  that  cannot  be  explained  by  characteristics  of 
the  firm.  To  identify  executive  effects  on  firms’  effective  tax  rates,  we  construct  a  data 
set  that  tracks  the  movement  of  908  executives  across  firms  over  time.  Results  indicate 
that  individual  executives  play  a  significant  role  in  determining  the  level  of  tax  avoid¬ 
ance  that  firms  undertake.  The  economic  magnitude  of  the  executive  effects  on  tax 
avoidance  is  large.  Moving  between  the  top  and  bottom  quartiles  of  executives  results 
in  approximately  an  11  percent  swing  in  GAAP  effective  tax  rates;  thus,  executive 
effects  appear  to  be  an  important  determinant  in  firms’  tax  avoidance. 

Keywords:  tax  avoidance;  effective  tax  rate;  executive  fixed  effects. 

Data  Availability:  Data  used  in  this  study  are  available  from  public  sources  identified  in 
the  document. 

I.  INTRODUCTION 

This  study  investigates  whether  individual  executives  have  incremental  effects  on  their 
firms’  tax  avoidance  that  cannot  be  explained  by  characteristics  of  the  firm.  Despite  de¬ 
cades  of  empirical  research  in  corporate  taxation,  little  attention  has  been  focused  on 
whether  individual  executives  have  an  effect  on  their  firms’  tax  avoidance  (for  reviews  see  Shack¬ 
elford  and  Shevlin  [2001]  and  Graham  [2003]).  In  this  prior  literature,  executives  were  either 
ignored  or  treated  as  homogenous  inputs  to  the  tax  avoidance  process.  In  contrast,  our  study 
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considers  the  possibility  that  individual  top  executives  are  partially  responsible  for  variation  in  tax 
avoidance  across  firms.  Essentially,  we  posit  that  a  given  firm’s  tax  avoidance  will  differ  depend¬ 
ing  on  whether  it  is  run  by  Executive  A  versus  Executive  B.  We  find  that  not  only  do  executives 
matter  incremental  to  firm  characteristics,  but  also  they  appear  to  matter  in  a  big  way. 

At  first  thought,  it  might  be  hard  to  imagine  a  top  executive  having  an  individual  effect  on  the 
firm’s  tax  avoidance.  The  typical  CEO  is  almost  never  a  tax  expert.  While  the  typical  CEO  is 
unlikely  to  understand  the  ins  and  outs  of  common  tax  strategies,  he/she  likely  understands  the 
competitive  nature  of  his/her  industry  and  the  potential  for  expansion  to  generate  operational 
economies  of  scale.  Thus,  it  is  reasonable  that  a  CEO  could  affect  the  firm’s  operational  and 
financial  strategies,  but  perhaps  less  so  the  firm’s  tax  avoidance  activities. 

However,  a  CEO  can  affect  tax  avoidance  by  setting  the  “tone  at  the  top’’  with  regard  to  the 
firm’s  tax  activities.  For  example,  some  CEOs  may  change  the  relative  emphasis  of  different 
functional  areas  of  the  firm  (e.g.,  marketing,  operations,  treasury,  tax)  and  the  resources  allocated 
to  hiring  different  advisors  both  within  and  outside  of  the  firm.^  Further,  the  tone  at  the  top  could 
extend  to  setting  the  compensation  incentives  of  the  tax  director  (who  has  direct  involvement  for 
the  firm’s  tax  decisions).^  Anecdotal  evidence  is  consistent  with  the  tone  at  the  top  influence  of  top 
executives.  For  example,  in  deposition  testimony,  David  Bullington,  Wal-Mart’s  vice  president  for 
tax  policy,  stated  that  he  began  to  feel  pressure  to  lower  the  company’s  effective  tax  rate  after  the 
current  chief  financial  officer,  Thomas  Schoewe,  was  hired  in  2000.  “Mr.  Schoewe  was  familiar 
with  some  very  sophisticated  and  aggressive  tax  planning  . . .  And  he  rides  herd  on  us  all  the  time 
that  we  have  the  world’s  highest  tax  rate  of  any  major  company,’’  Mr.  Bullington  said,  according 
to  a  transcript  of  the  deposition  (Drucker  2007,  Al). 

It  is  important  to  clarify  at  the  outset  what  we  mean  by  the  term  “tax  avoidance.’’  As  in 
Dyreng  et  al.  (2008),  we  define  tax  avoidance  broadly  to  encompass  anything  that  reduces  the 
firm’s  taxes  relative  to  its  pretax  accounting  income.  We  are  not  attempting  to  measure  tax  ag¬ 
gressiveness,  tax  risk,  tax  evasion,  or  tax  sheltering.  To  keep  our  measures  of  tax  avoidance  broad 
and  easy  to  understand,  we  examine  two  standard  measures.  The  first  is  the  firm’s  effective  tax  rate 
as  defined  under  GAAP  (hereafter,  GAAP  ETR),  which  is  total  tax  expense  (current  plus  deferred 
tax  expense)  divided  by  pre-tax  accounting  income  (adjusted  for  special  items).  The  second 
measure  is  the  firm’s  cash  taxes  paid  divided  by  pre-tax  accounting  income  (adjusted  for  special 
items)  (hereafter,  cash  effective  tax  rate,  or  CASH  ETR).  We  use  both  measures  in  order  to  capture 
the  various  objectives  managers  may  have  when  it  comes  to  tax  avoidance.  To  the  extent  that 
managers  are  concerned  about  reducing  tax  expense  for  financial  accounting  purposes,  the  use  of 
our  first  measure,  GAAP  ETR,  will  capture  the  executive’s  propensity  to  affect  this  metric.  To  the 
extent  that  managers  are  concerned  with  reducing  actual  cash  taxes  paid,  our  second  metric,  CASH 
ETR,  will  capture  the  effects  on  this  cash  tax  rate.  We  refer  to  the  measures  collectively  as 
effective  tax  rates. 


*  Anecdotal  reports  are  that  firms  differ  greatly  in  how  involved  tax  advisors  are  in  structuring  transactions  and  that  this 
variation  likely  contributes  to  differing  levels  of  tax  avoidance.  For  example,  in  a  survey  of  150  firms,  CFO  Research 
Services,  in  collaboration  with  Deloitte  Tax  LLP,  report  that  roughly  50  percent  of  the  firms  surveyed  said  that  the  tax 
department  has  a  substantial  role  in  major  transactions  and  only  ai  ound  25  percent  responded  that  the  tax  department 
played  a  substantial  role  in  making  operating  decisions  (CFO  Research  Services  and  Deloitte  Tax  LLP  2006). 

^  This  idea  is  consistent  with  the  discussion  in  Crocker  and  Slemrod  (2005)  and  the  findings  in  Armstrong  et  al.  (2009) 
discussed  below. 

^  Executives  can  avoid  or  manage  the  expense  without  affecting  cash  taxes  paid  and  can  affect  cash  taxes  paid  without 
affecting  the  total  tax  expense.  For  example,  if  firms  (or  executives)  change  their  valuation  allowance,  tax  contingency 
reserve,  or  the  amount  of  foreign  earnings  designated  as  permanently  reinvested  they  can  change  the  GAAP  ETR  but 
these  actions  would  not  change  the  CASH  ETR.  Conversely,  if  a  firm  accelerates  tax  depreciation  relative  to  book 
depreciation  (which  is  an  expense  in  the  denominator-pre-tax  income)  it  would  reduce  cash  taxes  paid  but  not  the  GAAP 
ETR  (since  the  deferred  tax  expense  would  increase). 
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We  use  a  technique  developed  in  the  economics  literature  by  Bertrand  and  Schoar  (2003)  that 
involves  tracking  individual  executives  who  served  as  top  executives  at  more  than  one  firm.  Using 
this  technique,  we  can  observe  whether  particular  executives  appear  to  have  systematic  effects  on 
their  firms’  tax  avoidance.  We  can  test,  for  example,  whether  Firm  A’s  tax  avoidance  changes  when 
Executive  X  is  hired  and  whether  it  changes  again  when  s/he  leaves  to  run  Firm  B.  The  executives 
we  track  include  the  CEO,  CFO,  and  other  top  executives  listed  in  the  ExecuComp  database  that 
were  employed  by  more  than  one  firm,  which  allows  us  to  isolate  executive  effects  apart  from  firm 
effects.  We  identify  908  executives  who  worked  for  at  least  two  firms  and  were  employed  by  each 
firm  for  at  least  three  years  between  1992  and  2006.  Following  the  approach  in  Bertrand  and 
Schoar  (2003),  we  regress  the  firm’s  effective  tax  rate  on  firm  fixed  effects,  year  fixed  effects,  and 
executive  fixed  effects.  Thus,  all  stationary  characteristics  of  the  firm  are  controlled  in  our  speci¬ 
fication  through  the  firm  fixed  effect,  and  any  time-specific,  cross-sectional  effects  on  effective  tax 
rates  are  controlled  through  the  year  fixed  effects.  In  our  main  tests  we  also  control  for  time- 
varying  characteristics  of  the  firm  through  a  set  of  control  variables  (e.g.,  size,  leverage,  R&D). 

Results  indicate  that  individual  executives  play  a  statistically  and  economically  significant 
role  in  determining  the  level  of  tax  avoidance  that  firms  undertake.  While  the  results  are  slightly 
stronger  for  CEOs  (i.e.,  for  executives  who  were  CEOs  in  the  last  year  of  their  career  in  our  data), 
the  results  also  hold  for  CFOs  and  other  executives  covered  by  ExecuComp.  The  frequency  of 
significant  executives  is  far  greater  than  would  be  observed  under  the  null  hypothesis  that  tax 
avoidance  is  determined  by  the  characteristics  of  the  firm,  but  not  the  individual  executive  per  se. 
We  also  examine  executives  in  event-time  as  they  move  across  firms,  and  we  illustrate  the  effects 
they  have  on  their  firms’  effective  tax  rates  as  they  join  and  later  depart  from  the  firm.  Finally,  we 
conduct  a  number  of  robustness  tests,  which  consistently  show  significant  executive  effects  on 
both  GAAP  ETR  and  CASH  ETR. 

After  establishing  an  executive  effect,  we  investigate  whether  tax  avoidance  is  associated  with 
(1)  other  executive  effects  that  represent  broader  styles  such  cost-cutting  or  growth  strategies 
and/or  (2)  executive  biographical  characteristics.  There  appears  to  be  little  connection  between 
these  factors  and  tax  avoidance.  We  note  that  the  lack  of  a  strong  association  with  other  styles  or 
manager  characteristics  does  not  mitigate  our  main  result  that  the  executive  matters.  It  simply 
means  there  are  not  identifiable,  common  characteristics  that  can  explain  executive-specific  tax 
avoidance — the  executive  effects  on  tax  avoidance  appear  to  be  idiosyncratic. 

To  our  knowledge,  ours  is  the  first  study  to  find  and  quantify  the  effects  that  individual  top 
executives  have  on  tax  avoidance.  As  Bertrand  and  Schoar  (2003)  state,  the  neoclassical  view  of 
the  firm  is  that  the  same  decisions  are  made  regardless  of  the  executive  in  office.  Prior  and 
contemporaneous  research  in  the  tax  avoidance  area  takes  the  neoclassical  view,  whereby  tax 
avoidance  is  viewed  as  driven  by  firm  characteristics  and  executive  compensation  but  the  indi¬ 
vidual  executive  is  assumed  to  not  matter.'^  Our  study  broadens  the  view  of  what  drives  tax 
avoidance  to  include  the  individuals  in  the  top  executive  positions. 

Our  study  is  distinct  from  the  prior  and  contemporaneous  tax  avoidance  literature  in  three 
ways.  First,  we  posit  and  find  effects  from  the  executives  themselves.  We  view  our  consideration 
of  the  role  of  executives  to  be  an  important  step  toward  a  better  understanding  of  the  substantial 
variation  in  tax  avoidance  that  prior  research  has  found  across  firms,  even  among  firms  in  the  same 
industry  (e.g.,  Dyreng  et  al.  2008).  Second,  because  of  our  research  design,  we  are  able  to  quantify 
the  tax  avoidance  effects  of  individual  executives,  something  that  no  prior  study  has  been  able  to 
do,  and  we  find  these  effects  are  economically  significant.  Third,  our  results  indicate  that  the 


Studies  focusing  on  the  role  of  compensation  and  tax  avoidance  include  Phillips  (2003),  Armstrong  et  al.  (2009),  Rego 
and  Wilson  (2009),  and  Robinson  et  al.  (2010). 
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executive  is  a  significant  determinant  of  CASH  ETR,  an  important  finding  given  that  none  of  the 
aforementioned  studies  investigating  firm-level  incentives  document  any  effect  on  CASH  ETR. 

Our  findings  raise  many  questions  that  we  hope  will  be  the  addressed  in  future  research.  What 
happens  to  these  executives  and  the  firms  they  manage  in  the  future?  Tax  avoidance  can  be 
value-enhancing  to  the  firm  but  not  necessarily  so.  Perhaps  the  executives  who  appear  to  empha¬ 
size  tax  avoidance  simply  make  their  firms  take  on  substantial  tax  risk.  In  that  case,  we  might 
expect  to  see  additional  taxes  being  paid  in  future  years  as  the  firms  undergo  audit  by  the  IRS, 
perhaps  on  another  executive’s  watch.  It  would  be  interesting  to  know  what  happens  to  the  career 
prospects  of  executives  who  emphasize  or  de-emphasize  tax  avoidance.  These  are  just  a  few  of  the 
questions  raised  by  this  study  that  we  believe  can  lead  to  further  research. 

In  the  next  section  we  review  the  prior  literature,  specifically  the  prior  research  on  executive 
effects  and  the  literature  on  tax  avoidance.  In  Section  III  we  discuss  our  sample  selection  and 
present  descriptive  statistics.  We  present  the  empirical  analysis  in  Section  IV  and  conclude  in 
Section  V. 


II.  PRIOR  LITERATURE 
Prior  Literature  on  Tax  Avoidance 

Tax  avoidance  has  been  of  interest  to  researchers  for  decades,  yet  surprisingly  little  empirical 
evidence  exists  about  cross-sectional  variation  in  tax  avoidance,  and  until  recently  there  has  been 
little  research  on  executives’  roles  in  tax  avoidance.  We  discuss  both  briefly  here  and  refer  readers 
to  Hanlon  and  Heitzman  (2010)  for  a  broad  review  of  the  literature. 

Firm-level  characteristics  such  as  size,  economies  of  scale  via  foreign  operations,  tax  plan¬ 
ning,  and  other  factors  have  been  examined  as  determinants  of  tax  avoidance,  where  tax  avoidance 
has  been  measured  in  a  variety  of  ways.^  In  a  pioneering  study  that  examines  the  role  of  manager 
incentives,  Phillips  (2003)  uses  private  survey  data  to  investigate  whether  compensating  managers 
based  on  pre-tax  or  after-tax  earnings  affects  GAAP  ETRs.  Phillips  (2003)  finds  that  compensation 
for  managers  of  business  units  based  on  income  measured  after  taxes  (as  compared  to  before  tax) 
appear  to  be  associated  with  a  lower  GAAP  effective  tax  rate;  however,  this  same  association  does 
not  hold  for  CEO  performance  measures.®  Recent  studies  in  this  area  of  incentive  effects  on  tax 
avoidance  include  Desai  and  Dharmapala  (2006),  which  examines  “high-powered”  incentives  and 
tax  avoidance;  Armstrong  et  al.  (2009),  which  is  the  first  to  examine  compensation  of  tax  direc¬ 
tors;  Rego  and  Wilson  (2009),  which  examines  links  between  top  executive  compensation  and  tax 
aggressiveness;  and  Robinson  et  al.  (2010),  which  examines  the  effects  on  ETRs  of  treating  the  tax 
department  as  a  profit  center.  Finally,  in  studies  of  tax  shelter  firms  (an  alternative  measure  of  tax 
avoidance),  Wilson  (2009)  and  Lisowsky  (2009)  examine  firm  characteristics  associated  with  tax 
sheltering. 

There  has  been  much  less  research  on  the  consequences  of  tax  avoidance.  Desai  and  Dhar¬ 
mapala  (2008)  examine  the  effects  of  tax  avoidance  on  firm  value,  and  Hanlon  and  Slemrod 
(2009)  examine  market  reactions  to  tax  shelter  involvement.  Jennings  et  al.  (2009)  find  an  asso¬ 
ciation  between  tax  avoidance  and  implicit  taxes;  Graham  and  Tucker  (2006)  examine  whether  tax 


^  See  Zimmerman  (1983),  Gupta  and  Newberry  (1997),  Mills  et  al.  (1998),  Rego  (2003),  Siegfried  (1974),  Porcano 
(1986),  Stickney  and  McGee  (1982),  and  Shevlin  and  Porter  (1992).  In  addition,  see  Callihan  (1994)  for  a  summ.ary  of 
annual  effective  tax  rate  research. 

®  Phillips  (2003,  869)  interprets  these  results  by  stating  that,  “After-tax  CEO  performance  measures  could  have  an  indirect 
negative  effect  on  ETRs,  however,  because  CEOs  compensated  after-tax  are  more  likely  to  compensate  their  BU 
managers  on  an  after-tax  basis”  and  that  other  incentives  such  as  job  retention  are  perhaps  sufficient  to  motivate  CEOs 
to  focus  on  after-tax  results. 
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shelter  firms  carry  less  debt  (i.e.,  whether  the  shelters  are  non-debt  tax  shields);  and  Wilson  (2009) 
provides  evidence  that  shelter  firms  have  higher  market  returns  as  long  as  the  firm’s  governance  is 
“good”  (measured  by  the  Gompers  et  al.  [2003]  governance  index). 

In  sum,  prior  research  has  investigated  how  to  measure  tax  avoidance  (e.g.,  Dyreng  et  al. 
2008),  the  firm-level  determinants  of  tax  avoidance,  and,  to  some  extent,  on  the  consequences  of 
tax  avoidance.  While  the  effect  of  the  CEO  or  other  top  management  has  recently  been  investi¬ 
gated  with  other  performance  outcomes  of  the  firm  (discussed  in  the  next  section),  little  attention 
has  been  given  to  whether  individual  executives  have  an  impact  on  how  much  tax  the  firm  avoids.^ 
We  address  this  question.  This  is  an  important  step  in  identifying  factors  that  make  some  firms 
more  able  to  avoid  tax  than  others.  We  do  not  investigate  the  consequences  of  tax  avoidance 
resulting  from  executive  effects.  We  look  forward  to  research  on  this  and  related  questions  in  the 
future. 

The  Effects  of  CEOs  and  Other  Top  Executives  on  Other  Aspects  of  Firm  Performance 

How  much  of  an  effect  the  top  executives  and  the  “tone  at  the  top”  these  executives  set  has  on 
various  aspects  of  firm  performance  and  shareholder  value  is  a  question  that  is  attracting  increased 
attention.  For  example,  Hayes  and  Schaefer  (1999)  estimate  how  much  executives  are  worth  to 
shareholders  by  identifying  firm/executive  separations  and  computing  the  abnormal  returns  sur¬ 
rounding  the  separations.^  Bertrand  and  Schoar  (2003)  examine  whether  top  executives  affect  firm 
performance  and  decisions  by  constructing  a  sample  of  executives  who  have  moved  across  at  least 
two  firms  and  investigating  the  executives’  effects  on  these  firms.  By  tracking  executives  who 
change  firms,  we  can  separate  the  firm  effect  from  the  executive  effect  on  corporate  behavior  and 
performance. 

Several  other  studies  examine  the  relation  between  CEO  characteristics  or  CEO  personal 
events  and  company  performance.  For  example,  Chatterjee  and  Hambrick  (2007)  examine  narcis¬ 
sistic  CEOs  and  company  performance,  while  Liu  and  Yermack  (2008)  find  that  future  company 
performance  deteriorates  when  CEOs  acquire  very  large  homes  or  estates.  Malmendier  and  Tate 
(2009)  report  evidence  that  CEOs  who  win  awards  from  the  business  press  (e.g.,  being  named  to 
BusinessWeek's  “Best  Managers”  list)  underperform  relative  to  their  peer  group  after  winning  the 
award  (similar  to  the  Sports  Illustrated  “Jinx”).  Bennedsen  et  al.  (2008)  use  data  from  Denmark 
and  report  that  CEOs’  (but  not  board  members’)  own  and  family  deaths  are  strongly  correlated 
with  declines  in  firm  operating  performance.  The  most  meaningful  deaths  for  the  CEOs,  as  might 
be  expected,  are  the  death  of  a  child  or  a  spouse,  whereas  the  death  of  a  mother-in-law  has  very 
little  impact  upon  performance. 

There  are  also  several  concurrent  studies  that  use  the  Bertrand  and  Schoar  (2003)  methodol¬ 
ogy  to  examine  executive  effects  on  accounting  outcomes.  For  example,  Bamber  et  al.  (2010) 
examine  executive  fixed  effects  on  corporate  financial  disclosure  policies  and  find  that  individual 
managers  have  a  significant  effect  on  voluntary  disclosure  (measured  using  management  fore- 


^  There  are  other  studies  broadly  related  to  the  questions  examined  in  this  study.  One  is  Chen  et  al.  (2010),  which 
examines  whether  family  owned/run  firms  are  more  tax  aggressive  than  non-family  owned/run  firms.  The  authors  find 
that  family  owned/run  firms  are  less  aggressive  because  they  try  to  convince  minority  shareholders  that  family  members 
are  not  rent-extracting  through  complicated  tax  structures.  Thus,  while  Chen  et  al.  (2010)  examine  the  effects  of  certain 
types  of  ownership  it  is  not  a  test  of  whether  executives  matter  to  tax  avoidance  absent  family  ownership  (note  our  study 
is  about  managers  that  change  jobs  so  they  are  not  likely  members  of  the  founding  family).  Another  is  Frank  et  al. 
(2009),  which  examines  whether  firms  that  are  aggressive  for  tax  purposes  are  aggressive  for  financial  accounting 
purposes  and  whether  firms  that  are  aggressive  for  both  have  other  similar  corporate  policies. 

*  See  also  Johnson  et  al.  (1985),  which  examines  the  market  reaction  to  the  sudden  deaths  of  executives  (53  events).  The 
authors  find  the  market  reaction  is  associated  with  founder  status,  decision-making  responsibilities,  and  estimates  of 
transaction  costs  for  negotiating  employment  agreements.  See  also  Pourciau  (1993)  and  Murphy  and  Zimmerman 
(1993). 
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casts).  Ge  et  al.  (2009)  investigate  whether  manager  effects  are  significant  in  financial  accounting 
outcomes  such  as  discretionary  accruals,  off-balance-sheet  activities,  and  conservatism. 

III.  SAMPLE  AND  DESCRIPTIVE  STATISTICS 

The  sample  begins  with  all  executives  listed  in  the  ExecuComp  database  for  the  years  1 992  to 
2006.  We  track  executives  across  firms  and  retain  those  executives  who  were  employed  for  at  least 
three  years  in  each  of  at  least  two  different  firms.  We  impose  minimal  data  requirements,  requiring 
that  the  firm-year  be  listed  on  Compustat  with  total  assets  available.  We  then  inspect  each  execu¬ 
tive  change  to  ensure  that  they  were  actual  changes  of  firms.  The  resulting  sam^ple  includes  12,958 
firm-years  of  data,  corresponding  to  1,138  distinct  firms,  and  908  distinct  executives. 

Table  1  shows  the  breakdown  of  executives  by  their  current  title  (i.e.,  the  last  title  listed  for 
the  executive  on  ExecuComp)  and  their  prior  title.  Overall,  the  sample  contains  351  executives 
whose  last  title  is  CEO,  195  whose  last  title  is  CFO,  and  362  whose  last  title  is  something  else 
(e.g.,  president,  vice-president).  There  are  101  instances  where  an  executive  was  CEO  of  one  firm 
listed  in  ExecuComp  and  moved  to  become  CEO  of  another  firm.  There  are  few  instances  of  a 
CEO  of  one  firm  going  to  take  a  lower  title  at  another  firm — in  only  four  cases  did  a  CEO  later 
take  on  the  title  of  CFO  and  in  58  cases  a  CEO  later  took  on  a  non-CEO,  non-CFO  title.  Being  a 
CFO,  on  the  other  hand,  is  sometimes  a  stepping-stone  to  becoming  CEO  at  another  firm.  This 
transition  occurs  33  times  in  our  sample,  while  160  times  a  CFO  makes  a  move  to  another  firm  but 
retains  the  same  title.  Finally,  there  are  217  instances  of  executives  with  titles  other  than  CEO  and 
CFO  going  on  to  become  CEOs.  In  31  cases  a  non-CEO/non-CFO  moves  to  become  a  CFO  and 
in  281  cases  moves  to  another  non-CEO/non-CFO  position. 

Table  2  presents  descriptive  statistics  for  the  12,958  firm-years  in  our  sample.  The  effective 
tax  rate  variable  {GAAP  ETR)  is  measured  as  total  tax  expense  (current  and  deferred)  divided  by 


TABLE  1 

Frequency  of  Executive  Changes 

Current  Title 


Prior  Title 

CEO 

CFO 

Other 

Total 

CEO 

101 

4 

58 

163 

CFO 

33 

160 

23 

216 

Other 

2\1 

31 

281 

529 

Total 

351 

195 

362 

908 

This  table  describes  title  changes  as  executives  move  from  one  company  to  another.  The  sample  is  a  total  of  908  executives 
(351  CEOs,  195  CFOs,  and  362  other  executives).  Each  executive  is  required  to  be  employed  by  at  least  two  different 
firms,  for  at  least  three  years  at  each  firm.  The  columns  across  the  top  describe  the  executives’  title  in  the  last  firm  in  which 
the  executives  were  observed  (current  title),  while  the  rows  describe  the  title  at  the  firm  in  which  the  executives  were 
employed  before  the  last  firm.  For  example,  the  cell  in  the  first  column  and  first  row  describes  CEOs  that  are  CEO  in  their 
current  firm  (last  firm  listed  in  ExecuComp)  and  were  CEO  at  the  firm  where  they  were  previously  employed.  In  the  case 
of  executives  who  were  employed  by  more  than  two  firms  for  at  least  three  years,  the  table  represents  their  last  change 
only. 
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TABLE  2 

Descriptive  Statistics 


Variable 

n 

Mean 

Std. 

Dev. 

25th 

Percentile 

50th 

Percentile 

75th 

Percentile 

GAAP  ETR 

10,355 

0.309 

0.154 

0.249 

0.337 

0.381 

CASH  ETR 

10,124 

0.263 

0.195 

0.133 

0.249 

0.347 

EBITDA 

12,716 

0.153 

0.119 

0.084 

0.140 

0.210 

R&D 

12,958 

0.045 

0.117 

0.000 

0.000 

0.032 

ADVERTISING 

12,958 

0.011 

0.110 

0.000 

0.000 

0.007 

SG&A 

9,545 

0.297 

0.254 

0.136 

0.241 

0.381 

CAPITAL  EXPENDITURES 

10,109 

0.298 

0.349 

0.120 

0.199 

0.338 

PERCENTAGE  CHANGE  IN  SALES 

12,926 

0.155 

0.350 

0.004 

0.087 

0.215 

LEVERAGE 

12,586 

0.246 

0.186 

0.097 

0.237 

0.357 

CASH  HOLDINGS 

12,578 

0.158 

0.257 

0.019 

0.059 

0.187 

EOREIGN  OPERATIONS 

12,958 

0.470 

0.499 

0.000 

0.000 

1.000 

NET  OPERATING  LOSS 

12,958 

0.273 

0.446 

0.000 

0.000 

1.000 

SIZE 

12,958 

7.699 

1.819 

6.424 

7.587 

8.878 

ESTIMATED  OPTION  EXPENSE 

12,958 

0.028 

0.132 

0.000 

0.000 

0.005 

INTANGIBLE  TO  TOTAL  ASSETS 

12,958 

0.126 

0.192 

0.000 

0.038 

0.182 

GROSS  PP&E  TO  TOTAL  ASSETS 

12,089 

0.643 

0.453 

0.283 

0.548 

0.947 

This  table  describes  the  variables  used  in  the  study.  As  each  variable  is  described,  the  Compustat  pneumonic  is  in 
parentheses. 


Variable  Definitions: 

GAAP  ETR  = 

CASH  ETR  = 

EBITDA  = 

R&D  = 

ADVERTISING  = 
SG&A  = 

CAPITAL  EXPENDITURES  = 

PERCENTAGE  CHANGE  IN 
SALES = 
LEVERAGE  = 

CASH  HOLDINGS  = 
EOREIGN  OPERATIONS  = 


SIZE  = 

NET  OPERATING  LOSS  = 
ESTIMATED  OPTION 
EXPENSE  = 


INTANGIBLE  TO  TOTAL 
ASSETS  = 

GROSS  PP&E  TO  TOTAL 
ASSETS  = 


the  financial  accounting  effective  tax  rate,  defined  as  total  income  tax  expense  {TXT) 
divided  by  pre-tax  book  income  {PI)  before  special  items  {SPI)\ 
the  cash  effective  tax  rate,  defined  as  cash  tax  paid  {TXPD)  divided  by  pre-tax  book 
income  {PI)  before  special  items  {SPI)\ 

earnings  before  interest,  taxes,  depreciation,  and  amortization  {OIBDP)  scaled  by 
lagged  total  assets  (AT); 

research  and  development  expense  {XRD)  divided  by  net  sales  {SALE);  when  missing, 
reset  to  0; 

advertising  expense  {XAD)  divided  by  net  sales  {SALE);  when  missing,  reset  to  0; 
selling,  general,  and  administrative  expense  {XSGA)  divided  by  net  sales  {SALE); 
missing  values  of  SG&A  are  set  to  0  when  used  as  a  control  variable  in  Equation  (1); 
reported  capital  expenditures  {CAPX)  divided  by  gross  property,  plant,  and  equipment 
{PPEGT); 

the  annual  percentage  change  in  net  sales  {{SALE,/ SALE, _i)  -  1); 

the  sum  of  long-term  debt  {DLTT)  and  long-term  debt  in  current  liabilities  {DEC) 

divided  by  total  assets  (AT); 

cash  and  cash  equivalents  {CHE)  divided  by  total  assets  (AT); 

the  firm  has  a  non-missing,  non-zero  value  for  pre-tax  income  from  foreign  operations 
{PIFO); 

the  natural  log  of  total  assets  (AT); 

an  indicator  if  the  firm  has  a  non-missing  value  of  tax  loss  carry-forward  {TLCF); 

calculated  from  ExecuCornp  as  the  average  annual  value  realized  from  exercise  of 
options  for  the  top  executives  grossed  up  by  the  fraction  of  options  owned  by  the  covered 
executives,  scaled  by  average  total  assets; 

the  ratio  of  intangible  assets  {INTANG)  to  total  assets  (AT);  and 

gross  property,  plant,  and  equipment  {PPEGT)  divided  by  total  assets  {AT). 
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pre-tax  income.^  The  mean  GAAP  ETR  in  the  sample  is  30.9  percent,  with  a  median  of  33.7 
percent.  We  also  measure  firms’  cash  effective  tax  rates  (CASH  ETR)  as  the  ratio  of  cash  taxes  paid 
to  pretax  income.  The  sample  firms  have  CASH  ETRs  with  a  mean  of  26.3  percent  and  a  median 
of  24.9  percent.  The  observation  that  CASH  ETR  is  lower  than  GAAP  ETR  is  consistent  with 
firms  on  average  having  lower  taxable  income  than  pretax  accounting  income,  and  is  consistent 
with  prior  research.  In  addition,  we  recognize  that  CASH  ETR  is  more  volatile  than  GAAP  ETR 
and  potentially  noisy  on  an  annual  basis  and,  thus,  results  may  be  more  difficult  to  document 
(Dyreng  et  al.  2008).  Note  that  the  number  of  observations  changes  over  the  measures  because  if 
the  denominator  is  negative  for  either  ratio  or  if  the  cash  taxes  paid  is  not  disclosed  or  negative, 
then  we  code  that  observation  as  missing. 

We  also  present  a  number  of  other  variables  in  Table  2  that  are  used  later  when  we  investigate 
whether  executives’  tax  avoidance  effects  are  associated  with  the  executives’  effects  on  other 
corporate  actions  and  outcomes.  These  other  nontax  variables  are  all  defined  in  Table  2  and 
include  measures  such  as  EBITDA,  research  and  development  expense  (R&D),  advertising  ex¬ 
pense  (ADVERTISING),  and  selling,  general,  and  administrative  expense  (SG&A)  (unless  other¬ 
wise  noted,  variables  are  scaled  by  beginning  total  assets).  Because  we  include  firms  that  have  a 
departing/incoming  executive  at  some  point  in  the  sample  period,  we  compare  our  descriptive 
statistics  to  those  of  the  Compustat  population  to  get  a  sense  of  whether  our  firms  are  somehow 
different,  which  would  limit  generalizability.  We  find  that  our  sample  firms  are  slightly  more 
profitable  and  invest  slightly  more  in  R&D,  but  are  not  substantially  different  from  the  typical  firm 
on  Compustat  (results  untabulated).  We  next  turn  to  the  primary  empirical  analysis. 

IV.  EMPIRICAL  ANALYSIS 

Primary  Specification 

Our  primary  research  question  is  whether  individual  executives  matter  for  firm  tax  avoidance. 
As  discussed  in  Bertrand  and  Schoar  (2003),  much  of  the  prior  literature  has  either  assumed  that 
all  executives  are  homogeneous  and  provide  selfless  inputs  into  the  firm’s  processes  or  that  even 
if  executives  are  heterogeneous,  they  cannot  easily  affect  firm  policies.  An  opposing  view  from 
standard  agency  models  is  that  executives  have  discretion  within  their  own  firms  and  use  this 
discretion  to  affect  corporate  decision  making  and  possibly  to  advance  their  own  objectives. 

The  identification  strategy  is  a  regression  based  on  Bertrand  and  Schoar  (2003)  that  uses  a 
panel  of  data: 


ETRit  =ao  +  ^i^  aj^CONTROLl  +  21,  a, YEAR,  +  2,  aiEIRM,  +  21  oi,„EXEC„  +  e,„  (1) 

where  ETR,,  is  GAAP  ETR  (or  CASH  ETR,  each  tested  in  separate  regressions);  FIRM,  is  a 
separate  indicator  variable  for  each  firm,  i  (firm  fixed  effects);  YEAR,  is  a  separate  indicator 
variable  for  each  year,  t  (year  fixed  effects);  e„  is  the  error  term.  EXEC„,  is  a  separate  indicator  for 
each  executive,  m  (executive  fixed  effects),  and  is  our  main  variable  of  interest.  The  firm  fixed 
effects  in  the  model  control  for  all  stationary  firm  characteristics  such  as  industry  and  stationary 
firm-level  strategies  for  tax  avoidance.  The  vector  of  control  variables,  CONTROL'^,  reflects  firm 


®  Pretax  income  is  measured  as  income  before  discontinued  operations  and  extraordinary  items  and  excludes  special 
items.  ETRs  with  negative  pretax  income  are  set  to  missing.  The  remaining  non-missing  ETRs  are  winsorized  (reset)  so 
that  the  largest  observation  is  1  and  the  smallest  is  0.  While  we  would  like  to  measure  long-run  cash  effective  tax  rates 
as  in  Dyreng  et  al.  (2008),  it  is  not  possible  for  us  to  examine  both  manager  changes  and  estimate  tax  rates  over  a 
ten-year  span. 

The  annual  cash  tax  rates  are  similar  to  the  rates  in  Dyreng  et  al.  (2008),  which  used  a  larger  sample  but  required  firms 
to  have  positive  income  summed  over  a  ten-year  period.  The  mean  annual  cash  tax  rate  (measured  similarly)  in  Dyreng 
et  al.  (2008)  is  27  percent  and  the  median  is  25.6  percent. 
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characteristics  that  can  change  over  time  and  could  plausibly  be  related  to  tax  avoidance,  as  is 
discussed  in  more  detail  below. 

We  retain  firm-years  with  an  executive  change  and  all  available  prior  and  subsequent  years  for 
the  firm.  The  years  outside  the  executive  tenure  allow  a  better  estimation  of  the  firm  effect  versus 
the  executive  effect.  Thus,  we  employ  a  panel  data  set  in  all  tests. 

Within  the  title  EXEC,  we  also  separately  identify  executives  by  title  (e.g.,  CEO,  CFO,  or 
some  non-CEO,  non-CFO  title).  To  do  this,  we  classify  executives  according  to  the  title  last  listed 
on  ExecuComp  for  the  individual.  Coding  each  title  separately  allows  us  to  test  each  group  of 
executives  while  controlling  for  the  effects  of  the  other  executives  (for  example,  using  a  joint 
F-test,  we  can  test  the  effect  of  CEOs  while  controlling  for  the  effects  of  CFOs). 

If  we  find  that  executives  matter,  it  could  be  that  executives  have  different  styles  that  they 
impose  on  the  firm  or  it  could  be  that  the  firm  hired  the  executives  because  of  their  style  (e.g.,  a 
tax  avoider).  Because  we  control  for  the  firm  effect,  however,  only  if  the  firm  changed  objectives 
and  hired  executives  with  different  styles  through  time  could  our  results  be  due  to  matching 
between  the  firm  and  executives;  otherwise  the  new  executive’s  style  would  be  the  same  as  the  old 
executive’s  and  the  effect  would  be  captured  by  the  firm  indicator  variable. 

As  noted  earlier,  CONTROL'^  in  the  above  equation  represents  a  vector  of  time-varying, 
firm-level  controls.  When  deciding  upon  a  research  design,  we  gave  considerable  thought  to  the 
variables  to  include  in  CONTROL'^.  The  more  variables  we  include  in  CONTROL'^,  the  less 
variation  we  leave  for  the  executive  effect  to  “pick  up.”  In  other  words,  if  executive  choices  affect 
the  variables  in  our  vector  of  controls,  then  including  these  controls  could  drive  the  executive  fixed 
effect  coefficients  toward  zero.  For  example,  consider  an  executive’s  choice  to  increase  spending 
on  research  and  development  (R&D).  Assume  that  this  spending  generates  additional  research  and 
development  tax  credit,  which  reduces  the  effective  tax  rates  of  the  firm  (note  that  R&D  spending 
is  expensed  for  financial  reporting  so  the  denominator  of  our  ratios  is  also  reduced,  but  assuming 
the  amount  is  creditable  likely  results  in  reduced  effective  rates).  What  we  cannot  separate  in  our 
tests  is  whether  the  executive  increased  the  R&D  spending  in  order  to  reduce  the  tax  rate  or 
whether  the  executive  increased  the  R&D  spending  for  other  reasons  and  a  byproduct  of  this 
decision  is  a  reduced  tax  rate.  If  we  include  R&D  as  a  control  variable  in  the  regression,  then  the 
implicit  assumption  in  the  research  design  is  that  the  R&D  spending  was  for  a  nontax  reason  that 
we  want  to  control  for  in  our  tests  and  remove  from  the  manager  effect  coefficient.  If  we  do  not 
include  R&D  as  a  control  variable  in  the  regression,  then  the  implicit  assumption  in  the  research 
design  is  that  the  effect  of  the  increased  R&D  spending  on  the  tax  rate  is  intentional  on  the  part  of 
management,  such  that  the  regression  will  place  that  effect  in  the  executive’s  fixed  effect  coeffi¬ 
cient.  Since  we  obviously  do  not  know  the  extent  to  which  each  decision  an  executive  makes  is 
explicitly  for  tax  reasons,  we  have  to  arbitrarily  choose  between  these  two  designs  and  interpret 
the  results  accordingly. 

With  this  caveat  in  mind,  we  have  chosen  to  estimate  our  main  analyses  after  including  a  wide 
range  of  variables  in  CONTROL'^.  We  draw  these  variables  from  prior  effective  tax  rate  literature 
(e.g.,  Mills  et  al.  1998;  Rego  2003).  We  include  ADVERTISING  expense,  CAPITAL  EXPENDI¬ 
TURES,  ESTIMATED  OPTION  EXPENSE,  an  indicator  for  whether  the  firm  has  FOREIGN  OP¬ 
ERATIONS,  the  ratio  of  INTANGIBLE  TO  TOTAL  ASSETS,  LEVERAGE,  SIZE,  an  indicator  for 
whether  the  firm  has  a  NET  OPERATING  LOSS  (NOL),  GROSS  PROPERTY  PLANT  &  EQUIP¬ 
MENT,  SG&A  expense  and  R&D  expense.  We  also  estimate  the  regression  with  only  a  minimal 
set  of  controls  {NOL  and  SIZE)  in  an  untabulated  robustness  check,  finding  very  similar  results. 


Variable  definitions  are  found  in  Table  2.  When  using  SG&A  as  a  control  variable  in  Equation  (1),  missing  values  are 
set  to  0. 
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It  is  again  important  to  note  that  by  including  firm  fixed  effects,  we  control  for  stationary  firm 
characteristics.  For  example,  if  Firm  A  has  lower  tax  rates  than  Firm  B  because  Firm  A  always  has 
more  foreign  operations  than  Firm  B,  then  that  effect  will  be  captured  in  the  fixed  effect  for  Firm 
A.  Similarly,  the  tests  control  for  industry  membership  because  industry  is  a  stationary  firm 
characteristic.  However,  changes  in  Firm  A’s  effective  tax  rates  caused  by  a  new  executive  who 
changes  Firm  A’s  foreign  operations  will  be  reflected  in  the  fixed  effect  for  that  executive.  Thus, 
the  intention  is  for  the  executive  fixed  effect  to  capture  the  impact  on  tax  avoidance  from  changes 
at  the  firm  while  under  his/her  command. 

Our  sample  consists  of  executive  changes.  If  a  firm  had  the  same  managerial  team  over  time, 
then  the  firm  fixed  effects  and  executive  fixed  effects  would  be  perfectly  collinear  and  we  could 
not  separate  the  effects  that  are  specific  to  an  executive  from  the  effects  of  the  firm.  The  variation 
in  executives  over  time  (i.e.,  executive  changes)  is  what  makes  it  possible  to  separate  executive 
effects  from  firm  effects.  Moreover,  by  observing  the  performance  of  the  same  executive  across 
two  firms,  say  A  and  B,  we  reduce  the  likelihood  that  the  results  are  due  to  coincidental  time- 
varying  performance  in  one  of  the  firms.  Accordingly,  we  impose  the  same  requirement  on  our 
sample  as  did  Bertrand  and  Schoar  (2003)  by  requiring  executives  in  our  sample  to  have  been 
employed  by  at  least  two  firms  in  the  sample,  with  at  least  three  years  of  employment  at  each 
firm.*^ 

Main  Results 

Table  3  reports  the  results  from  estimating  Equation  (1)  across  the  sample  of  firm-years  with 
non-missing  tax  rates  (n  =  10,355  for  GAAP  ETR  and  n  =  10,124  for  CASH  ETR).  Panel  A 
presents  the  results  with  GAAP  ETR  as  the  dependent  variable  and  Panel  B  with  CASH  ETR  as  the 
dependent  variable.  The  number  presented  in  the  table  is  the  F-statistic  from  the  joint  test  of  the 
significance  of  that  particular  set  of  coefficients.  (For  example,  10.00  in  Panel  A  is  the  F-statistic 
from  the  joint  test  of  significance  of  the  coefficients  on  the  year  indicator  variables.) 

Model  1  is  a  baseline  regression  that  includes  only  an  intercept  and  the  vector  of  time-varying 
firm  controls  {CONTROL'^)  described  above.  Moving  to  the  right,  we  present  the  results  from  a 
model  that  includes  the  vector  of  time  varying  controls  and  only  year  effects  (Model  2),  only  firm 
effects  (Model  3),  only  CEO  effects  (Model  4),  only  CFO  effects  (Model  5),  and  only  other 
(non-CEO/non-CFO)  executive  effects  (Model  6).  Model  7  presents  results  of  a  regression  that 
only  includes  the  time  varying  controls  {CONTROL'')  and  only  the  executive  fixed  effects.  In  this 
model,  we  include  an  indicator  variable  for  all  executives  in  the  sample  and  test  them  jointly, 
regardless  of  specific  title  (i.e.,  CEO,  CFO,  Other).  In  Model  8,  we  estimate  the  full  model 
specified  in  Equation  (1).  Here  we  include  the  executive  fixed  effect  (indicator  variable)  for  every 
executive  as  well  as  the  firm  fixed  effect  and  year  fixed  effect  and  the  time  varying  controls.  As 
described  above,  when  specified  in  this  manner,  we  can  statistically  test  the  significance  of  any 
subset  of  indicator  variables.  We  report  tests  of  each  subset  of  executives  in  Model  8.  The  differ¬ 
ence  across  models  when  looking  at  the  F-statistic  for  CEOs,  for  example,  is  that  in  Model  8,  the 
F-statistic  tests  the  joint  significance  of  all  the  CEOs  after  controlling  for  time  varying  controls. 


It  is  possible  that  teams  of  executives  move  from  firm  to  firm,  but  not  in  exactly  the  same  years.  We  find  this 
phenomenon  occurs  in  about  10  percent  of  cases.  We  examine  the  influence  of  this  phenomenon  on  the  executive’s 
ability  to  affect  the  firm’s  GAAP  ETR  or  CASH  ETR  in  Table  7.  We  note  we  cannot  test  the  effect  of  a  team  moving  in 
the  exact  years  as  we  cannot  separate  out  the  executive  effect  from  the  team  effect  and  we  cannot  associate  this  team 
effect  with  any  executive  characteristics  such  as  M.B.A.,  age,  etc.,  in  our  later  tests  below.  Thus,  these  executives  where 
the  team  moves  in  the  exact  same  years  are  removed  from  our  sample  (i.e.,  the  perfectly  collinear  executives).  There  are 
24  such  executives. 
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the  firm  effect,  the  year  effect,  and  the  effects  of  executives  with  other  titles,  whereas  in  Model  4, 
the  F-statistic  tests  the  joint  significance  of  all  the  CEOs  after  controlling  for  the  time  varying 
effects  only. 

Results  indicate  that  both  firm  and  year  fixed  effects  are  significant  when  included  on  their 
own  in  both  the  GAAP  ETR  and  the  CASH  ETR  regressions  (Models  2  and  3)  in  Panels  A  and  B. 
All  three  sets  of  executive  coefficients  are  significant  at  the  1  percent  level  when  considered 
individually  (Models  4-6)  and  collectively  (Model  7).  Model  8  presents  the  results  for  the  full 
model  from  Equation  (1)  that  includes  the  vector  of  time  varying  controls  {NOL,  SIZE,  R&D, 
SG&A,  etc.),  fixed  effects  for  the  firm,  the  year,  and  each  executive.  Each  type  of  executive  has  a 
fixed  effect  that  is  significant  in  the  full  model  in  both  the  GAAP  ETR  and  the  CASH  ETR 
regressions  (i.e.,  after  controlling  for  the  effect  of  other  executives  and  firm  and  year  effects,  each 
type  of  executive  still  has  a  significant  fixed  effect,  on  average).  The  F-statistics  are  usually  largest 
for  CEO  fixed  effects,  both  in  the  full  model  and  when  each  executive  group  is  tested  individually, 
which  might  be  expected  if  CEOs  (and  future  CEOs)  have  the  most  influence  on  the  firm’s 
policies.  Flowever,  we  consistently  find  significant  effects  for  non-CEO  executives  as  well. 

These  results  document  that  executives  have  a  significant  effect  on  GAAP  ETRs  and  on  CASH 
ETRs.  This  effect  is  after  controlling  for  many  of  the  firm-level  items  that  have  been  documented 
(or  suspected)  to  affect  these  effective  tax  rates,  such  as  R&D  spending,  foreign  operations,  and 
other  variables.  This  is  an  important  step  in  the  literature,  as  heretofore  it  has  been  difficult  to  find 
factors  that  are  consistent  determinants  of  effective  tax  rates  (even  the  effect  of  SIZE  is  debated  in 
the  literature).  In  this  study,  however,  we  document  an  important  item  not  previously  considered — 
executives  have  a  significant  effect  on  the  level  of  effective  tax  rates  of  the  firm. 

Robustness  Tests 

We  conduct  a  battery  of  robustness  tests  on  our  main  results.  In  the  first  set  of  tests,  we 
examine  the  effect  of  the  time  varying  control  variables  on  the  significance  of  the  executive  fixed 
effects.  In  untabulated  tests,  we  re-estimate  Equation  (1),  but  reduce  the  vector  of  time  varying 
controls  to  include  only  NET  OPERATING  LOSS  and  SIZE.  We  find  results  are  very  similar  to 
those  in  Table  3. 

In  the  second  set  of  robustness  tests,  we  address  the  potential  concern  that  our  regressions 
may  show  significance  for  an  executive  even  if  the  reduction  to  the  effective  tax  rates  was  due  to 
some  random  event  that  occurred  during  his/her  tenure.  In  order  to  mitigate  this  concern,  we 
examine  whether  the  significant  executive  effects  are  driven  by  just  one  of  the  firms  at  which  the 
executive  works  or  whether  executive  effects  persist  across  the  firms  at  which  the  executive  is 
employed.  We  do  this  by  estimating  a  model  similar  to  Model  8  in  Table  3,  but  without  the 
executive  fixed  effects,  thus  leaving  the  executive  effects  in  the  residual.  We  then  regress  each 
significant  (at  the  5  percent  level)  executive’s  mean  residual  corresponding  to  his/her  tenure  from 
his/her  second  firm  on  the  mean  residual  corresponding  to  his/her  tenure  at  his/her  first  firm.  In 
untabulated  results,  we  find  that  the  tax  avoidance  at  the  first  firm  is  positively  associated  with  the 
tax  avoidance  at  the  second  firm;  the  coefficient  is  0.353  (p  =  0.023)  for  the  GAAP  ETR  regres¬ 
sion  and  0.280  (p  =  0.053)  for  the  CASH  ETR  regression.  Thus,  the  results  do  not  appear  to  be 
driven  by  an  effect  at  only  one  of  the  firms  at  which  the  executive  is  employed. 

In  the  third  set  of  tests,  we  consider  alternatives  to  the  tests  based  on  the  F-statistics,  since  it 
is  possible  for  F-statistics  to  be  influenced  by  only  a  few  significant  coefficients.  Accordingly,  we 
examine  the  frequency  of  executives  whose  specific  fixed  effect  is  significant  (i.e.,  the  t-statistic  on 
the  fixed-effect  coefficient  is  significant).  Figure  1  presents  the  number  of  individual  executives 
with  statistically  significant  fixed  executive  effects  from  Model  8  in  Table  3.  Our  sample  contains 
795  (783)  executives  with  sufficient  data  to  estimate  the  GAAP  ETR  {CASH  ETR)  regression. 
Under  the  null  hypothesis  that  executives  have  no  incremental  effect  beyond  firm  fixed  effects  and 
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FIGURE  1 

Frequencies  of  Significant  Manager  Fixed  Effects 

Panel  A:  Frequency  of  Significant  Fixed  Effects  Compared  to  Expected  Frequency  with 
GAAPETRas  Variable  of  Interest 


1%  5%  10% 

Significance  Level 


Panel  B:  Frequency  of  Significant  Fixed  Effects  Compared  to  Expected  Frequency  with 
CASHETR  as  Variable  of  Interest 


1%  5%  10% 

Significance  Level 


Panels  A  and  B  of  the  figure  report  the  frequency  of  significant  executive  fixed  effects  relative  to  what  would  be 
expected  under  the  null  hypothesis  if  the  models  tested  in  Table  3  are  well  specified.  The  calculations  are  per¬ 
formed  at  the  1  percent,  5  percent,  and  10  percent  significance  level,  respectively.  Panel  A  presents  the  calcula¬ 
tion  when  GAAP  ETR  is  the  dependent  variable  (as  in  Table  3  Panel  A),  and  Panel  B  presents  the  calculation 
when  CASH  ETR  is  the  dependent  variable  (as  in  Table  3  Panel  B). 


time  varying  characteristics,  we  would  expect  80  executives  to  be  significant  in  the  GAAP  ETR 
regression  at  the  10  percent  level  (795  X  0.10),  40  to  be  significant  at  the  5  percent  level,  and  8 


The  Accounting  Review 
American  Accounting  Association 


July  2010 


The  Effects  of  Executives  on  Corporate  Tax  Avoidance 


1177 


to  be  significant  at  the  1  percent  level  of  significance.  In  our  regression  results,  when  using  GAAP 
ETR  as  the  dependent  variable  (Panel  A  in  Table  3),  140  executives  are  significant  at  the  10 
percent  level,  97  are  significant  at  the  5  percent  level,  and  46  are  significant  at  the  1  percent  level. 
When  using  CASH  ETR  as  the  dependent  variable  (Panel  B  of  Table  3),  133  executives  are 
significant  at  the  10  percent  level,  96  are  significant  at  the  5  percent  level,  and  33  are  significant 
at  the  1  percent  level.  Thus,  we  find  the  number  of  significant  executives  in  our  data  to  be 
consistently  greater  than  what  we  would  expect  under  the  null.  In  untabulated  data,  the  results  for 
each  separate  level  of  executive  (e.g.,  CEO,  CEO,  OTHER)  are  similar— the  actual  number  of 
significant  executives  is  much  greater  than  one  would  expect  under  the  null  hypothesis. 

In  the  fourth  set  of  robustness  tests,  we  control  for  the  possibility  that  the  standard  errors  on 
the  t-statistics  and  F-statistics  could  be  downwardly  biased  for  any  reason.  Accordingly,  we  re¬ 
conduct  the  tests  in  Table  3,  but  instead  of  actual  data,  we  use  data  that  have  been  randomly 
shuffled.  Specifically,  we  randomize  the  data  to  generate  the  empirical  distribution  of  estimated 
executive  effects  under  the  null  hypothesis  by  shuffling  the  executive-firm  pairings  so  that  each 
executive  is  assigned  to  a  firm  at  which  she/he  did  not  actually  work.  We  repeat  this  shuffling 
1,000  times  and  compute  the  percentage  of  significant  executive  effects  for  each  iteration.  Because 
we  would  not  expect  a  randomly  matched  firm  and  executive  to  generate  significant  results,  this 
procedure  generates  the  empirical  distribution  under  the  null  hypothesis  against  which  our  actual 
results  can  be  compared. 

The  results  in  both  the  GAAP  ETR  and  CASH  ETR  regressions  reveal  that  more  executives  are 
significant  in  the  actual  data  than  any  of  the  1,000  iterations  of  the  randomized  data.  Thus,  the 
results  are  highly  significant. 

In  the  fifth  set  of  robustness  tests,  we  cluster  standard  errors  at  the  firm  level  (Petersen  2009). 
However,  clustering  of  the  standard  errors  does  not  allow  for  the  inclusion  of  all  of  our  currently 
included  fixed  effects.  Thus,  in  order  to  estimate  the  regression  we  must  eliminate  observations 
(specifically  those  in  which  the  executive  is  included  in  a  cluster  of  firms  where  the  number  of 
executives  within  the  cluster  is  equal  to  or  exceeds  the  number  of  firms). The  results  on  this 
reduced  sample  are  robust  to  using  clustered  standard  errors.  In  fact,  in  all  cases,  the  test  statistics 
are  stronger  than  those  reported. 

In  the  sixth  robustness  test,  we  recognize  that  variation  in  the  ratios  we  examine,  GAAP  ETR 
and  CASH  ETR,  could  be  driven  by  the  either  the  numerator  (income  tax  expense  or  cash  tax  paid) 
or  by  the  denominator  (pre-tax  income).  To  test  whether  the  executive  effects  on  the  GAAP  ETR 
and  CASH  ETR  ratios  are  being  driven  by  executives  who  affect  the  denominator  via  earnings 
management,  we  change  the  dependent  variables  by  putting  cash  from  operations  (adjusted  for 
cash  taxes  paid)  in  the  denominator.  This  change  greatly  reduces  the  possibility  that  variation  in 
the  ratio  is  driven  by  earnings  management,  although  the  interpretation  of  the  ratios  becomes  less 
intuitive.  In  an  unreported  analysis  similar  to  that  in  Table  3,  we  find  that  in  all  cases  executive 
fixed  effects  are  significant  when  we  conduct  the  tests  with  cash  flow  from  operations  in  the 
denominator.  Thus,  it  appears  that  executives  are  able  to  influence  the  effective  tax  rate  ratios  via 
management  of  tax  expense  and/or  cash  tax  paid,  and  not  just  by  managing  the  denominator  of  the 


More  specifically,  clustering  standard  errors  at  the  firm  level  restricts  the  effective  degrees  of  freedom  for  the  model  to 
the  number  of  clusters.  Because  we  include  firm  fixed  effects,  we  would  be  estimating  a  model  where  the  number  of 
parameters  would  exceed  the  effective  degrees  of  freedom.  While  it  is  possible  to  generate  standard  errors  for  each  of 
the  variables  in  the  regression,  it  is  not  possible  to  test  more  linear  restrictions  than  degrees  of  freedom.  Thus,  in  our 
particular  setting  it  is  not  possible  generate  the  test  statistics  associated  with  the  overall  model  fit  or  the  joint  significance 
of  the  firm  effects — which  is  our  main  test. 

This  situation  can  arise  if  Executive  X  worked  at  Firm  A  and  Firm  C,  Executive  Y  worked  at  Firm  A  and  Firm  B,  and 
Executive  Z  worked  at  Firm  B  and  Firm  C.  This  results  in  three  executives,  but  only  three  degrees  of  freedom,  because 
there  are  only  three  firms. 
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FIGURE  1 

Frequencies  of  Significant  Manager  Fixed  Effects 

Panel  A:  Frequency  of  Significant  Fixed  Effects  Compared  to  Expected  Frequency  with 
GAAPETRas  Variable  of  Interest 


1%  5%  10% 

Significance  Level 


Panel  B:  Frequency  of  Significant  Fixed  Effects  Compared  to  Expected  Frequency  with 
CAS’FTE'TT?  as  Variable  of  Interest 


Significance  Level 


Panels  A  and  B  of  the  figure  report  the  frequency  of  significant  executive  fixed  effects  relative  to  what  would  be 
expected  under  the  null  hypothesis  if  the  models  tested  in  Table  3  are  well  specified.  The  calculations  are  per¬ 
formed  at  the  1  percent,  5  percent,  and  10  percent  significance  level,  respectively.  Panel  A  presents  the  calcula¬ 
tion  when  GAAP  ETR  is  the  dependent  variable  (as  in  Table  3  Panel  A),  and  Panel  B  presents  the  calculation 
when  CASH  ETR  is  the  dependent  variable  (as  in  Table  3  Panel  B). 


time  varying  characteristics,  we  would  expect  80  executives  to  be  significant  in  the  GAAP  ETR 
regression  at  the  10  percent  level  (795  X  0.10),  40  to  be  significant  at  the  5  percent  level,  and  8 
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to  be  significant  at  the  1  percent  level  of  significance.  In  our  regression  results,  when  using  GAAP 
ETR  as  the  dependent  variable  (Panel  A  in  Table  3),  140  executives  are  significant  at  the  10 
percent  level,  97  are  significant  at  the  5  percent  level,  and  46  are  significant  at  the  1  percent  level. 
When  using  CASH  ETR  as  the  dependent  variable  (Panel  B  of  Table  3),  133  executives  are 
significant  at  the  10  percent  level,  96  are  significant  at  the  5  percent  level,  and  33  are  significant 
at  the  1  percent  level.  Thus,  we  find  the  number  of  significant  executives  in  our  data  to  be 
consistently  greater  than  what  we  would  expect  under  the  null.  In  untabulated  data,  the  results  for 
each  separate  level  of  executive  (e.g.,  CEO,  CEO,  OTHER)  are  similar — the  actual  number  of 
significant  executives  is  much  greater  than  one  would  expect  under  the  null  hypothesis. 

In  the  fourth  set  of  robustness  tests,  we  control  for  the  possibility  that  the  standard  errors  on 
the  t-statistics  and  F-statistics  could  be  downwardly  biased  for  any  reason.  Accordingly,  we  re¬ 
conduct  the  tests  in  Table  3,  but  instead  of  actual  data,  we  use  data  that  have  been  randomly 
shuffled.  Specifically,  we  randomize  the  data  to  generate  the  empirical  distribution  of  estimated 
executive  effects  under  the  null  hypothesis  by  shuffling  the  executive-firm  pairings  so  that  each 
executive  is  assigned  to  a  firm  at  which  she/he  did  not  actually  work.  We  repeat  this  shuffling 
1,000  times  and  compute  the  percentage  of  significant  executive  effects  for  each  iteration.  Because 
we  would  not  expect  a  randomly  matched  firm  and  executive  to  generate  significant  results,  this 
procedure  generates  the  empirical  distribution  under  the  null  hypothesis  against  which  our  actual 
results  can  be  compared. 

The  results  in  both  the  GAAP  ETR  and  CASH  ETR  regressions  reveal  that  more  executives  are 
significant  in  the  actual  data  than  any  of  the  1,000  iterations  of  the  randomized  data.  Thus,  the 
results  are  highly  significant. 

In  the  fifth  set  of  robustness  tests,  we  cluster  standard  errors  at  the  firm  level  (Petersen  2009). 
However,  clustering  of  the  standard  errors  does  not  allow  for  the  inclusion  of  all  of  our  currently 
included  fixed  effects.  Thus,  in  order  to  estimate  the  regression  we  must  eliminate  observations 
(specifically  those  in  which  the  executive  is  included  in  a  cluster  of  firms  where  the  number  of 
executives  within  the  cluster  is  equal  to  or  exceeds  the  number  of  firms). The  results  on  this 
reduced  sample  are  robust  to  using  clustered  standard  errors.  In  fact,  in  all  cases,  the  test  statistics 
are  stronger  than  those  reported. 

In  the  sixth  robustness  test,  we  recognize  that  variation  in  the  ratios  we  examine,  GAAP  ETR 
and  CASH  ETR,  could  be  driven  by  the  either  the  numerator  (income  tax  expense  or  cash  tax  paid) 
or  by  the  denominator  (pre-tax  income).  To  test  whether  the  executive  effects  on  the  GAAP  ETR 
and  CASH  ETR  ratios  are  being  driven  by  executives  who  affect  the  denominator  via  earnings 
management,  we  change  the  dependent  variables  by  putting  cash  from  operations  (adjusted  for 
cash  taxes  paid)  in  the  denominator.  This  change  greatly  reduces  the  possibility  that  variation  in 
the  ratio  is  driven  by  earnings  management,  although  the  interpretation  of  the  ratios  becomes  less 
intuitive.  In  an  unreported  analysis  similar  to  that  in  Table  3,  we  find  that  in  all  cases  executive 
fixed  effects  are  significant  when  we  conduct  the  tests  with  cash  flow  from  operations  in  the 
denominator.  Thus,  it  appears  that  executives  are  able  to  influence  the  effective  tax  rate  ratios  via 
management  of  tax  expense  and/or  cash  tax  paid,  and  not  just  by  managing  the  denominator  of  the 


More  specifically,  clustering  standard  errors  at  the  firm  level  restricts  the  effective  degrees  of  freedom  for  the  model  to 
the  number  of  clusters.  Because  we  include  firm  fixed  effects,  we  would  be  estimating  a  model  where  the  number  of 
parameters  would  exceed  the  effective  degrees  of  freedom.  While  it  is  possible  to  generate  standard  errors  for  each  of 
the  variables  in  the  regression,  it  is  not  possible  to  test  more  linear  restrictions  than  degrees  of  freedom.  Thus,  in  our 
particular  setting  it  is  not  possible  generate  the  test  statistics  associated  with  the  overall  model  fit  or  the  joint  significance 
of  the  firm  effects — which  is  our  main  test. 

This  situation  can  arise  if  Executive  X  worked  at  Firm  A  and  Firm  C,  Executive  Y  worked  at  Firm  A  and  Firm  B,  and 
Executive  Z  worked  at  Firm  B  and  Firm  C.  This  results  in  three  executives,  but  only  three  degrees  of  freedom,  because 
there  are  only  three  firms. 
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ratio.  In  sum,  the  weight  of  the  evidence  throughout  the  tests  indicates  that  executives  are  impor¬ 
tant  factors  in  the  extent  to  which  firms  undertake  tax  avoidance. 

The  remaining  robustness  tests  examine  whether  industry  effects  could  be  driving  our  results. 
First,  we  examine  whether  the  significant  executives  are  clustered  in  a  few  industries  or  are  spread 
out  over  a  variety  of  industries.  We  find  that  they  are  well  spread  across  industries.  For  example, 
in  untabulated  results  using  the  17-industry  Fama-French  grouping,  we  find  significant  managers 
in  15  out  of  17  industries  using  GAAP  ETR  and  17  of  17  industries  using  CASH  ETR.  Moreover, 
the  percentage  of  significant  executives  closely  tracks  the  percentage  of  observations  in  each 
industry.  Similar  findings  obtain  when  we  use  30  industry  groupings. 

Second,  we  separately  examine  the  distribution  of  significant  positive  and  negative  executive 
coefficients  across  industries.  We  again  find  significant  executives  to  be  well  distributed  across 
industries.  Third,  we  examine  results  separately  for  executives  who  switch  industries  (43  percent 
of  executives)  and  do  not  switch  industries  (57  percent  of  executives)  when  they  switch  jobs.  As 
in  the  other  tests,  the  significant  executives  are  well  spread  across  industries  for  both  groups. 
Overall,  the  results  do  not  appear  to  be  driven  by  industry  effects. 

Event  Time  Analysis  of  Effective  Tax  Rates 

We  next  illustrate  the  changes  in  GAAP  ETR  and  CASH  ETR  in  the  periods  before  and  after 
the  executives  join  their  respective  firms  and  the  periods  before  and  after  the  executives  depart 
their  firms.  We  begin  by  classifying  executives  into  one  of  four  categories  based  on  our  regression 
results  above:  GAAP  Erf?-reducing  executives  (those  with  a  significantly  negative  fixed  effect  at 
the  5  percent  level  in  the  Table  3,  Model  8  regression),  GAAP  ETP-increasing  executives  (those 
with  a  significantly  positive  fixed  effect  at  the  5  percent  level),  no  effect  GAAP  ETR  executives 
with  positive  coefficients  (those  with  an  insignificant  positive  fixed  effect),  and  no  effect  GAAP 
ETR  executives  with  negative  coefficients  (those  with  an  insignificant  negative  fixed  effect).  We 
then  normalize  each  firm’s  string  of  GAAP  ETR  data  by  the  mean  GAAP  ETR  in  the  three  years 
leading  up  to  the  executive  joining  the  firm  (t— 3  to  t— 1).  This  procedure  allows  us  to  interpret  the 
figure  as  the  change  in  the  firm’s  GAAP  ETR  incremental  to  what  the  firm’s  average  GAAP  ETR 
was  before  the  executive  arrived.  Because  the  data  are  classified  based  upon  our  regression  results, 
we  do  not  view  these  as  independent  tests,  but  rather  as  a  graphical  illustration  of  what  a  signifi¬ 
cant  executive  fixed  effect  indicates  is  occurring  in  the  data. 

Figure  2,  Panel  A  shows  the  change  in  firms’  GAAP  ETR  around  the  date  of  hiring.  Firms  run 
by  GAAP  PPP-increasing  executives  see  their  GAAP  ETR  increase  by  approximately  15  percent¬ 
age  points  by  three  years  after  the  year  of  hiring.  Firms  run  by  GAAP  EPP-decreasing  executives 
see  their  GAAP  ETR  decrease  steadily  by  slightly  more  than  15  percentage  points  by  year  r+3. 
Firms  run  by  executives  whose  fixed  effects  are  not  significant  in  Table  3  show  changes  in  their 
GAAP  ETRs  of  about  five  percentage  points  in  absolute  value  after  the  executive  joins  the  firm. 

Figure  2,  Panel  B  shows  the  corresponding  results  for  the  date  of  departure.  Once  the  execu¬ 
tives  leave  their  firms,  the  GAAP  ETR  reverts  back  to  levels  they  had  been  prior  to  the  executives’ 
employment.  Together,  Panels  A  and  B  show  that  if  GAAP  ETR  were  changing  due  to  some  factor 
other  than  the  new  executives,  it  would  have  to  perfectly  coincide  with  both  the  executives’  hiring 
dates  and  the  executives’  departure  dates  to  explain  the  data. 

Panels  C  and  D  of  Figure  2  present  the  changes  in  CASH  ETR  surrounding  the  dates  of  hire 
and  departure,  where  the  year  of  departure  is  t  =  0.  The  pattern  is  similar  to  what  we  find  in  the 
GAAP  ETR  data.  Once  the  executives  join  their  firms,  the  CASH  ETR  begins  to  diverge.  After  the 
executives  leave  their  firms,  the  CASH  ETR  reverts  back  to  the  levels  that  existed  prior  to  the 


See  http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html. 
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FIGURE  2 

Illustration  of  the  Change  in  GAAP  ETR  and  CASH  ETR  Surrounding  the  Year  of  Executive 

Hire  and  Departure 


Panel  A:  Change  in  GAAP  ETR  Surrounding  Date  of  Hire 
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Pane!  B:  Change  in  GAAP  ETR  Surrounding  Date  of  Departure 
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Panel  C;  Change  in  CIS//  ETR  Surrounding  Date  of  Hire 
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Panel  D:  Change  in  CASH  ETR  SuiTounding  Date  of  Departure 


0.25 

0.20 

0.15 

0.10 

0.05 

0.00 

-0.05 

-0.10 

-0,15 

-0.20 

-0.25 


Event  Time  Relative  to  Manager  Tenure 
0  =  Year  of  Departure 


♦  CASH  ETR  Increasing  Executives 

'  No  Effect  Increasing  CASH  ETR 
Executives 

B  '  No  Effect  Decreasing  CASH  ETR 
Executives 

-A— CASH  ETR  Decreasing 
Executives 


The  Accounting  Review 


July  2010 

American  Accounting  Association 


1180 


Dyreng,  Hanlon,  and  Maydew 


The  figure  shows  change  in  effective  tax  rates  around  the  hiring  (Panels  A  and  C)  and  departure  (Panels  B  and 
D)  of  the  executive.  Managers  are  assigned  to  decreasing,  increasing,  and  no-effect  groups  by  observing  the 
manager’s  fixed  effect  coefficient.  The  no-effect  groups  are  further  split  into  those  with  positive  insignificant 
coefficients  and  those  with  negative  insignificant  coefficients.  Managers  that  have  a  statistically  negative  coef¬ 
ficient  (at  the  5  percent  level)  are  labeled  “decreasing”  managers.  Managers  that  have  a  statistically  positive 
coefficient  (at  the  5  percent  level)  are  labeled  “increasing”  managers.  Managers  whose  coefficient  is  insignifi¬ 
cant  are  labeled  “no  effect”  managers.  Each  firm’s  GAAP  ETR  or  CASH  ETR  is  normalized  using  its  average 
value  of  GAAP  ETR  or  CASH  ETR  from  the  event-time  period  (-3  to  -1). 


executive  joining  the  firm.  Collectively,  the  results  in  Figure  2  illustrate  the  changes  to  GAAP  ETR 
and  CASH  ETR  when  executives  join  firms  and  the  return  of  GAAP  ETR  and  CASH  ETR  to  their 
prior  levels  once  executives  depart. 

Our  tests  up  to  this  point  have  focused  on  determining  whether  executives  have  a  statistically 
significant  incremental  effect  on  their  firms’  reported  effective  tax  rates,  beyond  firm  fixed  effects 
and  time-varying  firm  characteristics.  In  the  next  section  we  examine  whether  the  results  are 
economically  significant. 

The  Economic  Magnitude  of  Executive  Effects  on  Tax  Avoidance 

An  appealing  aspect  of  the  empirical  design  (i.e.,  estimating  a  regression  of  an  ETR  on 
executive  fixed  effects)  is  that  it  generates  a  tax  avoidance  estimate  for  each  executive  in  the 
sample.  Thus,  the  coefficient  on  Executive  A’s  fixed  effect  is  an  estimate  of  Executive  A’s  influ¬ 
ence  on  the  effective  tax  rates  of  the  firms  where  she/he  has  been  employed.  In  Table  4,  we 
present  the  distribution  of  the  executive  fixed  effect  coefficients.  In  the  table  we  include  all 
executive  fixed  effect  coefficients,  be  they  CEO,  CFO  or  other  executives.  The  first  two  rows 
document  the  distribution  of  fixed  effects  for  the  primary  variables  of  interest  in  the  study,  namely 
GAiAP  ETR  and  CASH  ETR.  The  remaining  rows  show  the  distribution  of  estimated  executive 
fixed  effects  for  other  economic  outcomes.  These  additional  effects  are  included  in  the  table  to 
show  the  economic  magnitude  of  executive  effects  on  other  economic  outcomes. 

The  results  indicate  that  executives  can  have  economically  large  effects  on  their  firms’  effec¬ 
tive  tax  rates,  in  both  directions.  Among  the  entire  set  of  executives,  those  at  the  25th  percentile 
showed  a  GAAP  ETR  reduction  of  5.7  percent,  while  those  at  the  75th  percentile  were  associated 
with  a  GAiAP  ETR  increase  of  5.4  percent.  Since  reductions  in  GAAP  effective  tax  rates  translate 
directly  into  increased  reported  earnings,  executives  effective  at  promoting  tax  avoidance  have  a 
significant  effect  on  their  firms’  reported  earnings  that  is  proportionally  larger  than  their  impact  on 
the  GAAP  effective  tax  rate.^^  For  example,  consider  a  firm  that,  given  a  certain  level  of  pre-tax 
earnings  and  absent  an  executive  specific  effect,  would  have  an  effective  tax  rate  of  40  percent.  An 
executive  who  could  reduce  the  effective  tax  rate  by  5  percent  would  increase  after-tax  earnings  by 
about  8.33  percent,  as  given  by  (t  -  F)  /  ( 1  where  t  is  the  effective  tax  rate  with  no  executive 
effect  and  t'  is  the  effective  tax  rate  after  the  executive  effect.  The  effects  on  CASH  ETR  are 


Naturally,  these  are  simply  estimates  and  are  measured  with  error  as  indicated  by  the  standard  errors  of  the  respective 
coefficients. 

We  assume  no  implicit  tax  effect  (cost  to  avoid  the  tax)  in  our  computations  but  consider  a  given  level  of  pre-tax 
earnings.  We  leave  a  detailed  consideration  of  the  consequences  of  executive  determined  tax  avoidance  to  future 
research. 
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TABLE  4 


Magnitude  of  Fixed  Effects 


Fixed  Effect 

n 

Mean 

Std.  Dev. 

P25 

P50 

P75 

GAAP  ETR 

795 

0.001 

0.112 

-0.057 

-0.001 

0.054 

CASH  ETR 

783 

-0.010 

0.144 

-0.080 

-0.003 

0.063 

EBITDA 

896 

0.001 

0.074 

-0.034 

0.003 

0.032 

R&D 

891 

-0.005 

0.041 

-0.009 

-0.001 

0.005 

ADVERTISING 

891 

0.000 

0.013 

-0.003 

-0.000 

0.002 

SG&A 

660 

-0.009 

0.126 

-0.036 

-0.002 

0.029 

CAPITAL  EXPENDITURES 

751 

0.005 

0.194 

-0.060 

0.008 

0.079 

PERCENTAGE  CHANGE  IN  SALES 

908 

0.003 

0.253 

-0.089 

-0.002 

0.098 

LEVERAGE 

891 

0.004 

0.089 

-0.044 

0.004 

0.053 

CASH  HOLDINGS 

891 

0.003 

0.123 

-0.035 

0.006 

0.051 

FOREIGN  OPERATIONS 

908 

0.018 

0.210 

-0.052 

0.002 

0.070 

The  table  presents  the  distribution  of  the  executive  fixed  effects  estimated  in  Table  3,  Panels  A  and  B  (for  GAAP  ETR  and 
CASH  ETR,  respectively).  The  estimation  of  the  executive  fixed  effects  for  other  variables  was  not  reported.  The  first 
column  represents  the  number  of  executives  for  which  the  effect  was  estimated.  The  remaining  columns  describe  the 
distribution. 


similar  in  magnitude  to  the  effects  on  GAAP  ETR.  Executives  at  the  25th  percentile  led  to  a 
reduction  in  their  firms’  CASH  ETR  of  8.0  percent,  while  those  at  the  75th  percentile  led  to  an 
increase  in  their  firms’  CASH  ETR  of  6.3  percent.  Thus,  the  results  indicate  that  the  effects  that 
executives  appear  to  have  on  GAAP  ETR  and  CASH  ETR  can  be  economically  significant. 

The  remaining  rows  in  Table  4  present  the  distributions  of  executive  effects  on  other  out¬ 
comes.  These  results  are  generally  in  line  with  those  in  Bertrand  and  Schoar  (2003).  For  example, 
results  show  that  an  executive  at  the  25th  percentile  of  the  distribution  of  executive  effects  on 
EBITDA  reduces  EBITDA  by  about  3.4  percent  of  assets,  while  an  executive  at  the  75th  percentile 
of  the  distribution  of  executive  effects  on  EBITDA  increases  EBITDA  by  about  3.2  percent  of 
assets.  Below,  we  provide  some  initial  evidence  about  whether  executives  who  are  effective  at 
influencing  tax  avoidance  have  particular  attributes  and/or  also  tend  to  have  predictable  manage¬ 
rial  styles.  We  turn  to  these  issues  in  the  next  two  sections. 

Management  Styles  and  Tax  Avoidance 

Prior  research  has  found  evidence  that  executives  have  different  “styles.”  For  example,  some 
executives  may  emphasize  growth  strategies.  In  this  section,  we  examine  whether  executives’ 
tendencies  toward  tax  avoidance  are  associated  with  other  individual  specific  effects  that  the 
executives  have  on  their  firms’  operational  and  financial  decisions  and  outcomes.  It  is  possible,  for 
example,  that  executives  who  emphasize  cost-cutting  in  general  are  the  same  executives  who 
emphasize  tax  avoidance  (i.e.,  as  one  way  to  cut  costs).  On  the  other  hand,  it  is  possible  that  the 
propensity  for  tax  avoidance  is  independent  of  other  decisions  made  in  the  firm  and  not  associated 
with  any  other  executive  styles. 

We  begin  by  estimating  Equation  (1)  with  a  variety  of  dependent  variables  other  than  GAAP 
ETR  and  CASH  ETR.  Following  prior  research  and  including  a  variable  for  foreign  operations,  we 
use  the  following  operational  and  financial  variables  as  dependent  variables  in  separate  Equation 
(l)-type  regressions:  EBITDA,  R&D,  ADVERTISING,  SG&A,  CAPITAL  EXPENDITURES,  PER¬ 
CENTAGE  CHANGE  IN  SALES,  LEVERAGE,  CASH  HOLDINGS,  and  FOREIGN  OPERATIONS 
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(defined  in  Table  2).  Each  of  these  dependent  variables  is  selected  because  it  is  plausible  ex  ante 
that  it  could  represent  the  type  of  activity  in  which  a  manager  interested  in  tax  avoidance  would 
engage.  For  example,  executives  who  minimize  SG&A  might  be  the  type  who  cut  all  costs, 
including  taxes.  Further,  executives  with  high  intangibles  (R&D  spending)  and/or  more  foreign 
operations  may  have  more  opportunities  to  reduce  effective  tax  rates. 

The  analysis  works  as  follows.  We  regress  each  variable,  for  example  CAPITAL  EXPENDI- 
TURES,  on  a  vector  of  time-varying  controls,  firm,  year,  and  executive  fixed  effects.  We  then  test 
whether  the  executive  fixed  effect  coefficients  that  we  estimate  from  these  regressions  are  asso¬ 
ciated  with  the  executive  fixed  effect  coefficients  from  the  tax  regressions  discussed  earlier.  For 
each  pair  of  coefficients,  we  estimate  a  univariate  regression  of  the  following  form: 

«m  =  ^0  + +  (2) 

where  dr^  is  the  estimated  fixed  effect  coefficient  for  executive  m  estimated  in  Equation  (1)  when 
Y  (which  represents  either  GAAP  ETR  or  CASH  ETR)  is  the  dependent  variable  in  Equation  (1), 
and  is  the  estimated  fixed  effect  coefficient  for  executive  m  in  Equation  (1)  when  Z  (which 
represents  the  nontax  variables  listed  above)  is  the  dependent  variable  in  Equation  (1). 

It  is  important  to  note  that  these  are  inherently  low-power  tests,  as  the  dependent  variables  are 
themselves  coefficients  from  the  earlier  executive  effects  regressions  and,  thus,  are  noisy  esti¬ 
mates.  The  noise,  in  a  univariate  regression,  will  lead  to  downward  bias  in  an  OLS  estimation  of 
yS.  We  attempt  to  mitigate  this  problem  by  using  a  GLS  estimation  technique — weighting  the 
observation  by  the  inverse  of  the  standard  error  of  the  independent  variable  (Bertrand  and  Schoar 
2003). 

Table  5  presents  the  results.  Note  that  each  entry  in  the  table  represents  the  coefficient  from  a 
separate  univariate  weighted  least-squares  regression.  In  general,  the  data  are  consistent  with 
executive  effects  on  GAAP  ETR  being  positively  associated  with  executive  effects  on  accounting 
income,  as  evidenced  by  the  positive  coefficient  on  EBITDA.  This  suggests  the  possibility  of  an 
implicit  tax  effect  whereby  activities  that  are  tax-favored  also  have  low  pretax  rates  of  return 
(Scholes  et  al.  2009,  Jennings  et  al.  2009),  or  it  could  be  that  executives  who  are  not  good  at 
increasing  operating  income  make  up  for  their  deficiency  by  being  good  at  tax  avoidance. 

We  find  very  little  evidence  of  executive  fixed  effects  on  effective  tax  rates  being  correlated 
with  executive  fixed  effects  on  other  variables.*^  While  this  may  seem  surprising  at  first,  the 
results  are  consistent  with  prior  research  that  has  documented  a  high  degree  of  variability  in 
effective  tax  rates  across  firms,  even  within  the  same  industry  (Dyreng  et  al.  2008). 

Executive  Characteristics  and  Tax  Avoidance 

Prior  research  has  found  some  evidence  that  executives  with  different  backgrounds  have 
predictable  differences  in  managerial  style  (e.g.,  Bertrand  and  Schoar  2003).  Accordingly,  we 
attempt  to  obtain  biographical  information  about  each  of  the  executives  (e.g.,  age,  educational 
background,  gender)  in  our  sample.  Frequently  these  data  are  missing  from  ExecuComp.  For  each 
executive  with  missing  information,  we  hand-collect  biographical  information  through  Internet 


18 

We  do  not  report  the  regression  results  from  each  of  these  separate  regressions  in  the  interest  of  space.  However,  we  note 
that,  as  one  might  expect,  the  effect  of  an  executive  on  such  items  as  R&D  spending  and  leverage  is  greater  than  the 
executives’  effect  on  the  effective  tax  rate  measures.  For  example,  F-statistics  on  the  executive  fixed  effects  in  the  R&D 
regressions  range  (over  executive  ranks)  from  1. 0-5.0  and  for  leverage  the  range  is  2.35-3.74.  In  comparison,  the  range 
for  the  tax  regressions  (from  Table  3)  is  1.36-2.15. 

In  untabulated  robustness  tests,  for  each  test  in  this  table  we  exclude  the  one  specific  time  varying  control  (e.g.,  R&D) 
from  the  right-hand  side  of  Equation  (1),  where  we  have  GAAP  ETR  or  CASH  ETR  as  the  dependent  variable,  before  we 
use  that  executive  fixed  effect  in  the  Equation  (2)  tests  for  that  specific  variable  (e.g.,  R&D).  These  robustness  tests  have 
little  effect  on  the  results.  Across  the  18  test  statistics  in  the  table,  only  three  are  significant  at  the  5  percent  level. 
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TABLE  5 

Correlation  of  GAAP  ETR  and  CASH  ETR  Fixed  Effects  with  Other  Fixed  Effects 


Independent  Variable  Dependent  Variable 


Estimated  Fixed  Effect  from 

Equation  (1) 

for  Manager  m  When  Z  is  the 
Dependent 

Variable  and  Z  is: 

Estimated  Fixed  Effect  from 
Equation  (1)  for  Manager  m 
When  GAAP  ETR  is  the 
Dependent  Variable 

Estimated  Fixed  Effect  from 
Equation  (1)  for  Manager  m 
When  CASH  ETR  is  the 
Dependent  Variable 

EBITDA 

0.321*** 

-0.028 

(0.056) 

(0.072) 

R&D 

-0.158 

-0.170 

(0.110) 

(0.148) 

ADVERTISING 

0.633 

0.504 

(0.355) 

(0.460) 

SG&A 

-0.015 

0.047 

(0.043) 

(0.058) 

CAPITAL  EXPENDITURES 

0.048** 

0.048 

(0.024) 

(0.030) 

PERCENTAGE  CHANGE  IN  SALES 

0.029 

-0.035 

(0.017) 

(0.022) 

LEVERAGE 

-0.047 

0.038 

(0.047) 

(0.059) 

CASH  HOLDINGS 

-0.017 

0.063 

(0.034) 

(0.045) 

FOREIGN  OPERATIONS 

0.009 

-0.043* 

(0.018) 

(0.023) 

Represent  statistical  significance  at  the  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

This  table  presents  tests  of  association  between  managers’  “style”  and  their  proclivity  to  avoid  taxes.  Each  entry  in  the  mble 
represents  the  coefficient  from  the  following  model  run  separately  (univariately)  for  each  independent  variable  listed 
aboveia*'  =  /Sq  +  +  u^.  In  this  model,  is  the  estimated  fixed  effect  for  manager  m  when:  Y  (GAAP  ETR  or  CASH 

ETR)  is"the  dependent  variable  in  Equation  (1);  dl  is  the  estimated  fixed  effect  for  manager  m  when  Z  is  the  dependent 
variable  in  Equation  (1);  and  Z  is  one  of  the  following:  EBITDA,  R&D,  ADVERTISING,  SG&A,  CAPITAL  EXPENDI¬ 
TURES,  PERCENTAGE  CHANGE  IN  SALES,  LEVERAGE,  CASH  HOLDINGS,  EOREIGN  OPERATIONS  (all  of  which 
are  defined  in  Table  2).  The  estimate  of  /Sj  for  each  regression  is  presented  in  the  table,  with  the  respective  standard  error 
below  in  parentheses.  Descriptive  statistics  for  the  dependent  and  independent  variables  are  presented  in  Table  4. 


searches  for  those  executives.  Often,  we  could  obtain  additional  information  about  the  executives 
using  this  approach.  In  addition,  we  gather  information  about  the  executives  wealth  sensitivity  to 
changes  in  the  stock  price  of  their  firm,  proxies  for  their  degree  of  optimism  and/or  overconfi¬ 
dence,  and  whether  they  move  between  firms  as  part  of  a  larger  executive  team. 

Table  6  presents  descriptive  statistics  on  the  biographical  information  for  the  executives  in  the 
sample  having  available  data.  Panel  A  presents  the  data  for  the  subset  of  executives  who  have 
coefficients  estimated  when  GAAP  ETR  is  the  dependent  variable  in  Equation  (1).  Panel  B  pre¬ 
sents  the  data  for  the  subset  of  executives  who  have  coefficients  estimated  when  CASH  ETR  is  the 
dependent  variable  in  Equation  (1).  We  find  that  about  20  percent  of  the  sample  has  an  M.B.A. 
degree  from  a  top  10  institution  (as  defined  by  BusinessWeek).'^^  Another  18  percent  of  executives 


Rankings  can  be  found  online  at:  http:/^wnt.businessweek.com/interactive_reports/mba_domestic_2008/index.asp. 
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TABLE  6 

Descriptive  Statistics  of  Executive  Characteristics 

Panel  A:  Descriptive  Statistics  for  Regression  of  Executive  Fixed  GAAP  ETR  Effects  on  Executive 
Characteristics 


Variable 

n 

Mean 

Std.  Dev. 

P25 

P50 

P75 

TOPIO  MBA 

521 

0.200 

0.400 

0.000 

0.000 

0.000 

OTHER  MBA 

521 

0.182 

0.387 

0.000 

0.000 

0.000 

JURIS  DOCTORATE 

521 

0.119 

0.324 

0.000 

0.000 

0.000 

ACCOUNTING  DEGREE 

521 

0.106 

0.308 

0.000 

0.000 

0.000 

NO  DEGREE  RECORDED 

521 

0.075 

0.263 

0.000 

0.000 

0.000 

CFO  AT  PRIOR  FIRM 

521 

0.225 

0.418 

0.000 

0.000 

0.000 

FEMALE 

521 

0.044 

0.206 

0.000 

0.000 

0.000 

AVERAGE  TENURE 

521 

5.036 

1.463 

4.000 

5.000 

6.000 

AVERAGE  AGE 

521 

50.880 

6.761 

46.000 

50.769 

55.359 

SAME  INDUSTRY  SWITCH 

521 

0.438 

0.497 

0.000 

0.000 

1.000 

MOVES  AS  A  TEAM 

521 

0.148 

0.355 

0.000 

0.000 

0.000 

OVERCONFIDENCE  IN  FORECASTS 

521 

0.399 

0.269 

0.200 

0.400 

0.600 

LOG  OF  WEALTH  SENSITIVITY  TO  PRICE 

521 

9.283 

1.917 

8.099 

9.291 

10.644 

OPTIMISM  IN  lOK 

521 

0.000 

0.031 

-0.018 

0.000 

0.016 

Panel  B:  Descriptive  Statistics  for  Regression  of  Executive  Fixed  CASH  ETR  Effects 
Characteristics 

on  Executive 

Variable 

n 

Mean 

Std.  Dev. 

P25 

P50 

P75 

TOPIO  MBA 

513 

0.203 

0.402 

0.000 

0.000 

0.000 

OTHER  MBA 

513 

0.181 

0.386 

0.000 

0.000 

0.000 

JURIS  DOCTORATE 

513 

0.119 

0.324 

0.000 

0.000 

0.000 

ACCOUNTING  DEGREE 

513 

0.107 

0.310 

0.000 

0.000 

0.000 

NO  DEGREE  RECORDED 

513 

0.076 

0.265 

0.000 

0.000 

0.000 

CFO  AT  PRIOR  FIRM 

513 

0.224 

0.417 

0.000 

0.000 

0.000 

FEMALE 

513 

0.045 

0.207 

0.000 

0.000 

0.000 

AVERAGE  TENURE 

513 

5.043 

1.464 

4.000 

5.000 

6.000 

AVERAGE  AGE 

513 

50.904 

6.785 

46.000 

50.833 

55.442 

SAME  INDUSTRY  SWITCH 

513 

0.435 

0.496 

0.000 

0.000 

1.000 

MOVES  AS  A  TEAM 

513 

0.150 

0.358 

0.000 

0.000 

0.000 

OVERCONFIDENCE  IN  FORECASTS 

513 

0.399 

0.270 

0.200 

0.403 

0.600 

LOG  OF  WEALTH  SENSITIVITY  TO  PRICE 

513 

9.287 

1.925 

8.099 

9.294 

10.647 

OPTIMISM  IN  lOK 

513 

0.000 

0.031 

-0.018 

0.000 

0.016 

have  an  M.B.A.  from  other  than  top  10  institutions.  About  12  percent  of  the  executives  have  law 
degrees,  1 1  percent  have  accounting  degrees,  and  about  7  percent  have  no  degree  recorded.  We 
focus  on  M.B.A.,  J.D.,  and  accounting  degrees  because  it  is  possible  that  executives  with  financial 
or  legal  training  might  be  more  likely  to  emphasize  tax  avoidance.  About  23  percent  of  the  sample 
was  CFO  at  his/her  prior  firm  (unconditional  on  current  title),  and  about  5  percent  of  the  sample 
is  female.  The  average  executive’s  tenure  is  about  five  years  per  firm.  Executives  average  about  51 
years  of  age  during  their  tenure.  About  44  percent  of  our  sample  executives  switch  from  one  firm 
to  another  within  the  same  two-digit  SIC  code,  and  about  15  percent  eventually  end  up  working 
with  the  same  executives  at  both  firms  (although  the  years  do  not  overlap  perfectly  so  as  to  allow 
for  statistical  estimation  in  the  first-stage  model). 
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We  also  include  estimates  of  the  executive’s  overconfidence,  wealth  sensitivity,  and  optimism. 
We  measure  executive  overconfidence  as  the  average  frequency  the  firm  missed  its  own  manage¬ 
rial  forecast  during  the  years  the  executive  was  employed  (Hribar  and  Yang  2007).  We  calculate 
the  executive’s  wealth  sensitivity  to  a  percentage  change  in  stock  price  following  Core  and  Guay 
(1999).  To  measure  optimism,  we  download  all  10-K  filings  for  all  firm-years  in  our  sample  from 
http ;//w ww.sec.gov.  We  then  use  the  software  “General  Inquirer”  to  count  the  number  of  positive 
and  negative  words  in  the  10-K  (see  Kothari  et  al.  [2009]  for  a  recent  study  that  uses  the  General 
Inquirer  software  in  an  accounting  disclosure  context).  We  use  the  ratio  of  positive  words  to 
negative  words  as  a  measure  of  optimism.  We  subtract  each  firm-year  value  of  optimism  from  the 
firm’s  average  optimism  value.  For  each  executive,  we  take  the  average  of  the  optimism  scores  for 
the  firm-years  for  which  s/he  has  sufficient  data  in  our  sample.  Thus,  the  executive’s  optimism 
score  is  his/her  average  optimism  at  the  firms  at  which  she/he  was  employed  during  the  years 
she/he  was  employed,  adjusting  for  each  firm’s  mean  optimism. 

Table  7  presents  the  results  of  regressions  of  executive  fixed  effect  coefficients  obtained  from 
the  empirical  estimation  of  Equation  (1)  on  variables  that  eapture  the  biographical  characteristics 
from  Table  6.  The  first  two  columns  present  the  results  using  executive  fixed  effect  coefficients 
from  the  GAAP  ETR  model  of  Equation  (1)  as  the  dependent  variable.  The  second  two  columns 
present  the  results  using  the  exeeutive  fixed  effect  coefficients  from  the  CASH  ETR  model  of 
Equation  (1)  as  the  dependent  variable. 

Executive  backgrounds  do  not  explain  much  of  the  variation  in  tax  avoidance  across  execu¬ 
tives.  We  find  no  relation  between  the  educational  background  of  the  executives  in  our  sample  and 
their  propensities  to  reduce  effective  tax  rates.  The  lack  of  connection  with  financial  educational 
background  suggests  that  executives’  effects  on  tax  avoidance  is  likely  due  more  to  the  “tone  at  the 
top”  than  it  is  to  direct  involvement  by  the  executives  in  the  tax  avoidance. 

We  also  test  whether  the  executive  effects  depend  on  whether  the  executive  moved  to  a  firm 
in  a  different  industry  or  stayed  in  the  same  industry,  and  whether  the  executive  moved  alone  or  as 
part  of  a  team.  There  is  weak  evidence  that  staying  in  the  same  industry  is  associated  with  less 
executive-specific  tax  avoidance  in  the  sense  of  higher  CASH  ETR,  but  there  is  no  significant 
difference  for  GAAP  ETR.  The  results  indicate  no  difference  in  the  executive  effects  between 
moving  alone  and  moving  as  a  team.  We  find  weak  evidence  of  a  link  between  optimism  in  the 
firm’s  10-K  and  lower  cash  effective  tax  rates.  That  is,  it  appears  that  more  optimistic  executives 
reduce  CASH  ETRs  to  a  greater  extent.  There  is  no  evidence  that  the  executive’s  level  of  over- 
confidence  influences  his/her  ability  to  affect  effective  tax  rates.  Einally,  we  examine  the  execu¬ 
tive’s  compensation  structure  and  find  no  significant  association  with  the  executive  fixed  effects. 

Overall,  the  results  indicate  that  biographical  information  such  as  educational  background  and 
age  does  not  explain  much  of  the  variation  in  tax  avoidance  across  executives.  We  note  that  this 
finding  does  not  suggest  that  executives  have  no  effect  on  tax  avoidance.  Instead,  these  results  are 
evidence  that  common,  observable  characteristics  are  not  strongly  associated  with  executives 
propensities  to  reduce  effective  tax  rates. 


V.  CONCLUSIONS 

We  track  the  movement  of  908  CEOs,  CFOs,  and  other  executives  across  firms  during  the 
period  1992  to  2006  in  order  to  examine  whether  individual  executives  have  an  effect  on  their 
firms’  tax  avoidance.  By  examining  executives  who  switch  firms,  we  attempt  to  control  for  firm 
fixed  effects  and  identify  executive-specific  effects.  Results  indicate  that  individual  executives 
play  a  significant  role  in  determining  the  level  of  tax  avoidance  that  firms  undertake,  incremental 
to  characteristics  of  the  firm.  Analyzing  executive  effects  in  event  time  shows  results  consistent 
with  executive-specific  tax  avoidance  that  arises  shortly  after  the  executive  arrives  and  stops  after 
the  executive  leaves.  In  addition,  an  executive’s  effect  on  tax  avoidance  at  his/her  first  firm  tends 
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TABLE  7 


Tax  Fixed  Effects  on  Manager  Characteristics 


Parameter 

GAAP  ETR  Fixed  Effects 

CASH  ETR  Fixed  Effects 

Estimate 

t-stat 

Estimate 

t-stat 

Intercept 

-0.022 

-0.53 

-0.044 

-0.99 

TOPIO  MBA 

-0.014 

-1.10 

0.005 

0.30 

OTHER  MBA 

0.000 

0.00 

-0.003 

-0.22 

JURIS  DOCTORATE 

0.007 

0.45 

0.012 

0.69 

ACCOUNTING  DEGREE 

-0.007 

-0.45 

0.011 

0.43 

NO  DEGREE  RECORDED 

-0.009 

-0.48 

-0.006 

-0.39 

CEO  AT  PRIOR  EIRM 

-0.013 

-1.12 

0.016 

1.05 

EEMALE 

0.009 

0.39 

0.035 

1.34 

AVERAGE  TENURE 

-0.002 

-0.64 

0.003 

0.93 

AVERAGE  AGE 

0.000 

0.43 

0.001 

1.33 

SAME  INDUSTRY  SWITCH 

0.007 

0.72 

0.021* 

1.89 

MOVES  AS  TEAM 

-0.009 

-0.63 

0.021 

1.23 

OVERCONEIDENCE  IN  EORECASTS 

0.004 

0.25 

-0.012 

-0.54 

LOG  OE  WEALTH  SENSITIVITY 

0.003 

0.96 

-0.004 

-1.44 

OPTIMISM  IN  lOK 

-0.107 

-0.71 

-0.402** 

-2.14 

n 

521 

513 

r2  0.012  0.025 

ADJRSQ  -0.015  -0.002 


*,  **,  ***  Represent  statistical  significance  at  the  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

This  table  presents  results  from  testing  the  association  of  manager  characteristics  with  manager  fixed  effects  from  Equation 
(1).  The  model  is: 

=  (Ug  +  (Oj^CHARACTERISTICf^  +  g„. 


where:  <5^  is  the  estimated  fixed  effect  for  manager  m  when  Y  {GAAP  ETR  in  Panel  A  or  CASH  ETR  in  Panel  B)  is  the 
dependent  variable  in  Equation  (1);  and  CHARACTERISTIC^^  is  the  fcth  characteristic  of  manager  m.  We  include  the 
following  characteristics:  TOPIO  MBA  indicates  the  manager  earned  his  M.B.A.  from  a  business  school  ranked  in  the  top 
10  in  the  2008  issue  of  BusinessWeek  (available  at:  http://www.businessweek.com/bschools/ranldngs/). 


Variable  Definitions: 

OTHER  MBA  = 

JURIS  DOCTORATE  = 
ACCOUNTING  DEGREE  = 
NO  DEGREE  RECORDED  = 
CEO  AT  PRIOR  EIRM  = 
AVERAGE  TENURE  = 
AVERAGE  AGE  = 
SAME  INDUSTRY  SWITCH  = 
OPTIMISM  IN  lOK  = 


OVERCONFIDENCE  IN 
FORECASTS  = 

LOG  OF  WEALTH 
SENSITIVITY  = 


the  manager  earned  his  M.B.A.  from  a  business  school  other  than  one  of  the  top  10 
business  schools; 

the  manager  earned  a  Juris  Doctor; 

at  least  one  of  the  manager’s  degrees  was  in  accounting  or  had  an  accounting  emphasis; 
no  information  was  available  about  the  manager’s  education; 
the  manager  was  a  CFO  at  his  prior  place  of  employment; 

the  average  number  of  years  the  manager  spent  at  each  company  recorded  in  ExecuComp; 

the  average  age  of  the  manager  during  his  tenure  as  an  executive; 

the  manager  switched  firms  within  the  same  two-digit  SIC  code; 

the  average  level  of  optimistic  words  to  pessimistic  words  in  the  annual  10-K  reports 

for  the  firm-years  during  which  the  manager  worked  at  the  firm  (adjusted  for  the  firm 

mean).  We  used  the  software  General  Inquirer  to  classify  words  as  optimistic  or 

pessimistic.  Documentation  is  available  at:  http://www.wjh.harvard.edu/~inquirer/; 

the  number  of  times  the  firm  missed  its  own  earnings  forecast  relative  to  the  number  of 
forecasts  issued;  management  earnings  forecasts  were  gathered  from  First  Call;  and 

the  log  of  the  manager’s  wealth  sensitivity  to  a  1  percent  change  in  stock  price, 
calculated  as  in  Core  and  Guay  (1999)  and  averaged  over  the  executive’s  tenure  at  both 
firms;  we  use  iteratively  reweighted  robust  regression  to  control  for  extreme 
observations. 
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to  be  positively  associated  with  his/her  effect  at  the  second  firm,  mitigating  potential  concerns  that 
our  results  are  driven  by  random  effective  tax  rate  changes  at  only  one  firm  during  the  executive’s 
tenure. 

Overall,  our  study  documents  that  individual  executives  exhibit  different  proclivities  toward 
tax  avoidance.  This  variation  in  tax  avoidance  proclivity  across  executives  affects  their  firms’  tax 
avoidance  in  ways  that  cannot  be  explained  by  firm  characteristics.  Moreover,  the  evidence  indi¬ 
cates  that  these  executive  effects  can  be  economically  large.  Given  the  difficulty  of  explaining 
effective  tax  rates  or  other  measures  of  tax  avoidance,  documenting  that  executives  matter  to  their 
firms’  effective  tax  rates  is  an  important  first  step  and  contribution. 

These  findings  raise  many  questions  that  we  hope  will  be  addressed  by  future  research.  It 
would  be  interesting  to  know  what  happens  to  these  executives  and  the  firms  they  manage  in  the 
future.  Perhaps  the  executives  who  appear  to  emphasize  tax  avoidance  simply  lead  their  firms  to 
take  on  substantial  tax  risk.  In  that  case,  we  might  expect  to  see  additional  taxes  being  paid  in 
future  years  as  the  firms  undergo  auditing  by  the  IRS,  perhaps  on  another  executive’s  watch.  What 
happens  to  the  career  prospects  of  executives  who  emphasize  or  de-emphasize  tax  avoidance?  In 
addition,  future  research  could  examine  whether  tax  avoidance  indicates  anything  about  the  ex¬ 
ecutives’  propensity  to  emphasize  other  behavior,  including  behavior  that  may  not  be  in  the  best 
interests  of  the  firm.  These  are  just  a  few  of  the  questions  raised  by  this  study  that  we  believe  can 
lead  to  further  insights. 
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ABSTRACT:  We  examine  how  biased  financial  reports  (managed  earnings)  affect 
product  market  competition  and  how  product  market  competition  affects  incentives  to 
bias  financial  reports  in  a  model  with  fully  rational  firms.  We  find  that  Cournot  competi¬ 
tors  bias  their  reports  to  create  the  impression  that  their  costs  are  lower  than  they 
actually  are.  This  bias  leads  to  lower  total  production  and  a  higher  product  price,  even 
though  each  firm  fully  understands  its  rival’s  incentives  to  bias  its  financial  reports.  The 
magnitude  of  the  bias  is  larger  when  firms  compete  in  more  profitable  product  markets 
and  smaller  when  the  firm  can  extract  more  information  about  its  rival’s  costs  from  its 
own.  When  the  costs  of  misreporting  are  asymmetric,  the  lower-cost  firm  engages  in 
more  earnings  management  than  its  rival,  produces  more  than  it  would  in  a  full- 
information  environment,  and  earns  greater  profits.  Our  analysis  leads  to  new,  testable 
relations  among  earnings  management,  reported  and  actual  earnings,  and  industry 
structure. 

Keywords:  earnings  management;  reporting  bias;  product  market  competition;  infor¬ 


mation  asymmetry. 

JEL  Classifications:  G1;  G14;  M41. 


I.  INTRODUCTION 


e  develop  a  theoretical  model  to  examine  how  earnings  management  affects  product 
market  competition  and  how  competition  affects  earnings  management  decisions.  A 


T  y  key  feature  of  our  analysis  is  that,  in  addition  to  an  information  extraction  problem, 
there  is  an  important  component  of  rivalry  in  the  firms’  earnings  management  and  production 
decisions.  The  motivation  for  our  study  arises,  in  part,  from  claims  made  by  C.  Michael  Armstrong 
who  was  CEO  of  AT&T  from  1997  to  2002,  one  of  the  most  turbulent  times  for  the  company.  He 
has  recently  argued  that  accounting  fraud  at  WorldCom  was  the  cause  of  AT&T  s  perceived 
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Strategic  failures,  its  inability  to  compete  with  WorldCom  and,  in  the  end,  the  decision  to  break  up 
the  company.  He  specifically  suggests  that  WorldCom’s  fraudulently  reported  revenues,  margins 
and  costs  drove  AT&T’s  layoffs,  cost-cutting,  and  a  very  unprofitable  price  war  that  left  AT&T 
unable  to  service  the  debt  he  incurred  to  revive  the  company.  This  view  is  supported  by  William 
Estry  who  was  Sprint’s  CEO  during  that  same  time.* 

To  address  our  research  questions,  we  employ  an  incomplete  information  Cournot  duopoly 
model  in  which  each  firm  knows  its  own  production  costs,  but  not  its  rival’s.  In  our  model,  each 
firm  provides  a  disclosure  through,  for  example,  its  income  statement.  A  firm’s  rival  can  use  the 
disclosure  to  update  its  beliefs  about  the  disclosing  firm’s  production  costs  prior  to  competing  in 
the  product  market.  Our  model  differs  from  prior  work  on  disclosure  in  incomplete  information 
Cournot  models  because  we  assume  that  firms  can  provide  biased  reports  and  that  all  firms 
optimally  update  their  beliefs  using  all  available  information.^  If  firms  bias  their  reports,  they  incur 
a  cost  of  misreporting.  We  show  that  when  the  firms  know  each  other’s  cost  of  misreporting,  there 
are  no  linear  equilibria.  However,  when  these  costs  are  also  private  information,  there  are  linear 
equilibria  in  which  the  firms  bias  their  disclosures  to  gain  a  competitive  advantage  in  the  product 
market.  Thus,  our  study  complements  and  extends  the  results  in  Fischer  and  Verrecchia  (2004).  In 
their  model,  only  one  firm  has  private  information,  and  biased  reporting  in  equilibrium  requires 
that  some  firms  use  heuristics  when  adjusting  their  beliefs  (i.e.,  some  firms  are  not  Bayesian).  Our 
model  differs  from  theirs  in  two  important  ways.  First,  we  allow  for  rivalry  to  impact  earnings 
management  decisions  (that  is,  more  than  one  firm  has  private  information)  and  second,  because 
all  firms  optimally  update  their  beliefs  (i.e.,  all  firms  are  Bayesian),  we  extend  the  literature  by 
showing  that  biased  reporting  can  occur  in  equilibrium  in  a  model  with  fully  rational  firms. 

Intuitively,  in  our  model,  each  firm’s  disclosure  is  designed  to  create  the  impression  that  its 
production  costs  are  lower  than  they  actually  are.  Further,  because  each  firm  uses  all  available 
information  efficiently  and  fully  understands  its  rival’s  incentives  to  bias  its  report,  each  adjusts  its 
beliefs  about  its  rival’s  production  costs  upward  relative  to  the  disclosure,  but  still  underestimates 
those  costs  in  equilibrium.  Consequently,  each  firm  cuts  its  production  below  the  full-information 
level,  causing  the  market  price  to  rise  and  each  firm’s  product  market  profits  to  increase,  in  turn 
providing  just  the  incentive  needed  for  each  firm  to  bias  its  disclosure.^  Our  analysis  indicates  that 
these  effects  are  smaller  when  firms  compete  in  more  profitable  product  markets  or  when  they  use 
reasonably  similar  technologies.  Interestingly,  while  the  effects  are  smaller  in  both  cases,  they 


’  See,  for  example,  the  book  written  in  2004  by  Dick  Martin,  the  former  head  of  public  relations  at  AT&T  (Martin  2004), 
or  the  articles  by  Searcy  (2005),  Blumenstein  and  Grant  (2004),  and  McConnell  et  al.  (2002).  While  many  are  familiar 
with  the  most  famous  aspects  of  WorldCom’s  accounting  fraud  (i.e.,  capitalizing  line  costs  and  booking  non-existent 
revenues),  there  appears  to  be  no  bright  line  between  earnings  management  and  fraud  (Mulford  and  Comiskey  2002,  41; 
Ronen  and  Yaari,  2008,  25-31;  Schilit  1993,  1;  or  the  survey  by  Beneish  2001).  This  is  probably  due,  in  part,  to  the 
inherent  difficulties  in  adequately  defining  earnings  management  and  distinguishing  it  from  fraud.  In  WorldCom’s  case, 
its  manipulation  of  earnings  began  slowly  and  became  more  egregious,  culminating  in  actions  that  virtually  all  agree  are 
fraud.  Some  of  the  early  activities  that  may  or  may  not  be  fraudulent  include  recording  and  releasing  reserves  and 
capitalizing  internal  labor  costs  related  to  line  installation  (Thornburgh  2002). 

^  The  prior  work  using  incomplete  information  Cournot  models  includes  Fried  (1984),  Shapiro  (1986),  Gal -Or  (1986, 
1988),  Darrough  (1993),  Raith  (1996),  and  Vives  (2002).  Also  see  the  surveys  by  'Verrecchia  (2001)  and  Vives  (2006). 
When  these  authors  focus  on  disclosure,  they  assume  that  the  firms’  disclosures  are  unbiased  but  that  the  firms  can 
determine  the  amount  of  noise  in  the  disclosure.  Thus,  if  the  firm  wishes  to  disclose,  then  it  chooses  not  to  add  noise  and 
if  it  wishes  not  to  disclose,  it  chooses  to  add  an  infinite  amount  of  noise. 

As  a  result,  the  transmission  mechanism  that  produces  value  to  biasing  reports  is  different  in  our  setting  than  in  Fischer 
and  Verrecchia  (2004).  In  their  setting,  the  Bayesian  firms  recognize  that  their  rivals  employ  heuristics  and  thus 
“overproduce”  relative  to  the  full-information  setting.  Realizing  this,  the  Bayesian  firms  accommodate  the  increased 
production  by  reducing  theirs.  In  our  model,  a  firm’s  biased  report  leaves  its  rivals  rationally  believing  that  the  reporting 
firm  s  costs  are  lower  than  they  actually  are,  which  leads  the  rival  to  believe  that  the  reporting  firm  will  produce  more 
than  it  actually  does.  As  a  result,  in  a  symmetric  equilibrium,  total  output  is  less  than  the  full-information  level  and  so 
both  firms  benefit. 
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differ  in  that  firms  engage  in  more  (less)  misreporting  in  the  first  (second)  situation.  Finally,  when 
firms  have  sufficiently  different  costs  of  misreporting,  the  firm  with  lower  costs  introduces  greater 
bias  into  its  reports,  produces  more  output  than  it  would  in  the  full-information  outcome,  and  earns 
greater  product  market  profits.  Thus,  our  analysis  supports  Armstrong’s  view  of  the  competitive 
impact  of  WorldCom’s  fraudulent  accounting  and  suggests  that  it  may  be  worth  examining  similar 
competitive  environments  for  similar  activity. 

By  focusing  on  equilibrium  incentives  to  bias  reports,  our  analysis  suggests  new  empirical 
implications  associated  with  the  use  of  different  earnings  management  techniques.  First,  only 
some  such  techniques  effectively  bias  inferences  about  the  firm’s  production  costs.  Thus,  our 
model  predicts  that  the  use  of  these  techniques  is  positively  correlated  with  standard  measures  of 
profitability  such  as  ROE  calculated  using  original  (unadjusted)  financial  statement  information 
but  negatively  correlated  with  these  measures  calculated  using  restated  (adjusted)  financial  state¬ 
ment  information,  if  available.^  In  addition,  if  incentives  to  use  other  earnings  management  tech¬ 
niques  are  orthogonal  to  the  product  market  incentives  that  are  the  focus  of  our  analysis,  then 
increases  in  the  cost  of  these  latter  techniques  (e.g.,  as  the  result  of  the  passage  of  the  Sarbanes- 
Oxley  Act  of  2002)  should  reduce  the  observed  proportions  of  their  use  among  SEC  enforcement 
actions.  Third,  we  show  that  firms  with  similar  production  technologies  will  be  less  likely  to 
employ  these  types  of  earnings  management  techniques.  Thus,  firms  whose  production  is  governed 
by  physical  or  chemical  processes  or  firms  in  mature  industries  are  less  likely  to  engage  in  this 
type  of  earnings  management  than  firms  in  service  industries  or  firms  that  have  significantly  larger 
portfolios  of  products.  Finally,  our  analysis  suggests  that  the  increased  use  of  deferred  prosecution 
agreements  (DPAs)  is  likely  to  lead  to  unexpected  consequences.  In  particular,  because  such 
agreements  dramatically  increase  the  cost  of  misreporting,  competitors  of  firms  operating  under 
DPAs  are  likely  to  respond  by  increasing  the  amount  of  misreporting  in  order  to  gain/increase  their 
competitive  advantage  in  the  product  market. 

In  addition  to  contributing  to  the  literature  on  the  interaction  between  disclosure  and  product 
market  competition,  our  study  also  highlights  and  extends  a  key  insight  of  Fischer  and  Verrecchia 
(2000)  into  biased  disclosures.  They  introduce  uncertainty  about  the  manager’s  objective  function 
into  a  standard  voluntary  disclosure  model  (e.g.,  Verrecchia  1983)  and  show  that  the  manager 
optimally  introduces  bias  into  the  disclosure.  Essentially,  the  agency  problem  combines  with 
incomplete  contracting  to  limit  the  market’s  ability  to  fully  adjust  its  expectation  of  firm  value 
leaving  the  manager  with  a  personal  benefit  to  offering  a  biased  voluntary  disclosure.  In  our 
model,  the  additional  uncertainty  associated  with  the  market  being  unsure  of  the  firm’s  reporting 
costs  opens  a  similar  opportunity  for  the  firm  to  bias  its  disclosure,  but  does  not  rely  on  the 
combination  of  an  agency  problem  and  incomplete  contracting.  Thus,  our  model  enriches  the 
understanding  of  biased  disclosures  by  showing  that  they  may  arise  even  if  all  agency  problems 

and  incomplete  contracting  issues  are  eliminated. 

The  literature  on  the  interaction  between  disclosures  and  product  market  competition  is  ex¬ 
tended  and  synthesized  in  Darrough  (1993),  generalized  in  Raith  (1996),  and  summarized  in 


Examples  of  this  type  of  eamings  management  include  fraudulent  or  aggressive  revenue  recognition,  aggressive  cost 
capitalization,  inclusion  of  operating  costs  in  restructuring  costs,  inappropriately  low  estimates  of  the  allowance  for 
doubtful  accounts  or  warranty  expense,  and  certain  types  of  “real”  earnings  management  such  as  delaying  expenditures. 
Examples  of  eamings  management  techniques  that  would  not  be  expected  to  lead  to  biased  inferences  about  production 

costs  include  channel  stuffing,  delayed  write-downs  of  assets,  and  the  “timely”  sale  of  assets. 

^  Since  adjusted  financial  statement  information  is  generally  only  available  for  firms  that  have  restated  eamings,  this  result 
is  likely  to  be  most  easily  tested  by  focusing  on  the  subset  of  firms  that  have  restated  eamings  and  admitted  to 
employing  eamings  management  techniques  that  effectively  bias  inferences  about  production  costs. 
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Vives  (2006).^  The  key  difference  between  this  literature  and  our  model  is  that  prior  work  required 
firms  to  make  unbiased  disclosures— firms  could  only  alter  the  amount  of  noise  in  their 
disclosure — whereas  we  permit  firms  to  bias  their  disclosures  at  a  cost.  This  difference  leads  to 
very  different  predictions.  For  Cournot  competitors  with  private  information  about  production 
costs,  Darrough  (1993)  and  others  have  shown  that  firms  prefer  to  precommit  to  disclosing  their 
private  information  without  adding  any  noise.  Intuitively,  adding  noise  results  in  the  firm’s  rival 
choosing  to  produce  more  than  it  would  if  fully  informed,  which  lowers  price  and  product  market 
profits.^  If  the  disclosure  decision  is  made  after  firms  learn  their  production  costs,  then  because 

o 

they  are  required  to  disclose  truthfully,  the  standard  unraveling  story  holds.  Our  results  comple¬ 
ment  this  literature  by  showing  that  when  fully  rational  firms  are  permitted  to  bias  disclosures 
(even  at  a  cost),  they  choose  to  do  so  and  that  this  decision  is  affected  by  and  affects  competition 
in  the  product  market.^ 

Our  analysis  is  also  related  to  recent  work  by  Sadka  (2006)  and  Kedia  and  Philippon  (2009). 
Sadka  (2006)  examines  accounting  fraud  in  a  competitive  (price-taking)  model  in  which  fraud  is 
detectable  unless  the  firm  alters  its  output  decisions  to  mimic  a  low-cost  producer.  He  shows  that 
accounting  fraud  lowers  social  welfare  because  the  fraudulent  firms  produce  too  much  output  at 
too  high  a  cost  and  the  low-cost  firms  produce  too  little  relative  to  the  Pareto  Efficient  outcome. 
Kedia  and  Philippon  (2009)  examine  accounting  fraud  in  a  signaling  framework  in  which  the 
fraudulent  firm  must  overinvest  and  overhire  in  order  to  minfic  the  more  efficient  producers.  They 
exploit  this  structure  to  suggest  why  the  detection  of  accounting  fraud  can  impact  not  only  the 
firm’s  rivals,  but  can  have  economy-wide  effects,  too.  Thus,  while  the  models  differ,  the  central 
idea  is  that  fraud  can  be  detected  by  examining  a  firm’s  real  (as  opposed  to  financial)  decisions  and 
that  this  is  the  transmission  mechanism  that  links  reporting  activity  to  real  effects.  Our  analysis 
complements  this  approach  by  showing  that  there  is  an  additional  linkage  by  which  firms  can 
influence  their  competitive  position  in  the  product  market  by  introducing  bias  into  their  financial 
reports. 

Section  II  contains  a  description  of  our  model  of  disclosure  and  product  market  competition. 
In  Section  III,  we  discuss  equilibrium  in  restricted  versions  of  our  model.  In  Section  IV,  we  focus 
on  symmetric  equilibria  and  analyze  asymmetric  equilibria  in  Section  V.  We  conclude  in  Section 
VI. 


II.  THE  MODEL  AND  EQUILIBRIUM 

In  this  section,  we  develop  a  model  designed  to  provide  insight  into  how  earnings  manage¬ 
ment  affects  product  market  competition  and  how  competition  affects  earnings  management  de¬ 
cisions.  To  address  these  questions,  we  need  to  fully  capture  the  idea  that  each  firm  understands 
that  its  rival  will  try  to  extract  an  information  advantage  from  the  firm’s  financial  reports  while 


There  is  also  an  important  literature  on  product  market  competition  as  a  source  of  the  cost  of  voluntary  disclosures.  This 
literature  frequently  focuses  on  the  effects  on  a  rival’s  entry  decision  from  an  incumbent  firm’s  voluntary  disclosure, 
thus  endogenizing  the  cost  of  voluntary  disclosure.  The  foundational  studies  in  this  literature  include  Darrough  and 
Stoughton  (1990),  Wagenhofer  (1990),  and  Feltham  and  Xie  (1992). 

In  a  recent  study,  Sadka  (2004)  explores  how  disclosure  of  cost  information,  especially  by  the  low-cost  producer,  can 
affect  a  firm’s  competitive  advantage  in  a  product  market  and  exploits  this  model  to  show  that  the  key  results  in 
^  Darrough  (1993)  are  reinforced  by  the  existence  of  his  endogenized  costs  of  disclosure. 

See  Christensen  and  Feltham  (2002)  for  a  nice  summary  of  this  literature.  Interestingly,  Arya  et  al.  (2010)  show  that  if 
firms  compete  in  multiple  markets,  then  the  unraveling  result  may  fail.  In  particular,  they  show  that  there  can  be  a  partial 
pooling  equilibrium  in  which  firm  types  that  have  high  costs  in  one  market  and  low  costs  in  the  second  pool  with  firm 
types  with  low  costs  in  the  first  and  high  costs  in  the  second. 

The  issue  of  whether  the  firms  precommit  to  disclose  does  not  arise  substantively  in  our  model  because  firms  can  select 
a  disclosure  that  is  independent  of  their  private  information.  Thus,  even  if  firms  precommit  to  disclose,  they  have  an 
available  strategy  that  allows  them  to  reveal  none  of  their  private  information. 
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simultaneously  understanding  that  its  rival  realizes  that  the  firm  is  trying  to  do  the  same  thing — 
extract  an  information  advantage  from  the  rival’s  financial  reports.  Thus,  a  key  feature  is  that  not 
only  is  there  an  information  extraction  problem,  but  there  is  also  an  important  component  of 
rivalry  in  determining  how  each  firm  manages  its  earnings  (in  addition  to  the  rivalry  associated 
with  the  firms’  competition  in  the  product  market). 

To  adequately  capture  these  forces,  both  firms  must  be  privately  informed  about  their  own 
costs  and  these  costs  must  be  correlated.  Both  firms  having  private  information  is  required  in  order 
to  analyze  the  competitive  component  of  their  earnings  management  decisions.  If  there  is  only  one 
privately  informed  firm,  then  it  is  the  only  firm  that  can  engage  in  earnings  management  designed 
to  affect  competition  in  the  product  market  and  so  competition  to  manage  earnings  would  be 
eliminated  by  assumption.  The  reason  the  firms’  costs  need  to  be  correlated  is  somewhat  more 
subtle,  but  can  be  most  clearly  seen  by  considering  the  impact  of  assuming  the  opposite — that  their 
costs  are  not  correlated.  If  this  is  case,  then  a  firm’s  private  information  is  not  helpful  in  estimating 
its  rival’s  costs,  and  thus  the  firm’s  financial  reports  only  provide  the  rival  with  information  about 
the  firm’s  costs  of  production.  Consequently,  the  firm’s  report  can  only  impact  competition  in  the 
product  market — there  is  no  competitive  effect  in  the  firms’  earnings  management  decisions. 
However,  when  the  firms’  costs  are  correlated,  the  information  one  firm  extracts  about  its  rival  s 
costs  not  only  informs  it  about  those  costs,  but  also  provides  additional  information  about  the 
rival’s  ability  to  extract  information  from  the  firm’s  own  report. 

To  address  our  research  questions,  we  examine  a  two-stage  game  that  captures  the  interactions 
between  firm  disclosures,  earnings  management  decisions,  and  competitive  pressures  in  the  prod¬ 
uct  market.  In  the  first  stage,  each  of  two  firms  chooses  a  disclosure  without  knowing  its  rival’s 
choice  and  incurs  a  disclosure  cost.  Each  firm  makes  this  disclosure  knowing  both  its  own  dis¬ 
closure  and  production  costs.  In  the  second  stage,  the  firms  observe  both  disclosures  from  the  first 
stage  and  compete  in  the  product  market  by  choosing  an  amount  of  the  homogeneous  product  to 
offer  for  sale  without  observing  the  output  choice  of  its  rival.  We  simplify  the  analysis  by  assum¬ 
ing  that  each  firm  has  constant  marginal  costs  of  production  and  fixed  costs  are  zero.  Together, 
these  imply  that  the  firm’s  average  costs  of  production  equal  its  marginal  costs. 

In  the  disclosure  stage,  each  firm  chooses  a  disclosure  that  we  interpret  as  a  disclosure  about 
its  average  cost  of  production.  Our  objective  is  to  capture  the  idea  that  a  firm  s  financial  statements 
(e.g.,  an  income  statement)  can  be  used  to  infer  the  firm’s  reported  marginaEaverage  costs.  Fur¬ 
ther,  since  firms  can  make  reporting  decisions  that  either  enhance  or  diminish  reported  earnings 
(both  within  the  discretion  afforded  by  GAAP  or  through  fraud),  we  do  not  require  truthful 
disclosure.  Instead,  we  assume  that  a  firm  that  does  not  disclose  truthfully  incurs  a  disclosure  cost 
h^{si,Ci),  i  =  1,2,  where  Si  is  firm  /’s  disclosure  and  C;  is  its  cost  of  production.  To  keep  the 
analysis  tractable,  we  assume  that  h^isi ,  c,)  =  ^,(5,  -  c,)  -1-  ( 1  /  2)e,(5i  -  c,)^,  with  e,-  >  0.  Notice 
that  with  this  specification,  firms  that  disclose  truthfully  incur  no  disclosure  costs,  but  firms  that  do 
not  incur  two  types  of  costs.  The  first  type  (represented  by  the  second  term)  imposes  a  quadratic 
cost  of  misreporting  c— if  the  firm  misreports  (si  A  c^),  it  incurs  a  cost  that  is  proportional  to  the 


As  an  example  consider  the  case  when  each  firms’  costs  of  production  are  made  up  of  two  components,  a  common  and 
an  idiosyncratic  component.  If  the  firms’  costs  are  not  correlated,  then  the  common  component  must  be  common 
knowledge  and  all  each  firm  needs  to  do  is  extract  its  rival’s  idiosyncratic  component  from  its  rival’s  financial  report. 
However  when  the  common  component  is  not  known  with  certainty,  then  the  firm  itself  will  be  unsure  of  the  split 
between  the  common  and  its  idiosyncratic  components  and  will  use  its  rival’s  financial  report  to  help  it  understand  its 
own  split.  This  will,  in  turn,  impact  its  inference  about  its  rival’s  production  costs  and  the  rival  will  need  to  factor  this 
effect  in  when  determining  whether  (or  how  much)  to  manage  its  earnings. 

"  Throughout,  we  assume  that  these  costs  are  not  prohibitively  large. 
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squared  difference  between  its  disclosure  and  its  actual  marginal  cost  of  production.  As  a  result, 
it  is  symmetric  in  deviations  from  disclosing  truthfully.  The  second  type  of  disclosure  cost  (rep¬ 
resented  by  the  first  term)  is  not  symmetric  and  depends  on  the  sign  of  A:,-.  Intuitively,  if  ki  >  0, 
then  the  firm’s  disclosure  costs  increase  when  its  disclosure  exceeds  its  actual  marginal  cost  of 
production  and  may  represent  an  added  disadvantage  from  the  market  inferring  that  it  is  less 
competitive.  Similarly,  if  /c,  <  0,  then  the  firm’s  disclosure  costs  increase  when  its  disclosure  is 
smaller  than  its  actual  marginal  costs  and  may  represent  an  added  disadvantage  associated  with  a 
(perceived)  weakened  position  in  bargaining  with  employees  or  in  dealing  with  regulators  (Watts 
and  Zimmerman  1986,  1990). 

In  the  production  stage  of  the  game,  the  firms  observe  the  first-stage  disclosures  (^i ,  ^2)^  draw 
inferences  from  these  disclosures  optimally  and  compete  in  the  product  market.  We  model  the 
product  market  using  a  fairly  standard  incomplete  information,  Cournot  duopoly  model  where 
firms  know  their  own  costs  of  production  but  do  not  know  their  rival’s.  More  specifically,  we 
assume  that  market  demand  for  the  homogeneous  product  sold  by  the  two  firms  is  P  =  a-qi 
-  qj  and  units  of  output  are  normalized  so  that  the  slope  coefficient  is  1 .  All  information  about 
market  demand  is  common  knowledge. 

An  important  component  of  our  model  is  the  information  structure.  We  assume  that  firm  i 
knows  its  own  disclosure  costs  (k,)  and  its  costs  of  production  (c,)  but  not  its  rival’s  disclosure 
costs  {kj)  or  its  rival’s  costs  of  production  (cj).  All  other  parameters  are  common  knowledge. 
Thus,  each  firm  has  two  pieces  of  private  information,  and  c„  and  does  not  know  kj  and  Cj}^  We 
assume  that  each  firm’s  priors  are  that  its  rival’s  disclosure  costs  are  independent  and  normally 
distributed  and  that  its  rival’s  marginal  costs  of  production  are  also  normally  distributed.  A  key 
feature  of  our  model  is  that  the  covariance  between  the  two  firms’  marginal  costs  is  non-negative. 
Thus,  each  firm  can  learn  about  its  rival’s  marginal  costs  when  it  learns  its  own  marginal  costs. 
More  formally,  and  ^2  are  independent,  normally  distributed  random  variables  with  zero  means 
and  variances  Further,  c,-  ~  ME[c,] ,  cr,-,)  for  i  =  1,2  and  Cov[ci ,  C2]  =  —  0- 

A  key  difference  between  our  model  and  most  others  in  the  literature  is  that  each  firm  has  two 
sources  of  private  information:  its  disclosure  costs  and  its  production  costs. Thus,  firm  /’s 
disclosure  in  the  first  stage  is  likely  to  depend  on  both  components  of  its  private  information  and, 
therefore,  may  provide  its  rival  with  information  about  c,.  If  so,  then  the  disclosure  will  affect  how 
the  firms  compete  in  the  product  market. 

As  is  standard,  we  focus  on  linear  equilibria.  In  the  first  stage  of  the  game,  each  firm  knows 
its  own  disclosure  and  production  costs.  That  is,  firm  /’s  first  period  information  set  is  y| 
=  (c,- ,  k,),  so  we  conjecture  that  each  firm  uses  a  strategy  that  is  linear  in  the  elements  of  its  first 
stage  information  set.  In  the  second  stage,  each  firm  has  observed  the  disclosures  made  in  the  first 
stage  and,  as  a  result,  firm  /’s  information  set  is  y,-  =  (c,- ,  5, ,  Sj).  While  firm  f’s  own  disclosure  does 
not  provide  it  with  additional  information,  we  include  5,  because  its  rival  uses  5,  to  infer  c,;  we  do 


Quadratic  cost  functions  have  become  relatively  common  in  the  disclosure  literature  because  they  support  linear  equi¬ 
libria  in  Coumot-like  models.  See,  for  example,  Crawford  and  Sobel  (1982),  Fischer  and  Stocken  (2001),  and 
Chakraborty  and  Harbaugh  (2007)  who  employ  a  quadratic  cost  function  in  their  models  of  cheap  talk,  as  well  as  the 
models  in  Stocken  and  Verrecchia  (2004),  Fischer  and  Stocken  (2004),  and  Guttman  et  al.  (2006). 

Prior  work  using  a  Cournot  model  with  incomplete  information  about  costs  include  Fried  (1984),  Shapiro  (1986), 
Gal-Or  (1986,  1988),  Darrough  (1993),  and  Vives  (2002).  Also,  see  the  survey  by  Verrecchia  (2001). 

Formally,  we  are  assuming  that  each  firm’s  type  is  two-dimensional,  t,  =  (k,- ,  c,)  G  7^^ 

The  main  exceptions  are  Fischer  and  Verrecchia  (2000)  and  studies  that  build  on  their  model  such  as  Har  et  al.  (2009). 
In  these  models,  incomplete  contracting  limits  the  market’s  ability  to  understand  how  managerial  compensation  is  linked 
to  stock  price.  Since  the  manager  does  know  how  his/her  compensation  is  linked  to  stock  price  and  has  private 
information  about  the  firm  s  future  performance,  the  manager  has  two  sources  of  private  information  in  Fischer  and 
Verrecchia-type  models,  although  the  sources  differ  from  those  studied  here. 
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not  include  ki  because  its  rival  does  not  observe  it  and  it  is  independent  of  all  other  unknowns. 
Thus,  firm  i  anticipates  that  qj  will  depend  on  5;  and  therefore  conditions  its  own  output  choice  on 
5;.  Neither  firm’s  output  choice  depends  on  k,  because  neither  uses  its  own  disclosure  to  infer  its 
own  private  information.  Consequently,  we  conjecture  that  in  equilibrium; 

5  j  —  Z9q  +  jC  j  +  Z)2^i 

•^2  =  ^0  +  ^1^2  +  P  2^2 


-  A^O  +  +  •^2‘^1  +  ^3'^2 


q2  —  M Q  +  M 1C2  1^2'^ I ^3^2'  (Cl) 

To  solve  for  a  perfect  Bayes  equilibrium  of  the  game,  we  begin  by  analyzing  the  second-stage 
game  in  which  firms  compete  in  the  product  market  given  their  prior  disclosures  and  their  infer¬ 
ences  about  their  rival’s  private  information.  Given  our  normality  assumptions  and  our  focus  on 
linear  equilibria,  we  make  the  additional  conjecture  that:^^ 

E[ci|y2]  =  «0  +  “1^2  +  «2^1 


E[c2lyi]  =  ySo  + Aci  +  /32-y2- 

Given  these  conjectures,  in  the  second  stage,  firm  i  solves: 

max  E[(a  -  Q)qi  -  J  =  max  -  ^[qj\yi]qi  -  c^qi.  (1) 

qi  ‘li 

Note  that  the  first-stage  disclosure  costs  are  ignored  because,  in  the  second  stage,  they  are  a  sunk 
cost  and  thus  do  not  affect  the  optimal  output  decision.  The  first-order  condition  for  this  maximi¬ 
zation  problem  is  0  =  a  -  Iqi  -  E[^;  |  yj  -  c„  which  forms  the  basis  for  Proposition  1:^^ 

Proposition  1:  If  (Cl)  and  (C2)  hold,  then  there  exists  a  linear  equilibrium  {q^,q2)  to  i\\& 
production  game  with; 

<?!  +  +N2S\  +NzS2 

qj  =  Mq  +  M1C2  +  +  ^3^2,  ’ 


where: 


yVo  =  (l/3) 


(2-/3|)q;o  ^ 
4-  ai^i 


2(2-ai);3o\ 

d-criA  / 


Mo  =  (1/3) 


(2-q;i)^o  ^ 


2(2-/3i)a;o\ 
A-axPi  ! 


Nx  =  - 


2-A 

A-axPx 


M,  =  - 


2-  ax 

A-axPx 


A/2  =  -(l/3)a2 


2-A  \ 
A-axPxl 


M2  =  (2/3)  0-2 


2-A  \ 

A-ax/3x/ 


In  a  linear  equilibrium,  we  will  have  to  compute  all  of  the  conjecmred  coefficients  in  Equations  (Cl)  and  (C2). 
The  second-order  condition  is  satisfied  since  the  coefficient  on  is  negative. 
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^3  =  (2/3)A 


2-0^1  \ 
4-aiA/ 


M,  =  - {1/3)132 


2-ai  \ 

4-a,A/ 


There  are  two  important  features  of  this  linear  equilibrium  that  we  should  highlight.  First, 
each  firm’s  equilibrium  production  strategy  depends  on  its  rival’s  disclosure,  its  actual  production 
costs  and  its  own  disclosure.  The  (potentially)  unintuitive  feature  is  that  the  firm’s  production 
strategy  depends  on  its  own  disclosure  despite  the  fact  that  the  firm  knows  its  own  cost  of 
production  and  uses  it  rather  than  its  own  disclosure  to  estimate  its  rival’s  marginal  cost  of 
production.  As  suggested  above,  the  reason  for  this  dependence  is  that  firm  i  knows  that  its  rival 
is  using  i’s  disclosure  to  make  inferences  about  i’s  production  costs  (see  Equations  (C2))  and  thus 
i  can  infer  that  its  rival’s  production  strategy  will  depend  on  5,-.  Since  firm  /’s  production  strategy 
depends  on  its  inference  about  firm  j’s  production  decision  that  firm  i  knows  depends  on  Sj,  firm 
i’s  production  strategy  “indirectly”  depends  on  its  own  disclosure  because  of  the  information  its 
rival  can  extract  from  that  disclosure.  Second,  the  parameters  describing  the  equilibrium  produc¬ 
tion  strategies  (the  M’s  and  N’s)  depend  on  the  coefficients  in  the  conditional  expectations  de¬ 
scribed  by  (C2).  Intuitively,  both  firms  use  all  of  their  private  information  and  their  public  disclo¬ 
sures  from  the  disclosure  stage  to  infer  as  much  as  they  possibly  can  about  their  rival’s  production 
costs. 

Proposition  1  can  also  be  used  to  derive  each  firm’s  equilibrium  expected  second  stage  profits. 
Substituting  q-  into  Equation  (1)  shows  that,  as  is  usual  in  Cournot  games,  firm  i’s  equilibrium 
expected  second  stage  profits  are: 


ELnf  |y,]  =  (a  -  q  -  q*  -  E[q*\yi])q*  =  {l/2){q*)^. 

It  will  also  be  useful  to  have  simplified  expressions  for  E[qj  \  y,].  This  is  most  readily  obtained 
from  Proposition  1  by  taking  expectations: 

E[^2bi]  =  4^0  +  M]E[^C'^yi\  +  M2S\  -l- M352 


E[^i  1^2]  -Nq  +  -^iE[ci|y2]  +  A^2‘yi  +  ^3‘^2- 

Substituting  for  E[c,- 1  yj]  from  (C2)  shows  that  each  equation  has  the  form  implied  by  (Cl): 
E[^2l>’l]  +  ^1^1  +  ^2^1  +  A352  (2) 

E[^l  |y2]  = -Sq  +  +  ^3‘^2-  (3) 

Folding  back  to  the  disclosure  stage,  each  firm  chooses  a  disclosure,  5,-,  which  we  interpret  as 
a  disclosure  about  the  firm’s  average  (equivalently,  in  our  model,  marginal)  cost  of  production. 
Recall  that  our  interpretation  is  that  the  firm’s  income  statement  can  be  used  to  infer  the  firm’s 
reported  average  costs.  Given  our  distributional  and  information  assumptions,  in  the  first  stage  of 
the  game,  firm  i  chooses  5,-  to  maximize  its  expected  profits  in  the  two-stage  game  overall:^* 


18 

The  maximand  also  highlights  the  importance  and  implications  of  our  assuming  quadratic  costs  of  disclosure.  In 
particular,  if  S;  —  0  so  that  disclosure  costs  are  linear,  then  we  will  only  find  comer  solutions  to  the  firm’s  maximization 
problem.  If,  on  the  other  hand,  ei  0,  then  disclosure  costs  are  quadratic,  the  firm’s  maximization  problem  has  an 
interior  solution,  and  we  can  identify  the  linear  equilibrium  we  seek.  Further,  because  is  the  coefficient  on  the  linear 
term  of  the  disclosure  costs,  its  value  only  affects  the  intercept  of  the  firm’s  reaction  function  and,  thus,  the  particular 
value  it  takes  on  does  not  impact  our  ability  to  find  interior  solutions  to  the  firm’s  maximization  problem. 
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max(l/2)E[(^*)21};;  ]  -  kiis^  -  c,)  -  e,(s/  -  c,)^. 
■Si 


Substituting  equilibrium  outputs  from  Proposition  1,  and  maximizing  with  respect  to  Si  (^2)  yields: 


= 


£1  —  Ao 


(- A2(a  -  Aq)  +  (fii  +  A2(1  +  Ai))ci  -  ki  -  A2A3E[s2ki,A:i]) 


(4) 


^2  = 


1 

S2-B\ 


(—  —  Bq)  +  (£2  +  53(1  +  Bi))c2  —  ^2  “  ^2-®3E['5'i|c2,^2])- 


(5) 


Since  and  ^2  ^re  not  correlated  with  each  other  or  the  firms’  marginal  costs  of  production, 

E[sj  I  ki ,  c,-]  =  E[sj  I  c,].  Equations  (4)  and  (5)  suggest  that  5,-  is  linear  in  c, ,  ki,  as  long  as  the 
conditional  expectations  E[s,  I  c,]  are  linear  in  those  variables,  too.  Proposition  2  provides  condi- 
tions  under  which  the  5,  s  are,  in  fact,  linear. 

Proposition  2:  In  every  linear  equilibrium,  ,  ^2)  defined  as  in  Proposition  1  and 
,  ^2)  satisfy: 

■^1  ~  B>q  +  D\Ci  +  D2k\ 


S2  =  Fq  +  F1C2  +  F2k2, 

where: 

E[c2lci]  =  /Jio  +  ptiCx  E[ci|c2]=  <%+  ^1<^2 


Dn  =  — -  ^0  +  ^3^0  +  ^3Ei/U-o)]  Fq  =  ;;5[-  B^ia  -Bq  +  B2DQ  -  ^2^1  ^0)] 


r  -A^' 
Cl  /I2 


^2~B\ 


Dx  =  — ^^[A2(1  +Ai)  -A2A3F1/21  +  £1]  Fi  =  eBx)-B2B2,Dx8x  +  £2] 


£1  —  An 


S2~B\ 


Dn  =  - 


,  £1  -  A 


F2  =  - 


1  “^2^ 


F2  “  ^3 


We  also  observe  that  in  any  linear  equilibrium  as  described  in  Proposition  2: 

E[c,k.,.J  =  E[cJ  +  (  . 

\  0-1 1  ( 0-22^1  +  iTk  kF2)  -<^2^1/ 


+ 


,  O'!  1  (<^22^1  +  (^k2k2^2)  ~  ^2^1 


|(52-Fo-FiE[c2]) 


E[c,k„.,]  =  E[.,]  +  ( ^  j (C2 -  Efo]) 


(^11^22^1  -  (^2^1 


(5i  -Do-F>iE[ci]), 


\  (o'n-Oi  +  CrA:jit,E^2)‘^22  <^2^1, 

and  a  linear  equilibrium  requires  that  these  coefficients  match  the  a  s  and  /3  s  in  Equations  (C2). 
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Unfortunately,  there  is  no  general  result  on  the  existence  of  a  perfect  Bayes  equilibrium  in  this 
type  of  linear- normal  model. Further,  finding  a  linear  equilibrium  in  closed  form  for  our  model 
is  very  difficult  because  it  requires  solving  a  system  of  24  nonlinear  equations  for  the  unknown 
coefficients.  Both  numerical  solutions  and  closed-form  solutions  for  restricted  versions  that  elimi¬ 
nate  some  of  the  key  forces  in  the  model  are  analyzed  below.  However,  before  examining  the 
properties  of  the  solutions,  we  explain  the  importance  of  our  assumption  that  there  are  two  sources 
of  private  information. 

The  easiest  way  to  see  the  importance  of  assuming  that  there  are  two  sources  of  private 
information  rather  than  just  one  is  to  consider  the  effect  of  assuming  that  both  firms  know  the 
realizations  of  and  ^2  (equivalently  that  =  0)-  ^o,  then  each  firm  has  one  source 

of  private  information:  their  own  costs  of  production.  As  a  result,  any  disclosure  strategy  that  is 
monotone  in  the  firm’s  private  information  can  be  inverted,  which  allows  all  to  infer  the  firm’s 
private  information  exactly  and  we  are  able  to  prove  the  following  result: 

Proposition  3:  If  kj  and  /c2  are  common  knowledge,  then  there  are  no  monotone  equilibria  in 
pure  strategies. 

The  key  implication  of  assuming  that  kj  and  ^2  are  common  knowledge  is  that  neither  firm 
can  employ  a  monotone  disclosure  strategy  without  revealing  its  actual  production  costs.  That  is, 
if  the  conjectured  disclosures  are  monotone  in  the  firm’s  private  information,  even  if  biased,  then 
they  allow  each  firm  to  infer  the  other’s  costs  and  thus  be  fully  informed  when  they  compete  in  the 
product  market.  As  a  result,  if  a  firm  deviates  from  the  conjectured  disclosure  strategy  and  biases 
its  reported  costs  downward,  then  the  rival  would  infer  that  the  deviating  firm’s  costs  are  lower 
and  accommodate  the  firm  by  producing  less.  This  allows  the  deviating  firm  to  produce  more  and 
shift  product  market  profits  from  its  rival  to  itself. 

In  the  following  section,  we  examine  restricted  versions  of  our  model  that  allow  for  closed- 
form  solutions  at  the  cost  of  eliminating  key  components  of  the  information  extraction  problem 
and  how  rivalry  impacts  the  firms’  decisions  to  bias  their  reported  earnings.  In  Section  IV,  we 
examine  numerical  solutions  of  the  full  model  and  highlight  both  of  these  features. 

III.  RESTRICTED  VERSIONS  OF  THE  MODEL 

In  this  section,  we  examine  two  special  cases  of  the  model  described  above  that  simplify  the 
computation  of  a  linear  equilibrium  at  the  cost  of  eliminating  important  forces  in  the  model. 

We  begin  by  restricting  the  analysis  to  the  case  when  the  firms’  private  information  is  uncor¬ 
related.  As  we  noted  in  the  previous  section,  this  restriction  eliminates  a  key  force  in  our  model — 
the  impact  of  learning  about  what  the  rival  can  learn  from  the  firm’s  own  financial  reports.  This 
assumption,  that  0-12  =  0,  implies  that,  in  Proposition  2,  a,  and  fiy  are  both  zero.  While  it  is  true 
that  eliminating  this  dimension  of  each  firm  learning  about  the  information  its  rival  can  extract 
from  its  financial  reports  does  simplify  the  system  of  equations  mentioned  above,  it  does  not 
simplify  them  enough  to  offer  useful  closed-form  solutions. 

To  obtain  a  manageable  linear  equilibrium,  we  introduce  a  second  restriction:  only  one  firm 
has  private  information.  The  cost  of  this  restriction  is  that  it  eliminates  the  impact  of  rivalry  on  the 


With  minor  modifications  to  our  distributional  assumptions,  however,  the  existence  of  a  trembling  hand  perfect  Nash 
equilibrium  ensures  that  there  is  a  perfect  Bayes  equilibrium  (see  Fudenberg  and  Tirole  1993).  In  particular,  if  each 
firm  s  private  information  is  drawn  from  a  finite  set  (so  that  the  set  of  types  in  our  game  is  finite),  standard  existence 
theorems  ensure  that  there  is  a  trembling  hand  perfect  Nash  equilibrium  and  therefore  a  perfect  Bayes  equilibrium  in  the 
modified  game. 

20  '  ^  ' 

Algebraic  manipulation,  available  from  the  authors,  produces  a  quadratic  equation  with  one  real  root.  Unfortunately,  the 
root  is  a  very  complex  function  of  the  exogenous  variables  and  requires  that  we  again  revert  to  numerical  solutions. 
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earnings  management  decision  as  well  as  a  key  component  of  each  firm’s  inference  problem. 
Without  loss  of  generality,  assume  that  C2  is  common  knowledge  and  since  firm  2’s  costs  are 
known,  set  ^2  =  0- 

Corollary:  If  C2  is  common  knowledge,  /c2  =  0  and  cr[2  =  0,  then  there  is  a  linear  equilibrium 
{(^1  ,  (^1 ,  ^2)) ,  (^2  .  .  “^2))}  defined  by: 

Qx  =  (1/6)(2((3  +  /3o)  -  3ci  -  5]  +  2/32S2) 

q*2  =  (l/6)(2(a  -  (l/2)^o)  -  3c2  +  2^1  -  (32S2) 

)[2fl  -  (l/2)y8o  +  (3  +  sx)cx  +  (l/2)(3  +  ^2)02  -  9/ci] 

9si  -  1  / 

^2  =  C2, 

with: 

E[Cx\c2,Sx]  =  /3q  +  (32Si. 


E[c2\cx,S2]  =  C2 


y8o  = 


(3  +  ei)fe\E[  ] 
981-1  I 


(  ^  y 

(9ei  -  1)(3  +  8i)crii 

\9sx-ll 

_(3  +  8i)Vn  +  81(r^^. 

Despite  the  significant  restrictions  that  eliminate  many  key  forces,  the  equilibrium  described 
in  the  Corollary  does  have  a  couple  of  interesting  features.  First,  because  firm  2’s  costs  are 
common  knowledge,  only  firm  1  has  the  opportunity  to  impact  product  market  competition  by  its 
disclosure  policy — and  it  chooses  to  do  so  in  equilibrium.  The  reason  it  chooses  to  do  so  is  that 
firm  2  cannot  “look  through”  firm  I’s  disclosure  and  correctly  infer  firm  I’s  costs  of  production. 
The  two  sources  of  private  information  mean  that  firm  2  optimally  attributes  part  of  the  difference 
between  firm  I’s  reported  cost  (5*)  and  its  expected  cost  (E[ci])  to  the  realized  value  of  k^  and  the 
rest  to  firm  I’s  cost  of  production  (cj.  Thus,  firm  1  has  an  incentive  to  underreport  its  costs  (when 
ki  =  0)  and,  despite  firm  2  optimally  updating  its  expectation  of  firm  I’s  costs,  firm  2  is  left  with 
the  belief  that  firm  I’s  costs  are  lower  than  they  actually  are.  Firm  2,  therefore,  optimally  responds 
to  its  belief  about  firm  1  ’s  costs  by  producing  fewer  units  than  it  would  if  it  knew  firm  1  ’s  costs. 
Firm  1  anticipates  this  reaction  to  its  disclosure  and  produces  more  than  it  would  had  firm  2 
known  firm  I’s  costs.  The  end  result  is  that  profits  are  shifted  from  firm  2  to  firm  1  providing  the 
motivation  for  firm  1  to  bias  its  disclosure.  Put  differently,  competition  in  the  product  market 
induces  firm  1  to  bias  its  report  and  the  biased  report  alters  the  competitive  outcome  in  the  product 
market. 

A  disadvantage  of  this  restricted  model  is  that  there  are  few  interesting  comparative  statics.  As 
one  would  expect,  as  firm  I’s  cost  of  providing  a  misleading  disclosure  increases,  it  offers  a  less 
biased  report  and  earns  smaller  profits.  For  essentially  the  same  reasons,  if  the  size  of  the  product 
market  is  smaller  (a  declines),  then  firm  1  offers  a  less  biased  report  and  earns  smaller  profits.  In 
the  first  case,  the  cost  of  biasing  its  disclosure  increases  and  so  it  does  less.  In  the  latter  case,  the 
benefits  of  biasing  its  disclosure  decrease  and  so  it  does  less. 
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Interestingly,  despite  the  limitations  of  this  version  of  the  model,  it  still  contributes  to  the 
existing  literature  on  biased  reporting.  For  example,  Fischer  and  Verrecchia  (2004)  study  trans¬ 
parent  disclosure  bias  in  a  Cournot  oligopoly  model  with  firms  that  employ  heuristics  to  interpret 
new  information.  In  their  model,  some  firms  over-react  to  information  and,  in  equilibrium,  the 
informed  firm  biases  its  disclosure.  In  essence,  firms  that  employ  heuristics  “believe”  their 
disclosure/interpretation  and  produce  more  output  than  they  would  if  they  were  Bayesian.  Baye¬ 
sian  firms  realize  this  and  accommodate  the  added  output,  thereby  producing  a  benefit  to  the 
heuristic  firms.  Our  restricted  model  complements  Fischer  and  Verrecchia’s  (2004)  analysis  by 
showing  that  biased  reporting  arises  even  when  every  firm  optimally  employs  all  available  infor¬ 
mation  (every  firm  is  Bayesian)  but  still  produces  a  similar  shifting  of  product  market  profits.  In 
contrast  to  Fischer  and  Verrecchia  (2004),  the  bias  is  not  fully  transparent  and  so  our  analysis 
suggests  that  fully  transparent  biased  reporting  may  require  the  non-optimal  use  of  available 
information  (heuristics). 

Our  model  also  complements  Fischer  and  Verrecchia’s  (2000)  analysis  of  reporting  bias.  In 
their  model,  a  manager  is  permitted  to  offer  biased  earnings  reports.  They  show  that,  when 
investors  are  uncertain  about  how  the  manager’s  compensation  depends  on  stock  price,  in  equi¬ 
librium,  the  manager  offers  a  biased  voluntary  disclosure  that  is  not  transparent.  In  their  model,  the 
market’s  uncertainty  about  the  manager’s  payoff  is  required  for  equilibrium  to  exhibit  biased 
reports.  Our  model  complements  their  analysis  by  showing  that  competition  in  the  product  market 
can  substitute  for  agency  problems  combined  with  incomplete  contracting  as  an  explanation  for 
biased  reporting. 

Our  results  for  the  restricted  model  differ  markedly  from  those  reported  in  the  next  section 
when  both  key  features  of  our  model — that  both  firms  face  an  information-extraction  problem  and 
that  there  is  an  important  component  of  rivalry  in  determining  how  each  firm  manages  its  earnings 
(in  addition  to  the  rivalry  associated  with  the  firms’  competition  in  the  product  market)— are 
present.  In  the  full  model,  both  firms  bias  their  reports  and  this  allows  both  to  earn  greater  profits 
than  they  would  in  the  full-information  environment.  Further,  the  restrictions  do  not  allow  us  to 
explore  how  earnings  management  is  affected  by  how  much  information  a  firm  can  infer  about  its 
rival  or  how  asymmetries  in  the  costs  of  misleading  disclosures  impacts  equilibrium  outcomes. 

IV.  SYMMETRIC  EQUILIBRIA 

In  this  section,  we  focus  on  symmetric  equilibria,  and  in  the  following  section,  we  explore  the 
nature  of  asymmetric  equilibria.  In  a  symmetric  equilibrium,  both  firms  employ  the  same  strate¬ 
gies  (the  same  map  from  their  private  information  to  their  actions).  Thus,  equilibrium  disclosures 
are  determined  by  the  same  function  of  the  firm’s  private  information,  and  equilibrium  production 
decisions  are  determined  by  the  same  function  of  each  firm’s  information  set  in  the  second  stage. 
That  is,  symmetry  requires  that  A  =  F,  and  a,  =  /3,  for  i  =  1,2,3;  V,  =  M,  for  i  =  1,2,3, 4  and  .u, 
=  Si  for  i  =  1,2.  Further,  we  must  assume  that  both  firms  are  in  the  same  “competitive  position:” 
ki  and  ^2  must  be  drawn  from  the  same  distribution,  Ci  and  C2  must  be  drawn  from  the  same 
distribution  and  the  firms  must  incur  the  same  disclosure  costs,  8,  =  £2  =  e.  Finally,  without  loss 
of  generality,  we  assume  that  all  of  the  normally  distributed  random  variables  are  standard  normal 
with  zero  means  and  variances  equal  to  1.  Given  this  structure,  we  have: 

Proposition  4:  If  s  is  not  too  small,  then  there  is  a  symmetric  linear  perfect  Bayes  equilib¬ 
rium  in  pure  strategies. 

Intuitively,  if  the  cost  of  misreporting  is  sufficiently  small,  there  are  no  linear  equilibria 
because  the  incentive  to  misreport  is  too  great  and  cannot  depend  linearly  on  the  firm’s  private 
information.  However,  if  the  cost  of  misreporting  is  larger,  then  there  is  a  linear  perfect  Bayes 
equilibrium  in  which  each  firm  provides  a  disclosure.  The  disclosure  allows  the  rival  to  update  its 
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beliefs  about  the  disclosing  firm’s  production  costs  but  does  not  allow  it  to  infer  exactly  what  those 
costs  are  (i.e.,  the  biased  reports  are  not  transparent).  The  update  does,  however,  affect  how  the 
rival  competes  with  the  disclosing  firm  in  the  product  market. 

To  better  understand  the  properties  of  the  disclosure  and  its  effects  on  product  market  com¬ 
petition,  we  will  generally  do  numerical  comparative  static  analysis  because  of  the  complexity  of 
the  equations  defining  the  symmetric  equilibrium.  However,  comparative  static  results  on  the  cost 
of  misreporting  can  be  done  analytically.  In  particular,  as  the  cost  of  misreporting  increases,  the 
amount  of  misreporting  declines,  each  firm’s  estimate  of  its  rival’s  cost  of  production  becomes 
more  accurate,  and  each  firm’s  output  approaches  the  output  it  would  make  in  a  full-information 
environment.^' 

Intuitively,  this  comparative  static  tells  us  that,  as  the  cost  of  misreporting  gets  large,  equi¬ 
librium  in  the  product  market  converges  to  the  complete  information  solution.  The  reason  is  that, 
as  the  cost  of  misreporting  gets  large,  the  firms  do  less  which  makes  their  disclosures  more 
informative.  Thus,  the  uncertainty  about  the  rival’s  cost  dissipates  and  competition  in  the  product 
market  approaches  the  full-information  outcome.  Interestingly,  this  result  does  not  require  that 
firms  precommit  to  a  disclosure  or  be  required  to  disclose  truthfully.  Instead,  as  the  cost  of 
misreporting  rises,  firm  behavior  converges  to  the  full-information  solution  where  each  truthfully 
discloses  its  private  information  about  its  production  costs. 

To  clarify  how  equilibrium  outcomes  vary  with  the  cost  of  misreporting,  the  correlation 
between  the  firm’s  costs  of  production  and  the  size  of  the  product  market,  we  turn  to  numerical 
comparative  static  analyses.  Since  hi{si ,  c,)  =  k^isi  -  c,)  -i-  ( 1  /  2)g,(5;  -  c,)^,  a  positive  (negative)  k, 
produces  marginal  incentives  for  the  firm  to  report  Si<  Ci  {si>  c,).  Thus,  we  do  all  numerical 
analyses  assuming  that  the  realized  values  of  and  ^2  are  zero.  We  begin  by  analyzing  the 
equilibrium  described  above  in  more  detail  and  the  effects  of  an  increase  in  the  cost  of  misreport¬ 
ing. 

Result  1:  In  equilibrium,  each  firm’s  reported  cost  of  production  is  smaller  than  its  actual 
cost  of  production,  each  firm’s  output  is  smaller  than  the  full- information  quantity 

and  each  firm’s  estimate  of  its  rival’s  costs  is  smaller  than  its  rival’s  actual  cost  of 
22 

production. 

This  result  tells  us  that  firms  bias  their  reported  costs  downward  in  an  attempt  to  create  the 
impression  that  their  costs  are  lower  than  they  actually  are.^^  If  successful,  their  rival  will  opti¬ 
mally  reduce  the  amount  it  sells  in  the  product  market,  causing  the  price  to  be  higher  and  resulting 
in  greater  profits  for  the  misreporting  firm.  Each  firm,  in  equilibrium,  is  successful  because  neither 
can  perfectly  extract  its  rival’s  production  costs  from  reported  costs.  Each  understands  the  equi¬ 
librium  and  adjusts  its  expectation  of  the  rival’s  cost  taking  into  account  the  rival’s  incentive  to 
report  lower  costs  but,  despite  this,  the  adjustment  is  only  partial.  We  emphasize  that  neither  firm 
is  “fooled.”  Each  uses  all  available  information  (its  private  information  about  its  own  costs  and  the 


See  the  Appendix  for  a  proof  of  these  assertions. 

Further,  combining  Result  1  with  our  comparative  static  results  on  changes  in  e,  when  the  cost  of  misreporting  increases, 
all  differences  shrink.  Reported  costs  converge  to  actual  costs,  output  choices  converge  to  the  full-information  quanti¬ 
ties  and  each  firm’s  estimate  of  its  rival’s  costs  converge  to  its  rival’s  actual  costs. 

We  should  clarify  that  we  are  not  using  biased  in  the  classical  statistical  sense  but  rather,  to  use  Fischer  and  Verrecchia  s 
(2000)  term  in  the  “realized”  sense.  This  distinction  is  standard  in  the  biased  disclosure  literature  and  is  important  to 
understanding  our  results  because  both  firms  are  Bayesian,  which  implies  that  there  is  no  bias  in  the  classical  statistical 
sense.  There  is,  however,  bias  in  the  “realized”  sense  that  can  be  seen  as  follows.  Since  i,-  is  linear  in  c,  and  k,-,  it  is 
normally  distributed  and  so,  after  suppressing  conditioning  on  Cj,  we  can  write  £[0,- 1  i,]  =  E[c,]  -i-  /c(s,  -  E[5,]).  Recalling 
that  k  =  0,  substituting  for  i,  and  £[«(]  and  rearranging,  we  obtain  Xc,- -  E[c,  |  ij  = -(1  -  X.)E[Ci].  Evaluating  at  £[0,] 
=  0  and  noting  that  0  <  X  <  1,  we  have  that  c,-  >  E[c;  |  ij.  That  is,  each  firm’s  estimate  of  its  rival  s  costs  is  smaller  than 
its  rival’s  actual  cost  of  production. 
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disclosures)  to  make  the  best  possible  estimate  of  its  rival’s  production  costs.  Each  underestimates 
its  rival’s  costs  because  there  are  two  reasons  why  they  can  be  small — either  the  costs  of  produc¬ 
tion  actually  are  small;  or  the  costs  of  misreporting  are  small,  making  it  less  expensive  to 
misreport — and  the  firm  cannot  completely  sort  out  the  two  effects.^'^ 

These  results  differ  markedly  from  the  results  for  Cournot  models  in  the  prior  literature 
(Darrough  1993;  Raith  1996).  Darrough  (1993)  proves  that,  in  a  Cournot  model,  if  the  firm’s 
disclosure  is  required  to  be  a  noisy  but  unbiased  reflection  of  the  firm’s  private  information  about 
its  cost  of  production,  then  in  equilibrium,  both  firms  would  precommit  to  minimize  the  noise  in 
the  disclosure.  That  is,  they  report  their  private  information  without  bias.  Other  models  permit 
firms  to  choose  whether  to  disclose  after  learning  their  production  costs  but  require  truthful 
disclosures.  This  literature,  summarized  in  Christensen  and  Feltham  (2002),  shows  that  the  stan¬ 
dard  unraveling  result  applies — in  equilibrium,  every  firm-type  discloses  because  if  more  than  one 
type  is  pooled  together,  then  the  type  with  the  lowest  production  costs  can  increase  profits  by 
disclosing  its  costs  and  inducing  its  rivals  to  reduce  the  quantities  they  offer  for  sale.  In  our  model, 
firms  are  not  required  to  disclose  truthfully,  but  if  they  report  with  a  bias,  they  incur  a  cost 
associated  with  misreporting.  Our  analysis  shows  that  when  firms  have  the  option  to  bias  their 
disclosure  (even  at  a  cost),  they  do  so  and  that  the  effects  are  only  partially  accounted  for  by  the 
rival. 

'yf\ 

Result  1  also  highlights  the  importance  of  rivalry  in  the  earnings  management  decision.  In 
particular,  even  though  both  firms  are  fully  rational  and  optimally  use  all  available  information, 
biased  reporting  occurs  in  equilibrium  and  leads  to  greater  industry  profits.  This  differs  from  both 
the  results  in  Fischer  and  Verrecchia  (2004)  and  our  restricted  model.  In  particular,  in  Fischer  and 
Verrecchia  (2004),  profits  are  shifted  from  firms  that  use  Bayes’  Rule  to  those  using  a  particular 
heuristic  and  leads  to  lower  overall  industry  profits.  In  our  restricted  model,  profits  are  shifted 
from  the  firm  without  private  information  to  the  firm  with  private  information  (via  the  latter’s 
biased  disclosure)  and  again  leads  to  lower  industry  profits.  In  contrast  to  the  aforementioned 
models  without  rivalry  in  the  disclosure  stage,  when  there  is  rivalry,  both  firms  earn  greater  profits 
than  the  full-information  outcome  because  both  reduce  output  after  optimally  using  the  rival’s 
disclosure  to  estimate  their  rival’s  cost  of  production.  The  impact  on  firm  profits  is  described  in  the 
following  result: 

Result  2:  In  equilibrium,  misreporting  allows  both  firms  to  earn  greater  profits  in  the  product 
market  than  they  would  if  each  had  complete  information  about  its  rival’s  cost  of 
production.  Further,  the  profits  the  firms  earn  in  the  product  market  decline  as  the 
cost  of  misreporting  increases. 

Intuitively,  the  type  of  earnings  management  we  examine  allows  each  firm  to  bias  its  disclo¬ 
sure  in  order  to  affect  its  rival’s  inference  about  its  production  costs.  Result  1  indicates  that  the 


^  The  direction  of  the  preferred  bias  (to  report  production  costs  as  being  smaller  than  they  actually  are)  is  the  result  of 
assuming  that  the  firms  are  Cournot  competitors.  Had  we  assumed  that  the  firms  sold  heterogeneous  products  and  were 
Bertrand  competitors,  the  direction  of  bias  would  be  reversed.  Formally,  the  reason  for  this  difference  is  that  quantities 
are  strategic  substitutes  whereas  prices  are  strategic  complements.  Intuitively,  the  reason  is  that  under  Bertrand  compe¬ 
tition,  if  firm  i  reports  higher  costs  of  production,  it  anticipates  that  its  rival  will  increase  its  price,  allowing  i  to  increase 
its  price  thereby  increasing  i’s  profits.  The  higher  price  increases  firm  profits  because  competition  has  driven  the 
equilibrium  prices  below  their  monopoly  levels. 

Formally,  if  the  firm’s  private  information  is  t//j,  then  it  is  required  to  disclose  +  n,  where  Vj  is  a  normally 

distributed  random  variable  with  mean  0  and  variance  chosen  by  the  disclosing  firm.  If  the  firm  wishes  to  disclose  its 
private  information,  then  it  selects  a  variance  of  zero  and  if  it  wishes  not  to  disclose,  it  selects  a  variance  of  infinity.  In 
neither  case  can  the  disclosing  firm  intentionally  bias  its  disclosure. 

Recall  that  rivalry  requires  two  assumptions:  (1)  that  at  least  two  firms  have  private  information,  and  (2)  that  their 
information  is  correlated.  Thus,  the  impact  of  rivalry  in  our  model  is  seen  by  contrasting  equilibrium  outcomes  in  the 
restricted  model  (Corollary)  and  in  our  full  model. 
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firms  choose  to  misreport  costs  to  create  the  impression  that  they  are  lower  than  they  actually  are. 
As  a  result,  the  rival  infers  that  the  firm  will  produce  more  than  it  actually  does  and  responds  by 
reducing  the  quantity  it  produces.  Consequently,  earnings  management  results  in  both  firms  selling 
less  than  they  would,  had  they  known  their  rival’s  costs,  but  the  associated  price  increase  (from 
selling  less)  results  in  each  firm  earning  greater  profits  than  it  would  with  complete  information 
about  its  rival’s  costs.  Further,  as  the  cost  of  misreporting  rises,  the  magnitude  of  the  bias  in  the 
firms’  reports  declines.  Since  both  firms’  profits  in  the  product  market  are  increasing  in  the  amount 
of  earnings  management,  increases  in  the  cost  of  misreporting  lower  the  equilibrium  amount  of 
earnings  management  and  therefore  lower  the  profits  each  firm  earns  in  the  product  market. 
Further,  if  the  costs  of  misreporting  are  nonpecuniary  or  are  incurred  by  management  rather  than 
the  firm.  Result  2  implies  that  earnings  management  actually  increases  the  liquidation  value  of  the 
firm. 

The  clearest  empirical  implications  follow  from  associating  changes  in  the  cost  of  misreport¬ 
ing  with  changes  in  reporting  regulations.  For  example.  Section  302  of  the  Sarbanes-Oxley  Act  of 
2002  (SOX)  requires,  among  other  things,  that  CEOs  and  CFOs  (or  persons  performing  equivalent 
functions)  personally  certify  in  each  quarterly  and  annual  report,  including  transition  reports,  that: 

he  or  she  has  reviewed  the  report;  based  on  his  or  her  knowledge,  the  report  does  not  contain  any 
untrue  statement  of  a  material  fact  or  omit  to  state  a  material  fact  necessary  in  order  to  make  the 
statements  made,  in  light  of  the  circumstances  under  which  such  statements  were  made,  not  mis¬ 
leading  with  respect  to  the  period  covered  by  the  report;  based  on  his  or  her  knowledge,  the 
financial  statements,  and  other  financial  information  included  in  the  report,  fairly  present  in  all 

material  respects  the  financial  condition,  results  of  operations  and  cash  flows  of  the  issuer  as  of,  and 
^  27 

for,  the  periods  presented  in  the  report. 

Such  certification  increases  the  costs  of  misreporting  and  would  be  represented  by  an  increase 
in  s  in  our  model.  Thus,  the  results  on  increases  in  s  and  Result  1  combine  to  suggest  the  intuitive 
result  that  the  impact  of  Section  302  is  to  reduce  the  amount  of  misreporting  and  the  surprising 
results  that:  (1)  each  firm’s  estimate  of  its  rival’s  costs  increases  and  becomes  more  accurate,  and 
(2)  each  firm  produces  additional  output.  However,  to  the  extent  that  misreporting  of  the  type  we 
consider  is  important  to  the  economy,  our  analysis  suggests  that  Section  302  of  SOX  likely 
expanded  gross  domestic  product  but  reduced  taxable  profits  and  tax  revenues. 

More  importantly,  our  analysis  suggests  that  only  certain  types  of  misreporting  are  effective 
means  of  creating  a  competitive  advantage  in  the  product  market.  In  particular,  aggressive  cost 
capitalization,  including  operating  costs  in  restructuring  costs,  or  selling  previously  written  off 
inventory  all  produce  financial  statements  that  lead  the  firm’s  rival  to  infer  that  the  firm’s  costs  are 
lower  than  they  actually  are.  Changes  in  certain  estimates  can  also  produce  the  same  result.  For 
example,  a  firm  that  reduces  the  allowance  for  doubtful  accounts  (to  increase  earnings  rather  than 
because  of  changes  in  its  customers’  credit-worthiness)  increases  reported  revenue  relative  to 
operating  expenses,  and  thus  gives  its  rival  the  impression  that  the  reporting  firm’s  costs  are  lower 
than  they  actually  are.  Another  example  is  reducing  warranty  expense  estimates  (without  an 
associated  change  in  the  reliability  of  the  product)  that  directly  lowers  reported  operating  expenses 
and  misleads  the  firm’s  rival  in  a  similar  manner.  Finally,  certain  methods  of  “real”  earnings 
management  such  as  delaying  expenditures  have  the  same  effect. 

There  are,  however,  a  variety  of  “standard”  earnings  management  techniques  that  do  not 
create  a  competitive  advantage  of  the  type  we  examine.  For  example,  aggressive  revenue  recog- 


See  “Certification  of  Disclosure  in  Companies’  Quarterly  and  Annual  Reports,  Securities  and  Exchange  Commission 
17  CFR  PARTS  228,  229,  232,  240,  249,  270  and  274  [RELEASE  NOS.  33-8124,  34-46427,  IC-25722;  File  No. 
S7-21-02]  RIN  3235- AI54,  http://www.sec.gov/rules/final/33-8124.htm. 
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nition  or  granting  lenient  credit  terms  (or,  more  aggressively,  channel  stuffing)  all  produce  greater 
earnings  (assuming  gross  margins  are  positive),  but  do  not  necessarily  affect  a  rival’s  ability  to  use 
the  firm’s  financial  statements  to  infer  its  costs.  Similarly,  delays  in  writing  down  assets,  over¬ 
reserving  for  contingencies  or  “timely”  selling  of  assets  are  all  means  of  managing  earnings  that 
do  not  alter  the  rival’s  ability  to  infer  costs.  Thus,  assuming  that  firms  generally  face  similar 
incentives  to  engage  in  this  second  class  of  earnings  management  techniques,  our  analysis  sug¬ 
gests  that  SOX  will  reduce  the  use  of  the  first  class  of  earnings  management  techniques  relative  to 
the  second.  As  a  result,  our  model  predicts  that  the  observed  proportion  of  the  first  class  of 
earnings  management  techniques  among  SEC  enforcement  actions  will  be  smaller  post-SOX. 

Result  2  also  offers  an  alternative  explanation  for  the  “contagion”  effect  associated  with 
restatements  studied  recently  by  Gleason  et  al.  (2008),  Xu  et  al.  (2006),  Da  Dalt  and  Margetis 
(2004),  Margetis  (2004),  and  Gonen  (2003).  These  studies  all  examine  how  a  restatement  by  one 
firm  affects  the  stock  price  of  its  peers.  They  generally  find  that  the  negative  effect  on  the  restating 
firm’s  stock  price  is  contagious  in  the  sense  that  the  restatement  is  associated  with  negative  effects 
on  the  stock  price  of  its  peers.  These  studies  generally  focus  on  the  likely  transmission  mechanism 
being  the  market’s  concern  that  the  restatement  by  one  firm  increases  the  likelihood  of  a  restate¬ 
ment  by  its  peers  (hence  the  name  “contagion  effect”).  Our  analysis  offers  a  complementary 
transmission  mechanism  for  a  subset  of  the  financial  restatements  examined:  Result  2  tells  us  that 
if  the  restatement  by  one  firm  increases  the  cost  of  misreporting,  then  each  firm  will  respond  by 
reducing  the  amount  of  misreporting,  thereby  lowering  the  profits  each  earns  in  the  product 
market.  This  complementary  transmission  mechanism  can  be  examined  empirically  by  focusing  on 
the  future  operating  performance  of  the  restating  firm’s  peers.  A  deterioration  in  this  performance 
would  be  consistent  with  a  loss  of  profitability  in  the  firms’  product  market  and  would  suggest  that 

the  observed  contagion  effect  may  partly  reflect  the  market’s  anticipation  of  this  future  deteriora- 
28 

tion  in  performance. 

Having  analyzed  the  impact  of  earnings  management  on  competition  in  the  product  market, 
we  can  expand  our  understanding  of  the  incentives  to  manage  earnings  by  examining  how  changes 
in  the  product  market  affect  each  firm’s  willingness  to  misreport  costs. 

Result  3:  When  firms  compete  in  more  profitable  product  markets,  the  magnitude  of  misre¬ 
porting  is  greater  but  the  bias  in  the  rival’s  estimate  of  a  firm’s  cost  of  production 
is  smaller,  which,  in  turn,  results  in  a  smaller  gain  in  equilibrium  product  market 
profits  relative  to  the  full-information  level  of  profit. 

Intuitively,  Result  3  shows  how  the  competitive  environment  in  the  product  market  affects 
incentives  to  engage  in  the  type  of  earnings  management  we  analyze.  In  particular,  firms  that 
compete  in  more  profitable  product  markets  (markets  with  larger  demand  intercepts)  bias  their 
reported  earnings  more  because  the  benefits  from  inducing  a  rival  to  reduce  output  are  greater. 
What  may  be  less  obvious  is  that  the  induced  error  in  the  rival’s  expectation  of  the  firm’s  cost  is 
smaller.  The  reason  is  that  the  per-unit  cost  of  misreporting  is  constant  so  that  as  the  profitability 
of  the  product  market  increases,  inducing  firms  to  report  smaller  costs,  the  total  cost  of  misreport- 


Recently,  Dumev  and  Mangen  (2009)  and  Kedia  and  Philippon  (2009)  have  offered  alternative  but  similar  explanations 
for  the  contagion  effect.  Dumev  and  Mangen  (2009)  suggest  a  transmission  mechanism  that  relies  on  prior  misstate¬ 
ments  causing  inefficient  investments  by  a  firm’s  peers.  Based  on  this  idea,  they  offer  empirical  evidence  suggesting  that 
a  restatement  by  one  firm  in  the  industry  is  associated  with  more  inefficient  investment  decisions  by  its  rivals  as 
measured  by  Tobin’s  marginal  q.  Kedia  and  Philippon  (2009)  provide  a  signaling  explanation  for  why  fraudulent 
accounting  can  affect  not  only  the  firm’s  peers  but  also  have  economy-wide  effects  when  the  equilibrium  is  character¬ 
ized  by  a  pooling  equilibrium.  Intuitively,  in  their  model,  mimicking  behavior  by  low-productivity  firms  requires  that 
they  overinvest  and  overhire  and  both  effects  are  reduced  following  the  market’s  recognition  that  there  has  been 
significant  accounting  fraud  by  some  firms  in  the  economy. 
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ing  rises.  Thus,  the  increase  in  the  amount  of  misreporting  is  tempered  by  the  increasing  total  costs 
of  misreporting,  which,  in  net,  allows  its  rival  to  better  estimate  the  firm’s  true  cost  of  production. 
Since  the  estimate  is  better,  the  impact  of  misreporting  on  the  rival’s  output  choice  declines  and 
the  equilibrium  outcome  converges  to  the  full-information  solution. 

An  immediate  implication  of  Result  3  is  that  reported  (actual)  earnings  are  positively  (nega¬ 
tively)  correlated  with  the  magnitude  of  misreporting.  As  a  result,  our  analysis  predicts  that  most 
measures  of  profitability  (e.g..  Return  on  Equity,  Return  on  Invested  Capital,  Return  on  Sales) 
calculated  using  the  firm’s  unadjusted  (adjusted)  financial  statements  should  be  positively  (nega¬ 
tively)  correlated  with  the  use  of  the  types  of  earnings  management  techniques  we  analyze.  In 
many  cases,  this  result  may  be  difficult  to  test  because  adjusted  financial  statements  are  not 
available.  However,  this  difficulty  might  be  avoided  for  the  subset  of  firms  that  voluntarily  or 
involuntarily  provide  a  restatement  of  past  financial  information.  For  these  firms,  unadjusted  and 
adjusted  measures  of  profitability  are  available  as  are  the  reasons  for  the  restatement.  Together, 
this  information  would  allow  for  an  empirical  test  of  our  result.  Alternatively,  Result  3  implies  that 
firms  in  highly  profitable  markets  should  be  those  firms  that  are  more  often  engaged  in  (or  engage 
in  more)  misreporting  than  firms  in  less  profitable  markets.  As  a  result,  our  analysis  suggests  that 
firms  in  more  profitable  markets  are  more  likely  to  be  represented  in  the  set  of  firms  identified  by 
SEC  enforcement  actions  as  having  engaged  in  the  types  of  earnings  management  that  result  in 
biasing  expectations  about  the  reporting  firm’s  costs  of  production. 

Result  4:  The  more  information  a  firm  can  extract  about  its  rival’s  cost  from  knowing  its  own 
cost,  the  smaller  is  the  magnitude  of  the  misreporting,  the  smaller  is  the  induced 
bias  in  the  rival’s  estimate  of  the  firm’s  cost,  and  the  smaller  is  the  deviation  from 
the  full-information  level  of  output. 

The  amount  of  information  a  firm  can  extract  about  its  rival’s  costs  from  knowing  its  own 
costs  is  “measured”  by  the  covariance  between  the  costs,  cr,2.  In  particular,  the  greater  the  cova¬ 
riance,  the  greater  is  the  reduction  in  the  firm’s  remaining  uncertainty  about  its  rival’s  cost  after 
observing  its  own  costs.  The  reduction  in  this  uncertainty  reduces  the  effectiveness  of  misreporting 
by  offering  a  biased  report,  and  the  result  is  that  the  product  market  equilibrium  is  closer  to  the 
full-information  outcome  than  it  would  be,  had  the  firms’  costs  been  less  highly  correlated. 

Result  4  suggests  that  firms  with  similar  technologies,  such  as  those  controlled  by  physical  or 
chemical  processes,  should  be  expected  to  be  less  likely  to  engage  in  misreporting,  and  if  they  do 
so,  to  do  so  in  smaller  amounts  than  firms  whose  technologies  are  more  likely  to  be  very  different 
(e.g.,  service  industries).  We  also  expect  that  firms  in  mature  industries  such  as  the  auto  or  steel 
industries  are  less  likely  to  engage  in  (or  to  do  relatively  less)  misreporting  because  they  are  likely 
to  have  more  information  about  their  rival  s  production  costs.  Similarly,  firms  with  large  portfolios 
of  products  are  more  likely  to  engage  in  misreporting  because  their  rivals  are  likely  to  be  less  able 
to  infer  the  firm’s  costs  of  producing  each  product.  This  result  could  be  empirically  tested  by 
relating  misreporting  to  the  number  of  business  segments  in  the  firm. 

In  summary.  Results  I,  2,  3,  and  4  show  the  full  effect  of  firms  exercising  the  option  to  bias 
their  reported  costs.  Results  1  and  2  show  how  a  firm  s  decision  to  bias  its  disclosure  impacts 
competition  in  its  product  market,  and  Results  3  and  4  show  the  feedback  effect  how  the  com¬ 
petitive  environment  in  the  product  market  affects  incentives  to  manage  earnings. 

V.  ASYMMETRIC  EQUILIBRIA 

The  results  in  the  previous  section  provide  new  insights  into  the  relation  between  product 
market  competition  and  misreporting  designed  to  bias  a  rival’s  inferences  about  the  reporting 
firm’s  cost  of  production.  The  main  feature  that  cannot  be  explored  when  the  focus  is  on  symmet¬ 
ric  equilibria  is  the  impact  of  differential  costs  of  misreporting. 
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Previously,  the  cost  of  misreporting  was  treated  as  either  non-random  or  as  the  expected  cost 
of  misreporting.^^  The  advantage  of  interpreting  it  as  an  expected  cost  is  that  it  is  natural  to 
assume  that  firms  face  different  expected  costs  of  misreporting.  Not  only  might  firms  differ  in  the 
likelihood  that  they  are  examined  for  misreporting,  but  it  is  also  possible  that  firms  are  run  by 
managers  who  differ  in  their  concern  about  the  impact  of  being  detected  misreporting.  This 
situation  appears  to  apply  in  the  AT&T-WorldCom  example  cited  in  the  introduction. 

To  explore  the  effect  of  different  costs  of  misreporting,  we  examine  asymmetric  equilibria. 
We  again  apply  numerical  techniques  because  of  the  difficulties  solving  the  large  system  of 
nonlinear  equations  that  describe  the  equilibrium.  The  key  difference  in  this  section  is  that  we 
relax  the  assumption  that  8^  =  Sj  by  assuming,  without  loss  of  generality,  that  62  >  Ci-  As  in  the 
prior  section,  all  numerical  analyses  assume  that  the  realized  values  of  kj  and  ^2  are  zero. 

Result  5:  When  Cj  and  £2  are  sufficiently  different,  as  £2  rises  relative  to  £1: 

(i)  the  amount  of  misreporting  by  the  low-cost  misreporter  increases  while  the  amount  of 
misreporting  by  the  high-cost  misreporter  declines; 

(ii)  the  bias  in  the  estimate  of  the  low-cost  misreporter  is  greater  but  both  converge  to  zero 
as  £2  rises; 

(iii)  the  low-cost  misreporter  produces  more  output  than  the  full-information  output,  while 
the  high-cost  misreporter  produces  less;  and 

(iv)  the  low-cost  misreporter  earns  greater  product  market  profits  than  the  high-cost  mis¬ 
reporter  does  and  the  difference  increases  as  £2  increases. 

Result  5  describes  how  differences  in  the  costs  of  misreporting  affect  competition.  In  particu¬ 
lar,  the  greater  the  difference  in  the  costs  of  misreporting,  the  larger  (smaller)  is  the  amount  done 
by  the  low-cost  (high-cost)  misreporter.  Thus,  holding  the  lower  cost  of  misreporting  constant. 
Result  5  suggests  that  increases  in  the  high-cost  firm’s  costs  motivate  its  rival  to  engage  in  more 
misreporting.  In  fact.  Result  5  says  even  more:  If  the  low  cost  rises  slower  than  the  high  cost,  then 
the  direct  effect  of  the  cost  increase  on  the  low-cost  misreporter  (to  reduce  the  amount  of  misre¬ 
porting)  is  overwhelmed  by  the  indirect,  competitive  effect  of  the  increase  in  its  rival’s  cost  of 
misreporting.  Further,  the  lower  cost  of  misreporting  offers  the  firm  a  competitive  advantage  in  the 
product  market,  which  it  exploits  by  selling  more  output  than  its  rival  and  earning  greater  profits. 

Result  5  also  suggests  some  empirically  testable  implications.  First,  if  firms  “forced”  to 
replace  upper  management  in  the  wake  of  a  misreporting  scandal  incur  higher  costs  of  misreport¬ 
ing,  then  rivals  are  likely  to  engage  in  more  misreporting  and  increase  production,  thereby  shifting 
profits  away  from  the  firm  that  changed  upper  level  management.  Second,  it  suggests  a  potential 
risk  from  using  deferred  prosecution  agreements  in  misreporting  cases. Such  agreements  in¬ 
crease  (potentially  dramatically)  the  cost  of  misreporting,  and  our  Result  5  suggests  that  such 
agreements  will  produce  unwanted  competitive  responses  from  the  firm’s  rivals.  In  particular,  the 
agreements  will  motivate  the  firm’s  rivals  to  increase  their  level  of  misreporting. 

VI.  CONCLUSION 

We  examine  how  earnings  management  affects  product  market  competition  and  how  rivalry 
impacts  both  production  decisions  and  disclosure  (earnings  management)  decisions.  To  do  so,  we 


If  we  as.sume  that  the  cost  of  misreporting  is  uncorrelated  with  the  firm’s  private  information,  then  because  the  firm’s 
payoffs  are  linear  in  that  cost,  we  could  take  expectations  with  respect  to  it  and  simply  replace  the  random  variable  with 
its  expectation  in  the  expressions  derived  above. 

Examples  of  the  use  of  such  agreements  regarding  misreporting  of  the  type  we  examine  include  the  Bristol-Myers 
Squibb  case  (http://www.usdoj.gov/usao/nj/press/files/pdffiles/deferredpros.pdf);  the  PNC  Financial  case  (http:// 
www.usdoj.gOv/opa/pr/2003/June/03_crm_329.htm);  the  Computer  Associates  case  (http://www.sec.gov/litigation/ 
litreleases/lrl8891.htm);  and  the  AOL  case  (http://www.usdoj.gov/opa/pr/2004/December/04_crm_790.htm). 
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consider  an  incomplete  information  Cournot  duopoly  model  in  which  firms  know  their  own  pro¬ 
duction  costs  but  not  their  rival’s.  In  our  model,  fully  rational  firms  provide  a  disclosure  (e.g.,  an 
income  statement)  that  its  rival  can  use  to  update  its  beliefs  about  the  disclosing  firm’s  production 
costs  prior  to  competing  in  the  product  market.  Our  model  differs  from  those  in  the  prior  literature 
in  that  we  allow  firms  to  provide  biased  reports,  but  if  they  do  so,  they  incur  a  cost  of  misreport- 
ing.  We  show  that  biased  reporting  is  impossible  in  monotone  equilibria  when  these  costs  are 
common  knowledge.  However,  when  costs  are  private  information,  we  find  a  linear  equilibrium  in 
which  both  firms  bias  their  reports  even  though  both  are  fully  rational  and  optimally  use  all 
information  when  estimating  their  rival’s  costs  of  production.  Thus,  our  model  complements  and 
extends  Fischer  and  Verrecchia  (2004)  by  showing  that  biased  reporting  does  not  require  firms  to 
employ  heuristics  rather  than  optimally  update  their  beliefs.  Further,  our  model  complements  and 
extends  Fischer  and  Verrecchia  (2000)  by  showing  that  agency  problems  and  incomplete  contract¬ 
ing  are  not  required  for  equilibrium  to  exhibit  biased  disclosures.  Finally,  we  extend  the  literature 
on  disclosure  in  oligopoly  by  allowing  biased  disclosure  and  showing  that,  when  it  is  available, 

3 1 

firms  choose  to  bias  their  disclosures. 

Intuitively,  in  our  model,  firms  have  an  incentive  to  bias  their  reported  costs  downward  and, 
even  though  all  firms  use  all  available  information  efficiently,  fully  understand  each  other’s  in¬ 
centives,  and  adjust  their  beliefs  about  their  rival’s  costs  upward,  they  still  underestimate  those 
costs.  As  a  result,  each  cuts  production  relative  to  the  full-information  level  and  each  earns  greater 
product  market  profits.  Interestingly,  these  effects  are  smaller  in  more  profitable  product  markets 
even  though  the  magnitude  of  misreporting  increases.  They  are  also  smaller  when  firms  use  more 
similar  production  technologies.  Finally,  when  firms  have  sufficiently  different  (expected)  costs  of 
misreporting,  the  bias  in  the  low-cost  misreporter’s  disclosure  is  greater  than  its  rival’s  and  it 
produces  more  output  than  the  full-information  quantity. 

By  focusing  on  competitive  effects  and  fully  rational  firms,  our  analysis  suggests  some  em¬ 
pirical  implications.  To  see  why,  note  that  aggressive  cost  capitalization,  fraudulent  revenue  rec¬ 
ognition,  inappropriate  estimates  of  the  allowance  for  doubtful  accounts,  or  warranty  expense  are 
all  examples  of  (one  class  of)  earnings  management  techniques  that  lead  to  under-estimates  of  the 
firm’s  production  costs,  whereas  other  standard  earnings  management  techniques  (the  second 
class)  such  as  channel  stuffing,  delayed  write-downs  of  assets,  or  “timely”  sales  of  assets  do  not. 
Since  incentives  to  use  the  second  class  of  earnings  management  techniques  are  likely  to  be 
independent  of  the  incentives  to  employ  the  first  class,  our  model  predicts  that  regulatory  increases 
in  the  cost  of  misreporting  (e.g..  Section  302  of  the  Sarbanes-Oxley  Act  of  2002)  will  result  in  a 
change  in  the  distribution  of  observed  earnings  management  techniques  (as,  for  example,  among 
SEC  enforcement  actions).  In  particular,  we  expect  a  reduction  in  the  use  of  the  first  class  of 
earnings  management  techniques  relative  to  the  second.  Second,  our  analysis  suggests  that  stan¬ 
dard  measures  of  profitability  will  be  positively  correlated  with  the  use  of  the  first  class  of  earnings 
management  techniques.  Third,  firms  with  more  similar  technologies  (e.g.,  those  for  whom  pro¬ 
duction  is  governed  by  physical  or  chemical  processes  or  those  used  in  mature  industries)  will  be 
less  likely  to  employ  this  type  of  earnings  management  than  firms  with  less  similar  technologies 
(e.g.,  service  industries  or  firms  that  produce  a  large  variety  of  different  products).  In  addition,  our 
analysis  also  suggests  that  the  more  frequent  use  of  deferred  prosecution  agreements  may  lead  to 
unexpected  consequences — providing  the  firm  s  competitors  with  incentives  to  misreport  so  as  to 
bias  estimates  of  their  production  costs  downward. 

Our  analysis  also  appears  to  support  the  claims  made  by  C.  Michael  Armstrong,  former  CEO 
of  AT&T,  that  AT&T’s  perceived  strategic  failures,  its  inability  to  compete  with  WorldCom,  and 


See  Darrough  (1993)  or  Raith  (1996)  among  others  or  the  excellent  survey  by  Verrecchia  (2001). 
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the  decision  to  break  up  the  company  were  the  result  of  accounting  fraud  at  WorldCom.  He 
suggests  that  WorldCom’s  “revenues  were  false,  margins  were  false,  their  costs  were  false”  (Mc¬ 
Connell  et  al.  2002,  1)  and  that  this  resulted  in  layoffs,  cost  cutting,  and,  finally,  the  decision  to 
break  up  AT&T  in  order  to  service  its  debt.  Former  Sprint  CEO  William  Estry  suggests  that  his 
company  also  struggled  with  its  inability  to  match  WorldCom’s  performance  and  noted  that:  “It 
never  dawned  on  us  the  base  of  their  pricing  was  fraud”  (Searcy  2005,  Cl).  Our  analyses,  par¬ 
ticularly  when  one  firm’s  costs  of  misreporting  are  greater  than  its  rival’s,  are  consistent  with 
Armstrong’s  assertions  about  the  impact  of  WorldCom’s  accounting  fraud  on  its  competitors. 

Finally,  we  should  comment  on  the  impact  of  our  assuming  that  firms  are  Cournot  competi¬ 
tors.  It  is  well  known  (e.g.,  Darrough  1993;  Raith  1996)  that  results  may  be  sensitive  to  the  form 
of  competition  assumed  because  quantity  choices  are  strategic  substitutes  but  price  choices  are 
strategic  complements.  We  believe  that  the  only  significant  impact  of  assuming  Bertrand  (price¬ 
choosing)  rather  than  Cournot  (quantity-choosing)  competition  in  our  setting  is  on  the  direction  of 
the  equilibrium  bias  in  reports.  Because  prices  are  strategic  complements  under  Bertrand  compe¬ 
tition,  if  the  firms  choose  prices  and  sell  substitute  products,  then  we  expect  that  each  would  have 
an  incentive  to  bias  reported  costs  of  production  up  rather  than  down.  Other  than  the  direction  of 
the  bias,  we  believe  that  the  remaining  results  would  be  qualitatively  similar  to  those  obtained 
under  Cournot  competition. 


APPENDIX 

Proposition  1 

If  (Cl)  holds,  then  there  exists  a  linear  equilibrium  {q^  ,  ^2)  to  the  production  game  with: 

(Jl  —  Nq  H"  N\Ci  +  N2S  j  +  A^3‘5'2 


(J2  ~  ^0  ^ \^2  '^3*^2’ 
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Proof 

To  find  a  linear  equilibrium,  compute  using  the  conjectured  equilibrium  strategies: 
E[<?2bi]  =  4^0  +  3^iE[c2|yi]  +  M2S\  +  M2S2 


E[^l|y2]  -  ^0  +  ^lE[Ci|y2]  +  ■^2'^l  +  ^3^2- 

Substituting  Equations  (C2): 
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E[<?2|>’i]  =  -^0  +  +  ^2^2 

-  (Mq  +  MiPx)  +  M]/3iCi  +  M2S\  +  (Mi/32  +  ^2)^2 
=  Aq  +  AiCi  +A25J  +A2S2', 


(Al) 


£[^1^2]  =  ^0  +  A^l(«0  +  Ot\C2  +  «2-yi)  +  A^2^  1  +  ^3^2 

=  (A^o  + A’jOfo)  +  AiaiC2  +  (A^ia;2  + A^2)‘^i  +^3^2 
=  Bq  +  B1C2  +  52-^1  +  ^2^2- 


(A2) 


Substituting  into  the  expressions  for  q,  for  i  =  1,2  derived  in  the  text: 


^1  -  (l/2)(a  -  Cl)  -  (l/2)(Ao  +  AiCi  +  A251  +  A352) 


-  (l/2)(a  -  Aq-  (1  +  Ai)ci  -  A2S1  — A352); 


(A3) 


?2  =  (l/2)(a  -  C2)  -  (l/2)(fio  +  fliC2  +  B2S1  +  B2S2) 

=  {l/2){a  -  Bq  —  (I  +  5i)c2  -  B2S1  —  BjS2)- 


(A4) 


In  equilibrium,  the  expectation  of  (A3)  (resp.  (A4))  must  coincide  with  our  conjectures.  So, 
taking  expectations  of  (A3)  with  respect  to  firm  2’s  information  set  (of  (A4)  with  respect  to  firm 
I’s  information  set): 

E[^iI>'2]  =  (l/2)(a  -  Aq  -  (1  +Ai)fiQ  -  (1  +  Ai)^iC2  -  [(1  +  Ai)/C2  “  A2]5i  -  A352) 

E[<?2l>’i]  =  (l/2)(a  -  5o  -  (1  +5i)^o~(l  -B2S\  -[(1  +  ^1)^2  +  ^3]'^2  • 

The  proof  is  completed  by  matching  the  coefficients  in  Equations  (Al)  and  (A3)  and  Equa¬ 
tions  (A2)  and  (A4),  solving  the  resulting  system  of  equations  for  the  A’s  and  5’s  and  then  solving 
for  the  M’s  and  A^s.  H 

Proposition  2  ^  ^ 

In  every  linear  equilibrium,  {q\  ,  q\)  are  defined  as  in  Proposition  1  and  (s^  ,  ^2)  satisfy: 

5i  =  Dq  +  DiCi  +  D2k\ 

S2=Fq  +  F1C2  +  F 2/^2, 


where: 


E[c2|ci]  =  /CQ-t-  /U-iCi 


E[ci|c2]=  ^0+  ^1^^2 


[-  A2(a  -  Ao  +  A3F0  -I-  A3F1//-0)]  Fq  =  3[-  53(a  -Bq  +  B2DQ  -  ^2^1 4)1 

89  -  Dt, 


S2-E2 


Di  = - 5[A2(1  +Ai)  -  A2A3F1/C1  +  81] 

81  -A2 


^[^3(1  +  5i)  -  B2B2D1S1  +  82] 
E2-B] 


The  Accounting  Review 


July  2010 

American  Accounting  Association 


1212 


Bagnoli  and  Watts 


Proof 

To  find  a  linear  equilibrium,  compute  using  the  conjectured  equilibrium  strategies: 
E[52|ci]  =  Fo  +  FiE[c2|ci] 


E[5i|c2]  =  Do  +  £)iE[C]|c2], 

where  E[kj  \  ,  cj  =  E[kj  \  c,]  =  E[^^]  =  0  by  independence  and  our  assumption  that  E[fe,]  =  0  for  i 

=  1,2.  Further,  since  Cj  ,C2  are  joint  normally  distributed,  E[c2  |  Cj]  = /xq  +  and  E[ci  |  C2] 
=  <5o+  <5|C2.  Substituting  into  Equations  (4)  and  (5)  and  using  the  definitions  of  the  A’s  and  5’s: 


5,= 


[- A2(a  -  Aq  +  A3F0  +  A3F1/X1)  +  (A2(1  +  Aj)  -  A2A3F  1/X.1  +  ei)ci  - 


^2 


1 

82-53 


[-  -Bq  +  B2DQ  +  B2DiSq)  +  (53(1  +  Bi)  -  B2B2D1S1  +  82)82  -  k2]. 


Since  both  are  of  the  form  assumed,  the  linear  equilibrium  is  obtained  by  setting  the  coefficients 
in  these  equations  equal  to  the  F’s  and  D’s,  respectively.  ■ 


Proposition  3 

If  k^  and  k2  are  common  knowledge,  then  there  is  no  monotone  equilibria  in  pure  strategies. 


Proof 

Suppose  not.  Then  in  such  an  equilibrium,  firm  i  can  invert  the  strategy  defining  its  rival’s 
choice  of  Sj  and  infer  Cj.  Thus,  the  second-stage  game  is  now  a  game  of  complete  information 
whose  solution  is  well-laiown:  q-  =  ( 1  /  3)(a  -  2c,  +  cf)  and  equilibrium  profits  are  {q- Y,  for  i  = 
1,2.  Next,  consider  a  deviation  in  which  firm  i  reports  f,-,  which  leads  firm  j  to  infer  that  firm  fs 
cost  of  production  is  c,  ^  c,.  Thus,  ^,  =  ( 1  /  3)(a  -  c,- -  c,- -l-  Cj)  and  qj  =  {\  /  3){a  -  Icj  -  c,).  Since 
firm  f  s  profits  are  q],  they  are  decreasing  in  c,.  As  a  result,  if  firm  i  deviates  to  c,-  <  c,-,  then  its 
profits  in  the  product  market  increase,  which,  for  small  deviations,  exceed  the  costs  of  misreport- 
ing  making  firm  /’s  deviation  profitable  and  showing  that  there  is  no  linear  perfect  Bayes  equilib¬ 
rium  in  pure  strategies  when  kj  and  k2  are  common  knowledge.  | 

Proposition  4 

If  8  is  not  too  small,  then  there  is  a  symmetric  linear  perfect  Bayes  equilibrium  in  pure 
strategies. 


Proof 

In  a  symmetric  equilibrium,  D,  =  F,-  and  a,  =  yS,-  for  i  =  1,2,3;  and  A,-  =  M,-  for  i  =  1, 2,3,4  and 
yu,  =  Sj  for  i  =  1,2.  After  matching  coefficients,  we  have  12  nonlineai'  equations  in  12  unknowns 
describing  the  coefficients  of  the  firms’  strategies.  By  iterative  substitution,  one  obtains  a  system 
of  two  equations  in  two  unknowns,  Dj ,  D2  (resp.  Fj ,  F2).  One  equation  is  quadratic  in  Dj  and  has 
real  roots  when  8j(=82)  is  not  too  small.  Solving  for  the  roots  and  substituting  into  the  other 
equation  produces  an  18th  order  polynomial  in  D2  with  87  sign  changes.  Thus,  by  Descartes’  Rule 
of  Signs  (see,  for  example.  Levin  [2002]  and  the  references  therein),  there  is  at  least  one  real  root 
and,  thus,  at  least  one  symmetric  linear  equilibrium  in  pure  strategies.  | 

Proof  of  the  Comparative  Static  Results  for  Changes  in  e 

Suppose  not  Then  in  equilibrium,  5,-c,>^,>0  for  i  ==  1,2  for  all  8.  Since 
E[7r,- 1  k,c,]  =  E[(^,- )^  I  k,- ,  8,]  - /z,(s'y ,  8,)  and  E[(9,  )^]  is  bounded  from  above  by  the  firm’s  mo- 


The  Accounting  Review 
American  Accounting  Association 


July  2010 


Oligopoly,  Disclosure,  and  Earnings  Management 


1213 


nopoly  profits,  [(a-c,)/2p,  if  then  there  is  a  e  large  enough  so  that  [(a 

-  c,)  /  2f  -  hiisf ,  c,)  <  0.  Thus,  if  5,-  -  c,-  >  >  0,  there  exists  a  large  enough  e  so  that  in  the 

conjectured  equilibrium,  firm  i’s  expected  profits  are  negative.  Since  firm  i  could  deviate  to  5, 
=  c,  and  earn  non-negative  profits,  such  a  deviation  is  profitable  and  we  have  shown  that  in  every 
linear  perfect  Bayes  equilibrium,  5,  0  for  all  8  must  be  false.  Given  this,  the  difference 

Si  -  Ci  must  become  arbitrarily  small  as  e  gets  large,  which  implies  that  E[c,  |  Cj ,  5,- ,  Sj]  -  C;  also 
becomes  arbitrarily  small  as  e  becomes  arbitrarily  large.  Finally,  since  is  a  linear  function  of  c,- 
and  E[c^  |  c, ,  s, ,  Sj]  in  equilibrium,  it  too  converges  to  the  complete  information  Cournot  output  as 
8  becomes  arbitrarily  large.  I 
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ABSTRACT:  A  flourishing  stream  of  research  suggests  that  liquidity-constrained  firms 
with  low  accounting  quality  have  limited  access  to  capital  for  investments.  We  extend 
this  research  by  investigating  whether  these  firms  are  more  likely  to  lease  their  assets. 
Lessors’  superior  control  rights  allow  them  to  provide  capital  to  constrained  firms  with 
low-quality  accounting  reports.  Consistent  with  this  conjecture,  we  find  that  low  ac¬ 
counting  quality  firms  have  a  higher  propensity  to  lease  than  purchase  assets.  To  verify 
that  leasing  does  not  merely  reflect  these  firms’  desire  for  off-balance-sheet  accounting, 
we  investigate  whether  banks’  access  to  private  information  and  monitoring  affect  the 
relation  between  accounting  quality  and  leasing.  We  find  the  association  between  ac¬ 
counting  quality  and  leasing  decreases  when  banks  have  higher  monitoring  incentives 
and  when  loans  contain  capital  expenditure  provisions.  These  results  suggest  that 
other  mechanisms  can  substitute  for  the  role  of  accounting  quality  in  reducing  informa¬ 
tion  problems. 
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I.  INTRODUCTION 

Agrowing  body  of  accounting  research  concludes  that  higher  financial  reporting  quality 
reduces  the  negative  effects  of  financing  constraints  on  investment  by  mitigating  infor¬ 
mation  asymmetry  (e.g.,  Biddle  et  al.  2009;  Lara  et  al.  2009;  Biddle  and  Hilary  2006; 
Verdi  2006;  Bushman  et  al.  2008;  Hope  et  al.  2009).^  The  research  examining  whether  accounting 
quality  reduces  financing  frictions  and  underinvestment  focuses  on  asset  purchases  and  R&D 
without  considering  alternative  investment  mechanisms,  like  leases.  In  another  stream  of  literature, 
both  Sharpe  and  Nguyen  (1995)  and  Eisfeldt  and  Rampini  (2009)  argue  that  leases,  which  account 
for  about  one-third  of  new  equipment  investment  for  U.S.  firms,  provide  creditors  with  more 
security,  higher  priority  in  bankruptcy,  and  an  effective  way  of  reducing  adverse  selection  and 
moral  hazard  problems  that  arise  from  information  asymmetries.  Although  these  studies  suggest 
that  firms  facing  greater  financing  constraints  have  a  higher  propensity  to  engage  in  off-balance- 
sheet  lease  investments,  they  do  not  consider  other  mechanisms  that  reduce  financing  constraints, 
like  accounting  quality.  We  extend  these  two  research  streams  by  considering  the  role  of  account¬ 
ing  quality  in  the  firm’s  propensity  to  lease  assets  (i.e.,  the  proportion  of  assets  that  the  firm 
leases). 

We  focus  on  the  determinants  of  firms’  propensities  to  acquire  assets  using  operating  leases 
(measured  by  the  ratio  of  the  capitalized  minimum  operating  lease  payment  to  the  sum  of  that 
amount  and  the  amount  of  property,  plant,  and  equipment  [PP&E]).  By  capitalizing  operating 
leases,  this  measure  is  unlikely  to  be  affected  by  the  differences  in  accounting  treatment  for  leased 
versus  purchased  assets.  We  conduct  our  analyses  using  a  comprehensive  sample  of  3,033  manu¬ 
facturing  firms  on  the  Compustat  database.  Our  focus  on  manufacturing  firms  is  consistent  with 
previous  research  examining  how  financial  constraints  affect  asset  purchases.  In  addition,  focusing 
on  a  single  industry  reduces  concerns  that  our  results  are  driven  by  industry-induced  correlations 
between  leasing,  financing,  and  accounting.  To  measure  our  primary  variable  of  interest,  financial 
reporting  quality,  we  follow  Bharath  et  al.  (2008)  by  using  the  first  principal  component  of  three 
accrual-based  accounting  quality  metrics.^  We  conduct  cross-sectional  regressions,  examining  the 
relationship  between  leasing  propensity  and  accounting  quality,  controlling  for  size,  tax  incentives, 
financing  constraints,  and  other  factors  expected  to  affect  leasing  decisions.  We  average  all  vari¬ 
ables  across  the  sample  period,  thereby  treating  each  firm  as  one  observation  rather  than  using 
panel  data."^ 

We  find  that  firms  with  worse  financial  reporting  quality  have  a  greater  propensity  to  lease 
their  assets.  This  result  highlights  the  importance  of  considering  leasing  when  studying  the  effects 
of  financial  reporting  quality  on  investments.  Prior  research  concludes  that  firms  with  poor  ac¬ 
counting  quality  have  financial  constraints,  inhibiting  their  ability  to  invest  in  the  periods  in  which 
they  are  liquidity-constrained.  Our  results  extend  this  line  of  research,  suggesting  that  although 
low-quality  accounting  firms  are  likely  to  forgo  purchasing  assets,  they  are  more  likely  to  lease 
assets  in  the  periods  in  which  they  are  liquidity-constrained. 

One  interpretation  of  our  finding  that  poor  accounting  quality  firms  lease  more  of  their  assets 
is  that  these  firms  substitute  lease  financing  for  other  forms  of  secured  borrowing.  An  alternative 


See  Hope  et  al.  (2009)  for  a  discussion  of  whether  accounting  quality  (transparency)  affects  firms’  ability  to  gain  access 
to  financing. 

^  The  statistical  significances  of  the  coefficients  on  our  variables  of  interest  are  not  sensitive  to  this  choice. 

As  discussed  in  the  Appendix,  the  three  measures  are  Teoh  et  al.’s  (1998)  absolute  value  of  current  discretionary 
accruals,  Dechow  et  al.’s  (1995)  absolute  value  of  total  discretionary  accruals,  and  the  cross-sectional  version  of 
^  Dechow  and  Dichev’s  (2002)  absolute  value  of  current  discretionary  accruals. 

Difficulties  measuring  inter-temporal  variation  in  accounting  quality  and  lease  propensity  make  cross-sectional  regres¬ 
sions  more  appropriate  than  time-series  regressions  in  our  setting.  Biddle  et  al.  (2009)  use  a  panel  data  set  to  investigate 
the  relationship  between  investment,  the  business  cycle,  and  accounting  quality. 


The  Accounting  Review 
American  Accounting  Association 


July  2010 


Financial  Reporting  Quality,  Private  Information 


1217 


explanation  of  our  finding  is  that  the  negative  association  between  the  proportion  of  assets  leased 
and  accounting  quality  merely  reflects  the  desire  of  low  accounting  quality  firms  to  engage  in 
off-balance-sheet  financing.  To  distinguish  between  these  alternatives,  we  investigate  how  other 
mechanisms  that  reduce  information  asymmetry  and  agency  problems  impact  the  relationship 
between  accounting  quality  and  leasing  propensity.  Specifically,  we  investigate  how  lenders’  moni¬ 
toring  and  loan  covenants  affect  the  association  between  accounting  quality  and  leasing  propen¬ 
sity.  If  the  association  between  accounting  quality  and  leasing  reflects  the  role  of  accounting 
quality  in  reducing  information  asymmetry,  then  we  would  expect  the  negative  relation  between 
accounting  quality  and  leasing  to  be  mitigated  by  other  information  asymmetry  and  agency-cost- 
reducing  mechanisms,  suggesting  that  accounting  quality  and  leasing  are  alternative  ways  to 
address  financial  constraints.  Alternatively,  if  off-balance-sheet  accounting  motivates  firms  with 
poor  accounting  quality  to  lease,  then  the  relationship  between  leasing  and  accounting  quality 
should  not  be  affected  by  other  information-asymmetry-reducing  mechanisms. 

This  analysis  also  allows  us  to  extend  Biddle  and  Hilary’s  (2006)  investigation  of  whether 
cross-sectional  differences  in  access  and  use  of  private  information  and  monitoring  affect  the  role 
of  accounting  quality  in  investment  decisions.  Specifically,  Biddle  and  Hilary  (2006)  suggest  that 
the  role  of  accounting  quality  in  capital  investment  decisions  depends  on  the  firm’s  use  of  other 
monitoring  mechanisms  that  reduce  information  asymmetry.  We  expand  this  reasoning,  suggesting 
that  both  the  quality  of  banks’  monitoring  and  the  covenants  banks  use  to  restrict  investment  will 
affect  the  importance  of  accounting  quality  in  the  lease-versus-buy  decision. 

To  test  these  hypotheses,  we  use  a  subsample  of  manufacturing  firms  with  syndicated  bank 
loan  data  on  either  the  Loan  Pricing  Corporation  (LPC)  or  Security  Data  Corporation  (SDC) 
databases.  There  are  two  primary  reasons  why  we  focus  on  this  important  subsample  of  the  debt 
market.^  First,  the  variation  in  bank  loan  ownership  provides  cross-sectional  variation  in  lenders’ 
monitoring  incentives.  Lead  lenders  with  proper  financial  incentives  to  conduct  due  diligence 
reviews  prior  to  loan  initiation  and  to  provide  loan  monitoring  over  the  life  of  the  contract  have 
incentives  to  reduce  the  agency  problems  induced  by  information  asymmetries  between  lenders 
and  opaque  borrowers.  Therefore,  we  hypothesize  that  when  lead  lenders  have  greater  due  dili¬ 
gence  and  monitoring  incentives,  the  importance  of  the  quality  of  accounting  information  in  the 
lease-versus-buy  decision  will  be  mitigated. 

The  second  advantage  of  focusing  on  syndicated  loans  is  that,  unlike  public  debt,  there  is 
cross-sectional  variation  in  the  use  of  contractual  limitations  on  asset  purchases  in  these  loans. 
When  a  firm’s  debt  contract  contains  an  investment-restriction  covenant  designed  to  mitigate  the 
agency  problems  of  debt,  we  expect  the  quality  of  accounting  information  will  be  less  important 
in  controlling  information  problems.  Accordingly,  we  hypothesize  that  the  effect  of  accounting 
quality  in  the  lease-versus-buy  decision  decreases  in  the  face  of  investment-restriction  covenants. 

We  identify  a  subsample  of  997  manufacturing  firms  with  loan  ownership  data  in  the  LPC 
database  to  test  our  predictions  about  the  effects  of  banks’  direct  and  indirect  monitoring  on  the 
role  of  accounting  quality  in  the  lease-versus-buy  decision.  Based  on  Sufi  (2007)  and  Ball  et  al. 
(2008),  we  use  lead  lender  ownership  to  proxy  for  lenders’  due  diligence  and  monitoring 
incentives.^  Consistent  with  bank  monitoring  substituting  for  accounting  information,  we  find  that 
accounting  quality  is  less  important  in  the  lease-versus-buy  decision  when  lead  lenders’  ownership 
is  high. 


^  Sufi  (2007)  reports  that  syndicated  loans  represent  51  percent  of  U.S.  corporate  finance  originations,  and  generate  more 
underwriting  fees  than  both  equity  and  debt  underwriting.  He  also  notes  that  almost  90  percent  of  the  largest  500 
nonfinancial  Compustat  firms  obtained  a  syndicated  loan  between  1994  and  2002. 

®  Nini  et  al.  (2009)  find  that  32  percent  of  U.S.  bank  loans  contain  this  covenant,  while  we  find  that  less  than  5  percent 
of  public  debt  uses  this  covenant  to  directly  restrict  firms’  opportunistic  investments. 

^  In  sensitivity  tests,  Sufi  (2007)  documents  that  the  lead  lender  ownership  measure  is  very  robust. 
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We  then  examine  the  effects  of  contractual  restrictions  on  asset  purchases,  using  a  subsample 
of  865  manufacturing  firms  with  information  on  the  investment-restriction  covenant  in  the  SDC 
database.  We  use  an  indicator  variable  to  capture  the  effects  of  covenants  on  leasing  decisions. 
Consistent  with  our  previous  analysis,  we  find  that  the  magnitude  of  the  correlation  between  our 
accounting  quality  measure  and  proportion  of  assets  leased  is  lower  in  the  presence  of  this  cov¬ 
enant,  suggesting  that  this  covenant  diminishes  the  importance  of  accounting  information  in  the 
lease-versus-buy  decision. 

We  examine  the  sensitivity  of  our  results  to  a  variety  of  different  research  design  choices. 
Specifically,  we  consider  the  effect  of  limiting  the  sample  to  manufacturing  firms  and  the  choice  of 
measures  for  leasing  propensity,  accounting  quality,  and  lead  lender  measures.  We  also  investigate 
the  sensitivity  of  our  results  to  the  methods  we  use  to  address  endogeneity  and  the  methods  we  use 
to  address  missing  lease  data.  Finally,  we  investigate  whether  our  results  are  affected  when  we  use 
an  alternative  research  design  to  address  cross-sectional  variation  in  financing  constraints.  Each  of 
these  research  design  modifications  produces  robust  results. 

Our  study  contributes  to  the  accounting  literature  examining  the  role  of  accounting  quality  in 
investment  decisions,  and  to  the  finance  literature  examining  the  determinants  of  the  decision  to 
lease-versus-buy  assets,  by  showing  that  firms  with  information  asymmetries  due  to  poor  account¬ 
ing  quality  are  more  likely  to  lease  their  assets.  Results  also  suggest  that  accounting  quality  is 
important  in  mitigating  the  financing  constraints  that  lead  firms  to  lease  rather  than  buy  assets. 

Our  setting  also  allows  us  to  test  the  conjecture  made  in  previous  research  that  banks’  use  of 
covenants,  access  to  private  information,  and  monitoring  are  substitutes  for  accounting  informa¬ 
tion  in  reducing  information  asymmetries.  Results  suggest  that  bank  monitoring  and  covenant 
restrictions  are  substitutes  for  the  influence  of  accounting  quality  on  investment  decisions. 

Section  II  provides  the  background  to  our  study.  Section  III  contains  the  hypothesis  develop¬ 
ment,  Section  IV  discusses  the  sample  selection  and  database.  Section  V  displays  the  model 
specification.  Section  VI  presents  the  results.  Section  VII  reports  robustness  checks,  and  Section 
VIII  concludes. 


11.  BACKGROUND 

Information  Asymmetry  and  Investment 

Fazzari  et  al.  (1988)  examine  the  importance  of  capital  market  imperfections  caused  by 
information  asymmetry  between  managers  and  outside  capital  providers  on  firms’  optimal  invest¬ 
ment  decisions.  They  find  that  financially  constrained  firms  rely  on  internally  generated  cash  flows 
to  fund  their  investments,  increasing  the  sensitivity  of  their  investments  to  cash  flows. ^ 

Many  studies,  such  as  Bushman  et  al.  (2005),  Biddle  and  Hilary  (2006),  Verdi  (2006),  Biddle 
et  al.  (2009),  and  Lara  et  al.  (2009),  suggest  that  aspects  of  the  firms’  accounting  reports  may 
affect  the  extent  to  which  firms’  information  asymmetry  problems  affect  their  investments.  Despite 
the  fact  that  these  studies  have  used  a  variety  of  measures  of  both  investment  efficiency  and 
accounting  quality,  the  results  of  these  studies  have  all  been  consistent.  When  accounting  quality 
is  high,  the  reduction  in  information  asymmetry  leads  to  an  improvement  in  investments. 

Bushman  et  al.  (2005)  investigate  cross-country  variation  in  the  timeliness  of  loss  recognition 
and  the  timeliness  of  change  in  investment  policy.  They  find  that  firms  in  countries  with  more 
timely  loss  recognition  respond  more  quickly  to  declining  investment  opportunities  by  reducing 
net  inflows  of  capital  to  new  investments.  Their  study  implies  that  better  accounting  quality  leads 
to  increased  investment  efficiency. 


8 

Although  many  studies  conclude  that  information  asymmetry  and  financing  constraints  decrease  investment  efficiency, 
using  investment-cash  flow  sensitivity  to  proxy  for  investment  efficiency  has  been  controversial  (e.g.,  Kaplan  and 
Zingales  1997,  2000;  Bushman  et  al.  2008). 
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Biddle  and  Hilary  (2006)  examine  how  accounting  quality  affects  firms’  investment-cash  flow 
sensitivity.^  They  examine  the  effect  of  accounting  quality  on  investment  shocks,  finding  that 
higher  accounting  quality  is  associated  with  lower  investment-cash  flow  sensitivity.  They  also  find 
that  accounting  quality  plays  a  relatively  more  important  role  in  economies  where  the  capital  is 
primarily  provided  through  arm’s-length  transactions  and  investors  do  not  have  access  to  private 
information.  At  the  extreme,  Biddle  and  Hilary  (2006)  find  that  in  Japan,  where  capital  is  typically 
provided  by  keiretsu  and  banks,  accounting  quality  does  not  affect  the  association  between  invest¬ 
ment  and  cash  flows.  One  of  the  implications  of  Biddle  and  Hilary’s  (2006)  results  is  that  firms 
with  poor  accounting  quality  are  more  likely  to  rely  on  internally  generated  cash  flows  to  fund 
their  investments,  and  thus  have  more  difficulty  obtaining  external  financing  to  fund  their  invest¬ 
ments.  Their  results  also  imply  that  the  role  of  accounting  quality  in  capital  investment  decisions 
depends  on  whether  the  firm  uses  other  monitoring  mechanisms  that  reduce  information  asymme- 

try. 

Verdi  (2006)  and  Biddle  et  al.  (2009)  are  concerned  with  both  whether  firms  overinvest  in 
losing  projects  and  whether  they  underinvest  in  positive  net-present-value  projects.  Both  Verdi 
(2006)  and  Biddle  et  al.  (2009)  find  that  financial  reporting  quality  is  positively  correlated  with 
investment  when  firms  are  prone  to  underinvest  and  is  negatively  correlated  with  investment  when 
firms  are  prone  to  overinvest.  Biddle  et  al.  (2009)  conclude  that,  “higher  reporting  quality  is 
associated  with  both  lower  over-  and  underinvestment.”  Lara  et  al.  (2009),  draw  similar  conclu¬ 
sions  from  a  related  study  using  an  alternative  measure  of  accounting  quality. 

These  studies  focus  on  the  role  of  accounting  quality  on  firms’  decisions  to  purchase  assets 
(like  PP&E).  A  related  literature  examines  the  role  of  accounting  quality  on  firms’  ability  to  obtain 
traditional  financing  (like  debt  or  equity).”  These  studies  find  that  firms  with  poor  accounting 
quality  have  difficulty  raising  capital  from  either  debt  or  equity  markets,  and  thus  invest  subopti- 
mally.  For  example,  Bharath  et  al.  (2008)  find  that  firms  with  poor  accounting  quality  have  to  pay 
higher  interest  rates.  Further,  Francis  et  al.  (2004)  find  that  firms  with  better  accounting  quality 
bear  lower  cost  of  equity.'^  Most  recently,  Hope  et  al.  (2009)  investigate  the  relationship  between 
financial  reporting  transparency  and  firms’  ability  to  access  capital.  They  identify  a  sample  of 
privately  held  firms  with  cross-sectional  variation  in  whether  an  audit  has  been  conducted,  and 
find  that  audited  firms  with  more  transparent  financial  statements  have  better  access  to  external 
financing.  Their  results  imply  that  financial  transparency  influences  information  asymmetries,  and 
thus  influence  firms’  access  to  capital  from  traditional  debt  markets.  Thus,  firms  with  lower  quality 
accounting  are  inefficient  (have  lower  investment  efficiency).  We  extend  this  research  by  investi¬ 
gating  whether  firms  with  lower  quality  accounting  are  precluded  from  investment,  or  whether 
they  can  use  an  alternative  form  of  financing  (an  operating  lease). 

Leasing  and  Financing  Constraints 

Eisfeldt  and  Rampini  (2009)  and  Sharpe  and  Nguyen  (1995)  find  that  firms  with  larger 
information  asymmetries  (greater  financing  constraints)  are  more  likely  to  obtain  their  assets 
through  leases.  Sharpe  and  Nguyen  (1995)  suggest  that  institutional  features  specific  to  operating 


®  Biddle  and  Hilary  (2006)  employ  Dechow  and  Dichev’s  (2002)  measure  to  capture  accounting  quality. 

Verdi  (2006)  and  Biddle  et  al.  (2009)  augment  the  Dechow  and  Dichev  (2002)  measure  of  accounting  quality  according 
to  McNichols  (2002).  Lara  et  al.  (2009)  use  conditional  conservatism  to  capture  accounting  quality. 

"  The  examination  of  the  effects  of  financing  constraints  on  investment  in  balance  sheet  assets  (like  PP&E)  is  not  unique 
to  these  studies.  Most  of  the  related  finance  research  follows  this  same  approach. 

Francis  et  al.  (2004)  use  multiple  variables  to  measure  accounting  quality,  including  earnings  persistence,  predictability, 
smoothness,  relevance,  timeliness,  accounting  conservatism,  and  accrual  quality  measured  using  the  Dechow  and 
Dichev  (2002)  model. 
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leases  provide  lessors  with  important  advantages  in  bankruptcy.  If  the  lessee  goes  bankrupt,  then 
the  lessee  can  either  accept  or  breach  all  lease  obligations.  If  the  lessee  accepts  the  obligations, 
then  the  lessor  is  entitled  to  continue  receiving  payments  based  on  the  original  lease  agreement. 
The  rental  expenses,  along  with  lessors’  other  claims,  are  classified  as  administrative  expenses  in 
the  bankruptcy  code,  which  must  be  paid  immediately  or  “within  a  reasonable  period.”*"^  In  this 
case,  the  bankruptcy  filing  does  not  change  the  status  of  the  lessor’s  claims.  Alternatively,  if  the 
lessee  breaches  the  lease  obligations,  then  the  lessor  can  immediately  repossess  the  property.  Any 
economic  losses  accompanied  with  this  repossession,  including  unpaid  rents,  late  fees,  or  the  loss 
regarding  the  re-lease  or  sale  of  the  property,  is  an  unsecured  claim  against  the  lessee.  In  contrast, 
other  creditors  are  left  with  no  assurance  that  their  claims  will  be  recovered.’^ 

Sharpe  and  Nguyen  (1995)  argue  that  the  preferential  treatment  of  leases  in  bankruptcy  causes 
financially  constrained  firms  to  prefer  to  lease  assets  rather  than  borrow  to  buy  assets.  Consistent 
with  this  prediction,  they  find  firms  that  are  non-dividend-paying,  lower-rated,  or  cash-poor  are 
more  inclined  to  lease  than  nonfinancially  constrained  firms.  This  suggests  that  firms  with  infor¬ 
mation  asymmetries  due  to  poor  accounting  quality  may  prefer  other  alternatives,  like  leases. 

Banks’  Private  Information  and  Monitoring 

Biddle  and  Hilary  (2006)  suggest  that  there  are  other  mechanisms  that  may  substitute  for 
accounting  quality  in  reducing  information  asymmetry,  thereby  alleviating  financing  restrictions 
and  increasing  investment  efficiency.  In  contrast  to  their  findings  for  the  U.S.,  Biddle  and  Hilary 
(2006)  find  that  investment-cash  flow  sensitivities  do  not  vary  with  accounting  quality  in  Japan. 
They  conjecture  that,  unlike  U.S.  markets,  Japanese  capital  markets  resolve  information  asymme¬ 
try  problems  through  relationship  banking,  such  that  public  information  loses  its  importance  in 
affecting  firm  operations.  Biddle  and  Hilary  (2006)  argue  that  banks’  superior  private  information 
should  serve  as  a  substitute  for  accounting  quality  in  determining  the  sensitivity  of  investment  to 
internally  generated  cash  flows. 

In  addition  to  having  access  to  private  information,  Nini  et  al.  (2009)  suggest  that  banks  often 
include  loan  covenants  setting  an  upper  limit  on  capital  expenditures  and  find  that  this  capital 
expenditure  restriction  provision  successfully  reduces  firms’  capital  expenditures.  They  argue  that 
the  investment-restriction  covenant  mitigates  the  asset  substitution  and  overinvestment  problems. 
Typically,  these  covenants  define  capital  expenditures  as  assets  included  in  the  property,  plant,  and 
equipment  accounts  under  GAAP.  Therefore,  the  covenants  would  restrict  asset  purchases  and 
capital  leases,  but  not  operating  leases,  and  are  thus  likely  to  affect  both  the  use  of  operating  leases 
and  the  importance  of  accounting  information  in  the  leasing  decision. 

The  following  excerpt  from  Georgia  Pacific’s  December  2005  syndicated  loan  agreement  is  an 
illustration  of  a  typical  capital  expenditure  restriction: 


SFAS  No.  13  requires  leases  to  be  classified  as  either  operating  or  capital.  A  lease  meeting  any  of  the  following  four 
criteria  is  classified  as  a  capital  lease:  (1)  ownership  of  the  leased  asset  is  transferred  to  the  lessee  at  lease  maturity,  (2) 
a  bargain  purchase  option  is  available,  (3)  the  lease  term  is  equal  to  75  percent  or  more  of  the  remaining  economic  life 
of  the  leased  asset,  and  (4)  the  present  value  of  the  minimum  lease  payments  equals  or  exceeds  90  percent  of  the  asset’s 
market  value. 

Lessors’  other  claims  might  include  delinquencies,  late  fees,  and  other  damages  suffered. 

This  provision  applies  to  operating  leases,  but  not  capital  leases.  If  the  original  lease  contract  stipulates  that  the  lessor 
does  not  retain  property  ownership  at  lease  maturity,  then  the  lease  is  treated  as  secured  debt. 

This  prediction  is  consistent  with  theoretical  models  such  as  Holthau.sen  and  Verrecchia  (1988). 
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FINANCIAL  COVENANTS 

Maximum  Capital  Expenditures  of  up  to: 

Fiscal  Year  Capital  Expenditures 

2006  $900,000,000 

2007  $1,100,000,000 

2008  $1,100,000,000 

2009  $900,000,000 

2010  and  thereafter  $700,000,000 

The  lesser  of  (a)  100  percent  of  the  unused  amounts  allocated  to  any  year  and  (b)  50 
percent  of  the  maximum  Capital  Expenditures  permitted  in  such  year  shall  be  available  for 
Capital  Expenditures  in  the  following  fiscal  year  only.  In  addition,  the  Capital  Expenditure 
covenant  will  permit  the  amount  permitted,  but  not  used,  for  Restricted  Payments  and 
investments  in  joint  ventures  under  clause  (12)  of  the  definition  of  “Permitted  Investments” 
to  be  used  to  make  additional  Capital  Expenditures.  Capital  Expenditures  shall  exclude, 
whether  such  a  designation  would  be  in  conformity  with  GAAP,  expenditures  made  in 
connection  with  the  replacement  or  restoration  of  property,  to  the  extent  reimbursed  or 
financed  from  insurance  or  condemnation  proceeds. 

CERTAIN  DEFINITIONS 

'"Capital  Expenditures'’’  means,  for  any  period  and  with  respect  to  any  Person,  any  and  all 
expenditures  made  by  the  Parent  or  any  of  its  Subsidiaries  in  such  period  for  assets  added 
to  or  reflected  in  its  property,  plant,  and  equipment  accounts  or  other  similar  capital  asset 
accounts  or  comparable  items  or  any  other  capital  expenditures  that  are,  or  should  be,  set 
forth  as  “additions  to  plant,  property,  and  equipment”  on  the  consolidated  financial  state¬ 
ments  of  the  Parent  prepared  in  accordance  with  GAAP,  whether  such  asset  is  purchased 
for  cash  or  financed  as  an  account  payable  or  by  the  incurrence  of  Indebtedness,  accrued  as 
a  liability  or  otherwise. 


III.  HYPOTHESIS  DEVELOPMENT 

Two  streams  of  research  consider  how  firms  mitigate  the  effects  of  financing  constraints  on 
investment.  The  first  suggests  that  firms  can  reduce  information  asymmetry  and  financing  con¬ 
straints  by  committing  to  better  accounting  quality.  The  second  finds  that  firms  with  larger  financ¬ 
ing  constraints  lease  more  of  their  assets.  Together,  these  two  streams  of  research  lead  us  to  predict 
that  firms  with  worse  accounting  quality  are  more  likely  to  obtain  their  assets  through  operating 
leases. 

HI:  A  firm’s  propensity  to  lease- versus-buy  assets  is  declining  in  its  accounting  quality. 

An  alternative  explanation  for  a  negative  association  between  a  firm’s  accounting  quality  and 
its  propensity  to  enter  into  operating  leases  is  that  firms  with  poor  accounting  quality  prefer 
off-balance-sheet  financing.  Under  this  alternative,  we  would  expect  the  relationship  between 
leasing  and  accounting  quality  to  be  unaffected  by  other  mechanisms  that  reduce  information 
asymmetry. 

One  mechanism  that  is  likely  to  affect  the  extent  of  the  firm’s  information  asymmetry  problem 
is  whether  the  outside  suppliers  of  capital,  such  as  banks,  have  incentives  to  obtain  information 
through  private  channels  and  thus  reduce  information  asymmetry.  If  better  accounting  quality 
reduces  information  asymmetry  and  thus  reduces  a  firm’s  leasing  propensity,  then  other  mecha- 
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nisms  that  reduce  information  asymmetry  like  the  lenders’  incentives  to  access  private  information 
and  monitor  the  borrower  are  likely  to  affect  the  importance  of  accounting  quality  on  the  lease- 
versus-buy  decision,  motivating  our  second  hypothesis. 

H2:  The  importance  of  accounting  quality  in  the  lease-versus-buy  decision  decreases  with 
banks’  incentives  to  perform  due  diligence. 

Capital  expenditure  covenants  are  another  mechanism  that  lenders  use  to  reduce  agency 

costs.  If  these  covenants  are  effective  at  reducing  agency  costs  and  financing  constraints,  then 

the  importance  of  accounting  quality  in  the  lease-versus-buy  decision  will  also  be  reduced  by 

capital  expenditure  covenants.  In  other  words,  accounting  quality  affects  the  lease-versus-buy 

decision  (i.e.,  low-quality  accounting  limits  or  makes  it  more  costly  for  firms  to  engage  in  capital 

spending  so  they  turn  to  leasing).  However,  capital  expenditure  covenants,  by  their  very  nature, 

also  limit  the  ability  of  firms  to  engage  in  capital  spending.  Therefore,  when  a  covenant  is  already 

in  place,  the  incremental  impact  of  accounting  quality  on  the  lease-versus-buy  decision  is  reduced. 

For  example,  a  firm  with  a  covenant  in  place  could  have  to  opt  for  leasing,  even  though  it  has 

18 

high-quality  accounting. 

Similar  to  our  monitoring  hypothesis,  the  negative  association  between  a  firm’s  accounting 
quality  and  their  propensity  to  enter  into  operating  leases  could  be  due  to  firms  with  poor  account¬ 
ing  quality  preferring  off-balance-sheet  financing.  Under  this  alternative  hypothesis,  we  would 
expect  the  relationship  between  leasing  and  accounting  quality  to  be  unaffected  by  the  use  of 
capital  investment  restrictions.  We  test  this  reasoning  in  our  third  hypothesis. 

H3;  The  importance  of  accounting  quality  in  the  lease-versus-buy  decision  is  lower  in  the 
presence  of  capital  expenditure  restriction  provisions. 

IV.  SAMPLE  SELECTION 

We  use  three  different  samples  to  test  our  hypotheses.  The  first  is  a  comprehensive  sample  of 
manufacturing  firms  included  on  Compustat.  Our  second  and  third  samples  are  subsamples  of  the 
first  sample.  The  first  subsample  is  restricted  to  firms  with  data  on  lead  lenders  contained  on  the 
LPC  database.  The  second  subsample  requires  data  on  capital  expenditure  covenants  on  the  SDC 
database. 

Comprehensive  Sample 

We  test  our  first  hypothesis  on  the  role  of  accounting  quality  in  the  leasing  decision  using  a 
comprehensive  sample  of  Compustat  manufacturing  firms,  identifying  3,033  firms  in  SIC  codes 
(2000-3999)  during  the  period  1995-2006  with  the  necessary  data. 

Subsamples  with  Lead  Lender  and  Capital  Expenditure  Covenant  Data 

We  use  a  subsample  of  firms  that  borrow  in  the  syndicated  loan  market  to  probe  the  cross- 
sectional  variation  in  the  effect  of  accounting  quality  on  leasing  activities  and  to  ensure  that  our 
results  are  not  an  artifact  of  a  desire  for  off-balance-sheet  accounting.  We  focus  on  the  syndicated 
loan  market  for  two  reasons.  First,  the  variation  in  bank  loan  ownership  provides  cross-sectional 
variation  in  lenders’  private  information  and  monitoring  incentives.  As  we  discuss  above,  the 


We  focus  on  this  covenant  rather  than  other  financial  covenants  for  two  reasons.  First,  this  covenant  restricting  asset 
purchases  impacts  the  lease-versus-buy  decision  on  which  we  focus.  Second,  this  covenant  is  not  based  on  any  financial 
numbers,  so  it  is  unaffected  by  the  method  of  accounting  for  operating  leases. 

This  argument  might  suggest  that  this  covenant  should  not  be  required  for  high-accounting  quality  firms.  In  our 
investment  covenant  model  (see  Table  7)  we  find  that  beyond  accounting  quality  there  are  many  other  important 
determinants  of  these  covenants  suggesting  high  accounting  quality  firms  might  have  these  covenants. 
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syndicate  structure  provides  loan  owners  varying  incentives  to  incur  costs  to  collect  private  infor¬ 
mation  and  to  monitor  management  (Sufi  2007).  Therefore,  lead  arrangers’  ownership  is  an  estab¬ 
lished  proxy  for  the  private  information  acquired  and  the  quality  of  monitoring  imposed  by  banks. 

The  second  reason  we  focus  on  this  market  rests  on  the  use  of  debt  covenants  in  this  market. 
Since  private  lenders  have  informational  and  monitoring  advantage  over  public  debtholders  in 
writing  and  enforcing  debt  covenants  that  mitigate  the  agency  problems  of  debt  (e.g.,  Fama  1985; 
Diamond  1984),  bank  loans  provide  cross-sectional  variation  in  the  use  of  debt  covenants  that  is 
rare  in  corporate  bonds.  We  exploit  the  variation  of  the  use  of  the  covenant  restricting  capital 
expenditures  to  investigate  the  effect  of  direct  monitoring  by  banks  on  the  lease-versus-buy  deci¬ 
sion. 

To  test  the  relation  between  accounting  quality  and  banks’  private  information  and  monitoring 
quality,  we  use  a  sample  of  997  manufacturing  firms  that  are  covered  by  the  LPC  database  and 
have  loan  ownership  data.  Consistent  with  prior  research,  to  test  our  third  hypothesis  we  use  a 
sample  of  firms  listed  in  the  SDC  syndicated  loan  database  to  identify  firms  that  have  investment- 
restriction  covenants.  We  identified  865  firms  that  are  listed  in  the  SDC  database. 


V.  RESEARCH  DESIGN  AND  PROXIES 


The  basic  model  that  we  use  to  test  all  three  of  our  hypotheses  is  the  same,  although  the 
samples  studied  and  the  estimation  techniques  used  to  estimate  the  models  differ  across  our  tests. 
To  test  our  hypotheses,  we  estimate  the  following  model: 

Lease  =  /3o  +  /3i  *  AccQual  +  ^2  *  Nodiv  +  ^3  *  Unrated  +  ;S4  *  Rating  +  ^5  *  Size 


(1) 


+  /3(,  *  Loss  +  yS?  *  MTR  +  s 


where: 


Lease  =  the  average  of  the  capitalized  lease  expenditure  (lagged  Compustat  #96  multiplied 
by  10)  over  the  sum  of  PP&E  and  capitalized  lease  expenditure  (Compustat  #8 
+  lagged  Compustat  #96  X  10); 

AccQual  =  the  first  principal  component  of  three  accrual-based  accounting  quality  metrics 
based  on  Bharath  et  al.  (2008;  described  in  the  Appendix); 

Nodiv  =  an  indicator  variable  equal  to  1  if  the  firm  never  pays  dividends  (Compustat  #21) 
during  the  sample  period,  and  0  otherwise; 

Unrated  =  an  indicator  variable  equal  to  1  if  the  firm  is  never  rated  by  the  S&P  (Compustat 
#280)  during  the  sample  period,  and  0  otherwise; 

Rating  =  the  average  of  S&P  ratings  when  the  finn  is  rated  by  the  S&P;  following  Sharpe 
and  Nguyen  (1995),  the  lating  is  coded  1  if  the  S&P  rating  is  between  AAA  and  A- 
,  2  if  the  rating  is  between  A+  and  A-,  3  if  the  rating  is  between  BBB+  and 
BBB-,  and  4  if  the  rating  is  between  BB+  and  D; 

Size  =  the  decile  ranking  of  average  of  the  natural  log  of  sales  (Compustat  #12); 

Loss  =  the  average  of  the  indicator  variable  for  loss  year,  where  the  indicator  variable 
equals  1  when  the  Compustat  #123  is  less  than  0;  and 
MTR  =  the  average  of  the  marginal  tax  rates,  based  on  John  Graham’s  simulation 
technique,  as  reported  on  his  website.^° 


All  OLS  t-statistics  are  based  on  standard  errors  clustered  at  the  three-digit  industry  level. 

To  see  a  summary  of  the  approach  he  uses  to  obtain  marginal  tax  rates,  and  request  access  to  the  underlying  data,  please 
visit  the  following  website:  http://faculty.fuqua.duke.edu/~jgraham/. 
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We  capitalize  the  minimum  lease  payment  assuming  a  cost  of  capital  of  10  percent  and  that 

the  lease  payments  are  a  perpetuity.^'  As  such,  the  measure  will  be  unaffected  by  the  differences 

22 

m  accounting  treatment  for  these  two  investment  strategies. 

Accounting  quality  has  numerous  dimensions  (Dechow  et  al.  2009).  The  literature  arguing 
that  accounting  quality  reduces  investment  frictions  focuses  on  accounting  quality’s  ability  to 
reduce  information  asymmetries  (Biddle  et  al.  2009).  This  literature  is  not  concerned  with  whether 
accounting  quality  reflects  lower  earnings  management,  less  business  uncertainty,  more  stable  cash 
flows,  or  some  other  factor  that  reduces  information  asymmetry.  Similar  to  the  prior  research,  we 
do  not  distinguish  among  these  sources  of  accounting  quality.  Instead  we  follow  Bharath  et  al. 
(2008),  using  a  principal-component  analysis  to  isolate  the  common  component  of  three  common 
firm  level  measures  of  accounting  quality  {AccQual).  The  accruals  measures  are  based  on  Teoh  et 
al.  (1998),  Dechow  et  al.  (1995),  and  Dechow  and  Dichev  (2002).  The  Appendix  includes  a 
description  of  our  implementation  of  the  Bharath  et  al.  (2008)  measure. 

Our  definition  of  variables  differs  from  theirs  slightly  because  we  modify  the  deflator  used  in 
constructing  these  variables  to  accommodate  differences  in  accounting  for  operating  leases  versus 
asset  purchase.  The  total  asset  deflator  used  in  prior  research  would  be  affected  by  the  off-balance- 
sheet  accounting  treatment  for  operating  leases.  Instead,  we  use  the  sum  of  PP&E  and  capitalized 
minimum  lease  payment,  which  is  the  same  deflator  used  in  calculating  our  dependent  variable 
(see  Christie  1987).^^ 

The  lease-versus-buy  decision  is  likely  to  be  affected  by  a  variety  of  other  factors  (see  Smith 
and  Wakeman  1985).  Taxes  are  one  of  the  primary  motivations  for  leasing.  Firms  that  are  not 
paying  taxes  (or  have  low  marginal  tax  rates)  prefer  to  lease  as  opposed  to  buy,  as  leasing  allows 
firms  in  a  higher  tax  bracket  to  take  more  advantage  of  the  depreciation  deduction. 

Leasing  is  also  likely  to  be  related  to  whether  the  firm  faces  financing  constraints.  Following 
Sharpe  and  Nguyen  (1995)  and  Fisfeldt  and  Rampini  (2009),  we  include  three  variables  to  capture 
the  financing  constraints:  Nodiv,  Unrated,  and  Rating.  Non-dividend-paying  firms  are  expected  to 
have  more  financing  constraints  and,  therefore,  are  more  likely  to  lease.  Unrated  firms  are  also 
expected  to  have  higher  leasing  propensity.  We  expect  that  worse-rated  firms  have  greater  diffi¬ 
culties  in  raising  capital,  so  this  variable  should  be  positively  correlated  with  Lease. 

We  also  include  Size  and  Loss  as  control  variables.  Larger  firms  are  likely  to  be  less  finan¬ 
cially  constrained  and,  therefore.  Size  is  expected  to  have  a  negative  coefficient.  Finally,  we 
assume  loss  firms  have  a  lower  marginal  tax  rate  and  are  thus  unable  to  take  full  advantage  of  the 
tax  benefits  of  ownership  of  equipment.  Therefore,  we  expect  Loss  to  be  positively  correlated  with 
lease  activity. 

It  is  important  to  note  that  we  average  all  variables  across  the  sample  period,  thereby  treating 
each  firm  as  one  observation  rather  than  using  panel  data.  There  are  several  justifications  for  this 
research  design  choice.  First,  our  primary  variable  of  interest  is  a  stock  measure  of  the  leasing 
propensity.  This  measure,  as  well  as  our  financial  reporting  quality  measure,  is  highly  correlated 
through  time.  Averaging  our  variables  reduces  the  independence  problems  that  arise  from  the 


While  parsimonious,  we  recognize  that  this  approach  has  drawbacks  because  the  exact  discount  rate  and  payment  timing 
is  not  known.  To  address  these  concerns,  we  also  construct  the  variable  discounting  all  future  lease  payments  disclosed 
in  the  lease  footnotes  (five  years  of  payments  and  the  thereafter  piece),  and  varying  the  discount  rate  (between  8  and  12 
percent)  and  the  results  of  our  main  tests  are  consistent  with  those  reported.  Thus,  our  capitalization  assumptions  do  not 
appear  to  be  driving  our  results. 

In  our  study,  leases  represent  operating  leases  and  therefore  use  of  Compustat  #96  to  measure  lease  activities  should  be 
appropriate.  Since  we  classify  capital  leases  as  borrow-and-buy  and  since  Compustat  #8  includes  capital  lease  expen- 
22  ditures,  our  choice  of  this  variable  should  be  comprehensive. 

We  also  use  total  assets,  total  assets  excluding  property,  plant,  and  equipment  (Compustat  #8)  and  sales  (Compustat  #12) 
as  deflators,  the  results  in  the  main  analysis  continue  to  hold. 
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serial  correlation.  Although  this  approach  only  captures  the  between-firms  effect,  it  eliminates 
within-firm  correlation  issues.  We  investigate  the  sensitivity  of  our  results  to  this  research  design 
choice  using  a  variety  of  analyses  that  are  discussed  in  depth  in  the  sensitivity  analysis  section.  In 
particular,  we  replace  the  stock  measure  of  leasing  with  a  flow  measure,  and  replace  our  account¬ 
ing  quality  measure  with  a  measure  using  three  years  of  data  prior  to  the  leasing  decision,  finding 
similar  results. 

Comprehensive  Sample 

To  test  HI,  we  estimate  Equation  (1)  using  OLS  estimation  for  our  comprehensive  Compustat 
sample  of  manufacturing  firms  averaged  over  1995-2006.  If  accounting  quality  and  leasing  are 
two  alternative  mechanisms  for  dealing  with  financing  constraints,  then  we  expect  better  account¬ 
ing  quality  to  be  associated  with  a  lower  propensity  to  lease  (i.e.,  /3i  should  be  negative). 

Analysis  of  Due  Diligence  Efforts 

To  test  H2,  we  follow  Bharath  et  al.  (2008)  and  estimate  an  endogenous  switching  model  (Lee 
1978)  for  our  subsample  of  997  manufacturing  firms  that  are  covered  by  the  LPC  database  and 
have  loan  ownership  data.  The  endogenous  switching  model  allows  us  to  control  for  the  endoge¬ 
neity  of  the  syndicate  participants’  choice  to  require  a  larger  lead  arranger  ownership  position  in 
the  loan  to  enhance  their  due  diligence  and  monitoring  efforts  when  the  borrower  has  more  serious 
information  asymmetries  and  higher  credit  risk  (Sufi  2007). 

More  specifically,  the  endogenous  switching  model  has  a  “selection”  equation  modeling  the 
syndication’s  decision  to  require  lead  lenders  to  hold  a  significant  ownership  position  and  two 
second-stage  “regime”  equations  modeling  the  determinants  of  the  firm’s  leasing  propensity  (one 
for  firms  where  the  lead  lender  holds  a  significant  position,  and  another  for  firms  where  the  lead 
lender  holds  a  relatively  small  amount  of  the  loan).  The  three  regressions  are  estimated  simulta¬ 
neously  using  full  information  maximum  likelihood.  As  discussed  in  Bharath  et  al.  (2008),  this 
approach  controls  for  the  self-selection  associated  with  the  decision  by  the  syndicate  to  require  the 
lead  arranger  to  hold  a  relatively  larger  proportion  of  the  loan  and  allows  us  to  compare  the  effects 
of  accounting  quality  on  the  firm’s  leasing  propensity  across  the  two  “regimes.”  It  also  has  the 
benefit  of  not  requiring  the  coefficients  on  the  determinants  to  lease  to  be  the  same  across  the  two 
different  regimes. 

The  first-stage  probit  model  of  the  proportion  of  the  loan  held  by  the  lead  bank  is  as  follows; 

Lead  Rank  =  /Sq  +  ySi  *  AccQual  +  1^2*  Nodiv  +  /3^*  Unrated  -i-  /34  *  Rating  +  Size 

+  fte*  F  fS-j*  MTR  +  (3^*  Loan  -H  yS9  *  Maturity  -i-  e .  (2) 

Most  of  the  variables  in  Equation  (2)  are  also  included  in  Equation  (1)  and  are  defined  above. 
To  capture  lead  banks’  incentives  to  acquire  private  information  and  monitor  management  we 
develop  the  measure  Lead  Rank.  This  variable  is  an  indicator  variable  equal  to  1  if  the  average  of 
the  lead  arrangers’  ownership  of  the  firm’s  bank  loans  exceeds  the  median  of  all  sample  firms,  and 
0  otherwise. We  also  include  two  exogenous  variables  (exogenous  in  the  sense  that  they  are 
likely  to  be  associated  with  the  syndicate’s  decision  to  require  the  lead  lender  to  hold  a  significant 
proportion  of  the  loan,  but  not  associated  with  the  firm’s  leasing  propensity).  The  first  exogenous 
variable  is  the  size  of  the  deal  {Loan)  and  the  second  exogenous  variable  is  the  maturity  of  the  debt 


Since  firms  may  have  more  than  one  loan  during  the  sample  period,  lead  arranger  ownership  and  other  loan  character¬ 
istics  are  weighted  (by  deal  amount)  and  averaged  over  all  bank  loans  issued  during  the  sample  period.  Using  equally 
weighted  values  does  not  change  the  tenor  of  the  results.  We  examine  the  sensitivity  of  our  results  to  the  median  cut-off 
in  the  robustness  section. 
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{Maturity).  The  choice  of  these  two  instruments  is  motivated  by  Sufi  (2007).  Larger  loans  and 
loans  with  longer  maturities  are  likely  to  be  made  to  borrowers  with  better  credit  quality,  reducing 
the  importance  of  the  monitoring  provided  by  the  lead  lender.  This  suggests  a  negative  relationship 
between  loan  size,  maturity,  and  the  extent  of  the  lead  lender’s  holdings.  Alternatively,  larger  loans 
and  loans  with  longer  maturities  are  likely  to  require  more  monitoring,  suggesting  that  syndicate 
members  might  require  the  lead  lender  to  hold  larger  shares  of  these  loans,  suggesting  a  positive 
relation.  Thus,  we  do  not  predict  a  signed  relationship  between  the  lead  lender’s  share  of  the  loan, 
maturity,  and  the  size  of  the  loan. 

We  use  maximum  likelihood  estimation  for  our  two  “regime”  regressions.  If  debtholder  moni¬ 
toring  is  a  potential  substitute  for  accounting  quality  in  solving  information  problems,  then,  in  the 
regime  regressions,  we  expect  the  effect  of  accounting  quality  {AccQual)  on  the  leasing  propensity 
to  be  relatively  larger  for  firms  where  the  lead  lender  holds  a  relatively  smaller  proportion  of  the 
loan.  We  test  this  using  a  Chi-squared  test  of  the  equality  of  the  coefficient  on  accounting  quality 
across  the  two  regime  regressions. 

Analysis  of  Capital  Expenditure  Covenant 

Similar  to  our  tests  of  H2,  we  again  employ  an  endogenous  switching  model  to  test  H3,  which 
predicts  that  the  effects  of  investment  restrictions  on  the  importance  of  accounting  quality  on  the 
firm’s  leasing  propensity.  The  selection  model  we  employ  in  this  analysis  is  as  follows; 

Inv_Cov  =  Pq  +  /3i*  AccQual  +  P2*  Nodiv  +  /3^*  Unrated  +  Pn*  Rating  +  *  Size 

+  Loss  +  ^67  *  MTR  -t  /Sg  *  Loan  -t-  /Jg  *  Maturity  +  e .  (3) 

Most  of  the  variables  in  Equation  (3)  are  also  included  in  Equation  (1)  and  are  defined  above. 
InvjCov  is  an  indicator  variable  that  equals  1  if  the  firm  ever  includes  a  capital  expenditure 
restriction  provision  in  its  bank  loans  during  the  sample  period.  The  two  “regime”  regressions  are 
the  same  as  Equation  (1),  except  that  the  partition  is  based  on  whether  the  firm’s  debt  has  an 
investment  restriction.  If  this  covenant  is  useful  in  reducing  the  asset  substitution  or  overinvest¬ 
ment  problem,  then  we  expect  the  effect  of  accounting  quality  (AccQual)  on  the  lease- versus-buy 
decision  to  be  reduced  in  the  presence  of  this  restrictive  covenant. 

VI.  RESULTS 

Descriptive  Statistics 

Table  1  provides  descriptive  statistics  for  our  comprehensive  sample  partitioned  by  whether 
firms  are  relatively  more  likely  to  lease  assets  (the  high  lease  group)  or  less  likely  to  lease  assets 
(the  low  lease  group).  The  partition  is  based  on  the  median  lease  propensity  in  the  sample.  By 
construction,  the  high  lease  group  has  a  higher  leasing  propensity  than  the  low  lease  group.  The 
high  lease  group  also  has  lower  accounting  quality  than  the  low  lease  group.  They  are  also  less 
likely  to  pay  dividends,  less  likely  to  be  rated,  and  if  they  are  rated,  then  they  are  likely  to  have  a 
poor  credit  rating,  suggesting  that  firms  that  lease  are  more  likely  to  face  financing  constraints. 
Finally,  firms  with  a  higher  propensity  to  lease  have  more  losses,  and  have  lower  marginal  tax 
rates. 

Pearson  correlations  are  reported  in  Table  2.  The  variables  in  this  table  are  highly  correlated, 
in  a  statistical  sense,  with  p-values  less  than  1  percent.  Consistent  with  HI,  AccQual  and  Lease  are 
negatively  correlated.  Also  consistent  with  prior  discussions,  firms  with  financing  constraints  are 
more  likely  to  lease  than  to  purchase  assets.  In  addition.  Size  and  most  variables  (Nodiv,  Unrated, 
Rating,  and  Loss)  are  highly  correlated  (over  50  percent).  Loss  and  MTR  are  also  highly  correlated. 

Table  3  reports  descriptive  statistics  for  the  subsample  of  firms  with  syndicated  loan  data.  In 
the  second  and  third  columns  of  Table  3,  we  partition  the  sample  firms  by  the  level  of  lead 
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TABLE  1 

Descriptive  Statistics  (Means  and  Medians)  for  Sample  Firms  Partitioned  by  Their 
Leasing  Propensity  for  the  Period  1995-2006 


High  Lease  Low  Lease 


Mean 

Median 

(t-stat  for  High  Lease- 

(Wilcoxon  z-stat  for  High 

Mean 

Median 

Low  Lease) 

Lease-Low  Lease) 

Lease 

0.650 

0.652 

0.223 

0.223 

(94.54)*** 

(47.69)*** 

AccQual 

-0.147 

0.079 

0.217 

0.348 

(-18.11)*** 

(-22.63)*** 

Nodiv 

0.822 

1.000 

0.464 

0.000 

(22.15)*** 

(20.55)*** 

Unrated 

0.880 

1.000 

0.584 

1.000 

(19.50)*** 

(18.38)*** 

Rating^ 

3.604 

4.000 

3.134 

3.250 

(6.53)*** 

(6.67)*** 

Size 

3.194 

3.000 

5.807 

6.000 

(-28.14)*** 

(-25.06)*** 

Loss 

0.580 

0.600 

0.282 

0.200 

(25.73)*** 

(23.23)*** 

MTR 

0.108 

0.054 

0.198 

0.214 

(-20.58)*** 

(-20.20)*** 

n 

1,517 

1,517 

1,516 

1,516 

*,  **,  ***  Significant  at  the  10  percent,  5  percent,  and  1  percent  levels,  respectively.  High  Lease  (Low  Lease)  include 

firms  with  Lease  above  (below)  the  median. 

“  The  mean  (median)  values  of  this  variable  only  apply  to  firms  rated  by  the  S&P. 

Variable  Definitions; 

Lease  =  average  of  the  capitalized  lease  expenditure  (lagged  Compustat  #96  X  10)  over  the  sum  of  PP&E  and 
capitalized  lease  expenditure  (Compustat  #8  +  lagged  Compustat  #96  X  10); 

AccQuaF  average  of  the  first  principal  component  of  three  accrual-based  accounting  quality  metrics  based  on  Bharath 
et  al.  (2008,  described  in  the  Appendix); 

Nodiv  =  indicator  variable  equal  to  1  if  the  firm  never  pays  dividend  (Compustat  #21)  during  the  sample  period,  and 
0  otherwise; 

Unrated=  indicator  variable  equal  to  1  if  the  firm  is  never  rated  by  the  S&P  (Compustat  #280)  during  the  sample 
period,  and  0  otherwise; 

Rating  =  average  of  S&P  ratings  when  the  firm  is  rated  by  the  S&P.  Following  Sharpe  and  Nguyen  (1995),  the  rating 
is  coded  1  if  the  S&P  rating  is  between  AAA  and  A-,  2  if  the  rating  is  between  A-l-  and  A-,  3  if  the 
rating  is  between  BBB-I-  and  BBB-,  and  4  if  the  rating  is  between  BB-f  and  D; 

Size  =  decile  ranking  of  average  of  the  natural  log  of  sales  (Compustat  #12); 

Loss  =  average  of  the  indicator  variable  for  loss  year,  where  the  indicator  variable  equals  1  when  the  Compustat 
#123  is  less  than  0;  and 

MTR  =  average  of  the  marginal  tax  rates,  based  on  John  Graham’s  simulation  technique,  as  reported  on  his  website. 


arranger  ownership  for  the  LPC  subsample.  Firms  with  better  private  information  and  monitoring 
quality  tend  to  be  smaller,  have  worse  accounting  quality,  and  face  more  financial  constraints. 
These  firms  are  also  characterized  by  a  smaller  debt  size  and  shorter  maturities.  The  fact  that  these 
firms  have  a  greater  propensity  to  lease  is  consistent  with  Sufi’s  (2007)  finding  that  a  high  lead 
arranger  ownership  is  associated  with  high  information  asymmetries  and  financing  constraints. 
This  observation  suggests  that  it  is  important  to  control  for  endogeneity  before  drawing  any 
inferences. 
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TABLE  2 

Pearson  Correlation  Coefficients  (p-values)  for  Lease  Model  Variables  during  the  Period 

1995-2006 


Variables  Lease  AccQual 

Lease  1  —0.351 

(0.001) 

AccQual  1 

Nodiv 

Unrated 

Rating 

Size 

Loss 

MTR 


Nodiv 

Unrated 

Rating 

0.399 

0.369 

0.298 

(0.001) 

(0.001) 

(0.001) 

-0.301 

-0.233 

-0.191 

(0.001) 

(0.001) 

(0.001) 

1 

0.363 

0.482 

(0.001) 

(0.001) 

1 

NA 

1 


Size 

Loss 

MTR 

-0.532 

0.471 

-0.387 

(0.001) 

(0.001) 

(0.001) 

0.398 

-0.398 

0.318 

(0.001) 

(0.001) 

(0.001) 

-0.552 

0.501 

-0.417 

(0.001) 

(0.001) 

(0.001) 

-0.658 

0.312 

-0.192 

(0.001) 

(0.001) 

(0.001) 

-0.568 

0.495 

-0.309 

(0.001) 

(0.001) 

(0.001) 

1 

-0.644 

0.492 

(0.001) 

(0.001) 

1 

-0.695 

(0.001) 

1 


Correlations  with  Rating  only  apply  to  firms  that  are  rated  by  the  S&P. 
See  Table  1  for  variable  definitions. 


The  fourth  and  fifth  columns  in  Table  3  partition  the  SDC  subsample  firms  by  whether  the 
firm’s  bank  loans  contain  the  capital  expenditure  restriction.  Firms  that  use  this  restrictive  cov¬ 
enant  tend  to  be  smaller,  lease  less,  face  more  financing  constraints,  and  have  lower  performance. 
These  firms  also  have  smaller  loans  with  longer  maturities. 

Multivariate  Results 

Panel  A  of  Table  4  provides  the  results  of  the  OLS  estimation  of  Equation  (1)  for  the  com¬ 
prehensive  sample  of  manufacturing  firms.  Consistent  with  HI,  we  find  that  firms  with  worse 
accounting  quality  (i.e.,  low  AccQual)  have  a  greater  propensity  to  lease  assets.  This  result  is  both 
statistically  and  economically  significant.  For  example,  a  standard  deviation  change  in  AccQual 
(standard  deviation  =  0.583)  is  associated  with  a  7.1  percent  change  in  the  mean  Lease  (mean  = 
0.436).^^ 

As  for  control  variables,  results  on  Nodiv,  Unrated,  Size,  Loss,  and  MTR  are  consistent  with 
the  reasoning  that  firms  with  more  financial  constraints  have  a  greater  propensity  to  lease  assets. 
For  comparative  purposes,  we  repeat  the  analysis  for  the  comprehensive  sample  on  our  lead  lender 
subsample  and  report  the  results  for  this  subsample  in  Panel  B  of  Table  4.  The  results  on  both  our 
test  and  control  variables  are  quite  similar  across  these  two  samples,  suggesting  that  any  differ¬ 
ences  between  our  comprehensive  sample  of  Compustat  manufacturing  firms  and  the  subsample  of 
these  firms  with  syndicated  loans  do  not  affect  our  lease-versus-buy  findings. 


While  not  reported,  we  find  that  the  VIF  of  our  accounting  quality  measure  is  1.25,  so  our  finding  that  financial  reporting 
quality  influences  leasing  activities  is  not  driven  by  multi-collinearity. 
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TABLE  3 

Descriptive  Statistics  (means)  for  Sample  Firms  Partitioned  on  Whether  the  Firm  Uses  a 
Capital  Expenditure  Restriction  in  Bank  Loans  and  the  Lead  Lenders’  Ownership  of 

Bank  Loans  for  the  Period  1995-2006 


High  Lead 
Arrangers’ 
Ownership 

Low  Ownership 
(t-stat  for 
difference) 

With  Capital 
Expenditure 
Restrictions 

Without 
Restrictions 
(t-stat  for 
difference) 

Lease 

0.458 

0.316 

(10.58)*** 

0.371 

0.325 

(3.40)*** 

AccQual 

0.071 

0.302 

(_9  94)  *** 

0.223 

0.249 

(-1.02) 

Nodiv 

0.728 

0.315 

(14.29)*** 

0.556 

0.391 

(4.82)*** 

Unrated 

0.888 

0.359 

(20.52)  *** 

0.497 

0.493 

(0.13) 

Rating^ 

3.802 

3.097 

(6.19)*** 

3.792 

3.002 

(12.17)*** 

Size 

3.814 

7.424 

(-32.14)*** 

6.158 

6.632 

(-3.75)*** 

Loss 

0.423 

0.183 

(14.00)*** 

0.319 

0.226 

(5.16)*** 

MTR 

0.157 

0.213 

(-7.45)*** 

0.185 

0.208 

(-2.90)*** 

Lead  Own 

89.624 

24.218 
(69.44)  *** 

NA 

NA 

Loan 

16.364 

19.433 

(-41.35)*** 

18.091 

18.326 

(-2.13)** 

Maturity 

3.367 

3.869 

(-12.80)*** 

3.770 

3.591 

(5.16)*** 

n 

499 

498 

374 

491 

**,  ***  Significant  at  the  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

^  The  mean  values  of  this  variable  only  apply  to  firms  that  are  rated  by  the  S&P. 

Variable  Definitions; 

Lead  Own  =  weighted  average  of  the  lead  arrangers’  ownership  of  the  bank  loan  issued  in  the  sample  period; 
Loan  =  natural  of  log  of  the  amount  of  bank  loan  issued  in  the  sample  period;  and 
Maturity  =  natural  of  log  of  the  maturity  (in  months)  of  bank  loan  issued  in  the  sample  period. 

See  Table  1  for  definitions  of  other  variables. 


We  present  the  results  on  our  tests  of  the  effect  of  banks’  due  diligence  incentives  on  the 
lease-versus-buy  decision  in  Table  5.  In  the  first  column  of  results,  we  report  the  determinants  of 
the  leasing  propensity  for  firms  where  the  lead  lender  holds  a  relatively  larger  proportion  of  the 
loan  (i.e.,  Lead  Rank  =  1).  In  the  second  column  of  results,  we  report  the  determinants  of  the 
leasing  propensity  for  firms  where  the  lead  lender  holds  a  relatively  smaller  proportion  of  the  loan 
(i.e..  Lead  Rank  =  0). 

Focusing  on  the  determinants  to  lease  for  firms  where  the  lead  lender  has  a  stronger  incentive 
to  monitor,  we  find  that  smaller  firms,  firms  that  suffer  a  loss,  and  firms  that  are  not  paying 
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TABLE  4 


Coefficients  (and  t-statistics)  from  OLS  Estimations  of  the  Determinants  of  the  Extent  of 

Firms’  Expenditures  on  Operating  Leases 


Panel  A:  Complete  Sample 
Variables 

Predicted  Sign 

Coefficients 

t-stats 

Intercept 

+/- 

0.442 

10.18*** 

AccQual 

- 

-0.053 

—4.00*** 

Nodiv 

-1- 

0.048 

3.35*** 

Unrated 

+ 

0.057 

1.67* 

Rating 

-H 

0.003 

0.34 

Size 

- 

-0.022 

-7.26*** 

Loss 

-b 

0.094 

3.75*** 

MTR 

- 

-0.143 

—2  88*** 

n 

3,033 

R2 

0.3382 

Panel  B:  Syndicated  Loan  Sample 

Variables 

Predicted  Sign 

Coefficients 

t-stats 

Intercept 

+/- 

0.467 

7.37*** 

AccQual 

- 

-0.074 

-3.32*** 

Nodiv 

-t- 

0.036 

2.21** 

Unrated 

+ 

0.028 

0.73 

Rating 

•f 

0.003 

0.32 

Size 

- 

-0.021 

—4  32*** 

Loss 

+ 

0.085 

2.66*** 

MTR 

- 

-0.102 

-1.86* 

n 

1,314 

R2 

0.2414 

*,  ***  Significant  at  the  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

All  t-statistics  are  based  on  clustering  at  the  three-digit  industry  level. 
See  Table  1  for  variable  definitions. 


dividends  are  more  likely  to  lease.  These  results  suggest  that  firms  facing  financing  constraints  are 
more  likely  to  lease,  consistent  with  Eisfeldt  and  Rampini  (2009)  and  Sharpe  and  Nguyen  (1995). 
We  also  find  that  AccQual  has  a  significant  negative  coefficient,  indicating  that  worse  accounting 
quality  is  associated  with  a  greater  leasing  propensity  for  firms  with  a  high  lead  arranger  owner¬ 
ship  in  bank  loans. 

For  firms  where  the  lead  lender  has  less  monitoring  incentives,  we  again  find  that  AccQual 
has  a  significant  negative  coefficient.  Consistent  with  the  results  reported  in  the  first  column,  lower 
accounting  quality  is  associated  with  a  greater  leasing  propensity.  We  conduct  a  Chi-squared  test 
on  the  coefficient  on  accounting  quality  across  these  two  regressions  to  test  H2.  We  find  that  the 
coefficient  on  accounting  quality  is  statistically  larger  for  firms  that  have  lenders  with  fewer 
monitoring  incentives.  Thus,  when  information  asymmetry  problems  are  likely  to  be  larger,  ac¬ 
counting  quality  plays  a  relatively  more  important  role  in  the  leasing  decision. 
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TABLE  5 


Coefficients  (and  t-statistics)  from  Endogenous  Switching  Models  Estimating  How 
Monitoring  by  Banks,  Measured  by  Lead  Lenders’  Ownership,  Affects  the  Effect  of 
Accounting  Quality  on  Lease  Expenditures 


Variables 

Predicted  Sign 

Lead  Rank  =  1 

Lead  Rank  =  0 

Coefficients 

(z-stats) 

Coefficients 

(z-stats) 

Intercept 

+/- 

0.455 

0.402 

(1.75)* 

(4.78)*** 

AccQual 

- 

-0.053 

-0.211 

(-2.22)** 

(-6.48)*** 

Nodiv 

•f 

0.073 

0.043 

(3.11)*** 

(2.26)** 

Unrated 

-b 

0.044 

0.063 

(0.17) 

(1.29) 

Rating 

-b 

0.006 

0.011 

(0.10) 

(0.83) 

Size 

- 

-0.020 

-0.011 

(-2.73)*** 

(-1.45) 

Loss 

-b 

0.090 

0.053 

(1.98)** 

(1.24) 

MTR 

— 

-0.253 

-0.015 

(-2.26)** 

(-0.21) 

n 

472 

492 

Differences  in  the  impact  of  accounting  quality  on  the  lease-versus-buy  decision 
HO.  AccQua^i^ud  Rank=l)  ~  I^^^Q^^l(Lead  Rank-0) 

A^(l)  =  15.42 
p-value  =  0.0001 


*,  **,  ***  Significant  at  the  10  percent,  5  percent,  and  1  percent  levels,  respectively. 
See  Table  1  for  variable  definitions. 


We  report  the  results  of  our  tests  of  H3  on  the  effects  of  direct  monitoring  via  the  capital 
expenditure  covenant  on  the  leasing  propensity  in  Table  6.  The  first  column  reports  the  results  on 
the  determinants  of  the  leasing  propensity  for  firms  with  investment-restriction  covenants.  The 
second  results  column  reports  the  determinants  of  the  leasing  propensity  for  firms  without 
investment-restriction  covenants . 

We  find  that  for  the  subset  of  firms  with  investment  covenants,  the  firms  that  are  unrated,  or 
are  rated  and  have  relatively  poor  ratings  have  a  greater  propensity  to  lease  assets.  Similarly,  for 
the  subset  of  firms  without  investment  covenants,  we  find  that  smaller  firms  and  firms  that  have 
suffered  losses  have  a  greater  propensity  to  lease  assets.  Similar  to  the  results  discussed  above, 
these  findings  support  the  findings  in  Eisfeldt  and  Rampini  (2009)  and  Sharpe  and  Nguyen  (1995) 
that  firms  that  are  more  likely  to  have  financing  constraints  are  more  likely  to  lease. 

We  find  that  accounting  quality  is  an  important  determinant  of  leasing  propensity  for  firms 
without  investment  covenants  and  is  not  a  significant  determinant  of  the  leasing  propensity  for 
firms  with  these  covenants.  A  Chi-squared  test  confirms  that  the  effect  of  accounting  quality  on 
leasing  propensity  is  significantly  different  across  these  regressions.  These  results  support  our 
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TABLE  6 


Coefficients  (and  t-statistics)  from  Endogenous  Switching  Model  Estimation  of  the  Effect 
of  Capital  Expenditure  Restrictions  on  Firms’  Leasing  Propensity 


Variables 

Predicted 

Sign 

InvjCov  =  1 

InvjCov  =  0 

Coefficients 

(z-stats) 

Coefficients 

(z-stats) 

Intercept 

+/- 

-0.602 

0.364 

(-2.66)*** 

(5.18)*** 

AccQual 

- 

-0.005 

-0.104 

(-0.15) 

(-4.51)*** 

Nodiv 

-f 

0.038 

0.133 

(1.31) 

(0.66) 

Unrated 

-b 

0.841 

0.084 

(4.61)*** 

(1.24) 

Rating 

-h 

0.234 

0.018 

(4.77)  *** 

(0.93) 

Size 

- 

-0.018 

-0.013 

(-1.89)* 

(-2.40)** 

Loss 

-b 

0.053 

0.114 

(0.89) 

(2.75)*** 

MTR 

- 

-0.184 

-0.027 

(-1.38) 

(-0.35) 

n 

373 

483 

Differences  in  the  impact  of  accounting  quality  on  the  lease- versus-buy  decision 
HO.  AccQualQf^^  f^gjj—^'^  —  AccQualQfig  Qgg^Q'j 
xH\)  =  5.66 
p-value  =  0.017 


*,  **,  ***  Significant  at  the  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

Variable  Definition; 

Inv_Cov  =  indicator  variable  that  equals  1  if  the  firm  ever  includes  a  capital  expenditure  restriction  provision  in 
its  bank  loans  during  the  sample  period. 


See  Table  1  for  definition  of  other  variables. 


prediction  in  H3  that  accounting  quality  is  less  important  in  leasing  decisions  when  other  mecha¬ 
nisms,  like  capital  expenditure  covenants,  limit  the  ability  of  firms  to  engage  in  capital  spending. 

For  completeness,  we  report  the  results  of  our  “selection”  regressions  in  Table  7.  Although  we 
do  not  develop  hypotheses  about  the  determinants  of  lead  arranger’s  monitoring  incentives  or  of 
investment  covenants,  we  find  that  many  of  the  explanatory  variables  included  in  the  models  are 
statistically  significant.  Of  particular  importance  are  the  coefficients  on  the  variables  that  serve  as 
exogenous  instruments.  We  find  that  large  lead  arranger  ownership  decreases  with  loan  amounts 
and  maturities,  while  capital  expenditure  provisions  increases  with  loan  amount  and  maturities. 
While  not  a  direct  test  of  “exogeneity,”  these  results  suggest  that  our  exogenous  instruments 
explain  variation  in  investment  covenants,  and  lead-ownership  holdings. 
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TABLE  7 

Coefficients  (and  z-statistics)  from  Probit  Models  Examining  the  Determinants  of  the  Lead 
Lenders’  Ownership  and  Use  of  Capital  Expenditure  Provisions  for  the  Period  1995-2006 

Dependent  Variable:  Dependent  Variable: 

Lead  Rank  InvjOov 


Variables 

Prediction 

Coefficients 

(z-stats) 

Coefficients 

(z-stats) 

Intercept 

+/- 

23.211 

-6.531 

(10.98)*** 

(-5.82)*** 

AccQual 

— 

0.055 

0.104 

(0.25) 

(0.82) 

Nodiv 

-b 

-0.041 

0.077 

(-0.27) 

(0.71) 

Unrated 

-t- 

-1.228 

3.396 

(-1.73)* 

(6.15)*** 

Rating 

-b 

-0.250 

0.984 

(-1.28) 

(6.52)*** 

Size 

- 

-0.146 

-0.104 

(-2.76)*** 

(-2.25)** 

Loss 

+ 

0.391 

0.486 

(1.19) 

(2.35)** 

MTR 

+/- 

1.111 

0.051 

(1.96)** 

(0.11) 

Loan 

+/- 

-1.132 

0.010 

(-10.81)*** 

(1.78)* 

Maturity 

+/- 

-0.314 

0.440 

(-2.43)** 

(4.28)*** 

n 

964 

856 

Pseudo 

0.6732 

0.1442 

*,  **,  ***  Significant  at  the  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

All  z-statistics  are  based  on  clustering  at  the  three-digit  industry  level. 

Variable  Definitions: 

Lead  Rank  =  indicator  variable  equals  1  if  the  average  of  the  lead  arrangers’  ownership  of  the  firm’s  bank  loans 
exceeds  the  median  of  all  sample  firms,  and  0  otherwise;  and 
InvjCov  =  indicator  variable  that  equals  1  if  the  firm  ever  includes  a  capital  expenditure  restriction  provision  in 
its  bank  loans  during  the  sample  period. 

See  Table  1  for  variable  definitions. 


VII.  ROBUSTNESS  CHECKS 

Industry  Composition 

We  focus  on  the  manufacturing  industry  to  avoid  the  heterogeneity  in  financing,  leasing,  and 
financial  reporting  choices  across  various  industries.  Untabulated  results  show  that  our  findings 
continue  to  hold  in  analyses  that  include  all  industries  and  control  for  fixed  industry  effects. 

Lease  Measure 

The  main  tabulated  results  are  based  on  an  approach  that  averages  all  variables  across  the 
sample  period,  where  Lease  is  a  stock  measure.  To  show  that  our  results  are  not  driven  by  this 
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specific  design,  we  also  conduct  a  panel  data  analysis,  where  we  measure  accounting  quality  by 
averaging  the  absolute  value  of  discretionary  accruals  over  the  three  years  prior  to  the  measure¬ 
ment  year  of  our  leasing  propensity  variable. 

Consistent  with  Sharpe  and  Nguyen  (1995)  and  Eisfeldt  and  Rampini  (2009),  we  also  define 
Lease  using  a  flow  measure  in  the  panel  data  framework.  The  flow  measure  of  Lease  is  defined  as 
the  minimum  lease  payment  (lagged  Compustat  #96)  divided  by  the  sum  of  lease  and  capital 
expenditures  (Compustat  #128).  Table  8  shows  that  this  alternative  measure  in  our  panel  data 
analysis  generates  results  consistent  with  our  main  approach,  firms  with  higher  accounting  quality 
are  less  likely  to  lease  than  to  buy. 

We  also  replace  the  scalar  in  our  model  with  the  sum  of  total  assets  and  capitalized  lease 
obligations  (Compustat  #  6  +  lagged  Compustat  #96  X  10).  We  continue  to  observe  a  signifi¬ 
cantly  negative  association  between  accounting  quality  and  leasing.  Finally,  Tobit  estimation  in¬ 
cluding  firms  with  missing  lease  data  (about  8  percent  of  firms)  produces  similar  results. 

Accounting  Quality  Measure 

In  addition  to  using  the  principal-component  approach  to  measuring  accruals  quality,  we  also 
derive  measures  based  on  the  underlying  variables,  and  include  those  measures  in  the  model.  Our 


TABLE  8 


Coefficients  (and  t-statistics)  from  OLS  Estimations  of  the  Determinants  of  the  Extent  of 
Firms’  Expenditures  on  Operating  Leases  for  the  Period  1995-2006  Estimated  Using  an 
Annual  Regression  and  Flow  Measures  of  Lease  Proclivity 


Variables 

Predicted  Sign 

Coefficients 

t-stats 

Intercept 

+!- 

0.522 

19.68*** 

AccQual 

- 

-0.042 

-5.98*** 

Nodiv 

+ 

0.070 

6.54*** 

Unrated 

-t- 

0.056 

2.27** 

Rating 

-f 

0.005 

0.69 

Size 

- 

-0.029 

— 14  69**^ 

Loss 

-F 

0.030 

4 

MTR 

- 

-0.135 

-5.50*** 

n 

15,055 

R2 

0.3021 

*,  **,  ***  Significant  at  the  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

All  t-statistics  are  based  on  clustering  at  the  firm  level.  Results  on  time  fixed  effects  are  omitted. 

Variable  Definitions; 

Lease  =  lease  expenditure  (lagged  Compustat  #96)  divided  by  the  sum  of  capital  and  lease  expenditures  (Compustat 
#128  -f-  lagged  Compustat  #96); 

AccQuah  average  of  first  principal  component  of  three  accrual-based  accounting  quality  metrics  based  on  Bharath  et  al. 
(2008)  over  three  years  prior  to  the  year  Lease  is  measured  (described  in  the  Appendix); 

Nodiv  =  indicator  variable  equal  to  1  if  the  firm  does  not  pay  a  dividend  (Compustat  #21),  and  0  otherwise; 

Unrated=  indicator  variable  equal  to  1  if  the  firm  is  not  rated  by  the  S&P  (Compustat  #280),  and  0  otherwise; 

Rating  =  S&P  ratings  when  the  firm  is  rated  by  the  S&P.  Following  Sharpe  and  Nguyen  (1995),  the  rating  is  coded  1 
if  the  S&P  rating  is  between  AAA  and  A—,  2  if  the  rating  is  between  A -I-  and  A—,  3  if  the  rating  is 
between  BBB-I-  and  BBB  — ,  and  4  if  the  rating  is  between  BB-I-  and  D; 

Size  =  decile  ranking  of  average  of  the  natural  log  of  sales  (Compustat  #12); 

Loss  =  indicator  variable  for  loss  year,  where  the  indicator  variable  equals  1  when  the  Compustat  #123  is  less  than 
0;  and 

MTR  =  marginal  tax  rates  based  on  John  Graham’s  simulation  technique,  as  reported  on  his  website. 
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results  continue  to  hold  when  we  use  these  underlying  variables.  To  ensure  that  the  decision  to  use 
cross-sectional  estimation  is  not  affecting  our  results,  we  also  use  a  time-series  measure  of  the 
extent  of  current  accruals  mapping  into  future,  current,  and  future  operating  cash  flows  (Dechow 
and  Dichev  2002)  as  a  metric  for  accounting  quality.  The  results  of  using  this  time-series  measure 
are  consistent  with  the  results  of  using  cross-sectional  models. 

Similarly,  we  change  the  scalar  used  in  the  accounting  quality  measures  to  be  consistent  with 
the  scalar  used  on  the  independent  variable,  and  to  ensure  that  differences  in  the  accounting 
treatment  for  leases  are  not  driving  our  results.  If  we  use  total  assets  as  the  scalar,  we  find  results 
similar  to  those  that  we  tabulate. 

Lead  Rank  Measure 

To  test  the  sensitivity  of  our  results  to  our  measure  of  lead  ranks,  we  examined  two  alternative 
cutoffs.  Setting  this  variable  equal  to  1  if  the  average  of  the  lead  arrangers’  ownership  of  the  firm’s 
bank  loans  exceeds  either  the  33rd  or  the  66th  percentile  of  all  sample  firms,  and  0  otherwise, 
produces  similar  results.^^  The  results  of  these  alternative  specifications  suggest  that  our  results  are 
not  driven  by  our  median  cutoff. 

Financial  Constraints 

Finally,  when  the  firm  has  lower  free  cash  flow,  information  problems  are  more  likely  to  affect 
the  firm’s  investment  behavior  through  their  effect  on  external  financing.  We  interact  accounting 
quality  proxies  with  an  indicator  variable  for  high  free  cash  flow.  We  find  that  the  importance  of 
accounting  quality  affecting  leasing  activities  decreases  with  free  cash  flow,  consistent  with  our 
prediction. 


VIII.  CONCLUSION 

This  study  extends  prior  research  on  the  relation  between  financial  reporting  quality  and 
firms’  investment  and  financing  choices  by  investigating  the  effect  of  financial  reporting  quality  on 
the  lease-versus-buy  decision.  Our  research  design  complements  prior  studies  that  focus  exclu¬ 
sively  on  asset  purchases  when  examining  the  effect  of  accounting  quality  on  firm’s  investment 
decisions.  We  find  that  the  leasing  propensity  is  declining  in  accounting  quality.  This  finding 
suggests  that  although  previous  research  shows  that  firms  with  poor  accounting  quality  buy  fewer 
assets,  they  lease  more  of  their  assets,  indicating  that  poor  accounting  quality  may  not  result  in 
lower  total  investment. 

To  distinguish  between  two  alternative  explanations  of  this  finding,  we  also  examine  the 
mechanisms  that  lenders  use  to  mitigate  the  agency  problems  arising  from  information  asymme¬ 
tries.  Only  if  better  accounting  quality  and  leasing  are  two  alternative  mechanisms  for  addressing 
financial  constraints  would  we  expect  the  negative  relation  between  accounting  quality  and  leasing 
to  be  mitigated  by  other  mechanism  that  reduce  information  asymmetry  and  agency  costs.  We  find 
that  accounting  information  is  less  important  in  the  decision  to  lease  when  lenders’  incentives  to 
conduct  due  diligence  are  higher.  The  importance  of  financial  reporting  quality  in  the  lease-versus- 
buy  decision  is  also  decreasing  in  the  existence  of  capital  expenditure  covenants.  Although  the 
relation  between  leasing  and  accounting  quality  might  be  indicative  of  a  propensity  of  firms  with 
otherwise  poor  accounting  quality  to  engage  in  off-balance-sheet  financing,  our  results  are  more 
consistent  with  accounting  quality  and  leasing  serving  as  substitute  mechanisms  for  reducing 
financing  constraints. 


Similar  to  Sufi  (2007),  we  find  consistent  results  using  a  Herfindahl  index  of  bank  loan  ownership. 

We  also  interacted  a  continuous  lead-ownership  measure  with  our  accounting  quality  measure.  The  AccQual  coefficient 
magnitude  decreases  with  lead-ownership,  consistent  with  the  dichotomize  approach. 
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Overall,  our  results  highlight  the  importance  of  considering  leasing  when  studying  the  effects 
of  financial  reporting  quality  on  investments.  Prior  research  suggests  that  firms  with  poor  account¬ 
ing  quality  have  financial  constraints,  inhibiting  their  ability  to  invest  in  the  periods  in  which  they 
are  liquidity  constrained.  Our  results  extend  this  line  of  research,  suggesting  that  while  low-quality 
accounting  firms  are  likely  to  forgo  purchasing  assets,  they  are  more  likely  to  lease  assets  when 
they  are  liquidity  constrained.  Our  analyses  also  contribute  to  the  literature  examining  the  impor¬ 
tance  of  accounting  information  in  firms’  investment  decisions  by  considering  how  banks’  private 
information  and  monitoring  can  substitute  for  accounting  information. 


APPENDIX 

Construction  of  AccQual 

ABACCl:  The  absolute  value  of  current  discretionary  accruals  calculated  based  on  Teoh  et 
al.  (1998).  The  model  is  estimated  annually  for  each  Fama  and  French  (1997)  industry  group 
and  each  industry-year  regression  requires  at  least  20  observations.  Based  on  Teoh  et  al. 
(1998),  we  first  estimate  the  following  regression  to  get  the  estimated  coefficients  (variables 
are  defined  below): 

Current_Accc  1  A  Re  u 

- =  7,  — - -I-  72 - - —  +  77. 

Total_Inv  Total_lnv  Total_Inv 

The  second  step  calculates  the  absolute  value  of  discretionary  accruals  as: 

Current_Acc  ^  1  ^  (A  Re  u  -  AAR) 

Total_Inv  Total_Inu  Total_Inv 

ABACC2:  The  absolute  value  of  total  discretionary  accruals  calculated  based  on  Dechow  et 
al.  (1995).  The  model  is  estimated  annually  for  each  Fama  and  French  (1997)  industry  group 
and  each  industry-year  regression  requires  at  least  20  observations.  We  first  estimate  the 
following  regression  to  get  the  estimated  coefficients  (variables  are  defined  below): 

Total_Acc  1  A  Re  u  PPE 

- =  a\ - -f-  aj- - -f-  a-i - +  e. 

Total_Inv  Total_Inv  Total_Inv  Total_Inv 

The  second  step  calculates  the  absolute  value  of  discretionary  accruals  as: 

Total_Acc  ^  1  ^  (A  Re  n  -  AAR)  ^  PPE 

- - -a, - -  dy - -  ao -  . 

Total_Inv  Total_Inv  Total_lnv  Total_lnv 

ABACC3:  The  absolute  value  of  total  current  accruals  calculated  based  on  Dechow  and 
Dichev  (2002).  The  model  is  estimated  annually  for  each  Fama  and  French  (1997)  industry 
group  and  each  industry-year  regression  requires  at  least  20  observations.  ABACC3  is  the 
absolute  value  of  the  estimated  residual  from  the  following  model. 

Current_Acc  /  CEO  \  {  CEO  \  (  CEO  \ 

Totaljnv  \TotalJnvjt_^^ATotalJnvjt^ATotalJnv],^'^^^' 

where: 
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Current_Acc  earnings  before  extraordinary  items— cash  flow  from  operating  activities— 
depreciation  (Compustat  #123  -  #308  +  #14);^^ 

Total_Acc  =  earnings  before  extraordinary  items-cash  flow  from  operating  activities  (Compustat 
#123  -  #308); 


Totaljnv  —  the  sum  of  PP&E  and  capitalized  lease  expenditure  (Compustat  #8  +  lagged 
Compustat  #96  multiplied  by  10); 

CFO  =  cash  flow  from  operating  activities  (Compustat  #308); 

ARev  =  change  in  sales  (Compustat  #12); 

AAR  =  change  in  accounts  receivables  (Compustat  #2). 

PPE  =  property,  plant,  and  equipment  (Compustat  #8). 


We  standardize  each  of  the  three  measures  by  subtracting  the  mean  and  dividing  by  the  standard 
deviation,  and  then  extract  the  first  principal  component  from  the  three  proxies. Our  accounting 
quality  variable  {AccQual)  is  thus  defined  as  fhe  average  of  the  following  metric  over  the  sample 
period  1995-2006: 

-1  *  [standardizedABACCl  *  0.38538 +  standardizedABACC2  *  0.38371 
+  standardizedABACC3  *  0.32568].^° 
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ABSTRACT:  We  document  the  first  systematic  evidence  on  the  characteristics  and 
economic  consequences  of  firms  subject  to  employee  allegations  of  corporate  financial 
misdeeds.  First,  compared  to  a  control  group  that  avoided  public  whistle-blowing  alle¬ 
gations,  firms  subject  to  whistle-blowing  allegations  were  characterized  by  unique  firm- 
specific  factors  that  led  employees  to  expose  alleged  financial  misdeeds.  Second,  on 
average,  whistle-blowing  announcements  were  associated  with  a  negative  2.8  percent 
market-adjusted  five-day  stock  price  reaction;  this  reaction  was  especially  negative  for 
allegations  involving  earnings  management  (-7.3  percent).  Third,  compared  to  a  con¬ 
trol  group  that  exhibits  similar  characteristics,  firms  subject  to  whistle-blowing  allega¬ 
tions  were  associated  with  further  negative  consequences  including  earnings  restate¬ 
ments,  shareholder  lawsuits,  and  negative  future  operating  and  stock  return 
performance.  Finally,  whistle-blowing  targets  exposed  by  the  press  were  more  likely  to 
make  subsequent  improvements  in  corporate  governance.  Our  results  suggest  whistle¬ 
blowing  is  far  from  a  trivial  nuisance  for  targeted  /irms,  and  on  average,  appears  to  be 
a  useful  mechanism  for  uncovering  agency  issues. 
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Data  Availability:  All  data  are  available  from  public  sources  identified  in  the  paper. 

I.  INTRODUCTION 

The  purpose  of  our  study  is  to  identify  characteristics  of  firms  subject  to  whistle-blowing 
allegations,  examine  the  economic  consequences  of  such  allegations,  and  thus  shed  light  on 
the  role  of  employee  whistle-blowers  in  uncovering  corporate  financial  misdeeds.  Notable 
anecdotal  evidence  suggests  that  whistle-blowers  can  make  a  difference.  For  example,  two 
whistle-blowers,  Cynthia  Cooper  and  Sherron  Watkins,  played  significant  roles  in  exposing  ac¬ 
counting  frauds  at  WorldCom  and  Enron,  respectively,  and  were  named  as  the  2002  “persons  of 
the  year”  by  TIME  magazine.  In  response  to  Enron,  WorldCom,  and  other  scandals.  Congress 
passed  the  Sarbanes-Oxley  Act  (SOX)  in  July  2002,  which,  in  part,  made  it  unlawful  for  compa¬ 
nies  to  take  punitive  action  against  employees  who  disclose  “questionable  accounting  or  auditing 
matters”  (SOX  Section  806,  codified  as  title  15  U.S.C.,  §  78f(m)(4)).  Under  the  whistle-blower 
provisions  of  SOX,  employees  who  disclose  improper  financial  practices  receive  greater  protection 
from  discrimination  (Title  18  U.S.C.,  §  1514A(a)(l)).  SOX  also  required  firms  listed  on  a  U.S. 
stock  exchange  to  set  up  a  hotline  enabling  whistle-blowers  to  communicate  anonymously  to  the 
board  of  directors  {Economist  2006  ).  Further,  the  Securities  and  Exchange  Commission  is  con¬ 
sidering  a  whistle-blower  program  to  encourage  individuals  to  identify  potential  violations  of 
federal  securities  laws  (SEC  2009). 

In  contrast  to  these  anecdotes  suggesting  that  whistle-blowers  help  uncover  financial  impro¬ 
prieties,  skeptics  argue  that  (1)  whistle-blowers  often  misjudge  the  situation  and  indulge  in  trivial 
or  frivolous  complaints  (Miceli  and  Near  1992),  (2)  “Machiavellian”  whistle-blowers  who  have  an 
axe  to  grind  lodge  baseless  allegations  (Gobert  and  Punch  2000),  and  (3)  ineffective  workers 
misuse  their  protected  “whistle-blower”  status  to  avoid  discharges  or  disciplinary  proceedings 
(Schmidt  2003). 

Despite  the  recent  prominence  of  whistle-blowing  events,  we  are  not  aware  of  prior  work  in 
the  accounting  and  finance  literatures  that  attempts  to  resolve  the  conflict  between  these  perspec¬ 
tives.  We  examine  the  characteristics  of  firms  subject  to  publicly  available  whistle-blowing  alle¬ 
gations  and  the  economic  consequences  of  whistle-blowing  events  for  target  firms.  We  analyze  81 
employee  whistle-blowing  allegations  related  to  financial  misconduct  between  1989  and  2004 
reported  in  the  financial  press  (Press  sample)  and  137  instances  of  employee  whistle-blowing 
between  2002  and  2004  obtained  from  records  of  the  U.S.  government’s  Occupational  Safety  and 
Health  Administration  office  (OSHA),  the  agency  responsible  for  handling  complaints  of  discrimi¬ 
nation  from  whistle-blowers  who  report  SOX-related  allegations  (OSHA  sample).  We  label  these 
wide-ranging  allegations  as  “external  financial”  whistle-blowing  events.  They  are  “external”  be¬ 
cause  they  are  made  known  outside  of  the  organization;  we  generally  cannot  observe  cases  of 
whistle-blowing  that  are  resolved  internally.  They  are  “financial”  because  they  include  alleged 
earnings  management,  improper  disclosure,  insider  trading,  price-fixing  in  product  markets,  tax 
fraud,  or  violations  of  securities  regulation.  We  investigate  four  related  research  questions: 

Ql:  What  operating,  financial,  governance,  industry,  and  regulatory  characteristics  differen¬ 
tiate  firms  subject  to  external  financial  whistle-blowing  from  other  firms  that  avoid 
publicly  disclosed  whistle-blowing  allegations? 

Q2:  What  is  the  immediate  stock  market  reaction  to  an  external  financial  whistle-blowing 
announcement?  Do  these  stock  price  reactions  vary  with  the  nature  of  the  allegation  ? 

Q3:  What  are  the  subsequent  economic  consequences  for  firms  subject  to  external  financial 
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whistle-blowing  events?  Do  these  firms  experience  more  earnings  restatements,  more 
shareholder  lawsuits,  and/or  poorer  future  operating  and  stock  return  performance? 

Q4:  Do  firms  respond  to  whistle-blowing  allegations  by  improving  their  governance 
structure? 

To  investigate  our  first  research  question  (Ql),  we  identify  variables  that  potentially  differen¬ 
tiate  firms  subject  to  public  whistle-blowing  actions.  We  argue  that  certain  firm  characteristics  are 
associated  with  the  probability  that  an  employee  will  voice  an  allegation  of  financial  impropriety 
to  someone  outside  the  organization.  We  compare  whistle-blowing  samples  to  two  control  groups: 
(1)  a  broad  set  of  firms  not  subject  to  whistle-blowing  allegations,  and  (2)  firms  “tarnished”  by 
either  a  shareholder  lawsuit  or  an  earnings  restatement  (suggesting  some  level  of  financial  mis¬ 
deeds)  that  were  able  to  avoid  an  external  whistle-blowing  allegation. 

Compared  to  other  tarnished  firms,  both  Press  and  OSHA  firms  were  more  likely  to  be  large, 
growing  firms  with  strong  recent  stock  return  performance  that  had  recently  suffered  employee 
layoffs.  In  addition.  Press  firms  had  relatively  weak  external  monitoring  and  OSHA  firms  were 
relatively  well  regarded,  had  weak  internal  monitoring,  and  were  more  likely  to  be  firms  where  the 
whistle-blowing  employee  had  the  opportunity  to  share  in  the  proceeds  of  any  fraud-related  settle¬ 
ment  with  the  U.S.  government.  Overall,  we  interpret  our  evidence  as  consistent  with  whistle¬ 
blowing  targets  being  systematically  different  from  other  tarnished  firms  that  avoided  public 
whistle-blowing  allegations  of  financial  wrongdoing. 

Our  second  research  question  (Q2)  addresses  the  immediate  stock  market  reaction  to  whistle¬ 
blowing  allegations  that  are  reported  in  the  press.  We  find  the  average  market- adjusted  five-day 
stock  return  surrounding  the  announcement  in  the  financial  press  was  -2.8  percent.  This  reaction 
was  especially  negative  if  the  allegation  involved  earnings  management  (-7.3  percent). 

Our  third  research  question  (Q3)  investigates  subsequent  economic  consequences  for  firms 
targeted  by  whistle-blowing  allegations,  including  restatements,  lawsuits,  and  poor  operating  and 
stock  return  performance.  Our  research  design  uses  a  propensity-score  matched  control  sample 
that  includes  firms  with  similar  characteristics  as  the  whistle-blowing  targets  to  control  for  the  fact 
that  whistle-blowing  is  not  an  exogenous  event  and  that  negative  consequences  following  whistle¬ 
blowing  allegations  could  arise  from  many  of  the  same  underlying  economic  forces  that  contribute 
to  the  allegations  themselves. 

Compared  to  a  propensity-score  matched  control  sample,  we  find  that  whistle-blowing  targets 
identified  by  the  press  were  more  likely  to  restate  their  earnings  and  be  subject  to  shareholder 
litigation  in  the  three  years  following  the  whistle-blowipg  allegation.  In  addition,  we  find  evidence 
suggesting  whistle-blowing  targets  (both  Press  and  OSHA)  suffered  lower  operating  and  stock 
return  performance  than  did  their  control  firm  counterparts.  Thus,  whistle-blowers  appeared  to 
expose  firms  that  (1)  engaged  in  financial  misconduct  and  (2)  had  previously  unknown  agency 
problems — both  of  which  resulted  in  negative  future  consequences  for  the  firm.^  Evidence  of  such 
negative  ex  post  consequences  is  consistent  with  the  implicit  premise  in  SOX  that  whistle-blowers 
can  disclose  valuable  new  information  about  firms’  agency  problems. 

Our  fourth  research  question  (Q4)  examines  firms’  responses  to  whistle-blowing  allegations. 
We  find  that,  compared  to  a  matched  control  sample,  firms  subject  to  public  whistle-blowing 
allegations  (Press  sample)  were  more  likely  to  reduce  the  size  of  the  board,  reduce  the  proportion 
of  insiders  on  the  board,  have  less  busy  directors,  and  replace  their  CEO,  whereas  firms  subject  to 
less  publicized  whistle-blowing  allegations  (OSHA  firms)  did  not  respond  to  these  allegations  by 
making  governance  changes. 


'  Agency  problems  include  corporate  financial  misdeeds  that  result  from  conflicts  of  interest  between  management  and 
stakeholders  such  as  shareholders  and  creditors. 
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Overall,  our  evidence  suggests  that  whistle-blowing  is  an  important  mechanism  for  discover¬ 
ing  information  about  agency  problems  at  firms.  On  balance,  our  findings  are  inconsistent  with 
claims  by  critics  that  whistle-blowing  complaints  are  frivolous,  misleading,  or  unreliable.  Our 
research  also  provides  indirect  evidence  on  the  efficacy  of  SOX  Section  806  as  a  mechanism  to 
uncover  agency  issues  at  firms.  Interestingly,  public  whistle-blowing  events  are  relatively  infre¬ 
quent,  and  a  majority  of  the  whistle-blowing  allegations  we  identify  are  not  directly  related  to  the 
types  of  fraudulent  behavior  that  precipitated  the  SOX  legislation. 

From  a  broader  perspective,  we  contribute  to  an  emerging  stream  of  literature  in  corporate 
governance  related  to  how  information  about  financial  wrongdoing  reaches  the  stock  market.  In 
related  work,  Dyck  et  al.  (2008)  investigate  who  reveals  information  about  corporate  fraud.  Using 
a  sample  of  firms  subject  to  shareholder  class  action  litigation,  they  find  that  the  most  important 
sources  of  information  are  employees  (19  percent  of  the  cases),  media  (16  percent),  and  analysts 
(15  percent).  Unlike  Dyck  et  al.  (2008),  our  focus  is  on  (1)  characteristics  of  firms  targeted  by 
employee  whistle-blowers,  (2)  the  economic  consequences  of  whistle-blowing  revelations,  and  (3) 
firms’  responses  via  governance  changes  subsequent  to  such  whistle-blowing  events.  In  addition. 
Miller  (2006)  investigates  the  role  of  the  press  in  revealing  fraud,  but  does  not  indicate  that 
employees  are  often  named  as  the  source  of  the  articles.  Furthermore,  Miller  (2006)  identifies 
firms  that  are,  ex  post,  known  to  have  committed  financial  wrongdoing,  and  tracks  these  firms  to 
determine  the  role  of  the  press  in  uncovering  these  known  cases  of  wrongdoing.  In  contrast,  we 
identify  firms  alleged  to  have  engaged  in  financial  wrongdoing,  and  examine  whether  these  firms 
ultimately  suffered  negative  consequences. 

Section  II  discusses  financial  whistle-blowing  and  introduces  predictions  about  firm-specific 
factors  likely  to  be  associated  with  whistle-blowing  events.  Section  III  describes  the  sample  and 
Section  IV  presents  evidence  on  the  predictions  discussed  in  Section  II.  Section  V  presents  evi¬ 
dence  on  stock  price  reactions  surrounding  whistle-blowing  allegations  and  also  provides  evidence 
on  longer-term  consequences  of  whistle-blowing  actions.  Section  VI  reports  evidence  on  whether 
firms  improve  governance  after  whistle-blowing  allegations.  Section  VII  provides  a  summary  of 
findings  and  contributions. 


II.  BACKGROUND  AND  CHARACTERISTICS  OF  FIRMS  SUBJECT  TO 
WHISTLE-BLOWING  ALLEGATIONS 

Financial  Whistle-Blowing — Internal  versus  External 

Miceli  and  Near  (1985)  define  whistle-blowing  as  the  disclosure  by  either  former  or  current 
employees  of  alleged  illegal,  immoral,  or  illegitimate  practices  that  are  under  employer  control. 
Figure  1  summarizes  outcomes  of  potential  employee  whistle-blowing  allegations  and  related 
consequences  for  the  targeted  firm.  Whistle-blowing  can  be  either  internal  or  external.  Internal 
whistle-blowing  involves  informing  relevant  members  of  the  firm  (e.g.,  management,  internal 
audit)  about  potential  wrongdoing.  External  whistle-blowing,  on  the  other  hand,  involves  either  an 
internal  allegation  becoming  public  or  the  whistle-blower  directly  disclosing  an  allegation  outside 
the  organization  (e.g.,  to  the  media  or  a  regulatory  agency  such  as  OSHA).  We  focus  on  external 
whistle-blowing  because  instances  of  internal  whistle-blowing  generally  are  not  reported  in  the 
media  and  public  data  on  such  cases  generally  are  not  available.  Moreover,  we  concentrate  only  on 
whistle-blowing  related  to  financial  practices. 
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FIGURE  1 

Employee  Whistle-Blowing  Allegations:  Sequence  of  Possible  Actions  and  Consequences 


INTERNAL  WB _  EXTERNAL  WB  CONSEQUENCES 


Congressional  Action  to  Protect  Whistle-Blowers 

Largely  in  response  to  the  Enron  and  WorldCom  scandals,  Congress  passed  the  Sarbanes- 
Oxley  Act  (SOX)  in  July  2002.  SOX,  in  part,  sought  to  provide  whistle-blowers  greater  legal 
protection.  In  addition  to  creating  civil  remedies  for  retaliation  against  employees  in  fraud  cases, 
SOX  made  it  a  felony  to  take  any  action  that  is  harmful  to  any  person  in  retaliation  for  providing 
information  about  a  federal  crime  to  law  enforcement  officials.  Thus,  a  retaliatory  firing  or  demo¬ 
tion  of  a  whistle-blowing  employee  would  constitute  a  crime. 

Whistle-Blowing  is  Frivolous,  Misused,  Misleading,  and/or  Unreliable 

Some  survey  evidence  (Brickey  2003;  Zingales  2004)  suggests  that  the  personal  cost  to  the 
whistle-blower  can  be  high.^  Given  that  whistle-blowers  likely  anticipate  these  costs,  their  allega- 


^  Enron  and  WorldCom  represent  high-profile  examples  of  internal  whistle-blowing  that  became  public.  Our  general 
conclusions  are  not  affected  when  we  delete  Enron  and  WorldCom  from  the  sample,  i.e.,  our  results  are  not  driven  by 
these  prominent  examples. 

^  Zingales  (2004)  summarizes  evidence  on  the  consequences  of  whistle-blowing  for  individual  employees.  In  a  1992 
survey  of  1,500  federal  workers,  25  percent  of  employees  reported  that  they  experienced  verbal  harassment  and  intimi¬ 
dation;  20  percent  were  shunned  by  co-workers  and  managers;  18  percent  were  assigned  to  less  desirable  duties;  and  11 
percent  were  denied  a  promotion.  A  1998  survey  of  448  emergency  physicians  paints  an  even  worse  picture:  23  percent 
of  those  who  complained  about  an  issue  reported  having  been  fired  or  threatened  with  termination.  Brickey  (2003) 
reports  that  a  random  review  of  200  whistle-blower  complaints  filed  with  the  National  Whistleblower  Center  in  2002 
found  that  about  half  of  the  complainants  said  they  were  fired  after  they  reported  misconduct.  The  remaining  complain- 
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lions  should  be  credible.  However,  some  critics  suggest  that  whistle-blowers  often  misjudge  the 
situation  and  lodge  frivolous  complaints  to  garner  attention  and  publicity  (Near  and  Miceli  1996). 
Indeed,  in  June  2005,  the  French  government’s  Data  Protection  Authority  refused  to  allow  the 
installation  of  anonymous  whistle-blower  hotlines,  saying  that  such  lines  “were  disproportionate  to 
the  objectives  sought  with  the  risks  of  slanderous  denunciations”  {Economist  2006).  Furthermore, 
troublemakers  could  potentially  misuse  their  whistle-blower  status  to  avoid  being  suspended  or 
dismissed  from  employment.  For  example,  Anechiarico  and  Jacobs  (1996)  argue  that  several 
municipal  employees  in  New  York  City  chose  to  blow  the  whistle  in  order  to  fall  under  state 
protection  as  a  pre-emptive  measure  to  being  fired.  Hence,  given  the  enormous  trouble  that  can 
result  from  disciplining  a  whistle-blower,  supervisors  and  managers  might  find  it  easier  to  tolerate 
an  unproductive  and  ineffective  employee  rather  than  dismiss  or  demote  the  employee.  Gobert  and 
Punch  (2000)  argue  that  an  employee  might  even  be  excluded  from  necessary  salary  cuts  to  avoid 
the  perception  that  such  a  cut  is  retaliation  against  the  whistle-blower.  On  average,  if  whistle¬ 
blower  complaints  are  frivolous,  misused,  misleading,  and/or  unreliable,  then  we  should  observe 
no  significant  economic  consequences  to  a  whistle-blowing  action.  Having  said  that,  we  acknowl¬ 
edge  that  even  the  taint  of  an  allegation  could  potentially  lead  to  some  negative  consequences  for 
the  firm. 

Characteristics  of  Firms  Subject  to  Whistle-Blowing  Allegations:  Opportunities  and  Incen¬ 
tives  for  Committing  Financial  Misdeeds 

In  this  section,  we  argue  that  whistle-blowing  allegations  are  related  to  management’s  oppor¬ 
tunity  and  incentives  to  engage  in  financial  wrongdoing  and  are  a  function  of  employees’  incen¬ 
tives  to  voice  their  allegations  outside  the  organization.  At  the  outset,  we  acknowledge  that  some 
of  these  hypothesized  factors  are  exploratory  and  descriptive.  Given  how  little  published  research 
exists  in  the  area  of  whistle-blowing,  a  solid  theory  is  not  available  on  why  whistle-blowing  occurs 
and  what  its  consequences  may  be.  Hence,  we  believe  our  descriptive  approach  is  appropriate  at 
this  early  stage  of  the  literature’s  development.  Factors  we  expect  to  explain  external  whistle¬ 
blowing  follow. 

Capital  Market  Pressure 

In  recent  years,  firms  have  faced  heightened  capital  market  pressure  to  deliver  sustained 
earnings  growth  (Graham  et  al.  2005).  Such  pressures  have  likely  created  incentives  for  some 
managers  to  aggressively  boost  earnings,  either  via  earnings  management  or  via  other  questionable 
practices  such  as  overbilling  customers  or  improper  disclosure  of  material  financial  events.  We 
predict  that  firms  subject  to  capital  market  pressure  are  more  likely  to  experience  a  whistle¬ 
blowing  event.  To  capture  capital  market  pressure,  we  rely  on  three  variables  (described  in  detail 
in  the  Appendix)  to  proxy  for  (1)  the  need  to  access  capital  {FIN_NEED),  (2)  merger  and  acqui¬ 
sition  activity  (M&A),  and  (3)  the  amount  of  potential  earnings  management  (ABSDACC). 
We  compute  the  averaged  scaled  rank  of  these  three  proxies  and  label  the  averaged  rank  as 
CM_PRESSURE. 

Growth 

Firms  that  are  rapidly  growing  are  more  likely  to  outgrow  their  controls.  Responsibility  for 
overall  decision-making  is  typically  spread  across  many  individuals,  and  as  a  firm  grows  and 
responsibility  is  spread  over  more  individuals,  each  individual  has  less  information  and  authority 


ants  had  been  subjected  to  other  retaliatory  action  such  as  on-the-job  harassment  or  discipline.  A  survey  by  another 
watchdog  group,  the  Government  Accountability  Project,  found  that  about  90  percent  of  whistle-blowers  are  subjected 
to  reprisals  or  threats. 
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to  stop  wrongdoing  (Baucus  and  Near  1991).  We  predict  that  growth  firms  are  more  likely  to  be 
subject  to  a  whistle-blowing  event.  We  proxy  for  high-growth  environments  by  using  the  scaled 
rank  of  sales  growth  (GROWTH)  over  the  three  years  prior  to  the  whistle-blowing  event. 

Past  Performance 

We  predict  that  firms  with  strong  past  stock  market  performance  are  more  likely  to  be  a  target 
of  whistle-blowing  for  two  reasons.  First,  these  firms  are  more  newsworthy  to  the  media.  Second, 
an  employee  might  be  more  likely  to  expose  financial  misdeeds  perceived  to  enable  the  firm  to 
artificially  achieve  strong  performance.  We  measure  the  prior  performance  of  the  firm 
(PAST_PERF)  as  the  scaled  rank  of  stock  returns  over  the  year  before  the  whistle-blowing  event. 

Firm  Reputation 

We  predict  that  a  highly  regarded  firm  is  more  likely  to  attract  the  ire  of  a  disgruntled 
employee  and  the  resulting  allegation  is  more  likely  to  be  newsworthy.  The  external  whistle¬ 
blower  or  the  media  outlet  is  likely  to  get  more  attention  from  exposing  wrongdoing  at  a  highly 
regarded  firm."^  We  proxy  for  firm  reputation  based  on  whether  the  firm  is  listed  either  as  a  “Most 
Admired”  firm  or  a  “Best  Place  to  Work  For”  by  Fortune  magazine  (REPUTATION)  in  any  of  the 
five  years  prior  to  the  whistle-blowing  event.^ 

Unclear  Internal  Communication  Channels 

We  hypothesize  that  the  employee’s  motivation  to  resort  to  external  whistle-blowing  is  higher 
when  channels  for  raising  concerns  within  the  company  are  unclear.  This  situation  likely  occurs  in 
at  least  two  cases.  First,  Rothschild  and  Miethe  (1999)  suggest  that  bureaucratic  and  undemocratic 
work  environments  are  likely  to  experience  higher  levels  of  external  whistle-blowing  because  of 
significant  barriers  to  effective  internal  whistle-blowing.  Miceli  and  Near  (1994)  state  that  it  is 
common  for  employees  to  first  report  problematic  activities  to  management  and,  if  the  issue  is 
ignored,  then  resort  to  reporting  the  problem  to  external  audiences  such  as  the  media.  We  assume 
older  firms  (AGE)  are  likely  to  be  more  bureaucratic  and,  hence,  more  likely  to  be  subject  to  a 
whistle-blowing  action. 

Second,  King  (1999)  argues  that  geographical  distance  and  multiple  industrial  segments  in  a 
firm  make  communication  channels  less  clear.  In  particular,  an  employee  who  has  to  report  a 
misdeed  to  a  boss  he  or  she  hardly  knows  and  who  is  geographically  distant  might  feel  uncom¬ 
fortable  raising  sensitive  issues  (Miceli  and  Near  1994;  Beresford  et  al.  2003).  Such  unclear 
internal  communication  channels  increase  the  likelihood  the  employee  will  resort  to  external 
whistle-blowing.  We  measure  industrial  and  geographical  concentration  as  the  Herfindahl- 
Hirschman  index  in  the  year  prior  to  the  whistle-blowing  event  based  on  revenues  for  each  of  the 
firm’s  industrial  and  geographic  segments,  respectively  (IND_CONC  and  GEO_CONC).  We  ex¬ 
pect  the  probability  of  external  whistle-blowing  to  decrease  with  the  industrial  and  geographic 
concentration  of  the  firm’s  business  segments.  Our  empirical  measure  for  unclear  communication 
channels  (COMMUNICATION)  is  the  averaged  scaled  rank  of  AGE,  IND_CONC,  and  GEO- 
JCONC. 

Employee  Downsizing 

Luthans  and  Sommer  (1999)  find  that  following  employee  downsizing,  employees  experience 
a  decline  in  both  loyalty  and  commitment  to  the  firm.  We  predict  that  employees,  especially 


Of  course,  it  is  possible  that  a  highly  regarded  firm  is  more  likely  to  avoid  organizational  misconduct  or  have  processes 
in  place  that  resolve  employee  allegations  before  they  are  disclosed  publicly. 

^  Note  that  the  “Best  Places”  list  is  only  available  from  1998,  while  the  “Most  Admired”  list  is  available  throughout  our 
entire  sample  period. 
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former  employees  who  have  been  let  go,  are  more  likely  to  make  public  allegations  following 
layoffs.  Further,  layoffs  can  increase  the  level  of  animosity  between  the  firm  and  existing  employ¬ 
ees,  and  if  existing  employees  perceive  their  job  as  being  less  secure,  the  potential  cost  of  blowing 
the  whistle  decreases.  We  proxy  for  downsizing  {DOWNSIZE)  by  using  the  scaled  rank  of  the 
average  percentage  change  in  the  number  of  employees  over  the  three  years  prior  to  the  whistle¬ 
blowing  event. 

Monetary  Incentive  to  Report  Financial  Misdeeds  (Qui  Tam) 

In  certain  instances,  whistle-blowers  have  the  potential  to  benefit  financially  by  uncovering 
corporate  misdeeds.  Under  the  Federal  False  Claims  Act,  citizens  who  help  uncover  fraud  in 
government  contracts  or  government  programs  may  be  awarded  a  portion  of  the  recovered  funds. 
Since  these  whistle-blower  provisions  went  into  effect  in  1986,  the  government  has  recovered  over 
$15.6  billion  in  settlements  and  judgments  from  these  qui  tarn-related  cases.  Of  this  amount,  over 
16  percent  of  the  recovered  funds  have  been  distributed  to  the  individuals  who  helped  bring  the 
fraud  to  light  (TAF  2009). 

The  majority  of  qui  tarn  cases  fall  into  one  of  two  industries:  healthcare  and  defense  contract¬ 
ing.  Over  70  percent  of  the  funds  recovered  as  part  of  the  qui  tarn  provisions  relate  to  matters  in 
which  the  Department  of  Health  and  Human  Services  is  the  primary  agency,  while  over  15  percent 
of  the  recovered  funds  pertain  to  matters  in  which  the  Department  of  Defense  is  the  primary 
agency.  We  argue  that  the  qui  tarn  provisions  included  in  the  Federal  False  Claims  Act  provide 
employees  in  the  healthcare  and  defense  industries  with  increased  incentives  to  blow  the  whistle. 
We  proxy  for  these  qui  tarn  incentives  and  set  QUITAM  equal  to  1  if  the  firm  is  in  a  healthcare 
industry  (two-digit  SIC  code  equal  to  80)  or  if  the  firm  has  been  on  the  Department  of  Defense’s 
“100  Companies  Receiving  the  Largest  Dollar  Volume  of  Prime  Contract  Awards”  list  in  any  of 
the  three  years  prior  to  the  whistle-blowing  event. 

Firm  Size 

The  media  is  generally  more  interested  in  information  about  firms  that  are  large  (Miller  2006). 
It  is  also  possible  that  regulators  (e.g.,  OSHA)  are  similarly  more  likely  to  pay  attention  to 
information  about  large  firms.  As  a  result,  employees  of  large  firms  are  likely  to  be  more  confident 
that  their  allegations  will  have  an  audience,  and  we  therefore  predict  that  employees  of  large  firms 
will  be  more  likely  to  make  allegations  of  financial  wrongdoing.  Our  proxy  for  firm  size  {SIZE)  is 
the  scaled  rank  of  sales  in  the  year  prior  to  the  whistle-blowing  event. 

Internal  Control  Weaknesses 

Internal  controls  weaknesses  are  defined  as  “a  significant  likelihood  that  a  material  misstate¬ 
ment  of  the  annual  or  interim  financial  statements  will  not  be  prevented  or  detected”  (PCAOB 
2004).  We  argue  that  employees  are  more  likely  to  resort  to  external  whistle-blowing  when  the 
firm  has  poor  internal  controls.  To  capture  the  extent  of  a  firm’s  internal  control  weaknesses 
(/CW),  we  use  the  scaled  rank  of  the  fitted  value  from  a  model  of  the  determinants  of  internal 
control  weaknesses,  as  outlined  by  Doyle  et  al.  (2007)  and  described  in  the  Appendix. 

External  Monitoring 

Managers  are  more  likely  to  engage  in  financial  misconduct  when  the  quality  of  external 
monitoring  is  poor.  To  capture  such  external  monitoring  we  use  two  empirical  proxies:  (1)  the 
presence  of  block-holders  {BLOCK)  and  (2)  public  pension  funds  ownership  {PP).  Consistent  with 
prior  literature  (e.g.,  Cremers  et  al.  2005),  we  argue  that  firms  for  which  BLOCK  and  PP  are  low 
are  characterized  by  relatively  poor  external  monitoring.  We  use  the  averaged  scaled  rank  of 
BLOCK  and  PP  measured  in  the  year  prior  to  the  whistle-blowing  event  and  label  the  averaged 
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rank  EXT_M0N1T0R.  We  expect  a  negative  association  between  whistle-blowing  allegations  and 
monitoring  by  powerful  shareholders. 

Governance 

Managers  are  more  likely  to  indulge  in  wrongdoing  in  the  presence  of  weak  corporate  gov¬ 
ernance  and  strong  private  incentives.  To  capture  the  quality  of  internal  governance,  we  obtain 
board  data  from  the  IRRC  database,  and  use  the  averaged  scaled  rank  of  seven  empirical  proxies 
(described  in  detail  in  the  Appendix)  related  to  (1)  the  CEO’s  private  incentives  and  (2)  overall 
board  structure.  We  label  the  averaged  rank  as  GOVERNANCE. 

Model  for  Assessing  Characteristics  of  Firms  Subject  to  Whistle-Blowing  Allegations 

Next,  we  estimate  a  logistic  model  and  regress  the  presence  or  absence  of  a  whistle-blowing 
allegation  on  the  above-mentioned  proxies.  We  introduce  industry  fixed  effects  (based  on  two-digit 
SIC  codes  other  than  for  SIC  code  80  covered  by  QUITAM)  and  year  fixed  effects  as  controls  for 
potential  unspecified  factors.  Model  1  (where  firm  and  year  subscripts  are  suppressed)  links 
whistle-blowing  events  to  managerial  incentives  to  commit  financial  misdeeds  and  the  whistle¬ 
blowers’  likelihood  of  coming  forward: 

PriWhistle-blowing)  = /3o  +  IB^CM_PRESSURE  +  P2GROWTH+  /3^PAST_PERF 

+  I34REPUTATION  +  ^^COMMUNICATION  +  /S^DOWNSIZE 

+  /SjQUITAM  +  p^SIZE  -t-  jd^ICW-v  p^^XT_MONITOR 

+  Pw  GO  VERNANCE  -t-  Industry  -1-  Year  -1-  error  ( 1 ) 

where  Pr{Whistle-blowing)  is  the  probability  of  a  whistle-blowing  event  for  this  binary  dependent 
variable,  i.e.,  1  =  a  whistle-blowing  event  and  0  =  none.  Detailed  definitions  for  the  independent 
variables  are  in  the  Appendix. 

III.  WHISTLE-BLOWING  SAMPLES  AND  CONTROL  FIRMS 
Whistle-Blowing  Samples 

Our  whistle-blower  sample  comes  from  two  sources,  as  illustrated  in  Figure  1.  The  first 
source,  labeled  the  Press  sample,  consists  of  81  whistle-blowing  events  drawn  from  a  Lexis-Nexis 
search  of  every  combination  of  the  following  two  groups  of  search  terms  (1)  “whistle,”  “whistle¬ 
blowing,”  “whistleblower,”  “whistle-blower,”  and  (2)  “financial,”  “accounting,”  “reporting,” 
“fraud,”  “accounting  fraud”  over  the  calendar  years  1989  to  2004.  Note  that  the  Press  sample  label 
does  not  imply  that  employee  whistle-blowers  directly  informed  the  press.  These  are  merely 
instances  of  whistle-blowing  reported  in  the  media. 

The  second  source,  labeled  the  OSHA  sample,  consists  of  137  whistle-blowing  discrimination 
cases  gathered  from  written  requests  to  the  Occupational  Health  and  Safety  Adnunistration 
(OSHA).^  While  OSHA  handles  other  varieties  of  whistle-blower  allegations  besides  those  who 


®  While  the  vast  majority  of  OSHA  cases  were  not  resolved  in  favor  of  the  employee  whistle-blower  (Moberly  2007;  New 
York  Times  2009),  this  does  not  negate  the  importance  or  credibility  of  the  allegations.  First,  OSHA’s  rulings  were 
decisions  about  whether  the  whistle-blower  was  discriminated  against — not  rulings  on  whether  the  alleged  financial 
impropriety  was  taking  place.  It  is  entirely  possible  that  the  underlying  allegation  of  impropriety  was  credible  and 
important,  even  though  OSHA  determined  the  employee  was  not  treated  unfairly.  Second,  Moberly  (2007)  finds  that  one 
of  the  reasons  employee  whistle-blowers  have  not  been  successful  at  winning  claims  is  because  “OSHA  tended  to 
misapply  Sarbanes-Oxley’s  burden  of  proof  regarding  causation,  to  the  substantial  detriment  of  employees.”  Thus, 
despite  the  limited  success  employee  whistle-blowers  have  had  in  their  discrimination  cases  filed  with  OSHA,  it  remains 
unclear  whether  these  employees  were  raising  concerns  about  legitimate  cases  of  financial  malfeasance,  e.g.,  a  New  York 
Times  (2009)  editorial  argues  that  “Most  often,  the  reason  for  the  dismissal  has  nothing  to  do  with  the  merits  of  the 
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claim  to  have  uncovered  fraud  and  file  for  protection  under  SOX,  we  asked  for  and  received  data 

n 

related  only  to  financial  whistle-blowing  under  the  purview  of  SOX.  These  cases  represent 
whistle-blowers’  complaints  of  discrimination  in  the  workplace  (e.g.,  demotion,  termination)  for 
having  voiced  allegations  of  financial  impropriety.  In  essence,  the  OSHA  sample  reflects  the  very 
whistle-blowers  SOX  Section  806  was  intending  to  protect.  While  we  are  unable  to  observe 
whistle-blowing  allegations  that  did  not  result  in  alleged  employee  discrimination  (because  these 
allegations  were  not  reported  to  OSHA)  and  did  not  otherwise  become  public,  each  OSHA  case 
represents  a  previously  internal  allegation  of  financial  wrongdoing  being  made  public.  Figure  1 
highlights  the  external  whistle-blowing  allegations  we  are  able  to  observe. 

The  Press  sample  differs  from  the  OSHA  sample  in  several  ways.  First,  the  Press  sample 
likely  reflects  the  media’s  bias  in  covering  more  visible,  important,  or  sensational  whistle-blowing 
allegations.  In  contrast,  the  OSHA  sample  is  likely  to  be  less  biased  but  is  more  likely  to  include 
relatively  frivolous  complaints.  Second,  Callahan  and  Dworkin  (1994)  find  that  misdeeds  reported 
to  the  media  are  likely  to  involve  larger  sums  of  money  than  those  reported  to  other  outlets.  Third, 
observations  in  the  Press  sample,  by  definition,  represent  a  clear  public  revelation  of  the  whistle¬ 
blowing  allegation  whereas  the  OSHA  whistle-blowing  data  are  less  likely  to  be  widely  dissemi¬ 
nated  in  local  and  national  media  given  the  information  has  to  be  obtained  via  a  special  request 
from  a  government  agency.  Fourth,  we  can  ascertain  the  date  on  which  the  whistle-blowing 
allegation  was  made  public  based  on  the  news  story  in  the  press;  such  event-dating  is  difficult  with 
the  OSHA  sample.  Hence,  we  conduct  a  study  of  the  stock  market  reaction  surrounding  the 
whistle-blowing  event  only  for  the  Press  sample.  However,  to  the  extent  possible,  we  examine  the 
long-run  consequences  of  both  types  of  whistle-blowing  events. 

We  start  with  an  initial  Press  (OSHA)  sample  of  81  (137)  unique  firms.^  Panel  A  of  Table  1 
presents  an  annual  breakdown  of  the  firms  subject  to  whistle-blowing  actions.  The  number  of 
whistle-blowing  actions  in  the  Press  sample  is  relatively  stable  at  a  median  value  of  4  per  year, 
except  for  a  peak  of  23  events  in  1998  and  13  events  in  2002.  Because  the  OSHA  sample  begins 
in  August  2002,  this  year  has  only  12  events,  followed  by  62  events  in  2003  and  63  events  in  2004. 
Panel  B  of  Table  1  lists  the  nature  of  whistle-blowing  allegations  in  both  samples.  These  diverse 
allegations  range  from  clearly  “earnings  management”  to  potential  earnings  management  (e.g., 
“overbilling”  with  the  motivation  to  boost  revenue)  to  other  accounting  decisions  that  affect 
disclosure  (e.g.,  “improper  disclosure”  and  “accounting  irregularities”)  to  other  types  of  financial 
impropriety  not  directly  related  to  financial  reporting  (e.g.,  “insider  trading,”  “price-fixing”  and 
“tax  fraud”). 

Control  Samples 

We  rely  on  two  control  samples.  The  purpose  of  the  first  control  sample  (labeled  “broad 
control  sample”)  is  to  benchmark  our  whistle-blowing  firms  against  a  broad  set  of  firms  that  have 
not  been  publicly  accused  of  financial  improprieties  by  employee  whistle-blowers.  This  sample 
consists  of  all  firms  that  have  non-missing  CRSP  and  Compustat  data  for  all  of  the  independent 
variables  listed  in  model  (1).  All  observations  that  have  non-missing  data  in  the  year  before  the 
whistle-blowing  event  for  one  of  our  sample  firms  are  eligible  for  inclusion  in  the  broad  control 


case.”  Our  study  provides  direct  evidence  on  this  question. 

The  Sarbanes-Oxley  Act  did  not  assign  the  job  of  protecting  whistle-blowers  to  the  Securities  and  Exchange  Commis¬ 
sion.  Instead,  the  task  went  to  the  Labor  Department’s  OSHA  because  the  law’s  drafters  reasoned  that  OSHA  had  a 
record  of  helping  workers  penalized  for  voicing  concerns.  For  example,  OSHA  has  the  responsibility  of  enforcing  13 
additional  whistle-blower  statutes  covering  truckers,  airline  mechanics,  nuclear-power-plant  employees,  and  others. 
The  number  of  firms  used  in  each  empirical  test  is  reported  in  the  tables  that  follow  and  varies  depending  on  data 
availability. 
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TABLE  1 


Whistle-Blowing  Allegations  by  Year  and 

Type 

Panel  A: 

Whistle-Blowing  Allegations  by  Year 

Press 

OSHA 

Press 

OSHA 

1989 

1 

NA 

1997 

2 

NA 

1990 

0 

NA 

1998 

23 

NA 

1991 

1 

NA 

1999 

2 

NA 

1992 

3 

NA 

2000 

5 

NA 

1993 

4 

NA 

2001 

4 

NA 

1994 

5 

NA 

2002 

13 

12 

1995 

5 

NA 

2003 

8 

62 

1996 

5 

NA 

2004 

0 

63 

81 

137 

Panel  B:  Whistle-Blowing  by  Type  of  Allegation 


Press 

% 

OSHA 

% 

Earnings  Management 

24 

29.6% 

16 

11.7% 

Improper  Disclosure 

1 

1.2% 

4 

2.9% 

Overbilling 

40 

49.4% 

3 

2.2% 

Insider  Trading 

3 

3.7% 

1 

0.7% 

Price  Fixing 

6 

7.4% 

0 

0.0% 

Tax  Fraud 

2 

2.5% 

0 

0.0% 

Securities  Law  Violations 

0 

0.0% 

10 

7.3% 

Other  Accounting-Related 
Allegations 

1 

1.2% 

26 

19.0% 

Corporate  Governance  Issues 

0 

0.0% 

3 

2.2% 

Fraud 

0 

0.0% 

14 

10.2% 

Other 

4 

4.9% 

18 

13.1% 

No  Info 

0 

0.0% 

42 

30.7% 

81 

100.0% 

137 

100.0% 

sample  at  the  first  pass.  We  next  impose  the  following  filters:  (1)  no  observation  relating  to  any  of 
the  sample  firms  can  be  included  in  the  broad  control  sample,  and  (2)  no  control  firm  can  appear 
in  the  broad  control  sample  more  than  once  (i.e.,  no  more  than  one  firm-year  observation  for  each 
control  firm).  Whenever  we  found  multiple  observations  from  the  same  firm  in  the  control  sample, 
we  randomly  selected  just  one  firm-year. 

One  drawback  to  benchmarking  the  whistle-blowing  sample  to  a  broad  sample  of  firms  is  that 
firms  subject  to  a  whistle-blowing  event  are  inherently  different  from  the  average  firm.  Thus,  we 
construct  a  second  control  group  by  comparing  whistle-blowing  firms  to  other  firms  tarnished  by 
allegations  of  financial  misdeeds  that  have  not  been  exposed  to  a  public  whistle-blowing  event.  In 
particular,  we  identify  firms  that  were  subject  to  a  lawsuit  during  the  years  1989-2006  (“lawsuit 
sample”)  and  firms  that  filed  earnings  restatements  between  1997-2005  based  on  two  GAO  reports 
(“restatement  sample”).  We  emphasize  comparisons  between  our  whistle-blowing  samples  and  this 
second  control  group  because  they  allow  us  to  more  directly  assess  differences  between  firms  that 
face  whistle-blowing  allegations  and  other  tarnished  firms  that  avoid  public  allegations. 

We  rely  on  an  updated  version  of  the  lawsuit  sample  of  1,119  usable  firms  compiled  by 
DuCharme  et  al.  (2004)  covering  the  period  April  1988  through  May  2006  to  identify  firms  that 
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were  sued  in  connection  with  alleged  financial  misdeeds.  The  restatement  sample  is  drawn  from 
two  lists  of  firms  complied  by  the  General  Accounting  Office  (GAO  2002,  2006)  through  a 
Lexis-Nexis  search  with  variations  of  the  word  “restate.”  Data  limitations  on  several  of  the  inde¬ 
pendent  variables  constrain  the  restatement  sample  to  1,304  firm-year  observations.® 

IV.  CHARACTERISTICS  OF  FIRMS  FACING  EXTERNAL  WHISTLE-BLOWING 

ALLEGATIONS  (Ql) 

In  this  section,  we  present  evidence  on  the  characteristics  of  the  whistle-blowing  targets 
versus  the  two  control  samples  discussed  in  Section  III.  Given  the  Press  and  OSHA  samples  differ, 
we  separately  compare  each  whistle-blowing  sample  to  the  control  samples. 

Whistle-Blowing  Firms  versus  the  Broad  Control  Sample 

In  Panel  A  of  Table  2,  we  compare  whistle-blowing  firms  to  the  broad  control  sample  of  firms 
that  have  avoided  external  whistle-blowing  allegations.  We  estimate  multivariate  model  (1)  sepa¬ 
rately  for  the  Press  sample  in  columns  1  and  2  and  the  OSHA  sample  in  columns  3  and  4.  Columns 
1  and  3  present  results  without  the  external  monitoring  and  internal  governance  variables  because 
data  restrictions  on  these  variables  severely  restrict  our  sample  sizes.  Results  in  columns  2  and  4 
include  external  monitoring  and  internal  governance  variables. 

We  find  that,  relative  to  the  broad  control  sample,  whistle-blowing  firms  in  both  Press  and 
OSHA  samples  (1)  were  growing  more  rapidly  (with  statistically  significant  differences  reported  in 
columns  1,  2,  and  3),  (2)  had  relatively  unclear  internal  communication  channels  (all  four  col¬ 
umns),  (3)  came  from  QUITAM  environments  where  the  whistle-blower  had  an  opportunity  to 
share  in  the  proceeds  of  any  fraud-related  settlement  with  the  U.S.  Government  (columns  1,  3,  and 
4),  and  (4)  were  larger  (all  four  columns).  Results  in  columns  2  and  4  suggest  that  whistle-blowing 
targets  also  had  relatively  weak  external  monitoring.  We  also  find  some  evidence  that  capital 
market  pressures  were  associated  with  whistle-blowing  (columns  2  and  3).  Two  variables  were 
only  significant  for  the  OSHA  sample.  OSHA  firms  were  more  likely  to  (1)  have  a  strong  repu¬ 
tation  (columns  3  and  4),  and  (2)  have  undergone  employee  downsizing  (column  3). 

Whistle-Blowing  Firms  versus  Lawsuit  and  Restatement  Firms 

In  Panel  B  of  Table  2,  we  compare  the  whistle-blowing  sample  to  a  combined  sample  com¬ 
posed  of  the  lawsuit  and  restatement  control  samples. Although  both  whistle-blowing  firms  and 
these  control  firms  (lawsuit  and  restatement  firms)  are  likely  tarnished,  only  the  whistle-blowing 
targets  experienced  a  public  allegation.  Hence,  our  analysis  in  Panel  B  focuses  on  differences 
between  firms  where  employees  made  external  whistle-blowing  allegations  and  lawsuit  and  re¬ 
statement  firms  where  a  public  whistle-blowing  allegation  presumably  could  have  been  made. 

Compared  to  lawsuit  and  restatement  firms,  whistle-blowing  firms  (1)  were  growing  more 
rapidly  (significant  in  all  four  columns  of  Panel  B),  (2)  were  relatively  more  successful  in  their 
prior  stock  return  performance  (all  four  columns),  (3)  had  experienced  employee  downsizing 
(columns  2,  3,  and  4),  and  (4)  were  larger  (all  four  columns).^  In  addition,  OSHA  firms  (1)  were 
more  highly  regarded  (columns  3  and  4),  (2)  came  from  QUITAM  environments  (columns  3  and 


We  eliminate  firms  that  overlap  the  whistle-blowing  treatment  sample  and  the  tarnished-firm  control  sample  (lawsuits 
and  restatements)  to  avoid  any  mechanical  correlation  between  these  samples. 

In  an  effort  to  capture  the  period  of  alleged  financial  malfeasance  for  our  control  firms,  for  lawsuit  firms  we  calculate  the 
average  of  all  our  continuous  variables  over  the  years  of  alleged  wrongdoing.  For  restatement  firms  we  calculate  the 
average  of  all  our  continuous  variables  over  the  three  years  prior  to  the  restatement  announcement  (t-1  through  t-3). 
For  comparability,  we  also  use  prior  three-year  averages  for  the  continuous  variables  in  our  whistle-blowing  sample. 
We  acknowledge  that  the  finding  that  whistle-blowing  firms  are  characterized  by  relatively  more  successful  prior  stock 
return  performance  may  be  driven  by  the  fact  that  lawsuits  are  often  triggered  by  poor  stock  performance. 
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4),  and  (3)  had  relatively  weak  external  monitoring  (column  4).  Press  firms  appeared  to  have 
worse  governance  (column  2). 

In  general,  a  comparison  of  whistle-blowing  firms  to  lawsuit  and  restatement  firms  suggests 
whistle-blowing  firms  were  systematically  different  from  other  tarnished  firms.  These  unique 
environmental  factors  encouraged  whistle-blowers  to  make  their  allegations  public  and  help  ex¬ 
plain  why  some  firms  were  subject  to  whistle-blowing  allegations  while  other  firms  that  presum¬ 
ably  could  have  been  targeted  by  whistle-blowers  did  not  experience  such  allegations. 

V.  IMMEDIATE  (Q2)  AND  SUBSEQUENT  (Q3)  CONSEQUENCES  OF 
WHISTLE-BLOWING  ALLEGATIONS 
Immediate  Stock  Market  Reactions  to  Whistle-Blowing  Announcements 

Recall  that  we  can  compute  stock  market  reactions  only  for  the  Press  sample,  as  the  dates  on 
which  the  OSHA  whistle-blowing  events  become  public  are  not  generally  available.  Panel  A  of 
Table  3  presents  evidence  on  the  stock  market  reaction  to  whistle-blowing  events  measured  around 
five  days,  from  the  day  before  the  announcement  to  three  days  after  (day  —1  through  day  -t-3). 
RET  represents  raw  returns  while  CAR  signifies  market-adjusted  returns.  The  mean  five-day  ab¬ 
normal  market- adjusted  stock  market  reaction  was  —2.84  percent.  The  median  abnormal  reactions, 
while  smaller  in  magnitude,  were  also  significantly  negative.  Given  that  the  median  market  capi¬ 
talization  of  a  Press  sample  firm  was  $7.3  billion  (untabulated),  market  reactions  following 
whistle-blowing  allegations  resulted  in  significant  losses  in  shareholder  value,  on  average.  The 
frequency  of  negative  reactions  is  also  high,  at  67.9  percent  of  the  sample.'^ 

These  significant  negative  reactions  suggest  that,  on  average,  announcements  of  whistle¬ 
blowing  allegations  were  economically  important  for  the  targeted  firm  and  were  not  immediately 
seen  as  frivolous  by  stock  market  participants.  We  consider  subsequent  economic  consequences  of 
these  whistle-blowing  allegations  later  in  Section  V. 

Cross-Sectional  Determinants  of  the  Stock  Market  Reactions  to  Whistle-Blowing  Announce¬ 
ments 

In  Panel  B  of  Table  3,  we  attempt  to  explain  the  cross-sectional  distribution  of  stock  market 
reactions  to  a  whistle-blowing  event  by  including  five  factors  that  potentially  affect  investors’ 
reaction  to  whistle-blowing  allegations.  First,  we  expect  larger  firms  {SIZE)  to  suffer  larger  nega¬ 
tive  stock  price  reactions.  Larger  firms  have  deeper  pockets  and  are  more  likely  to  be  the  subject 
of  a  subsequent  class  action  lawsuit.  Second,  highly  regarded  firms  have  more  credibility  to  lose 
from  an  external  whistle-blowing  action  reported  in  .the  financial  press.  We  measure  a  firm’s 
reputation  based  on  whether  the  firm  is  ranked  as  most  admired  or  best  to  work  for  per  Eortune 
magazine  {REPUTATION) .  Third,  awareness  of  whistle-blowing  actions  likely  heightened  after  the 
passage  of  SOX,  and  we  expect  the  stock  market  to  react  more  negatively  to  a  whistle-blowing 
action  in  the  post-SOX  era.  We  define  a  variable,  SOX,  set  equal  to  1  if  the  whistle-blowing  action 
was  reported  in  the  press  after  the  calendar  year  2002.  Fourth,  we  argue  that  allegations  of 
earnings  management  and  overbilling  represent  especially  large  agency  issues,  and  we  predict 
these  allegations  are  likely  to  result  in  particularly  negative  stock  market  reactions.  EM  is  set  to  1 
if  the  allegation  pertains  to  earnings  management  (and  0  otherwise)  and  OVERBILLING  is  set  to 
1  if  the  allegation  involves  overbilling  (and  0  otherwise). 

Panel  B  of  Table  3  presents  evidence  on  the  cross-sectional  variation  in  stock  price  reactions 
around  whistle-blowing  allegations.  Two  of  our  predictions  are  consistently  significant.  We  find 
that  the  stock  market  viewed  whistle-blowing  allegations  relatively  negatively  for  firms  with 


We  find  qualitatively  similar  results  when  we  examine  three-day  (day  —1  through  day  -1-1)  returns  instead  of  five-day 
returns. 
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TABLE  3 


What  is  the  Immediate  Stock  Market  Reaction  to  Whistle-Blowing  Announcements?  (Q2) 
Panel  A:  Five-Day  Market  Reaction  (n  =  78) 


%  negative 

Mean 

Qi 

Median 

Q3 

p-value“ 

p-value*’ 

RET 

60.3% 

-0.0246 

-0.0478 

-0.0125 

0.0208 

0.055* 

0.033** 

CAR 

67.9% 

-0.0284 

-0.0592 

-0.0147 

0.0284 

0.024** 

0.013** 

Panel  B:  Cross-Sectional  Determinants  of  the  Market  Reaction 

Model :  CAR  =  /So  +  PiSIZE  +  P2REPUTATION  +  /S3SOA  -1-  p^EM  +  psOVERBILLING  +  e 


Independent  Variables 

Pred. 

Sign 

Coefficient 

t-statistic 

Intercept 

? 

0.070 

1.73* 

SIZE 

- 

-0.000 

-0.69 

REPUTATION 

- 

-0.024 

-0.31 

SOX 

- 

-0.041 

-0.87 

EM 

- 

-0.143 

-3.04*** 

OVERBILLING 

- 

-0.074 

-1.62* 

Adj.  r2 

8.9% 

F-statistic 

2.24 

0.063* 

n 

64 

*,  **,  ***  Significant  at  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

p- values  are  one-sided  for  variables  with  directional  predictions.  We  report  (1  -  p)  values  for  coefficients  that  assume  a 
sign  opposite  to  the  one  predicted. 

“  p-values  are  one-sided  associated  with  t-statistics. 

p-values  are  one-sided  associated  with  Wilcoxon  Z-statistics. 

Variable  Definitions  (where  firm  and  year  subscripts  are  suppressed): 

RET  =  five-day  cumulative  raw  returns  beginning  the  day  before  the  whistle-blowing  allegation; 

CAR  =  five-day  cumulative  abnonnal  returns  beginning  the  day  before  the  whistle-blowing  allegation; 

SIZE  =  total  sales  revenue  for  the  year  before  the  whistle-blowing  event; 

REPUTATION  =  an  indicator  variable  coded  as  1  if  the  firm  has  appeared  on  Fortune's  “Best  Companies  to  Work  For” 
list  or  on  Fortune's  “Most  Admired  Companies”  list  in  any  of  the  five  years  prior  to  the  whistle-blowing 
event,  and  coded  as  0  otherwise; 

SOX  =  an  indicator  variable  equal  to  1  if  the  allegation  was  made  later  after  2002,  0  otherwise; 

EM  =  an  indicator  variable  equal  to  1  if  the  allegation  is  related  to  earnings  management,  0  otherwise;  and 

OVERBILLNG  =  an  indicator  variable  equal  to  1  if  the  allegations  related  to  overbilling,  0  otherwise. 


alleged  earnings  management.  Specifically,  the  coefficient  on  EM  is  -0.143  (t-statistic  =  -3.04) 
indicating  that  allegations  related  to  earnings  management  resulted  in  an  average  incremental 
(overall)  - 14.3  percent  (-7.3  percent)  return  over  the  five-day  window.  Allegations  of  overbilling 
indicate  an  average  incremental  (overall)  -7.4  percent  (-0.4  percent)  five-day  return.  Hence,  it 
appears  that,  on  average,  whistle-blowing  events  were  far  from  trivial  for  targeted  firms,  particu¬ 
larly  for  firms  accused  of  managing  earnings. 

Subsequent  Consequences  of  Whistle-Blowing  Events  (Q3) 

In  this  section,  we  explore  four  potential  subsequent  consequences  of  whistle-blowing  events: 
(a)  earnings  restatements,  (b)  shareholder  lawsuits,  (c)  future  operating  performance,  and  (d) 
future  stock  market  performance.  If  a  whistle-blowing  event  reveals  financial  reporting  or  agency 
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problems  in  the  firm,  then  one  would  expect  firms  subject  to  such  allegations  to  be  more  likely  to 
file  future  earnings  restatements,  to  be  the  target  of  shareholder  lawsuits,  and  to  suffer  weak  future 
operating  and  stock  return  performance. 

We  recognize  the  issue  of  endogeneity  in  evaluating  the  consequences  of  whistle-blowing 
events.  In  other  words,  when  we  assess  future  economic  consequences  such  as  operating  or  stock 
return  performance,  it  is  possible  that  whistle-blowing  allegations  arise  from  many  of  the  same 
economic  forces  that  contribute  to  subsequent  problems  after  a  whistle-blowing  allegation.  Hence, 
it  would  be  inappropriate  to  treat  the  whistle-blowing  event  as  exogenous,  especially  given  that  we 
attempt  to  model  the  factors  associated  with  whistle-blowing  in  Section  IV. 

We  control  for  the  potential  endogeneity  between  whistle-blowing  allegations  and  future 
economic  consequences  by  comparing  whistle-blowing  targets  to  a  sample  of  control  firms 
matched  on  the  propensity  to  experience  a  whistle-blowing  event  (Armstrong  et  al.  2009;  Francis 
and  Lennox  2008).  The  primary  benefit  of  using  a  control  sample  matched  on  propensity  scores  is 
that  it  allows  us  to  compare  whistle-blowing  firms  to  a  set  of  firms  that  are  the  same  on  all 
observable  dimensions,  thus  allowing  us  to  more  clearly  attribute  any  observed  consequence  to  the 
whistle-blowing  event  itself,  rather  than  to  the  firm  characteristics  associated  with  whistle-blowing 
allegations.  From  a  practical  standpoint,  this  matched  sample  also  allows  us  to  compare  whistle¬ 
blowing  firms  to  control  firms  along  a  variety  of  dimensions  (e.g.,  restatements,  lawsuits,  operat¬ 
ing  performance,  stock  return  performance)  without  the  need  for  additional  control  variables.  In 
essence,  our  propensity-score  matching  procedure  directly  controls  for  any  relevant  difference 
between  whistle-blowing  and  control  firms. 

To  identify  the  propensity-score  matched  control  sample,  we  estimate  model  (1)  for  the 
whistle-blowing  firms  and  the  broad  control  sample  (as  outlined  in  Panel  A  of  Table  2),  and 
calculate  a  propensity  score  for  each  firm,  which  is  the  conditional  probability  of  receiving  the 
treatment  effect  (i.e.,  the  probability  that  the  firm  experienced  a  whistle-blowing  event)  given  all 
the  observable  data.'^  For  each  whistle-blowing  target,  we  select  two  control  firms  with  the  closest 
propensity  scores,  and  these  firms  constitute  the  propensity-score  matched  control  sample.  The 
result  is  that  each  whistle-blowing  firm  is  matched  to  two  firms  that  were  not  subject  to  whistle¬ 
blowing  allegations  but  are  similar  along  all  other  relevant  dimensions. Since  the  matched 
control  sample  is  similar  along  every  observable  dimension,  any  difference  in  outcome  between 
whistle-blowing  and  control  samples  can  be  attributed  to  the  whistle-blowing  event. 

We  measure  “covariate  balance”  to  ensure  that  the  whistle-blowing  and  matched  control 
samples  are  similar  across  all  dimensions  except  the  variable  of  interest  (i.e.,  being  the  target  of 
whistle-blowing  allegations).'^  In  general,  untabulated  results  suggest  that  in  comparing  whistle¬ 
blowing  firms  to  these  control  firms,  we  effectively  control  for  the  potential  endogeneity  linking 
whistle-blowing  allegations  and  future  consequences.  However,  we  do  find  a  significant  difference 
in  the  distribution  (but  not  the  mean)  of  SIZE  when  comparing  whistle-blowing  and  control 
samples.  Therefore,  to  be  conservative,  we  control  for  SIZE  in  subsequent  tests. 


In  order  to  maximize  the  number  of  observations  available  for  the  propensity  score  matching  procedure,  we  estimate 
model  (1)  excluding  EXT_MONITOR  and  GOVERNANCE. 

We  choose  two  matched  firms  for  each  whistle-blowing  target  in  an  effort  to  maximize  the  number  of  firms  in  this 
control  sample  while  ensuring  that  there  is  no  statistical  difference  in  the  propensity  scores  between  treatment  and 
control  firms. 

We  formally  assess  covariate  balance  by  testing  whether  the  means  and  medians  of  the  covariates  used  in  model  (1) 
differ  between  the  whistle-blowing  and  controls  samples.  The  untabulated  results  suggest  that  the  propensity-score 
matched  samples  resemble  the  whistle-blowing  samples  along  virtually  all  dimensions.  Specifically,  there  is  no  signifi¬ 
cant  difference  in  means  of  any  of  the  covariates  between  whistle-blowing  and  matched  control  firms  (for  either  the 
Press  or  OSHA  samples).  Furthermore,  the  Kolmogorov-Smimov  test  fails  to  reject  the  null  hypothesis  of  similar 
distributions  for  each  of  these  variables,  with  the  exception  of  SIZE. 
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Do  Whistle-Blowing  Announcements  Lead  to  Subsequent  Earnings  Restatements?  (Q3a) 

To  assess  subsequent  consequences  (in  this  and  the  following  lawsuit-based  tests),  we  rely  on 
the  restatement  and  lawsuit  samples  discussed  in  Section  III  with  one  major  exception.  Unlike 
Section  III,  we  do  not  eliminate  whistle-blowing  firms  that  experienced  a  restatement  or  lawsuit, 
as  the  purpose  of  this  analysis  is  to  document  the  existence  of  such  a  link  should  it  exist. 

The  results  in  Panel  A  of  Table  4  indicate  that  whistle-blowing  firms  in  the  Press  sample 
experienced  significantly  more  subsequent  restatements  (p  =  0.013)  in  the  three  years  following  a 
whistle-blowing  action  (17.9  percent)  than  propensity-score  matched  control  firms  (7.5  percent). 
Panel  B  of  Table  4  reports  that  whistle-blowing  firms  in  the  OSHA  sample  also  experienced  more 
restatements  (13.7  percent)  than  propensity-score  matched  control  firms  (9.5  percent),  but  the 
difference  is  not  significant  at  conventional  levels  (p  =  0.141).  However,  when  we  look  only  at  the 
subset  of  future  restatements  that  represent  accounting  irregularities  (as  defined  by  Hennes  et  al. 
[2008]),  we  find  that  OSHA  firms  were  more  likely  to  experience  an  accounting  irregularity  than 
their  matched  counterparts  (p  =  0.053,  untabulated). 

Because  the  whistle-blowing  and  propensity-score  matched  control  samples  had  different 
SIZE  distributions,  we  also  controlled  for  SIZE.  These  results,  reported  in  Panel  C  of  Table  4, 
support  our  earlier  conclusions.  In  sum,  firms  exposed  to  whistle-blowing  events  reported  in  the 
press  were  more  likely  to  experience  a  subsequent  restatement  than  were  matched  control  firms 
with  similar  underlying  characteristics. 

Do  Whistle-Blowing  Announcements  Lead  to  Subsequent  Shareholder  Lawsuits?  (Q3b) 

Table  5  provides  evidence  on  whether  whistle-blowing  allegations  were  a  precursor  to  share¬ 
holder  lawsuits.  The  sample  analyzed  for  this  test  again  includes  67  Press  firms  and  their  134 
propensity-score  matched  control  firms,  and  95  OSHA  firms  along  with  their  190  matched  control 
firms.  The  results  in  Panel  A  of  Table  5  indicate  that  whistle-blowing  firms  in  the  Press  sample 
experienced  significantly  more  subsequent  lawsuits  (p  =  0.004)  following  a  whistle-blowing 
action  (26.9  percent)  than  propensity-score  matched  control  firms  (11.9  percent).  Panel  B  of  Table 
5  reports  that  whistle-blowing  firms  in  the  OSHA  sample  also  experienced  more  lawsuits  (11.6 
percent)  than  propensity-score  matched  control  firms  (8.4  percent),  but  the  difference  is  not  sig¬ 
nificant  at  conventional  levels  (p  =  0.196).  Controlling  for  SIZE  in  Panel  C  yields  similar  con¬ 
clusions.  In  summary,  the  evidence  in  Tables  4  and  5  suggests  that  a  whistle-blowing  event 
reported  in  the  press  can  be  an  early  warning  indicator  of  future  earnings  restatements  or  share¬ 
holder  lawsuits,  and  that  whistle-blowing  allegations  are  not  trivial  and/or  baseless,  on  average. 

Are  Whistle-Blowing  Events  Associated  with  Relatively  Poor  Future  Operating  Performance‘s 
(Q3c) 

In  this  and  the  following  section,  we  evaluate  whether  whistle-blowing  allegations  serve  as  an 
early  warning  of  negative  future  operating  or  stock  price  performance.  The  link  between  a  whistle¬ 
blowing  event  and  negative  future  performance  can  be  motivated  in  at  least  three  ways.  First, 
given  our  earlier  evidence  that  whistle-blowing  can  be  a  significant  indicator  of  subsequent  eam- 


16 


We  verified  that  the  whistle-blowing  announcement  preceded  the  earnings  restatement  before  the  announcement  entered 
the  empirical  analyses.  However,  note  that  the  GAO  report  discloses  the  date  of  the  restatement  announcement,  and  not 
the  years  being  restated.  Data  on  the  year  being  restated  is  generally  patchy  and  unreliable.  Hence,  we  did  not  pursue  an 
mvesfigation  into  the  year  for  which  the  financial  statements  were  restated. 

The  time-windows  and  data  selection  for  the  restatement  tests  need  some  elaboration.  We  assume  that  whistle-blowing 
events  increase  the  probability  of  a  restatement  announcement  within  the  next  three  years  (including  the  year  of  the 
allegation).  Because  the  GAO  restatement  data  are  available  only  from  1997  to  2005,  we  are  forced  to  delete  observa¬ 
tions  that  occur  m  and  before  1993.  Thus,  the  sample  analyzed  for  these  tests  includes  67  Press  firms  and  their  134 
propensity-score  matched  control  firms,  and  95  OSHA  firms  along  with  their  190  propensity-score  matched  control 
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TABLE  4 

Long-Term  Consequences:  Are  Whistle-Blowing  Allegations  Associated  with  Subsequent 

Restatements?  (Q3a) 


Panel  A:  Restatement  Frequency — Press  versus  Propensity-Score  Matched  Sample 
_ Press  Sample _  Propensity-Score  Matched  Sample  Significance^ 


#  of 

Restatements 

#of 

Firms 

%  with 
Restatements 

#of 

Restatements 

#of 

Firms 

%  with 
Restatements 

p-value 

12 

Cl 

17.9% 

10 

134 

7.5% 

0.013** 

Panel  B:  Restatement  Frequency — OSHA  versus  Propensity-Score  Matched  Sample 
_ OSHA  Sample _  Propensity-Score  Matched  Sample  Significance^ 


#  of 

Restatements 

#of 

Firms 

%  with 
Restatements 

#  of 

Restatements 

#of 

Firms 

%  with 
Restatements 

p-value 

13 

95 

13.7% 

18 

190 

9.5% 

0.141 

Panel  C:  Logistic  Regressions 

RESTATE, =  ;eo  +  +  P2SIZE,_y  + 


Press  versus  Propensity-Score  OSHA  versus  Propensity-Score 


Independent  Variables 

Pred. 

Sign 

Matched  Sample 

Matched  Sample 

Coefficient 

p-value 

Coefficient 

p-value 

WB 

-f 

1.010 

0.016** 

0.406 

0.157 

SIZE 

7 

0.000 

0.470 

0.000 

0.621 

Pscudo-R^ 

4.1% 

0.8% 

Likelihood  Ratio  Statistic 

5.432 

0.066* 

1.467 

0.480 

ROC  Curve  Statistic 

0.666 

0.560 

n 

npRESS  = 

66 

HqSHA  = 

94 

nCONTROL  = 

=  124 

I^CONTROL  = 

=  182 

*,  **,  ***  Significant  at  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

p-values  are  one-sided  for  variables  with  directional  predictions.  We  report  (1  —  p)  values  for  coefficients  that  assume  a 
sign  opposite  to  the  one  predicted. 

“  p-values  are  associated  with  the  Pearson  Chi-square  test  statistic,  and  test  for  differences  in  the  proportion  of  firms  that 
experience  a  restatement. 

Variable  Definitions  (where  firm  subscripts  are  suppressed): 

RESTATE, ^2,  =  1  if  the  firm  experienced  a  restatement  in  the  event  year  or  in  the  three  subsequent  years,  0  otherwise; 
WB,  =  1  if  the  firm  experienced  a  whistle-blowing  event,  0  otherwise;  and 
SIZE,_i  =  total  sales  revenue  (Compustat  annual  #12)  for  the  year  before  the  whistle-blowing  event. 


ings  restatements  and  lawsuits,  it  is  perhaps  natural  to  expect  negative  future  performance  subse¬ 
quent  to  a  whistle-blowing  event.  Second,  managers  may  engage  in  defensive  actions  to  mitigate 
negative  publicity  and  other  consequences  of  a  whistle-blowing  event.  Such  defensive  efforts  can 
distract  managers  from  normal  profit-seeking  activities  and  siphon  funds  away  from  strategic 
investments.  Third,  a  whistle-blowing  event  can  hurt  the  firm’s  image  with  its  stakeholders,  result 
in  a  loss  of  confidence  in  management,  increase  the  probability  of  management  turnover,  and/or 
increase  the  perception  that  additional  problems  are  lurking. 
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TABLE  5 


Long-term  Consequences:  Are  Whistle-Blowing  Allegations  Associated  with  Subsequent 

Lawsuits?  (Q3b) 


Panel  A:  Lawsuit  Frequency — Press  versus  Propensity-Score  Matched  Sample 

Press  Sample  Propensity-Score  Matched  Sample  Significance® 


#of 

Lawsuits 

#of 

Firms 

%  with 
Lawsuits 

#of 

Lawsuits 

#  of 
Firms 

%  with 
Lawsuits 

p-value 

18 

67 

26.9% 

16 

134 

11.9% 

0.004*** 

Panel  B:  Restatement  Frequency — OSHA  versus  Propensity-Score  Matched  Sample 

OSHA  Sample  Propensity-Score  Matched  Sample  Significance® 

#  of  %  with  #  of  %  with 

#  of  Lawsuits  Firms  Lawsuits  #  of  Lawsuits  Firms  Lawsuits  p-value 

11  95  11.6%  16  190  8.4%  0.196 


Panel  C:  Logistic  Regressions 

LA  WSUIT,^3  =  +  WB,  +  fi2SIZE,_i  +  £,^3 


Independent 

Variables 

Pred. 

Sign 

Press  versus  Propensity-Score 
Matched  Sample 

OSHA  versus  Propensity-Score 
Matched  Sample 

Coefficient 

p-value 

Coefficient 

p-value 

WB 

-t- 

0.849 

0.017** 

0.234 

0.293 

SIZE 

7 

0.000 

0.025** 

0.000 

0.308 

Pseudo-R^ 

6.2% 

0.8% 

Likelihood  Ratio  Statistic 

10.598 

0.005*** 

1.484 

0.476 

ROC  Curve  Statistic 

0.728 

0.584 

n 

npRESS  =  66 

^OSHA  = 

94 

^CONTROL  =124 

r>CONTROL  = 

=  182 

*,  ***  Significant  at  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

p-values  are  one-sided  for  variables  with  directional  predictions.  We  report  (1  -  p)  values  for  coefficients  that  assume  a 
sign  opposite  to  the  one  predicted. 

p-values  are  associated  with  the  Pearson  Chi-square  test  statistic,  and  test  for  differences  in  the  proportion  of  firms  that 
experience  a  lawsuit. 

Variable  Definitions  (where  firm  subscripts  are  suppressed): 

LAWSUIT —  1  if  the  firm  experienced  a  lawsuit  in  the  event  year  or  in  the  three  subsequent  years,  0  otherwise; 

WB,  =  1  if  the  firm  experienced  a  whistle-blowing  event,  0  otherwise;  and 
SIZE,_^  =  total  sales  revenue  (Compustat  annual  #12)  for  the  year  before  the  whistle-blowing  event. 


In  analyzing  the  link  between  whistle-blowing  events  and  future  operating  performance,  we 
calculate  cumulative  return  on  assets  (ROA)  for  three  time  windows  following  the  whistle-blowing 
event  ranging  from  one  to  three  years  (i.e.,  ROA  for  year  +1;  years  -t-1  and  +2;  and  years  +1, 
+2,  and  +3).  The  data  in  Panels  A  and  B  of  Table  6  indicate  that  both  Press  and  OSHA  firms 
report  lower  ROA  in  the  year  immediately  following  the  whistle-blowing  allegation.  Beyond  the 
first  year,  whistle-blowing  firms  did  not  report  statistically  lower  ROA.  In  Panels  C  and  D,  we 
control  for  SIZE,  finding  similar  results,  except  that  Press  firms  also  reported  lower  ROA  in  the 
two  years  after  the  whistle-blowing  event. 
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TABLE  6 

Long-Term  Consequences:  Are  Whistle-Blowing  Allegations  Associated  with  Poor  Future 

Operating  Performance?  (Q3c) 

Panel  A:  Future  ROA — Press  versus  Propensity-Score  Matched  Sample 


Means 

Medians 

Press 

Control 

p-value“ 

Press 

Control 

p-value'’ 

ROAj^i 

0.020 

0.049 

0.015** 

0.035 

0.043 

0.020** 

(58) 

(101) 

(58) 

(101) 

ROAi^2 

0.061 

0.095 

0.106 

0.081 

0.094 

0.201 

(53) 

(81) 

(53) 

(81) 

0.099 

0.128 

0.255 

0.109 

0.098 

0.501 

(44) 

(59) 

(44) 

(59) 

Panel  B:  Future  ROA — OSHA  versus  Propensity-Score  Matched  Sample 

Means 

Medians 

OSHA 

Control 

p-value^ 

OSHA 

Control 

p-value’’ 

ROAf^i 

0.017 

0.044 

0.070* 

0.035 

0.048 

0.030** 

(84) 

(155) 

(84) 

(155) 

ROAf^2 

0.057 

0.063 

0.445 

0.071 

0.087 

0.264 

(40) 

(59) 

(40) 

(59) 

^OAi+j 

0.160 

0.115 

0.619 

0.233 

0.229 

0.526 

(6) 

(8) 

(6) 

(8) 

Panel  C:  OLS  Regressions- 

—Press  versus  Propensity-Score  Matched  Sample 

^OAt+„  =  Pa  +  PiWBf  +  P2SIZEf_i  +  £(+„ 

Independent 

Variables 

Pred. 

Sign 

ROAt+i 

ROA, ^2 

^0A,^2 

Coeff. 

p-value 

Coeff.  p-value 

Coeff. 

p-value 

WB 

_ 

-0.025 

0.032** 

-0.037  0.097* 

-0.024 

0.303 

SIZE 

? 

0.000 

0.040** 

0.000  0.012**  0.000 

0.140 

Adj.  R2 

3.1% 

4.1% 

0.3% 

F-statistic 

3.41 

0.036** 

3.66  0.029* 

*  1.13 

0.327 

n 

IlpRESS  = 

55 

npRESS  =  49 

npRESS  -  41 

^CONTROL 

=  96 

nCONTROL  =  ?6 

^CONTROL  =  56 

Panel  D:  OLS  Regressions- 

—OSHA  versus  Propensity-Score  Matched  Sample 

ROA,^„  =  Pa  +  PiWBt  +  P2SIZE,_i  + 

Independent 

Variables 

Pred. 

Sign 

ROAf^i 

ROAt+2 

ROAt^2 

Coeff. 

p-value 

Coeff.  p-value 

!  Coeff. 

p-value 

WB 

— 

-0.035 

0.029** 

-0.021  0.317 

0.018 

0.457 

SIZE 

? 

0.000 

0.168 

0.000  0.274 

0.000 

0.685 

Adj.  R2 

1.1% 

-0.8% 

-15.4% 

F-statistic 

2.29 

0.103 

0.64  0.527 

0.13 

0.878 

n 

Hqsha  = 

82 

Hqsha  =  39 

^OSHA  -  6 

r>CONTROL  ■ 

=  153 

nCONTROL  =  58 

ncONTROL  =  8 
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*,  **,  ***  Significant  at  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

Coefficients  (t-statistics)  are  reported,  p-values  are  one-sided  for  variables  with  directional  predictions.  We  report  ( 1  —  p) 
values  for  coefficients  that  assume  a  sign  opposite  to  the  one  predicted. 

^  p-values  are  associated  with  t-statistics. 

^  p-values  are  associated  with  Wilcoxon  z-statistics. 

Variable  Definitions  (where  firm  subscripts  are  suppressed): 

ROA,+„  =  buy-and-hold  return  on  assets,  starting  with  ROA  in  the  year  after  the  event  date,  and  ending  n  years 
after  the  whistle-blowing  event; 

WB,  =  1  if  the  firm  experienced  a  whistle-blowing  event,  0  otherwise;  and 
SIZEf^i  =  total  sales  revenue  (Compustat  annual  #12)  for  the  year  before  the  whistle-blowing  event. 


Are  Whistle-Blowing  Events  Associated  with  Relatively  Poor  Future  Stock  Price  Perfor¬ 
mance?  (Q3d) 

Table  7  presents  data  on  future  stock  return  performance  of  firms  subject  to  whistle-blowing 
allegations.  In  particular,  we  consider  monthly  returns  over  three  years  following  the  month  in 
which  the  whistle-blowing  allegation  becomes  public.  Thus,  this  event  window  does  not  overlap 
with  our  immediate  consequences  tests  reported  earlier  in  Section  V.  Panel  A  of  Table  7  reveals 
Press  firms  reported  lower  abnormal  returns  over  the  one,  two,  and  three  years  following  the 
whistle-blowing  event.  Panel  B  reports  some  evidence  consistent  with  OSHA  firms  also  reporting 
lower  returns  over  the  subsequent  two  or  three  years. 

In  Panels  C  and  D  of  Table  7,  we  regress  raw  monthly  returns  RET  on  (1)  SIZE  because 
whistle-blowing  firms  and  the  propensity-score  matched  control  firms  did  not  exhibit  the  same 
distribution  along  this  one  dimension,  and  (2)  the  other  three  Fama-French  factors,  including 
momentum.  We  include  a  WB  indicator  equal  to  1  for  firms  in  the  whistle-blowing  sample,  and  0 
otherwise,  which  allows  for  a  different  intercept  for  whistle-blowing  firms  and  control  firms.  The 
regressions  are  conducted  using  the  firm-month  as  the  unit  of  analysis.  The  variable  of  interest  in 
these  analyses  is  WB.  If  whistle-blowing  allegations  were  associated  with  poor  future  stock  return 
performance,  then  we  ought  to  observe  a  negative  coefficient  on  WB,  after  controlling  for  other 
factors  influencing  stock  returns.  The  results  reported  in  Panels  C  and  D  of  Table  7  are  generally 
consistent  with  the  univariate  results.  Press  firms  exhibited  lower  returns  in  year  + 1  and  year  +2, 
while  OSHA  firms  exhibited  lower  returns  in  year  +2  and  year  +3.  In  general,  we  find  that  firms 
targeted  by  whistle-blowers  suffered  lower  returns  in  subsequent  years,  which  in  turn  suggests  that 
whistle-blowing  allegations  were  not  frivolous  and  without  merit,  on  average. 

VI.  DO  WHISTLE-BLOWING  TARGETS  RESPOND  BY  CHANGING  CORPORATE 

GOVERNANCE?  (Q4) 

In  this  section,  we  investigate  our  fourth  research  question  (Q4):  What  actions  (if  any)  do 
whistle-blowing  targets  take  to  improve  corporate  governance  following  the  revelation  of  a 
whistle-blowing  allegation?  It  is  not  clear  whether  firms  will  take  meaningful  actions  to  improve 
corporate  governance  following  a  whistle-blowing  event.  For  example,  a  targeted  firm  can  easily 
initiate  cosmetic  changes  such  as  replacing  one  set  of  inside  directors  with  another  set  of  insiders. 
In  extreme  cases,  a  target  firm  might  even  fire  the  whistle-blower. 
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Changes  in  Governance  Variables 

We  hand-collect  governance  data  for  the  Press  and  OSHA  whistle-blowing  samples  from  their 
10-Ks  and  proxy  statements/^  To  limit  data  collection  to  manageable  levels,  we  focus  on  six 
aspects  of  governance,  details  of  which  have  been  discussed  earlier  in  Section  II:  (1)  the  number 
of  directors  on  the  board  (DIRECTORS);  (2)  the  proportion  of  insiders  on  the  board  (%INS_DIR); 
(3)  the  proportion  of  the  board  composed  of  busy  directors  where  busy  is  defined  as  three  addi¬ 
tional  directorships  (%BUSY);  (4)  the  percentage  attendance  at  board  meetings  (%ATTEND);  (5) 
the  proportion  of  firms  where  the  CEO  is  also  the  chairman  of  the  board  {CEO=COB%),  and  (6) 
the  proportion  of  firms  with  a  new  CEO  (NEW_CEO%).  We  calculate  three-year  changes  in  these 
six  governance  variables  from  the  year  before  the  whistle-blowing  event  (year  -1)  to  two  years 
after  the  event  occurs  (year  +2).  Of  the  81  (137)  possible  Press  (OSHA)  firms,  we  are  unable  to 
find  10-Ks  and/or  proxy  statements  for  24  (47)  firms. 

Since  governance  improvements  were  encouraged  in  the  period  following  the  accounting 
scandals  and  SOX,  we  also  collect  governance  data  for  a  sample  of  matched  firms  that  were  not 
subject  to  a  whistle-blowing  allegation.  We  select  matched  firms  with  the  same  four-digit  SIC 
code,  within  the  same  stock  exchange,  and  with  net  sales  as  close  as  possible  to  the  whistle¬ 
blowing  firm’s  net  sales  for  the  year  before  the  whistle-blowing  event. 

Table  8  reports  the  multivariate  analyses  that  compare  each  governance  change  between  the 
Press  sample  and  their  matched-firms  counterparts.  Control  variables  include:  size  (SALES),  past 
stock  market  performance  (RET),  and  the  change  in  firm  size  over  the  corresponding  three-year 
period  (ASIZE)  to  control  for  the  possibility  that  changes  in  governance  (e.g.,  smaller  board  size) 
were  simply  driven  by  changes  in  firm  size.  We  find  that,  compared  to  the  matched  control  sample, 
firms  exposed  to  whistle-blowing  allegations  via  the  press  were  more  likely  to  reduce  the  size  of 
the  board,  reduce  insider  participation  on  the  board,  have  less  busy  board  members,  and  were  more 
likely  to  replace  the  CEO. 

In  untabulated  findings,  we  find  no  evidence  that  OSHA  firms  made  changes  in  governance. 
Thus,  on  average,  whistle-blowing  targets  exposed  via  the  press  appeared  to  change  governance 
on  four  dimensions  that  exceeded  the  level  of  market-wide  governance  improvements  occurring  in 
the  post-SOX  era,  while  firms  targeted  by  whistle-blowers  via  OSHA  (a  less  public  venue)  did  not. 
This  suggests  the  visibility  of  the  allegation  played  a  role  in  firms’  responses  to  whistle-blowing 
allegations. 


VIL  CONCLUSIONS 

Whistle-blowing  has  received  considerable  attention  in  recent  years  after  (1)  whistle-blowers 
were  responsible,  in  part,  for  revealing  the  accounting  scandals  at  Enron  and  WorldCom,  and  (2) 
provisions  in  SOX  were  enacted  to  protect  employee  whistle-blowers.  Still,  little  is  known  about 
the  nature  of  firms  that  are  subject  to  whistle-blowing  and  whether  such  allegations  are  economi¬ 
cally  significant  events  with  meaningful  consequences  for  the  targeted  firms.  Unlike  other  research 


18 

Note  that  we  hand-collect  proxy  data  to  assess  subsequent  governance  changes,  rather  than  use  the  IRRC  data  used  in 
19  order  to  maximize  the  number  of  useable  observations  in  this  analysis 

If  we  cannot  find  a  match  firm  within  the  same  four-digit  SIC  code,  then  we  relax  the  matching  criteria  to  include  firms 
with  the  same  three-digit  SIC  code,  and  if  necessary,  the  same  two-digit  SIC  code 

We  employ  a  matched  control  sample  in  this  analysis  to  ensure  that  any  observed  governance  changes  are  not  the  result 
of  market- wide  improvements  m  governance  (rather  than  improvements  specific  to  whistle-blowing  targets).  Our  match¬ 
ing  criteria  are  consistent  with  this  concern.  We  do  not  match  on  propensity  scores  (as  with  our  consequences  analyses) 
because  propensity  matches  are  not  necessarily  aligned  in  calendar  time,  and  because  we  want  to  examine  whether 
w^st  e-b  owing  targets  improved  governance  relative  to  a  broad  cross-section  of  firms.  We  acknowledge  that  comparing 
whistle-blowing  firms  to  the  average  firm  results  in  a  somewhat  lower  benchmark  than  would  comparing  them  to  a 
propensity-score  iriatched  sample.  However,  we  feel  this  is  appropriate  because  our  sample  period  already  represents  a 
penod  of  improved  governance,  such  that  the  average  firm  was  likely  making  governance  improvements 
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TABLE  8 

Do  Whistle-Blowing  Allegations  Lead  to  Subsequent  Changes  in  Corporate  Governance? 

(Q4) 


Multivariate  Analysis — Press  Sample: 


Dependent 

Variables® 

agov,^2 

Pred. 

Sign 

of 

WB 

=  ySo  +  PiWB,  -b  P2REV,_i  +  +  p^ASIZEt^2  +  e^+2 

Independent  Variables 

Adj.  R2 

n 

Intercept 

WB 

REV 

RET 

ASIZE 

ADIRECTORS 

— 

-0.14 

-1.29 

0.00 

0.59 

-0.00 

4.2% 

70 

(0.738) 

(0.025)** 

(0.527) 

(0.230) 

(0.756) 

A%INS_DIR 

- 

0.05 

-0.09 

-0.00 

0.03 

0.00 

30.8% 

70 

(0.003)*** 

(<0.001)***  (0.283) 

(0.078)* 

(0.829) 

A%BUSY 

- 

0.02 

-0.07 

0.00 

-0.04 

-0.00 

-2.4% 

70 

(0.565) 

(0.091)* 

(0.925) 

(0.399) 

(0.807) 

A%ATTEND 

-t- 

0.01 

0.02 

-0.00 

0.01 

0.00 

-1.0% 

66 

(0.368) 

(0.111) 

(0.889) 

(0.122) 

(0.878) 

A%CEO=COB%^ 

- 

NA 

0.01 

0.00 

0.16 

-0.00 

0.2% 

72 

(0.506) 

(0.831) 

(0.745) 

(0.820) 

/ANEW_CEO%‘' 

-b 

-1.19 

1.34 

0.00 

0.80 

-0.00 

17.0% 

68 

(0.005)*** 

(0.013)** 

(0.154) 

(0.100) 

(0.121) 

*,  **,  ***  Significant  at  10  percent,  5  percent,  and  1  percent  levels,  respectively. 

p- values  are  reported  in  parentheses,  p-values  are  one-sided  for  variables  with  directional  predictions.  We  report  (1  —  p) 

values  for  coefficients  that  assume  a  sign  opposite  to  the  one  predicted. 

^  We  compare  the  value  of  each  governance  variable  in  year  —1  (pre-allegation)  to  its  value  in  year  +2  (post- allegation). 

^  We  report  the  results  of  an  ordinal  logistic  regression,  including  the  pseudo-R^. 

We  report  the  results  of  a  logistic  regression,  including  the  pseudo-R^. 

Definition  of  variables  (where  firm  subscripts  are  suppressed): 

ADIRECT0RS,+2  =  number  of  directors  in  year  +2  minus  the  number  of  directors  in  year  —1; 

\%INS_DIR,+2  ~  percent  of  inside  directors  in  yeai'  +2  minus  the  percent  of  inside  directors  in  year  —1; 

A%BUSYi+2  =  percent  of  “busy”  directors  in  year  +2  minus  the  percent  of  busy  directors  in  year  —1; 

A%ATTEND,^2  —  percent  board  meeting  attendance  in  year  +2  minus  the  percent  board  meeting  attendance  in 
year  —1; 

ACEO=  COB%i^2  -  an  ordinal  variable  coded  as  1  if  the  CEO  was  not  the  COB  in  year  +2  but  was  in  year  -1, 

coded  as  —1  if  the  CEO  was  the  COB  in  year  +2  but  was  not  in  year  —1,  and  coded  as  0 

otherwise; 

ANEW_CE0%,+2  =  an  indicator  variable  coded  as  1  if  the  firm  replaced  its  CEO  in  year  0,  year  -1-1,  or  year  +2, 
and  0  otherwise; 

WB,  =  1  if  the  firm  experienced  a  whistle-blowing  event,  0  otherwise; 

REV;_i  =  total  sales  revenue  (Compustat  annual  #12)  for  the  year  before  the  whistle-blowing  event; 

RET,_i  =  market-adjusted  buy-and-hold  returns  for  12  months,  beginning  with  month  -12  and  ending 
with  month  —1,  where  month  0  is  the  event  month;  and 
ASIZE,+2  ~  total  assets  in  year  -t-2  minus  total  assets  in  year  —1. 


to  date,  our  focus  is  on  (1)  characteristics  of  firms  targeted  by  employee  whistle-blowers,  (2)  the 
economic  consequences  of  such  whistle-blowing  revelations,  and  (3)  firm  responses  to  such  alle¬ 
gations  via  subsequent  governance  changes. 

Specifically,  we  report  new  empirical  evidence  on  218  employee  whistle-blowing  events 
between  1989  and  2004.  Our  paper  adds  to  the  literature  on  corporate  governance  in  four  ways. 
First,  we  conduct  a  cross-sectional  analysis  of  the  characteristics  of  firms  subject  to  whistle- 
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blowing.  Our  evidence  is  broadly  consistent  with  whistle-blowers  targeting  firms  that  were  large, 
growing,  successful,  and  highly  regarded.  Whistle-blowing  tended  to  be  more  prevalent  in  firms 
that  had  downsized  and  in  firms  with  worse  governance.  Finally,  and  not  surprisingly,  when  the 
potential  benefits  to  the  whistle-blower  increased  (via  the  potential  to  share  in  any  fraud-related 
settlement),  employees  were  more  likely  to  make  public  allegations  of  wrongdoing. 

Second,  we  present  evidence  that,  on  average,  whistle-blowing  allegations  had  an  immediate 
negative  economic  consequence  for  target  firms.  The  average  market-adjusted  return  in  the  five 
days  around  the  day  the  allegation  became  public  was  —2.8  percent.  Further,  the  stock  market 
reaction  was  substantially  more  negative  when  the  whistle-blower  alleged  earnings  management 
(—7.3  percent). 

Third,  we  find  that  whistle-blowing  targets  had  more  earnings  restatements  (Press  sample), 
more  shareholder  lawsuits  (Press  sample),  and  relatively  poor  operating  and  stock  return  perfor¬ 
mance  (press  and  OSHA  samples)  compared  to  a  sample  of  propensity-score  matched  firms  with 
similar  characteristics.  Whistle-blower  allegations  appear  to  be  an  early  indicator  of  future  nega¬ 
tive  economic  consequences  for  targeted  firms. 

Fourth,  we  find  that,  on  average,  whistle-blowing  targets  exposed  in  the  press  improved 
several  dimensions  of  governance  relative  to  the  year  before  the  whistle-blowing  event  and  rela¬ 
tive  to  a  matched  sample  of  control  firms.  Interestingly,  these  governance  improvements  were  not 
apparent  for  firms  subject  to  whistle-blowing  allegations  that  were  not  widely  disseminated 
(OSHA  sample).  In  sum,  whistle-blowing  is  far  from  a  trivial  nuisance  for  firms  targeted  by 
allegations  of  corporate  misconduct. 

Our  research  represents  early  work  on  the  role  of  financial  whistle-blowing  and  provides 
indirect  evidence  on  the  effectiveness  of  Sarbanes-Oxley  (SOX)  Section  806  as  a  mechanism  to 
expose  agency  problems  at  firms.  We  find  that  public  whistle-blowing  events  are  relatively  infre¬ 
quent,  and  a  majority  of  the  whistle-blowing  allegations  we  identify  are  not  directly  related  to  the 
types  of  fraudulent  behavior  that  precipitated  the  SOX  legislation.  Future  research  could  examine 
(1)  the  impact  of  the  whistle-blowing  provisions  in  SOX  on  the  incidence  and  types  of  whistle¬ 
blowing  allegations  and  (2)  whether  investors,  regulators,  and  other  stakeholders  benefit  from  (or 
overreact  to)  whistle-blowing  allegations.  Additionally,  future  research  could  examine  how  OSHA 
is  performing  in  its  role  of  adjudicating  SOX-related  whistle-blowing  complaints.  Finally,  if  one 
could  obtain  the  data,  an  examination  of  the  frequency,  importance,  and  consequences  of  internal 
whistle-blowing  would  be  a  valuable  contribution  to  the  literature. 


APPENDIX 

VARIABLE  DEFINITIONS 

Capital  Market  Pressure 

FIN_NEED  =  an  indicator  variable  coded  as  1  if  the  firm’s  free  cash  flow  in  the  year 
prior  to  the  whistle-blowing  event  is  less  than  -0.1,  and  0  otherwise. 
Similar  to  Dechow  et  al.  (1996),  we  calculate  free  cash  flow  as  cash  flow 
less  the  average  capital  expenditure  (Compustat  annual  #128)  over  the 
last  three  years,  deflated  by  average  total  assets. 

M&A  =  an  indicator  variable  coded  as  1  if  the  firm  was  involved  in  a  merger  or 
acquisition  (Compustat  AFTN  #1)  in  year  t  or  in  any  one  of  the  three 
years  prior  to  the  whistle-blowing  event,  and  0  otherwise. 

ABSDACC  =  the  absolute  value  of  discretionary  accruals  based  on  the  modified  Jones 
(1991)  model  for  the  fiscal  year  before  the  whistle-blowing  event. 
Discretionary  accruals  are  calculated  as  the  fitted  values  from  the  following 
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Regression  (2):  TOTACC  =  (3^  +  |3^^REV+  I32PPE  +  P^CEO  +  /3^R0A 
+  error.  This  regression  is  estimated  each  year  for  every  two-digit  SIC  code 
conditional  on  having  at  least  10  firms  in  an  industry.  Note  that  TOTACC 
=  total  accruals,  defined  as  income  before  extraordinary  items  minus  cash 
flow  from  operations  plus  extraordinary  items  (CompUSA  annual  #18- 
Compustat  annual  #308  +  Compustat  annual  #124),  AREV  =  the  change  in 
revenue  (Compustat  annual  #12),  PPE  =  plant,  property,  and  equipment 
(Compustat  annual  #7),  CEO  =  cash  from  operations  (Compustat  annual 
#308),  ROA  =  return  on  assets  (Compustat  #18).  We  scale  each  variable 
in  Regression  (2)  by  total  assets  (Compustat  annual  #6)  at  the  beginning  of 
the  period. 

CM_PRESSURE  =  the  average  scaled  rank  of  the  following  variables:  EIN_NEED,  M&A,  and 
ABSDACC.  Each  variable  is  coded  such  that  higher  values  are  consistent 
with  greater  capital  market  pressure.  This  measure  of  capital  market  pressure 
is  averaged  over  the  number  of  variables  with  non-missing  data. 

Growth 

GROWTH  =  the  scaled  rank  of  sales  growth,  defined  as  the  average  growth  in  sales 
(Compustat  annual  #12)  over  the  three  years  prior  to  the  whistle-blowing 
event. 

Past  Performance 

PAST_PERE  =  the  scaled  rank  of  market-adjusted  buy-and-hold  returns  for  12  months, 
beginning  from  month  —12  and  ending  month  —1,  where  month  0  is 
the  month  the  whistle-blowing  was  reported  in  the  press  or  to  OSHA. 

Reputation 

REPUTATION  =  an  indicator  variable  coded  as  1  if  the  firm  has  appeared  on  Eortune’s 
“Best  Companies  to  Work  For”  listor  on  Eortune's  “Most  Admired 
Companies”  list  in  any  of  the  five  years  prior  to  the  whistle-blowing  event, 
and  coded  as  0  otherwise. 


Unclear  Communication  Channels 

AGE  =  the  number  of  years  the  firm  has  been  listed  on  CRSP  as  of  the  year  of 
the  whistle-blowing  event. 

INDjCONC  =  the  Herfindahl-Hirschman  index  based  on  the  revenue  for  each  of  the 
firm’s  industry  segments,  calculated  as  the  sum  of  squares  of  each 
industry  segment’s  revenue  as  a  percentage  of  total  firm  revenue.  This 
variable  is  calculated  for  the  year  prior  to  the  whistle-blowing  event.  See 
Bushman  et  al.  (2004). 

GEOJCONC  =  the  Herfindahl-Hirschman  index  based  on  the  revenue  for  each  of  the 
firm’s  geographic  segments,  calculated  as  the  sum  of  squares  of  each 
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geographic  segment’s  revenue  as  a  percentage  of  total  firm  revenue. 

This  variable  is  calculated  for  the  year  prior  to  the  whistle-blowing  event. 
See  Bushman  et  al.  (2004). 

COMMUNICATION  =  the  average  scaled  rank  of  the  following  variables:  AGE,  IND_CONC, 

and  GEO_CONC.  We  reverse  the  sign  of  the  rank  of  AGE  such  that 
lower  values  are  consistent  with  worse  internal  communication  channels. 
This  measure  of  unclear  communication  channels  is  averaged  over  the 
number  of  variables  with  non-missing  data. 


Employee  Downsizing 

DOWNSIZE  =  the  scaled  rank  of  employee  growth,  defined  as  the  average  growth  in  the  total 
employees  (Compustat  annual  #29)  over  the  three  years  prior  to  the 
whistle-blowing  event. 

Qui  Tam 

QUITAM  =  an  indicator  variable  coded  as  1  if  the  firm  (1)  is  in  the  healthcare  industry 

(2-digit  SIC  code  =  80),  or  (2)  appeared  on  the  “100  Companies  Receiving  the 
Largest  Dollar  Volume  of  Prime  Contract  Awards”  list  in  any  of  the  three 
years  prior  to  the  whistle-blowing  event,  and  0  otherwise. 

Firm  Size 

SIZE  =  the  scaled  rank  of  total  sales  revenue  (Compustat  annual  #12)  for  the  year  before 
the  whistle-blowing  event. 


Internal  Control  Weaknesses 

ICW  =  the  scaled  rank  of  the  fitted  value  from  the  following  model,  as  estimated  by  Doyle  et 
al.  (2007):  ICW=  ^2  +  IS2MARKETCAP  -1-  I32EIRM_AGE  +  ^oLOSSES 
+  (32SEGMENTS  +  ^2E0REIGN_TRANS  +  I32EXTREME_SG  +  (32RESTRUCTURE 
+  error,  where  MARKETCAP  =  the  log  of  the  firm’s  market  capitalization,  FIRM_AGE 
=  the  log  of  the  number  of  years  the  firm  has  CRSP  data,  LOSSES  =  an  indicator 
variable  equal  to  1  if  earnings  before  extraordinary  items  in  the  two  most  recent  years 
sum  to  less  than  zero,  and  0  otherwise,  SEGMENTS  =  the  log  of  the  number  of 
operating  and  geographic  segments  reported  by  the  Compustat  Segments  database, 
FOREIGNJTRANS  —  an  indicator  variable  equal  to  1  if  the  firm  has  nonzero  foreign 
translation,  and  0  otherwise,  EXTREME_SG  =  an  indicator  variable  equal  to  1  if 
year-over-year  industry-adjusted  sales  growth  falls  into  the  top  quintile,  and  0  otherwise, 
and  RESTRUCTURE  =  the  aggregate  restructuring  charge  in  the  two  most  recent 
years,  scaled  by  the  firm’s  market  capitalization.  ICW  is  measured  as  of  the  year 
immediately  before  the  whistle-blowing  event.  We  use  coefficient  values  as  reported  in 
Doyle  et  al.  (2007).  Because  of  limited  data  availability,  we  set  SEGMENTS  equal 
to  0  if  the  necessary  data  is  not  available. 
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External  Monitoring 

BLOCK  =  percentage  5  percent  block-holder  ownership  of  the  firm  in  the  year  prior 
to  the  whistle-blowing  event  obtained  from  Compact  Disclosure. 

PP  =  percentage  pension  ownership  of  the  firm  in  the  year  prior  to  the 
whistle-blowing  event  obtained  from  Cremers  et  al.  (2005). 

EXT_MONITOR  =  the  average  scaled  rank  of  the  following  variables:  BLOCK  and  PP.  Each 
variable  is  coded  such  that  lower  values  are  consistent  with  worse  external 
monitoring.  This  measure  of  external  governance  is  averaged  over  the 
number  of  variables  with  non-missing  data. 

Governance 

EXECCOMP  =  the  value  of  the  CEO’s  in-the-money  exercisable  stock  options,  scaled  by 
the  CEO’s  wealth.  CEO  wealth  is  defined  as  the  sum  of  the  CEO’s 
salary,  bonus,  the  value  of  stock  ownership,  and  the  value  of  all  in-the-money 
options.  This  variable  is  measured  for  the  year  prior  to  the  whistle-blowing 
event. 

DIRECTORS  =  the  number  of  directors  on  the  firm’s  board  of  directors  in  the  year  before 
the  whistle-blowing  event. 

INSIDERJPCT  =  the  percentage  of  directors  who  are  insiders  in  the  year  before  the 
whistle-blowing  event. 

CEO=COB  =  an  indicator  variable  coded  as  1  if  the  firm’s  CEO  is  also  the  chairman  of 
the  board  (COB)  in  the  year  before  the  whistle-blowing  event,  and  0 
otherwise. 

BUSY_PCT  =  the  percentage  of  directors  who  are  “busy”  in  the  year  before  the 

whistle-blowing  event,  where  “busy”  is  defined  as  serving  on  at  least  three 
other  corporate  boards. 

ATTEND  =  the  percentage  of  board  members  who  attended  at  least  75  percent  of  the 
board  committee  meetings  in  the  year  before  the  whistle-blowing  event. 

NEWjCEO  =  an  indicator  variable  coded  as  1  if  the  firm  has  a  new  CEO  in  the  year 
before  the  whistle-blowing  event,  and  0  otherwise. 

GOVERNANCE  =  the  average  scaled  rank  of  the  following  variables:  EXECCOMP, 

DIRECTORS,  INSIDER_PCT,  ATTEND,  CEO=COB,  BUSY_PCT,  and 
NEWjCEO.  Each  variable  is  coded  such  that  lower  values  are  consistent  with 
worse  internal  monitoring.  This  measure  of  internal  governance  is  averaged 
over  the  number  of  variables  with  non-missing  data. 
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ABSTRACT:  Audit  standards  require  auditors  to  conduct  fraud  brainstorming  sessions 
on  every  audit.  The  Public  Company  Accounting  Oversight  Board  has  raised  concerns 
about  auditors’  fraud  judgments  and  the  quality  of  their  brainstorming  sessions.  We 
develop  a  measure  of  brainstorming  quality  to  examine  how  it  affects  auditors’  fraud 
decision-making  processes.  We  test  our  measure  using  field  survey  data  of  auditors’ 
actual  brainstorming  sessions  for  179  audit  engagements.  Respondents  report  consid¬ 
erable  variation  in  the  quality  of  brainstorming  in  practice.  We  find  some  evidence  that 
high-quality  brainstorming  improves  the  relations  between  fraud  risk  factors  and  fraud 
risk  assessments.  We  also  determine  that  brainstorming  quality  positively  moderates 
the  relations  between  fraud  risk  assessments  and  fraud-related  testing.  Our  results 
suggest  that  the  benefits  of  brainstorming  do  not  apply  uniformly,  because  low-quality 
sessions  likely  incur  the  costs  of  such  interactions  without  receiving  the  attendant  ben¬ 
efits.  By  documenting  best  practices  from  high-quality  brainstorming  sessions,  our  find¬ 
ings  can  inform  auditors  on  how  to  improve  their  consideration  of  fraud. 

Keywords:  brainstorming  quaiity;  fraud;  fraud  risk  assessments;  fraud  risk  response. 
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1.  INTRODUCTION 

Fraud  detection  is  among  the  highest  priorities  for  the  accounting  profession  and  standard- 
setters  (Elliott  2002;  PCAOB  2003,  2007,  2008).  Statement  on  Auditing  Standards  (SAS) 
No.  99  (AICPA  2002),  Consideration  of  Fraud  in  a  Financial  Statement  Audit,  requires 
brainstorming  sessions  on  every  audit  as  a  means  of  improving  auditors’  fraud  judgments.  How¬ 
ever,  the  Public  Company  Accounting  Oversight  Board  (PCAOB)  has  expressed  concerns  about 
the  actual  conduct  and  quality  of  brainstorming  sessions  in  practice  (PCAOB  2007).  Because  these 
sessions  are  a  relatively  new  requirement,  little  is  known  about  how  they  occur  in  practice  and 
how  they  influence  fraud-related  judgments.  Accordingly,  our  study  employs  a  field  survey  of 
actual  audit  engagements  to  pursue  three  research  objectives.  First,  we  study  how  audit  teams  are 
conducting  brainstorming  sessions  in  practice  and  develop  a  measure  of  brainstorming  quality. 
This  descriptive  analysis  should  provide  a  basis  for  further  modeling  of  brainstorming  and  future 
empirical  research  in  team  decision-making.  Second,  we  investigate  the  comprehensive  fraud 
decision-making  process  that  audit  teams  are  employing  in  practice,  from  the  consideration  of 
client  fraud  risk  factors  to  their  development  of  fraud  risk  assessments  and  fraud-related  audit 
testing.  Third,  we  examine  how  the  quality  of  these  brainstorming  sessions  influences  auditors’ 
consideration  of  fraud  and  whether  variation  in  this  quality  has  important  consequences  for  the 
resulting  fraud  judgments. 

This  investigation  is  important  for  several  reasons.  First,  prior  psychology  research  suggests 
that  the  effectiveness  of  judgments  and  decisions  likely  depends  on  the  quality  of  a  team’s  inter¬ 
action  (e.g.,  Stasser  1999).  We  leverage  the  survey  method  to  collect  complex  and  rich  data  about 
brainstorming  session  quality  and  its  underlying  characteristics.  Our  measure  of  brainstorming 
session  quality  is  multifaceted  (i.e.,  21 -item  scale)  and  is  developed  from  a  review  of  the  psychol¬ 
ogy  literature  on  brainstorming  and  team  decision-making,  the  accounting  literature  on  auditors’ 
judgments  and  fraud,  recent  PCAOB  inspection  reports,  and  professional  guidance.  Our  brain¬ 
storming  session-quality  measure  consists  of  three  overall  elements:  attendance  and  communica¬ 
tion,  structure  and  timing,  and  engagement  team  effort.  To  our  knowledge,  this  is  the  first  study  in 
accounting  or  elsewhere  to  develop  a  measure  of  brainstorming  quality  and  empirically  examine 
its  effects  on  team  judgments. 

Second,  prior  experimental  studies  have  examined  certain  parts  of  the  brainstorming  process, 
but  have  not  examined  the  entire  decision-making  process  (e.g..  Carpenter  2007;  Hoffman  and 
Zimbelman  2009).  Moreover,  the  accounting  literature  suggests  that  the  benefits  derived  from 
brainstorming  depend  on  the  brainstorming  method  used  (Carpenter  2007;  Hoffman  and  Zimbel¬ 
man  2009;  Lynch  et  al.  2009;  Hunton  and  Gold  2010).  Carpenter  (2007)  and  Hoffman  and  Zim¬ 
belman  (2009)  document  benefits  to  brainstorming  versus  not  brainstorming.  However,  contem¬ 
porary  studies  by  Lynch  et  al.  (2009)  and  Hunton  and  Gold  (2010)  suggest  that  the  type  of 
brainstorming  is  an  important  consideration  for  the  benefits  to  be  realized,  as  they  find  that  some 
effective  brainstorming  methods  are  not  predominant  in  practice.’  While  this  prior  research  has 


'  Lynch  et  al.  (2009)  conclude  that  brainstorming  effectiveness  is  higher  for  teams  using  group  support  systems  relative 
to  teams  using  traditional  face-to-face  brainstorming.  In  our  sample,  80  percent  of  our  observations  used  traditional 
face-to-face  brainstorming.  Hunton  and  Gold  (2010)  find  that  both  the  nominal  group  (where  there  is  no  communication 
among  team  members)  and  round  robin  brainstorming  techniques  are  more  effective  than  open  brainstorming.  However, 
in  our  sample,  we  find  that  91  percent  of  our  engagements  employed  open  brainstorming.  Thus,  while  these  two  studies 
provide  means  in  which  brainstorming  can  be  improved,  our  descriptive  data  suggest  that  such  changes  would  be 
significant  and  likely  a  substantial  challenge  for  firms  to  undertake.  Our  finding  of  a  preponderance  of  face-to-face 
communication  and  open  brainstorming  in  practice  likely  suggests  that  unidentified  institutional  factors  cause  audit 
teams  to  employ  these  two  potentially  less  effective  methods  (e.g.,  open  brainstorming  [versus  round  robin]  allows  a 
partner  to  better  control  the  discussion). 
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provided  some  evidence  on  auditor  fraud  judgments  in  experimental  settings,  there  is  no  empirical 
evidence  on  how  audit  teams  are  implementing  brainstorming  sessions  in  practice  and  whether  the 
relations  among  their  team  judgments  are  improved  by  the  quality  of  brainstorming  sessions 
required  by  SAS  No.  99.  We  contribute  to  the  literature  by  using  a  field  survey  to  collect  auditors’ 
client-specific  brainstorming  session  data  and  the  related  fraud  judgments.  Because  this  method¬ 
ology  allows  us  to  collect  the  underlying  dynamics  of  audit  teams’  actual  brainstorming  sessions, 
and  the  resulting  risk  factors  identified,  fraud  risk  assessments,  and  related  testing,  we  are  able  to 
examine  multiple  fraud  decisions  and  relationships  across  a  number  of  diverse  audit  engagements. 
Thus,  we  provide  insight  into  the  comprehensive  fraud  decision-making  process  that  audit  teams 
employ  in  practice. 

Third,  our  investigation  is  important  to  auditors  and  standard-setters.  PCAOB  inspection 
reports  indicate  variation  among  audit  teams’  brainstorming  sessions,  and  auditors’  testing  is  often 
not  responsive  to  their  fraud  risk  assessments  (PCAOB  2007).  Our  study  empirically  assesses  the 
PCAOB ’s  findings  by  documenting  how  auditors  are  implementing  SAS  No.  99  in  practice  and  by 
investigating  the  consequences  of  brainstorming  quality  on  the  link  between  fraud  risk  assess¬ 
ments  and  subsequent  testing.  Thus,  this  study’s  findings  can  inform  audit  practice  and  standard- 
setters  as  they  reconsider  the  professional  guidance  on  auditors’  consideration  of  fraud  (PCAOB 
2004,  2007). 

Our  theoretical  model  is  based  on  the  psychology  and  accounting  literatures,  PCAOB  inspec¬ 
tion  reports,  and  professional  guidance.  Based  on  this  model,  we  hypothesize  that  brainstorming 
quality  positively  moderates  the  relations  between  (1)  fraud  risk  factors  and  related  fraud  risk 
assessments  and  (2)  fraud  risk  assessments  and  subsequent  audit  testing.  We  test  our  hypotheses 
with  data  from  179  audit  engagements  for  which  highly  experienced  audit  partners  and  managers 
from  the  Big  4  firms  and  an  international  firm  completed  a  field  survey. 

Prior  experimental  research  has  typically  found  that  when  fraud  risk  factors  are  present,  fraud 
risk  assessments  tend  to  be  higher  (e.g.,  Nieschwietz  et  al.  2000).  Further,  prior  experimental 
research  has  documented  that  auditors  have  difficulty  responding  to  fraud  risk  assessments  by 
altering  the  nature  of  their  testing  (e.g.,  Zimbelman  1997;  Glover  et  al.  2003).  Actual  audit 
engagements  are  rich  in  context  and  the  fraud  risk  assessment  is  typically  determined  after  en¬ 
gagement  team  deliberation.  Such  an  environment  is  difficult  to  create  in  an  experimental  setting. 
Therefore,  there  is  a  need  to  complement  experimental  fraud  research  with  research  employing 
non-experimental  methods.  Our  field  survey  results  indicate  that  fraud  risk  factors  identified  for  a 
diverse  sample  of  clients  are  positively  correlated  with  auditor  fraud  risk  assessments.  In  addition, 
we  find  some  evidence  that  brainstorming  quality  positively  moderates  the  relations  between  fraud 
risk  factors  and  fraud  risk  assessments.  We  also  find  that  auditors’  responses  to  fraud  risk  assess¬ 
ments  are  contingent  on  the  quality  of  brainstorming  sessions.  Specifically,  when  audit  teams 
engage  in  higher  quality  brainstorming  sessions,  fraud  risk  assessments  are  more  positively  asso¬ 
ciated  with  changes  to  the  nature,  staffing,  timing,  and  extent  of  fraud-related  audit  procedures. 

While  several  experimental  studies  have  predicted  a  link  between  fraud  risk  and  related  audit 
testing,  they  have  typically  not  found  this  link  for  the  nature  or  type  of  procedures  employed  (e.g., 
Zimbelman  1997;  Glover  et  al.  2003;  Asare  and  Wright  2004).  We  complement  this  literature  by 
documenting  the  important  role  brainstorming  quality  can  play  in  improving  the  link  between 
auditors’  fraud  risk  assessments  and  their  subsequent  testing;  including  the  choice  of  the  nature  of 
procedures.  We  consequently  illustrate  that  important  and  tangible  benefits  can  be  derived  from 
higher  quality  brainstorming.  On  the  other  hand,  our  analyses  suggest  that  under-auditing  can 
occur  when  brainstorming  session  quality  is  low.  Thus,  some  engagement  teams  likely  incur  the 
cost  of  brainstorming  without  receiving  the  attendant  benefits  of  such  interactions.  These  new 
findings  triangulate  and  extend  prior  studies  by  offering  important  insights  on  the  costs  and 
benefits  of  fraud  brainstorming. 
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Legal  restrictions  prohibit  the  PCAOB  from  distributing  descriptive  evidence  about  brain¬ 
storming  sessions  in  practice  (PCAOB  2007).  Thus,  it  is  difficult  for  individuals  outside  the 
PCAOB  to  discern  if  the  PCAOB ’s  concerns  about  brainstorming  quality  are  widespread  or  per¬ 
haps  isolated  to  only  a  few  audit  engagements.  Our  study  captures  brainstorming  data  that  is  often 
not  documented  and  thus  not  available  for  PCAOB  inspection  teams  to  review  (e.g.,  participation 
levels  of  engagement  team  members).  Our  survey  data  validate  the  concerns  raised  by  the  PCAOB 
by  documenting  considerable  variation  in  the  reported  quality  of  brainstorming  sessions  in  prac¬ 
tice.  More  importantly,  we  find  evidence  that  there  are  consequential  effects  of  conducting  low- 
and  high-quality  sessions.  We  also  analyze  data  related  to  the  misappropriation  of  assets  and 
fraudulent  financial  reporting  that  were  detected  by  the  audit  teams  in  our  sample  (i.e.,  instances 
of  actual  fraud  detection).  In  additional  analyses,  we  examine  audit  team  effectiveness  by  analyz¬ 
ing  the  links  between  ex  ante  fraud  judgments  and  ex  post  fraud  detection.  Last,  our  data  reveal 
best  practices  for  how  auditors  can  improve  the  quality  of  their  brainstorming  sessions  (e.g., 
attendance  of  an  IT  audit  specialist,  occurrence  of  the  session  early  in  the  planning  process). 

Section  II  describes  the  background  and  hypotheses  development.  Sections  III  and  IV  provide 
the  methods  and  results  of  the  study,  respectively.  Section  V  concludes  the  paper. 

11.  BACKGROUND  AND  HYPOTHESES  DEVELOPMENT 
A  Model  of  Auditors’  Use  of  Brainstorming  in  the  Consideration  of  Fraud 

Based  on  SAS  No.  99,  as  well  as  prior  accounting  and  psychology  research,  we  develop  a 
model  of  auditors’  use  of  brainstorming  in  the  consideration  of  fraud.  SAS  No.  99  provides 
guidance  to  improve  the  likelihood  that  auditors  will  detect  fraud  using  a  multi-phase  approach 
(AICPA  2002,  2003).  First,  auditors  collect  information  related  to  the  risk  of  material  misstatement 
due  to  fraud  (e.g.,  data  on  equity-based  compensation).  Using  such  information,  auditors  brain¬ 
storm  to  identify  fraud  risk  factors  (e.g.,  incentives),  synthesize  this  information  to  develop  a  fraud 
risk  assessment,  and  develop  a  response  to  the  risk  assessments  such  as  altering  the  staffing  of  the 
engagement,  or  modifying  the  nature,  timing,  and  extent  of  audit  procedures.  Brainstorming  ses¬ 
sions  are  intended  to  aid  auditors  in  linking  fraud  risk  factors  to  risk  assessments  and,  in  turn, 
foster  the  development  of  appropriate  audit  responses.  As  such,  the  model  depicted  in  Figure  1 
indicates  that  brainstorming  should  influence  both  phases  of  the  fraud  decision-making  process 
such  that  the  relations  among  fraud  risk  factors,  risk  assessments,  and  responses  are  positively 
moderated  by  the  quality  of  the  brainstorming  session. 

Fraud  Risk  Factors 

Fraud  risk  factors  are  commonly  categorized  along  three  dimensions  of  the  fraud  triangle: 
incentives,  opportunities,  and  rationalizations  (AICPA  2002).  Auditors  synthesize  their  under¬ 
standing  of  these  risk  factors  with  other  information  (e.g.,  preliminary  analytical  procedures)  to 
develop  a  fraud  risk  assessment  (AICPA  2002).  Prior  archival  research  on  fraud  risk  factors 
documents  relations  between  fraudulent  financial  reporting  and  financial  statement  elements,  cor¬ 
porate  governance  quality,  nonfinancial  measures,  and  other  observable  data  (e.g.,  Beasley  1996; 
Beneish  1997;  McVay  2006;  Brazel  et  al.  2009). 

Fraud  Risk  Assessments 

Experimental  studies  have  examined  the  relationships  between  fraud  risk  factors  and  auditors’ 
fraud  risk  assessments.  While  auditors’  risk  assessments  are  generally  higher  when  fraud  risk 
factors  are  present  (Nieschwietz  et  al.  2000),  more  recent  studies  report  that  context  has  an 
important  influence  on  auditors’  judgments.  Wilks  and  Zimbelman  (2004)  find  that  auditors  who 
decompose  fraud  risk  assessments  are  more  sensitive  to  incentive  and  opportunity  risks  than 
auditors  who  provide  holistic  fraud  risk  assessments,  but  only  in  a  low  fraud  risk  context.  Asare 
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FIGURE  1 

Auditors’  Use  of  Brainstorming  in  the  Consideration  of  Fraud 
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This  figure  provides  a  model  of  audit  teams’  use  of  brainstorming  in  their  consideration  of  fraud.  This  model  is 
based  on  psychology  and  accounting  research  as  well  as  the  SAS  No.  99  framework. 


and  Wright  (2004)  observe  that,  given  a  case  where  fraud  was  perpetrated,  auditors  who  use  a 
fraud  risk  checklist  make  less  effective  fraud  risk  assessments  than  those  who  use  no  checklist. 
Since  the  context  of  the  experimental  setting  was  important  in  these  studies,  and  these  studies  were 
conducted  before  the  implementation  of  SAS  No.  99,  it  is  important  to  re-examine  the  relations 
between  fraud  risk  factors  and  fraud  risk  assessments  using  non-experimental  methods.  Further, 
none  of  these  studies  consider  the  possible  influence  of  brainstorming  quality  on  the  links  between 
risk  factors  and  risk  assessments. 

Fraud  Risk  Responses 

Although  SAS  No.  99  requires  auditors  to  design  audit  procedures  that  are  responsive  to  fraud 
risk  assessments,  research  has  not  consistently  found  such  a  relationship.  For  example,  Zimbelman 
(1997)  illustrates  limited  evidence  of  an  increase  in  the  extent  of  audit  procedures  and  no  evidence 
of  a  change  in  the  nature  of  procedures  in  response  to  fraud  risk  assessments.  While  Glover  et  al. 
(2003)  find  stronger  evidence  of  an  increase  in  the  extent  of  audit  procedures;  they  also  fail  to 
observe  evidence  of  a  change  in  the  type  of  procedures  employed  in  response  to  fraud  risk. 
Additionally,  Asare  and  Wright  (2004)  report  no  association  between  fraud  risk  assessments  and 
planned  audit  procedures.  However,  these  studies  were  conducted  prior  to  SAS  No.  99  and,  thus, 
exclude  the  possibility  that  brainstorming  could  improve  the  sensitivity  of  fraud  risk  responses  to 
fraud  risk  assessments. 
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Brainstorming  Quality 

Carpenter  (2007)  and  Hoffman  and  Zimbelman  (2009)  experimentally  study  the  influence  of 
brainstorming  sessions  on  the  link  between  fraud  risk  and  fraud  risk  assessment  and  the  link 
between  risk  assessment  and  the  nature  of  the  audit  procedures  selected,  respectively.  Carpenter 
(2007)  examines  hierarchical  teams  that  brainstorm  and  concludes  that  teams  generate  lists  with 
more  quality  fraud  risks  than  do  individual  auditors.  Further,  these  teams  provide  more  effective 
fraud  risk  assessments  (i.e.,  higher  when  fraud  was  present  than  when  it  was  not)  than  auditors 
working  alone.  Hoffman  and  Zimbelman  (2009)  investigate  the  role  that  brainstorming  plays  with 
regard  to  audit  managers’  planned  audit  procedures.  They  document  that  brainstorming  (versus  no 
brainstorming)  leads  to  more  effective  modification  of  standard  audit  procedures  in  a  specific  high 
fraud  risk  case.  While  both  of  these  studies  provide  important  insights,  neither  captures  the  col¬ 
lective  fraud  process  (i.e.,  the  relations  among  fraud  risk  factors,  risk  assessments,  and  respons^es), 
nor  do  they  examine  actual  audit  teams  where  partners’  judgments  are  likely  very  important. 

More  importantly,  contemporary  experimental  research  by  Lynch  et  al.  (2009)  and  Hunton 
and  Gold  (2010)  find  that  the  brainstorming  methods  that  are  predominant  in  practice  (e.g., 
face-to-face  communication  and  open  brainstorming)  are  less  effective  than  alternative  methods 
(e.g.,  electronic  communication  and  round  robin  brainstorming).  For  a  sample  of  audits  where 
open  brainstorming  and  face-to-face  discussion  were  the  norm,  we  examine  if  higher  brainstorm¬ 
ing  quality  can  mitigate  the  negative  effects  documented  by  these  contemporary  studies.  Further¬ 
more,  the  PCAOB  has  expressed  concern  about  brainstorming  quality  in  practice  (PCAOB  2007). 
However,  because  the  PCAOB  is  legally  restricted  from  distributing  data  from  their  inspections,  it 
is  difficult  for  those  outside  the  PCAOB  to  discern  if  brainstorming  quality  issues  are  widespread 
or  isolated,  and  the  extent  that  session  quality  varies  between  engagements.  Therefore,  it  is  an 
open  empirical  question  as  to  whether  differences  in  brainstorming  quality  actually  found  in 
practice  are  large  enough  to  lead  to  meaningful  differences  in  audit  teams’  fraud  risk  assessments 
and  audit  testing. 

There  is  a  substantial  body  of  psychology  literature  on  the  role  of  brainstorming.  In  general, 
this  research  reports  mixed  findings,  with  evidence  suggesting  that  brainstorming  can  lead  to  both 
process  gains  and  losses  (e.g..  Hill  1982;  Diehl  and  Stroebe  1987).  Most  of  these  experimental 
studies  were  conducted  with  college  students  who  held  no  task-specific  expertise  or  knowledge 
and  were  asked  to  brainstorm  about  mundane  problems  (e.g.,  Diehl  and  Stroebe  1987).  However, 
SAS  No.  99  requires  auditors  with  specific  training  and  experience  to  conduct  brainstorming 
sessions,  a  task  with  much  at  risk.  Past  studies  of  brainstorming  also  did  not  include  hierarchical 
teams,  eliminating  the  potential  costs  and  benefits  of  normal  hierarchies  (cf.,  Hinsz  et  al.  1997). 
Hierarchical  teams  of  auditors  are  likely  to  develop  synergies  by  building  on  one  another’s  ideas 
(e.g..  Carpenter  2007).  Conversely,  the  hierarchy  of  an  audit  team  might  exacerbate  problems  that 
can  occur  during  brainstorming,  such  as  evaluation  apprehension,  group  think,  and  production 
blocking  (e.g.,  Beasley  and  Jenkins  2003). 

Psychology  research  has  documented  links  between  cooperative  outcome  interdependence, 
effectiveness  of  team  communications,  and  resultant  decisions.  Under  cooperative  outcome  inter¬ 
dependence,  team  members  succeed  or  fail  together,  directly  benefiting  from  each  other’s  perfor¬ 
mance  (De  Dreu  2007).  Audit  teams  have  cooperative  outcome  interdependence  because  their 


^  Neither  Carpenter  (2007)  nor  Hoffman  and  Zimbelman  (2009)  include  audit  partners  in  their  brainstorming  teams. 
Given  the  authority  level  assigned  to  audit  partners,  it  is  likely  that  they  wield  substantial  influence  over  fraud-related 
judgments.  In  our  sample,  partners  attended  every  brainstorming  session  and  were  the  largest  contributors  to  the  sessions 
(see  Table  3).  Also,  whether  the  partner  led  the  session  and  the  amount  of  partner  contribution  were  positively  correlated 
with  a  number  of  fraud-related  judgments  (e.g.,  nature  of  testing;  see  Table  5).  Our  use  of  survey  data  from  actual  audit 
engagements  allows  us  to  include  the  role  of  partners. 
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team  interaction  in  a  brainstorming  session  depends  on  cooperation  as  opposed  to  competition. 
Cooperative  outcome  interdependence  is  related  to  the  teams’  ability  to  discuss  shared  and  un¬ 
shared  information  (Stasser  1999).  Thus,  the  quality  of  information  sharing  or  brainstorming 
among  team  members  is  likely  to  influence  the  effectiveness  of  the  discussion  and,  in  turn,  the 
decisions  of  the  team. 

Building  on  professional  guidance  and  the  accounting  and  psychology  literatures,  our  model 
in  Figure  1  posits  that  the  quality  of  the  brainstorming  session  (1)  positively  moderates  the 
relations  between  auditors’  fraud  judgments  and  (2)  is  based  on  three  elements:  attendance  and 
communication,  brainstorming  structure  and  timing,  and  engagement  team  effort. 

Attendance  and  Communication 

Attendance  and  communication  affect  the  quality  of  team  interaction  and  discussion.  Postmes 
et  al.  (2001)  suggest  that  teams  that  promote  critical  thought  are  more  likely  to  ensure  careful 
evaluation  of  information  and  arrive  at  higher  quality  decisions.  Schippers  et  al.  (2003)  find  that 
the  quality  of  team  communication  is  enhanced  when  teams  are  more  diverse  and  when  they 
encourage  open  communication.  Stasser  (1999)  illustrates  that  openness  to  ideas  improves  deci¬ 
sion  quality  by  encouraging  the  sharing  of  information.  In  line  with  this  research,  we  expect  that 
as  more  members  of  the  engagement  team  attend  and  participate  in  the  brainstorming  session, 
there  will  be  greater  diversity  and  more  sharing  of  information.  This  should  improve  the  overall 
quality  of  the  session  and  the  responsiveness  of  fraud  judgments. 

Brainstorming  Structure  and  Timing 

The  structure  and  timing  of  team  discussions  are  important  to  the  quality  of  team  judgments. 
For  example,  increased  time  pressure  exacerbates  the  tendency  of  teams  to  seek  cognitive  closure, 
reduces  motivation,  and  increases  reliance  on  heuristics  (e.g.,  Kruglanski  and  Freund  1983;  De 
Dreu  2003).  Further,  De  Dreu  (2003)  suggests  that  pressure  reduces  team  motivation  to  process 
information  in  a  systematic  and  deliberate  way,  leading  to  reduced  team  effectiveness.  Although 
time  pressure  is  ever-present  in  the  audit  environment,  auditors  can  potentially  mitigate  its  nega¬ 
tive  effects  by,  for  example,  conducting  brainstorming  sessions  early  in  the  planning  phase  of  the 
audit  rather  than  later.  Sessions  held  earlier  in  the  planning  process  are  expected  to  positively 
influence  auditors’  fraud  judgments  as  the  engagement  team  will  have  more  time  to  implement  the 
ideas  endorsed  during  the  session. 

Engagement  Team  Effort 

Engagement  team  effort  is  also  an  important  determinant  of  the  quality  of  teams’  brainstorm¬ 
ing  sessions.  There  are  two  qualitatively  different  mechanisms  of  information  processing:  shallow, 
less  critical,  heuristic  information  processing  and  more  argument-based,  effortful,  systematic  in¬ 
formation  processing  (Chaiken  and  Trope  1999).  If  the  goal  is  to  discover  popular  beliefs  and 
preferences,  team  interaction  tends  to  be  preference-driven,  leaving  little  opportunity  for  innova¬ 
tive  decision  making.  In  contrast,  information-driven  teams  employ  effortful  systematic  process¬ 
ing,  communicating  and  integrating  relevant  information,  to  arrive  at  higher  quality  decisions 
(Stasser  and  Birchmeier  2003).  Such  an  effortful  process  focus  is  evident  in  SAS  No.  99-related 
guidance,  which  suggests  that,  for  example,  auditors  individually  identify  risks  and  potential  audit 
responses  prior  to  brainstorming  (AICPA  2003).  These  requirements  should  enhance  each  team 
member’s  involvement  in  the  fraud  audit  process,  augment  their  client-specific  knowledge,  and 
improve  their  contribution  to  the  brainstorming  session.  Therefore,  we  expect  that  the  more  effort 
auditors  exert  prior  to  and  during  brainstorming,  the  higher  the  quality  of  team  interaction  and 
resultant  fraud-related  judgments. 
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In  summary,  higher  quality  brainstorming  sessions  are  expected  to  improve  (1)  the  links 
between  fraud  risk  factors  and  related  assessments  and  (2)  the  relations  between  fraud  risk  assess¬ 
ments  and  subsequent  testing.  That  is,  we  expect  session  quality  to  positively  moderate  both 
relations,  as  stated  in  the  following  hypotheses: 

HI:  Fraud  risk  factors  are  more  positively  related  to  fraud  risk  assessments  in  high-quality 
brainstorming  sessions  than  in  low-quality  brainstorming  sessions. 

H2:  Fraud  risk  assessments  are  more  positively  related  to  fraud  risk  responses  in  high-quality 
brainstorming  sessions  than  in  low-quality  brainstorming  sessions. 

III.  METHOD 

Participants 

We  approached  contacts  in  the  Big  4  firms  and  one  other  international  firm  to  request  partici¬ 
pation  in  the  study.  Each  firm  agreed  to  participate.  A  secure  Internet  survey  link  was  sent  to  each 
contact  who  distributed  the  survey  to  partners,  directors,  senior  managers,  and  managers  in  their 
offices.  A  total  of  367  auditors  received  the  survey  link,  of  which  179  completed  the  survey, 
resulting  in  a  response  rate  of  48.8  percent.  This  rate  exceeds  the  response  rates  in  other  auditing 
surveys  (e.g.,  16  percent  response  rate  by  Nelson  et  al.  [2002];  29.7  percent  response  rate  by 
Dowling  [2009]);  however,  it  is  lower  than  the  exceptionally  high  70.5  percent  achieved  by 
Gibbins  et  al.  (2001).^  Participants  held  positions  of  partner  (56),  director  (2),  senior  manager  (60), 
and  manager  (61).  All  179  brainstorming  sessions  in  our  study  reported  to  have  an  engagement 
partner  in  attendance. 

Research  Instrument 

Similar  to  the  method  used  by  Gibbins  et  al.  (2001),  Nelson  et  al.  (2002),  and  Dowling 
(2009),  participants  completed  a  field  survey  that  required  them  to  select  one  recently  completed 
audit  for  which  a  fraud  brainstorming  session  was  held.  They  then  responded  to  questions  related 
to  the  consideration  of  fraud  based  upon  their  experiences  on  that  engagement.  Participants  pro¬ 
vided  measures  of  fraud  risk  factors,  their  overall  fraud  risk  assessment  for  the  engagement, 
measures  of  audit  responses  to  the  fraud  risk  assessment,  and  objective  and  subjective  data  re¬ 
garding  the  quality  of  the  audit’s  fraud  brainstorming  session(s). 

Fraud  Risk  Factors 

Table  1  provides  descriptive  data  related  to  fraud  risk  factors.  Fraud  risk  factors  elicited  from 
participants  were  measured  according  to  SAS  No.  99  and  prior  research  (e.g.,  Wilks  and  Zimbel- 
man  2004)  that  characterize  fraud  risk  factors  as  incentives,  opportunities,  or  rationalizations.  We 
measure  incentives  with  seven  questions  related  to  market  and  debt  incentives  and  opportunity 
with  five  measures  of  corporate  governance  quality,  internal  control  strength,  and  accrual  levels. 
We  were  unable  to  find  research  that  suggests  rationalization  can  be  measured  with  multiple  items, 
so  we  use  one  measure  consistent  with  the  wording  in  SAS  No.  99.  To  facilitate  comparisons 
across  participants,  we  measure  all  but  one  of  these  items  (i.e.,  public  versus  private  company 
status)  on  a  scale  where  1  =  extremely  low  and  10  =  extremely  high. 

Factor  analyses  indicate  that  the  items  associated  with  market-related  incentives  and  debt- 
related  incentives  load  on  separate  factors.  Because  participants  were  allowed  to  provide  data  from 


^  Because  not  all  individuals  responded  to  our  survey,  we  examine  the  potential  for  nonresponse  bias.  Oppenheim  (1992) 
recommends  comparing  data  from  late  respondents  to  early  respondents  as  a  way  of  assessing  this  bias.  Accordingly,  we 
compare  the  responses  from  the  first  quartile  of  respondents  to  those  of  the  last  quartile  of  respondents.  There  are  no 
statistically  significant  differences  between  early  and  late  responses  on  any  of  our  hypothesized  variables. 
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TABLE  1 

Descriptive  Statistics:  Fraud  Risk  Factors 


Response 
[n  =  179] 


Item** 

Mean 

Std.  Dev. 

Market  Incentive 

Percentage  of  publicly  traded  clients 

67.03 

(47.00) 

Incentives  or  pressures  for  client  to  commit  fraud 

4.48 

(2.14) 

Pressure  for  client  to  meet  earnings  estimates 

4.89 

(2.57) 

Extent  of  equity-based  management  compensation 

4.28 

(2.52) 

Extent  of  budget-based  management  compensation 

4.68 

(2.40) 

Debt  Incentive 

Pressure  for  client  to  meet  debt  covenants 

3.72 

(2.68) 

Amount  of  debt 

4.93 

(2.75) 

Opportunity 

Opportunity  for  fraud  to  be  perpetrated  at  client 

4.90 

(1.92) 

Strength  of  corporate  governance 

6.85 

(2.26) 

Likelihood  of  management  override  of  controls 

4.32 

(2.12) 

Overall  internal  control  effectiveness 

6.74 

(2.05) 

Difference  between  net  income  and  cash  flow  from  operations 

4.58 

(2.34) 

Rationalization 

Extent  to  which  management  is  enabled  to  rationalize  fraud 

3.57 

(2.07) 

With  the  exception  of  whether  the  client  chosen  was  publicly  traded  (coded  1)  or  private  (coded  0),  all  items  are 
measured  on  a  scale  where  1  =  extremely  low  and  10  =  extremely  high.  Also,  due  to  the  positive  wording  of  two 
opportunity  items  (strength  of  corporate  governance  and  overall  internal  control  effectiveness),  responses  for  these  two 
items  are  reverse-coded  for  data  analysis. 


audits  of  both  publicly  traded  and  privately  held  companies,  we  decompose  the  incentive  eonstruct 
into  “market  incentive”  and  “debt  incentive.”  We  measure  the  four  fraud  risk  factors  as  the  mean 
of  the  participant’s  responses  to  the  items  classified  under  each  factor."^ 

Fraud  Risk  Assessment  and  Fraud  Risk  Responses 

Similar  to  Carpenter  (2007),  participants  provided  the  overall  fraud  risk  assessment  for  the 
engagement  on  a  scale  where  1  =  extremely  low  and  10  =  extremely  high.  The  non-tabulated 
mean  fraud  risk  assessment  (standard  deviation)  for  the  sample  was  4.92  (2.23).  Figure  2  provides 
a  frequency  distribution  of  fraud  risk  assessments  and  illustrates  substantial  variation  among  the 
risk  assessments,  which  ranged  from  2  to  10  for  our  sample.  After  providing  risk  assessments, 
participants  were  asked  to  provide  data  on  how  the  engagement  team  responded  to  the  assessment 
by  modifying  the  nature,  staffing,  timing,  and  extent  of  subsequent  audit  procedures.  Table  2 


With  only  three  exceptions,  factor  analyses  of  items  combined  to  create  fraud  risk  factors  and  fraud  risk  responses 
provide  factor  loadings  in  excess  of  0.50  and  Cronbach’s  alpha  levels  exceeding  the  generally  accepted  threshold  of  0.70 
(Nunnally  1978).  For  all  combined  measures,  the  percents  of  variances  explained  exceed  60.0  percent.  With  respect  to 
the  exceptions:  (1)  the  factor  loading  for  a  measure  of  total  accruals  (0.283)  was  unacceptable  and  therefore  excluded 
from  subsequent  analyses,  (2)  the  remaining  four  measures  of  opportunity  provide  a  moderate  reliability  level  (Cron¬ 
bach’s  alpha  =  0.62)  that  is  acceptable  for  testing  relationships  (e.g.,  Bamber  et  al.  1989),  and  (3)  the  Cronbach’s  alpha 
level  for  staffing  (0.423)  was  low  and  we  therefore  provide  results  for  staffing  with  the  two  staffing  items  combined  and 
separated. 
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FIGURE  2 

Frequency  Distribution  of  Fraud  Risk  Assessments 


This  figure  illustrates  fraud  risk  assessments  across  the  sample  of  actual  audit  engagements.  Fraud  risk  assess¬ 
ment  is  measured  on  a  scale  ranging  from  1  (“extremely  low”)  to  10  (“extremely  high”). 


provides  descriptive  statistics  on  the  items  used  to  measure  fraud  risk  responses.  The  “nature”  of 
responses  was  measured  with  two  items:  (1)  the  degree  to  which  the  number  of  testing  procedures 
was  increased,  and  (2)  the  degree  to  which  the  types  of  procedures  were  changed.  “Staffing”  was 
determined  by  both  the  degree  to  which  more  experienced  staff  or  forensic  specialists  were  used 
on  the  engagement.  The  “timing”  of  audit  responses  was  measured  by  the  degree  to  which  the 
timing  of  audit  procedures  was  changed  on  the  engagement.  The  “extent”  of  auditors’  responses 
was  measured  with  three  items:  the  extent  to  which  (1)  budgeted  audit  hours,  (2)  sample  sizes,  and 
(3)  audit  documentation  review  were  increased.  All  eight  items  were  measured  on  a  scale  where  1 
=  none  and  10  =  extensively.  We  measure  the  four  fraud  risk  responses  as  the  mean  of  the 
participant’s  answers  to  the  underlying  items. 

Quality  of  the  Fraud  Brainstorming  Session 

We  develop  a  21 -item  measure  to  determine  the  quality  of  brainstorming  sessions.  Table  3 
provides  descriptive  statistics  for  the  21  items  used  to  measure  quality  based  on  three  elements: 
attendance  and  communication,  brainstorming  structure  and  timing,  and  engagement  team  effort. 
Responses  for  each  of  the  items  were  coded  either  1  (high  quality)  or  0  (low  quality),  based  upon 
the  participant’s  response.  Similar  to  DeFond  et  al.’s  (2005)  multiple  measure  of  corporate  gov- 
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TABLE  2 

Descriptive  Statistics:  Fraud  Risk  Responses 


Response 
[n  =  179] 


Item® 

Mean 

Std.  Dev. 

Nature 

Increase  the  number  of  audit  procedures 

5.26 

(2.43) 

Change  the  types  of  audit  procedures 

5.56 

(2.41) 

Staffing 

Use  more  experienced  auditors  to  perform  audit  procedures 

4.36 

(2.78) 

Increase  the  use/consultation  of  forensic  specialists 

2.79 

(2.73) 

Timing 

Change  the  timing  of  audit  procedures 

3.83 

(2.70) 

Extent 

Increase  budgeted  hours 

3.35 

(2.43) 

Increase  sample  sizes 

3.73 

(2.56) 

Increase  the  extent  of  audit  documentation  review 

4.71 

(2.80) 

All  items  are  measured  on  a  scale  where  1  =  none  and  10  =  extensively. 


emance  quality,  we  weigh  each  item  equally  to  avoid  adding  additional  subjectivity  to  the  mea¬ 
sure.  Thus,  we  rate  the  total  quality  of  a  brainstorming  session  on  a  scale  from  0-21.  The  non- 
tabulated  mean  quality  score  for  our  sample  was  10.56  (SD  =  3.85),  with  a  minimum  score  of  3 
and  a  maximum  score  of  18.  Figure  3  provides  a  frequency  distribution  of  the  brainstorming 
session  quality  scores,  indicating  substantial  variation  in  brainstorming  quality. 

Attendance  and  Communication 

Our  measure  includes  1 1  items  related  to  which  professionals  led/attended  the  brainstorming 
sessions  and  the  extent  of  their  contribution.  For  example,  proper  tone  at  the  top  and  a  highly 
qualified  person  leading  the  session  are  important  to  the  success  of  the  brainstorming  process 
(AICPA  2003;  Landis  et  al.  2008).  Further,  whole  engagement  team  participation  allows  more 
experienced  auditors  to  share  their  fraud  insights/experiences  with  less  experienced  team  members 
(e.g.,  AICPA  2003).  Therefore,  if  the  brainstorming  session  is  led  by  either  the  engagement  partner 
or  a  fraud  specialist,  the  engagement  receives  a  coding  of  1  (i.e.,  high  quality)  for  this  item.  We 
measure  the  respondent’s  perceived  level  of  contribution  to  the  brainstorming  session  from  staff, 
seniors,  managers,  partners,  and  various  specialists,  as  well  the  level  of  the  engagement  manage¬ 
ment’s  openness  to  all  ideas  submitted  during  the  brainstorming  session.  These  items  are  measured 
on  a  scale  from  1  (extremely  low)  to  10  (extremely  high)  and  responses  related  to  these  items  are 
coded  high  if  they  exceed  the  sample  mean  for  the  item  (means  provided  in  Table  3). 

Brainstorming  Structure  and  Timing 

The  survey  contains  four  measures  of  brainstorming  structure  and  timing.  For  example,  prac¬ 
titioner  guidance  (e.g.,  AICPA  2003;  Beasley  and  Jenkins  2003)  suggests  that  the  use  of  agendas 
should  increase  the  effectiveness  and  efficiency  of  brainstorming  sessions.  Conversely,  Asare  and 
Wright  (2004)  find  that  the  use  of  fraud  checklists  can  impair  auditor  performance.  Therefore, 
responses  are  coded  as  high  quality  if  an  agenda  (checklist)  is  (is  not)  used. 


The  Accounting  Review 


July  2010 

American  Accounting  Association 


1284 


Brazel,  Carpenter,  and  Jenkins 


TABLE  3 


Descriptive  Statistics:  Quality  of  Fraud  Brainstorming  Session  (21  Items) 


Response 
[n  =  179] 


Item 

Attendance  and  Communication 

1 .  %  of  engagements  where  the  session  was  led  by  the  partner  or  forensic 
specialist 

2.  %  of  engagements  where  all  levels  of  engagement  team  attended  sessions 

3.  %  of  engagements  where  a  fraud  specialist  attended  the  primary  session 

4.  %  of  engagements  where  an  IT  audit  specialist  attended  the  primary  session 

5.  %  of  engagements  where  a  tax  professional  attended  the  primary  session 

6.  Level  of  staff  contribution  to  the  session  [n  =  13 

7.  Level  of  senior  contribution  to  the  session  [n  =  170]“’ 

8.  Level  of  manager  contribution  to  the  session^ 

9.  Level  of  partner  contribution  to  the  session^ 

10.  Level  of  fraud  specialist  contribution  to  the  session  [n  =  56]“’ 

1 1 .  Level  of  openness  to  ideas  submitted  during  the  session^ 

Brainstorming  Structure  and  Timing 

12.  %  of  engagements  where  an  agenda  was  used  during  the  session 

13.  %  of  engagements  where  a  checklist  was  not  used  during  the  session 

14.  %  of  engagements  where  the  primary  session  was  held  pre-planning  or 
early  in  planning 

15.  %  of  engagements  where  a  session  was  held  at  the  end  of  the  prior  year’s 
audit 

Engagement  Team  Effort 

16.  Time  spent  by  the  engagement  team  preparing  for  the  session  (in  hours) 

17.  %  of  engagements  that  had  more  than  one  session 

18.  Total  length  of  time  for  the  sessions  (in  minutes) 

19.  %  of  engagements  where  team  members  were  asked  to  identify  risks  prior 
to  the  session 

20.  Extent  of  discussion  about  how  management  might  perpetrate  fraud“ 

21.  Extent  of  discussion  about  audit  responses  to  fraud  risk“ 


Mean 

Std.  Dev. 

59.78 

(49.22) 

72.63 

(44.50) 

31.29 

(46.34) 

69.27 

(46.34) 

62.56 

(48.59) 

4.32 

(2.13) 

6.03 

(1.91) 

7.93 

(1.48) 

8.41 

(1.48) 

7.01 

(2.31) 

8.54 

(1.42) 

83.80 

(36.60) 

28.49 

(45.34) 

64.81 

(47.95) 

16.20 

(36.03) 

9.10 

(15.51) 

49.16 

(50.13) 

93.68 

(99.16) 

67.60 

(47.00) 

7.25 

(1.69) 

7.38 

(1.60) 

“  Items  are  measured  on  a  scale  where  1  =  extremely  low  and  10  —  extremely  high. 

Mean  represents  the  average  response  in  sessions  where  the  professional(s)  (e.g.,  staff,  fraud  specialist)  was  present 
(versus  not  present).  Manager(s)  and  partner(s)  were  present  for  all  observations. 


Engagement  Team  Effort 

We  measure  engagement  team  effort  with  six  items.  To  illustrate,  we  deem  responses  as  high 
if  the  time  spent  by  the  engagement  team  preparing  for  the  brainstorming  session  exceeds  the 
mean  for  our  sample,  if  the  engagement  has  more  than  one  brainstorming  session,  or  if  the  total 
length  of  time  for  the  brainstorming  session(s)  exceeds  the  mean  for  our  sample.^ 


^  Factor  analyses  performed  on  the  21  categorical  response  items  provide  significant  factor  loadings  (p  <  0.05)  for  20  of 
the  21  items  on  their  respective  constructs.  Further,  each  of  these  20  response  items  provide  significant  factor  loadings 
(p  <  0.05)  on  the  overall  construct  of  brainstorming  quality.  Both  the  Comparative  Fit  Index  of  0.93  and  the  Tucker- 
Lewis  Index  of  0.93  are  above  0.90,  which  is  indicative  of  a  good  fit  for  our  overall  construct  of  brainstorming  quality 
(Kline  2004).  Only  one  item,  percentage  of  engagements  where  all  levels  of  the  engagement  team  attended  sessions,  did 
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FIGURE  3 

Frequency  Distribution  of  Brainstorming  Session  Quality  Scores 

30  1 - — — - 
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Brainstorming  Session  Quality  Score 


This  figure  illustrates  brainstorming  session  quality  scores  across  the  sample  of  actual  audit  engagements. 
Brainstorming  quality  scores  are  formed  using  a  2 1  -item  measure  with  each  item  being  equally  weighted  to 
determine  a  session’s  quality.  Consequently,  session  quality  scores  can  range  from  0  to  21  points. 


Control  Variables 

Table  4  provides  descriptive  statistics  for  our  control  variables.  Prior  research  has  found  that 
client  size,  industry,  and  complexity  impact  auditor  risk  assessments  and  testing  decisions  (e.g., 
O’Keefe  et  al.  1994).  We  control  for  size  by  including  a  measure  of  the  client’s  revenues  in  our 
analyses.  We  asked  respondents  to  provide  the  client’s  industry  and,  similar  to  Johnstone  and 
Bedard  (2003),  create  dichotomous  industry  control  variables  to  control  for  industry  effects.  As 
suggested  by  the  audit  fee  and  resource  allocation  literatures  (e.g.,  Hackenbrack  and  Knechel 
1997),  we  measure  client  complexity  as  the  number  of  financial  statement  auditors  assigned  to  the 
engagement  divided  by  our  size  measure.  Because  we  investigate  fraud-related  judgments,  we 
control  for  whether  fraud  was  detected  on  the  engagement.  We  also  control  for  the  effects  of  other 
auditor  or  engagement  team-related  variables  that  could  influence  audit  judgments  (e.g.,  Bonner 
1990).  Specifically,  with  dichotomous  firm  control  variables,  we  simultaneously  control  for  the 
specific  audit  firm  which  performed  the  audit  and  the  inclusion  of  one  non-Big  4  firm  in  our 
sample.  Also,  our  model  includes  the  participant’s  general  audit  experience,  position  in  the  firm. 


not  load  satisfactorily  (p  =  0.70).  Omitting  this  item  from  the  factor  analyses  and  our  session-quality  measure  in  our 
regression  analyses  does  not  qualitatively  change  our  results.  We  include  this  item  in  our  analyses  because  the  item  was 
specifically  identified  as  a  deficiency  by  the  PCAOB  (PCAOB  2007). 


The  Accounting  Review 


July  2010 

American  Accounting  Association 


1286 


Brazel,  Carpenter,  and  Jenkins 


TABLE  4 


Descriptive  Statistics:  Control  Variables 

Response 
[n  =  179] 


Item 

Client  Size 

Manufacturing  industry 
Retail  industry 

Govemment/not-for-profit  industry 
Energy  industry 

High  tech/communications  industry 

Healthcare/pharmaceutical  industry 

Financial  services  industry 

Miscellaneous  industry 

Client  complexity 

Fraud  detect 

Audit  firm  A 

Audit  firm  B 

Audit  firm  C 

Audit  firm  D 

Audit  firm  E 

Audit  experience 

Position 

Client  experience 
Industry  experience 
Engagement  team  expertise 
Fraud  training 
Fraud  experience 
Restart 
Time  taken 


Mean 

Std.  Dev. 

3.06 

(1.42) 

0.20 

(0.40) 

0.11 

(0.31) 

0.19 

(0.39) 

0.08 

(0.28) 

0.14 

(0.35) 

0.07 

(0.26) 

0.07 

(0.25) 

0.14 

(0.35) 

0.93 

(0.65) 

0.24 

(0.42) 

0.14 

(0.35) 

0.19 

(0.39) 

0.32 

(0.47) 

0.16 

(0.37) 

0.19 

(0.39) 

1.70 

(0.86) 

2.29 

(1.23) 

3.35 

(2.05) 

8.69 

(6.73) 

7.50 

(1.81) 

1.42 

(0.66) 

1.92 

(0.71) 

0.08 

(0.28) 

22.29 

(15.51) 

“  Items  were  measured  as  follows: 

Client  size  is  based  on  client  revenues  and  coded  as  follows:  1  =  <  $100  million,  2  =  $100  million  —$500  million,  3  — 
>  $500  million  -$1  billion,  4  =  >  $1  billion  -$5  billion,  5  =  >  $  5  billion.  Industry  variables  are  dichotomous  (1,0) 
industry  variables  for  the  following  industries:  manufacturing,  retail,  government/not-for-profit,  energy,  high  tech/ 
communications,  healthcare/pharmaceuticals,  financial  services,  and  miscellaneous.  Client  complexity  is  measured  as 
number  of  financial  statement  auditors  assigned  to  the  engagement  (coded  as  follows:  1  =  0-5  auditors,  2  =  6-10  auditors, 
3  =  11-15  auditors,  4  =  16-20  auditors,  5  =  >  20  auditors)/client  size.  Fraud  detect  is  coded  1  if  the  participant  reported 
that  fraud  was  detected  at  the  client,  0  otherwise.  Audit  firm  variables  are  dichotomous  (1,0)  firm  variables  for  the  four 
international  and  one  national  (i.e.,  non-Big  4)  firm  that  participated  in  our  study.  Audit  experience  is  coded  as  follows:  1 
=  5-10  years,  2  =  11-15  years,  3  =  >  15  years.  Position  is  coded  as  follows:  1  =  manager,  2  =  senior  manager,  3  = 
director,  4  =  partner.  Client  experience  is  measured  as  the  number  of  years  the  respondent  had  served  the  client.  Industry 
experience  is  coded  as  follows:  1  =  0-5  years,  2  =  6-10  years,  3  =  11-15  years,  4  =  16-20  years,  5  =  >  20  years. 
Engagement  team  expertise  is  measured  on  a  scale  where  1  =  extremely  low  and  10  =  extremely  high.  Fraud  training  is 
measured  by  the  number  of  hours  of  fraud  training  the  respondent  had  received  in  the  last  year  and  coded  as  follows:  1  = 
0-8  hours,  2  =  9-16  hours,  3  =  >  16  hours.  Fraud  experience  is  determined  by  the  number  of  engagements  the  respondent 
served  on  which  frauds  were  identified  and  coded  as  follows:  1  =  0,  2  =  1—2,  3  =  >2.  Restart  is  coded  1  if  the  participant 
stopped  and  restarted  the  survey,  0  otherwise.  Time  taken  is  the  time  taken  by  the  164  respondents  who  did  not  stop  and 
restart  the  survey. 
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experience  with  the  client  and  the  client’s  industry,  extent  of  fraud  training,  experience  level  with 
actual  financial  statement  fraud,  their  perception  of  the  entire  engagement  team’s  expertise  level, 
whether  the  respondent  stopped  and  started  the  survey,  and  time  taken  to  complete  the  survey. 

Regression  Models 

To  test  HI  and  H2,  we  estimate  the  following  models  via  ordinal  logistical  regression  and 
examine  the  italicized  interactions: 

HI: 

FR  =  l3o  +  fijMI  +  /32DI  +  PsOPP  +  I34RAT+  PsSESSION  QUALITY  +  yS^M/ 

X  SESSION  QUALITY  +  IB7DI  X  SESSION  QUALITY  +  fisOPP  X  SESSION  QUALITY 
+  (3gRAT  X  SESSION  QUALITY  +  /3ia,32CONTROL  VARIABLES  +  e. 

H2: 

NATURE  =fio  +  f^iFR  +  /32SESSION  QUALITY  +  ft^ER  X  SESSION  QUALITY 

+  /34_26Control  variables  +  8. 

STAPLING  =  ySo  +  I3}FR  +  ^2SESSION  QUALITY  +  fsER  X  SESSION  QUALITY 

+  /34_26COntrol  variables  +  8. 

TIMING  =  (So  +  /3jER  +  I32SESSION  QUALITY  +  PsLR  X  SESSION  QUALITY 
+  /34_26C0NTR0L  variables  +  8. 

EXTENT  =  /3o  +  PiER  +  jSjSESSION  QUALITY  +  fSjER  X  SESSION  QUALITY 
+  y84_26CONTROL  VARIABLES  +  8. 

where: 


FR  =  Fraud  Risk  Assessment,  measured  on  a  scale  where  1  =  extremely 
low  and  10  =  extremely  high; 

MI,  DI,  OPP,  RAT  =  Market  Incentive,  Debt  Incentive,  Opportunity,  and  Rationalization; 

see  the  “Fraud  Risk  Factors”  section  for  information  on  measurement; 

SESSION  QUALITY  =  see  the  “Quality  of  the  Fraud  Brainstorming  Session”  section  for 

information  on  measurement;  this  variable  is  coded  1  if  session 
quality  for  the  engagement  exceeds  the  mean  for  the  sample  (10.56), 

0  otherwise;  the  use  of  an  indicator  variable  is  described  in  the 
“Correlation  Matrix  and  Multicollinearity”  section  below; 

NATURE,  STAFFING, 

TIMING,  EXTENT  =  see  the  “Fraud  Risk  Assessment  and  Fraud  Risk  Responses”  section 
for  information  on  measurement; 

CONTROL  VARIABLES  {CLIENT  SIZE,  INDUSTRY  VARIABLES,  CLIENT  COMPLEXITY, 
FRAUD  DETECT,  AUDIT  FIRM  VARIABLES,  AUDIT  EXPERIENCE,  POSITION,  CLIENT  EX¬ 
PERIENCE,  INDUSTRY  EXPERIENCE,  ENGAGEMENT  TEAM  EXPERTISE,  FRAUD  TRAIN¬ 
ING,  FRAUD  EXPERIENCE,  RESTART,  TIME  TAKEN) 

=  see  the  “Control  Variables”  section  and  Table  4  for  information  on 
measurement. 
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IV.  RESULTS 

Descriptive  Statistics  of  Brainstorming  Practices 

Given  our  sample  of  179  recently  completed  audit  engagements  across  five  public  accounting 
firms,  Table  3  provides  descriptive  evidence  regarding  how  selected  fraud  brainstorming  sessions 
are  being  performed  in  practice.  First,  our  participants  report  that  a  partner  or  forensic  specialist 
led  the  session  only  60  percent  of  the  time,  pointing  toward  the  important  role  audit  managers  and 
seniors  probably  play  in  determining  the  quality  of  the  session.  Second,  consistent  with  recent 
PCAOB  inspection  team  reports  (PCAOB  2007),  we  find  evidence  that  the  attendance  of  special¬ 
ists  (fraud,  tax,  IT)  in  sessions  varies.  Third,  our  results  are  consistent  with  the  hierarchical 
structure  of  engagement  teams  in  that  the  level  of  contribution  to  the  session  increases  with 
seniority.  Fourth,  consistent  with  PCAOB  inspection  team  findings  (PCAOB  2007),  for  35  percent 
of  engagements,  sessions  were  held  after  the  early  stages  of  planning.  Last,  engagement  team 
preparation  for  the  sessions  tended  to  be  substantial,  sessions  lasted  an  average  of  1.5  hours,  and 
49  percent  of  engagements  conducted  more  than  one  session. 

Correlation  Matrix  and  Multicollinearity 

Table  5  presents  a  correlation  matrix.  To  present  a  parsimonious  correlation  matrix,  brain¬ 
storming  quality  items  and  control  variables  are  excluded  from  Table  5  if  they  are  not  significantly 
correlated  (p  <  0.05)  with  at  least  four  of  the  hypothesized  variables. 

We  test  our  hypotheses  via  interactions  within  five  ordinal  logistical  regression  models.  These 
interactions  are  the  cross  product  term  of  two  independent  variables  (e.g.,  fraud  risk  X  session 
quality).  In  such  regressions,  the  interactive  term  is  likely  to  covary  with  the  main  effects  consti¬ 
tuting  the  interaction  term.  These  regressions  often  exhibit  multicollinearity  problems  (Echambadi 
and  Hess  2007).  Likewise,  when  we  interact  our  21-item  measure  for  session  quality  with  risk 
factors  (HI)  and  risk  assessments  (H2),  diagnostics  suggest  multicollinearity  exists  between  the 
interaction  terms  and  the  variables  that  constitute  the  interaction  (e.g.,  variance  inflation  factors 
[VIFs]  in  excess  of  10,  condition  indices  in  excess  of  30  with  more  than  50  percent  of  the  variance 
of  coefficient  estimates  associated  with  these  indices).  Multicollinearity,  similar  to  a  small  sample 
size,  biases  against  finding  statistically  significant  main  effects  and  interactions  in  multivariate 
regression  (e.g.,  Belsley  et  al.  1980;  Hair  et  al.  1998;  Wooldridge  2009). 

Prior  studies  have  mitigated  multicollinearity  effects  by  transforming  their  continuous  or 
scaled  moderating  variable  into  an  indicator  variable  (e.g..  Core  et  al.  2002;  Ali  et  al.  2008; 
Cardinaels  2008).  Thus,  we  transform  our  21 -item  measure  of  brainstorming  quality  into  an 
indicator  variable,  coding  each  engagement’s  total  session  quality  as  high  or  1  (low  or  0)  if  the 
total  score  for  the  engagement  exceeds  (is  less  than)  the  mean  for  our  sample  (versus  using  the 
21 -item  measure).^  We  use  this  indicator  variable  to  examine  the  main  and  interactive  effects  of 
session  quality  in  all  of  our  tabulated  and  non-tabulated  regression  analyses.  Tests  of  multicol¬ 
linearity  for  our  HI  regression  analyses  (with  the  session  quality  indicator  variable)  indicate  that 
all  but  one  of  the  independent  variables  for  HI  provide  satisfactory  multicollinearity  diagnostics 
(e.g.,  Belsley  et  al.  1980;  Neter  et  al.  1996;  Kennedy  2008).  The  sole  exception  is  the  VIF  for  the 
interaction  between  opportunity  and  session  quality,  which  is  11.71.  Using  condition  indices  and 


®  Alternatively,  we  could  have  split  the  sample  at  the  median  session  quality  score  (11.00)  and  excluded  firms  at  the 
median  from  our  analyses.  However,  we  would  then  have  eliminated  21  observations  with  a  session  quality  of  11.00 
from  our  analyses.  Given  our  relatively  small  sample  size  (179),  our  large  number  of  control  variables  (23),  that  the 
mean  and  median  for  our  sample  are  approximately  the  same  (10.56  and  11.00,  respectively),  and  partitioning  the 
sample  at  the  mean  can  be  performed  without  eliminating  observations  (i.e.,  no  observations  had  a  session  quality  score 
=  10.56),  we  use  the  mean  session  quality  score  to  create  the  indicator  variable  for  session  quality. 
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variances  to  assess  multicollinearity  (e.g.,  Belsley  et  al.  1980;  Core  et  al.  2002),  we  find  that  only 
one  of  the  33  condition  indices  for  HI  exceeds  the  standard  threshold  of  30,  and  over  50  percent 
of  the  variances  for  the  intercept  and  session  quality  are  associated  with  this  index  (Belsley  et  al. 
1980;  Kennedy  2008).  This  is  likely  the  result  of  session  quality  being  included  in  the  model  for 
HI  five  times  (one  main  effect  and  four  interactions).^  Thus,  for  testing  HI,  multicollinearity  is 
likely  to  bias  against  finding  significant  main  and  interactive  effects  for  session  quality,  and 
caution  must  be  taken  in  interpreting  null  findings  for  this  variable.  For  H2  analyses  (with  the 
session  quality  indicator  variable),  both  VIFs  and  condition  indices/variances  indicate  no  problems 
with  multicollinearity,  as  all  measures  are  below  acceptable  thresholds  (Neter  et  al.  1996;  Belsley 
et  al.  1980;  Kennedy  2008).^ 

Hypotheses  Testing 

Fraud  Risk  Factors,  Fraud  Risk  Assessments,  and  Brainstorming  Session  Quality 

Hypothesis  1  posits  that  fraud  risk  factors  are  more  positively  related  to  fraud  risk  assess¬ 
ments  in  high-quality  brainstorming  sessions  than  in  low-quality  sessions.  Thus,  HI  is  supported 
by  a  regression  result  that  provides  positive  and  significant  interactions  between  fraud  risk  factors 
and  session  quality  on  fraud  risk  assessments.  Table  6  provides  the  results  of  HI  testing.  For 
parsimony,  only  control  variables  with  p-values  <  0.10  are  included  in  Tables  6  and  7.  While  the 
main  effects  of  market  incentive,  opportunity,  and  rationalization  are  all  positive  and  significant 
(p-values  <  0.05),  their  interactions  with  brainstorming  session  quality  are  not.  Thus,  while  we 
find  that  three  of  the  four  risk  factors  are  positively  associated  with  fraud  risk  assessments,  we  do 
not  observe  that  these  relations  are  stronger  under  higher  quality  brainstorming. 

However,  diagnostics  suggest  that  multicollinearity  influences  the  main  and  interactive  effects 
of  session  quality  in  the  regression  model  for  HI  (specifically  related  to  its  interaction  with 
opportunity).  As  suggested  by  Bradshaw  et  al.  (2004)  and  Covrig  et  al.  (2007),  we  complement 
our  analysis  of  HI  with  two  separate  regressions  with  session  quality  as  low  and  high  (two 
subsamples).  We  partition  the  sample  at  the  mean  brainstorming  quality  score  (10.56),  and  re¬ 
perform  the  statistical  tests  described  above  for  each  of  the  two  groups  (removing  the  brainstorm¬ 
ing  quality  variable).  If  session  quality  positively  moderates  the  effect  of  fraud  risk  factors  on 
fraud  risk  assessments,  then  the  main  effects  of  fraud  risk  factors  on  fraud  risk  assessments  should 
be  more  significant  for  the  high-quality  subsample.  We  find  this  to  be  the  case  for  the  fraud  risk 
factors  of  debt  incentive  and  opportunity.  For  the  high-quality  group,  tests  of  HI  (not  tabulated) 
indicate  at  least  marginally  significant  and  positive  relations  between  debt  incentive  and  opportu¬ 
nity  on  fraud  risk  assessments  (p-values  =  0.06  and  0.04,  respectively).  For  the  low-quality  group, 
the  relations  between  debt  incentive  and  opportunity  on  fraud  risk  assessments  are  not  significant. 
Thus,  there  is  limited  evidence  supporting  Hl.^ 


’’  Given  the  significant  correlations  between  our  fraud  risk  factors  (see  Table  5)  and  that  auditors  simultaneously  incor¬ 
porate  fraud  risk  factors  into  one  fraud  risk  assessment,  not  including  all  four  fraud  risk  factors  in  the  same  multivariate 
regression  could  yield  spurious  results. 

*  To  ensure  that  our  interaction  results  are  stable  across  various  specifications,  we  analyze  the  following  models:  (1)  with 
both  main  and  interactive  effects  for  continuous  hypothesized  variables,  (2)  with  both  main  and  interactive  effects  for 
continuous  hypothesized  variables  (mean-centered),  (3)  with  only  interactive  effects  for  continuous  hypothesized  vari¬ 
ables  (no  mean-centering),  and  (4)  with  both  main  and  interactive  effects  for  continuous  hypothesized  variables  (no 
mean-centering,  but  session  quality  as  an  indicator  variable  (the  tabulated  model)).  Our  primary  inferences  are  similar 
across  all  four  models:  the  interactions  for  HI  are  largely  not  supported  and  the  interactions  for  H2  are  largely 
supported.  It  should  be  noted  that  Echambadi  and  Hess  (2007)  conclude  that  mean-centering  does  not  alleviate  col- 
linearity  problems  in  multivariate  regressions. 

®  Notwithstanding  our  expectation,  we  find  only  limited  evidence  that  the  quality  of  brainstorming  sessions  moderates  the 
relations  between  risk  factors  and  assessments.  Anecdotal  evidence  from  conversations  with  partners  at  the  Big  4  firms 
suggests  one  possible  explanation  for  our  results.  These  partners  report  that  engagement  team  management  come  to 
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TABLE  6 

HI  Testing:  Regression  Results  for  Fraud  Risk  Assessment^ 


Independent  Variable** 

Estimated 

Coefficient 

Wald- 

Statistic 

p-value 

MI 

0.271 

2.96 

0.043 

DI 

0.095 

1.07 

0.150 

OPP 

0.317 

3.13 

0.039 

RAT 

0.249 

4.35 

0.019 

SESSION  QUALITY 

0.409 

0.11 

0.739 

MI  X  SESSION  QUALITY 

0.001 

0.00 

0.499 

DI  X  SESSION  QUALITY 

0.018 

0.02 

0.443 

OPP  X  SESSION  QUALITY 

0.086 

0.14 

0.356 

RAT  X  SESSION  QUALITY 

-0.132 

0.71 

0.398 

CLIENT  SIZE 

-0.252 

3.10 

0.078 

ENGAGEMENT  TEAM 

0.200 

3.75 

0.053 

EXPERTISE 

ERAUD  TRAINING 

0.649 

7.23 

0.007 

RESTART 

-1.580 

3.51 

0.061 

Model  Chi-square  statistic  = 
79.60  (p-value  =  <  .001) 

Pseudo  =  .  362 


^  Fraud  risk  assessment  {FR)  is  measured  on  a  scale  where  1  =  extremely  low  and  10  =  extremely  high. 

^  The  “Method”  section  provides  information  on  the  measurement  of  market  incentive,  debt  incentive,  opportunity, 
rationalization.  SESSION  QUALITY  is  coded  1  if  the  total  session  quality  of  the  engagement  (described  in  the  “Method” 
section)  exceeded  the  sample  mean  (10.56),  0  otherwise.  CLIENT  SIZE  is  based  on  client  revenues  and  coded  as 
follows:  1  =<  $100  million,  2  =  $100  million-$500  million,  3  =>  $500  million-$l  billion,  4  =>  $1  billion-$5 
billion,  5  =>  $5  billion.  ENGAGEMENT  TEAM  EXPERTISE  is  measured  on  a  scale  where  1  =  extremely  low  and  10 
=  extremely  high.  FRAUD  TRAINING  is  measured  by  the  number  of  hours  of  fraud  training  the  respondent  had 
received  in  the  last  year  and  coded  as  follows:  1  =  0-8  hours,  2  =  9-16  hours,  3  =  >  16  hours.  RESTART  is  coded  1 
if  the  participant  stopped  and  restarted  the  survey,  0  otherwise. 


Fraud  Risk  Assessments,  Fraud  Risk  Responses,  and  Brainstorming  Session  Quality 

Hypothesis  2  posits  that  fraud  risk  assessments  are  more  positively  related  to  fraud  risk 
responses  in  high-quality  brainstorming  sessions  than  in  low-quality  sessions.  Table  7  presents 
results  related  to  H2.  For  nature,  staffing,  timing,  and  extent,  the  interaction  terms  for  fraud  risk 
assessment  and  session  quality  are  all  positive  and  at  least  marginally  significant  (p-values  <0.10, 
see  Panels  A-D),  thus  supporting  H2.  When  brainstorming  session  quality  is  higher,  audit  teams 
are  more  likely  to  add  or  change  testing  procedures,  employ  more  senior  staffing/fraud  specialists, 
change  the  timing  of  tests,  and  increase  the  extent  of  testing  in  response  to  higher  fraud  risk 


brainstorming  sessions  with  a  lot  of  information  about  the  likelihood  of  fraud  based  on  prior  years’  work  and  client 
continuation  discussions.  Specifically,  partners  indicate  that  fraud  risk  assessments  are  often  based  on  the  presence  or 
absence  of  specific  risk  factors  such  that  assessments  are  effectively  predetermined  by  the  partners  prior  to  the  brain¬ 
storming  sessions.  In  such  instances,  brainstorming  serves  primarily  to  inform  team  members  about  the  known  risk 
factors,  to  discuss  other  possible  factors  that  might  not  have  been  pre-identified  and,  more  importantly,  to  give  them  an 
opportunity  to  discuss  how  best  to  respond  to  fraud  risk.  How  previous  client  experiences,  client  continuation  discus¬ 
sions,  and  other  factors  affect  the  brainstorming  process  and  fraud-related  judgments  is  a  question  for  future  research. 


The  Accounting  Review 
American  Accounting  Association 


July  2010 


Auditors  ’  Use  of  Brainstorming  in  the  Consideration  of  Fraud:  Reports  from  the  Field 


1293 


TABLE  7 

H2  Testing:  Regression  Results  for  Fraud  Risk  Responses® 

Panel  A:  Nature 

Estimated 

Wald- 

Independent  Variable'’ 

Coefficient 

Statistic 

p-value 

FR 

0.229 

5.22 

0.022 

SESSION  QUALITY 

-0.148 

0.05 

0.832 

FR  X  SESSION  QUALITY 

0.196 

2.26 

0.067 

CLIENT  SIZE 

0.273 

3.92 

0.048 

HEALTHCARE/PHARMACEUTICAL  INDUSTRY 

-1.903 

7.92 

0.005 

CLIENT  COMPLEXITY 

0.643 

5.68 

0.017 

ERAUD  DETECTION 

0.667 

3.98 

0.046 

AUDIT  E IRMA 

-1.294 

6.14 

0.013 

AUDIT  EIRM  C 

-0.888 

3.88 

0.049 

AUDIT  EIRME 

-1.294 

6.14 

0.013 

ENGAGEMENT  TEAM  EXPERTISE 

0.162 

3.38 

0.066 

INDUSTRY  EXPERIENCE 

0.055 

3.42 

0.064 

Model  Chi-square  statistic  =  87.35  (p-value  <  .001) 

Pseudo  =  .390 

Panel  B:  Staffing 

Estimated 

Wald- 

Independent  Variable 

Coefficient 

Statistic 

p-value 

ER 

0.154 

2.30 

0.130 

SESSION  QUALITY 

-0.130 

0.03 

0.854 

FR  X  SESSION  QUALITY 

0.260 

3.84 

0.025 

CLIENT  SIZE 

0.504 

12.52 

<0.001 

CLIENT  COMPLEXITY 

0.599 

5.00 

0.025 

ERAUD  DETECTION 

0.792 

5.44 

0.020 

Model  Chi-square  statistic  =  94.62  (p-value  <  .001) 

Pseudo  =  .414 

Panel  C:  Timing 

Estimated 

Wald- 

Independent  Variable 

Coefficient 

Statistic 

p-value 

ER 

0.101 

0.95 

0.330 

SESSION  QUALITY 

-0.939 

1.71 

0.191 

ER  X  SESSION  QUALITY 

0.328 

5.96 

0.008 

MANUEACTURING  INDUSTRY 

1.097 

4.39 

0.036 

AUDIT  FIRM  A 

-1.29 

5.13 

0.023 

AUDIT  FIRM  B 

-1.90 

13.37 

<0.001 

CLIENT  EXPERIENCE 

-0.137 

3.19 

0.074 

ERAUD  EXPERIENCE 

-0.426 

3.23 

0.072 

Model  Chi-square  statistic  =  78.31  (p-value  <  .001) 

Pseudo  R^  =  .358 

(continued  on  next  page) 
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Panel  D:  Extent 

Independent  Variable 

FR 

SESSION  QUALITY 
FR  X  SESSION  QUALITY 
CLIENT  SIZE 
RETAIL  INDUSTRY 
FRAUD  DETECTION 
AUDIT  FIRM  B 
AUDIT  FIRM  C 
AUDIT  FIRM  E 
AUDIT  EXPERIENCE 
CLIENT  EXPERIENCE 

Model  Chi-square  statistic  =  85.52  (p-value  <  .001) 
Pseudo  =  .  383 


Estimated 

Coefficient 

Brazel,  Carpenter,  and  Jenkins 

Wald 

-statistic  p-value 

0.259 

6.64 

0.010 

0.095 

0.19 

0.891 

0.199 

2.32 

0.064 

0.234 

2.92 

0.088 

-1.163 

3.52 

0.061 

0.920 

7.50 

0.006 

-1.094 

4.91 

0.027 

-1.072 

5.66 

0.017 

-1.225 

5.58 

0.018 

-0.763 

4.59 

0.032 

-0.130 

3.10 

0.078 

^  The  “Method”  section  provides  information  on  the  measurement  of  the  fraud  risk  responses  of  NATURE,  STAFFING, 
TIMING,  and  EXTENT. 

FR  is  fraud  risk  assessment,  measured  on  a  scale  where  1  =  extremely  low  and  10  =  extremely  high.  SESSION 
QUALITY  is  coded  1  if  the  total  session  quality  of  the  engagement  (described  in  the  “Method”  section)  exceeds  the 
sample  mean  (10.52),  0  otherwise.  CLIENT  SIZE  is  based  on  client  revenues  and  coded  as  follows:  1  =<  $100  million, 
2  =  $100  million-$500  million,  3  =>  $500  million-$l  billion,  4  =>  $1  billion-$5  billion,  5  =>  $5  billion. 
HEALTHCARE/PHARMACEUTICAL  INDUSTRY  is  a  dichotomous  industry  variable  coded  1  if  healthcare/ 
phannaceutical  industry  client,  0  otherwise.  CLIENT  COMPLEXITY  is  measured  as  the  number  of  financial  statement 
auditors  assigned  to  the  engagement  (coded  as  follows:  1  =  0-5  auditors,  2  =  6-10  auditors,  3  =  11-15  auditors,  4  = 
16-20  auditors,  5  =  >  20  auditors)/CL/£'/V7’  SIZE.  FRAUD  DETECTION  is  coded  1  if  the  participant  reported  that 
fraud  was  detected  at  the  client,  0  otherwise.  AUDIT  FIRM  variables  are  dichotomous  (1,0)  firm  variables  for  the  four 
international  and  one  national  (i.e.,  non-Big  4)  firm  that  participated  in  our  study.  ENGAGEMENT  TEAM  EXPERTISE 
is  measured  on  a  scale  where  1  =  extremely  low  and  10  =  extremely  high.  INDUSTRY  EXPERIENCE  is  coded  as 
follows:  1  =  0-5  years,  2  =  6-10  years,  3  =  11-15  years,  4  =  16-20  years,  5  =  >  20  years.  MANUFACTURING 
INDUSTRY  is  a  dichotomous  industry  variable  coded  1  if  manufacturing  industry  client,  0  otherwise.  CLIENT  EXPE¬ 
RIENCE  is  measured  as  the  number  of  years  the  respondent  had  served  the  client.  FRAUD  EXPERIENCE  is  determined 
by  the  number  of  engagements  the  respondent  served  on  which  frauds  were  identified  and  coded  as  follows:  1  =  0,  2 
=  1-2,  3  =  >  2.  RETAIL  INDUSTRY  is  a  dichotomous  industry  variable  coded  1  if  retail  industry  client,  0  otherwise. 
AUDIT  EXPERIENCE  is  coded  as  follows:  1  =  5-10  years,  2  =  11-15  years,  3  =  >  15  years. 


assessments.  Consistent  with  H2,  a  higher  quality  brainstorming  session  appears  to  substantially 
improve  the  risk-based  strategy  prescribed  by  SAS  No.  99  (i.e.,  a  positive  relation  between  risk 
assessments  and  responses).'® 

To  further  illustrate  that  relations  between  fraud  risk  assessments  and  responses  are  stronger 
when  session  quality  is  high,  we  examine  the  response  of  nature  in  cases  where  risk  assessment  is 
“high”  (i.e.,  7-10  on  our  ten-point  scale  or  the  top  quartile  of  risk  assessments).  The  mean 
responses  under  high-  and  low-quality  sessions  are  8.19  and  5.37,  respectively  (p  <  0.01),  sug¬ 
gesting  that  under-auditing  could  be  occurring  when  brainstorming  quality  is  low." 


As  noted  in  footnote  4,  the  reliability  statistic  for  staffing  was  below  a  satisfactory  level.  Therefore,  we  examine  the 
interaction  posited  by  H2  in  relation  to  our  two  measures  of  staffing:  increase  use/consultation  of  fraud  specialists  and 
use  of  more  experienced  auditors.  Non-tabulated  results  indicate  that  the  interaction  term  for  fraud  risk  assessment  and 
brainstorming  quality  is  positive  and  marginally  significant  (p-values  <  0.10)  for  both  measures. 

”  In  order  to  provide  results  comparable  to  prior  experimental  research  examining  frauds  at  public  companies  (e.g.. 
Carpenter  2007),  we  examine  our  hypotheses  with  only  the  publicly  traded  companies  in  our  sample  (sample  size  = 
120).  Non-tabulated  regression  results  are  qualitatively  similar  to  those  tabulated  in  this  study. 
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Supplemental  Discussion  and  Analyses 

Specific  Brainstorming  Quality  Items  Driving  Effects 

While  our  21 -item  measure  of  brainstorming  quality  is  motivated  by  the  literature  and  appears 
to  improve  the  responsiveness  of  fraud  judgments,  it  might  not  be  practical  for  audit  teams  to 
adjust  all  21  items  at  once  to  improve  their  brainstorming  sessions.  We  therefore  examine  if  a 
smaller  number  of  items  could  be  identified  that  would  produce  the  same  moderating  effects  as  our 
21 -item  measure.  As  the  correlation  matrix  (Table  5)  indicates,  it  appears  that  seven  brainstorming 
items  are  significantly  correlated  with  a  number  of  the  hypothesized  variables  in  this  study.  These 
items  are:  (1)  whether  the  session  was  led  by  the  partner  or  forensic  specialist,  (2)  whether  an  IT 
audit  specialist  attended  the  primary  session,  (3)  whether  the  engagement’s  primary  session  was 
held  pre-planning  or  early  in  planning,  (4)  the  extent  of  discussion  about  how  management  might 
perpetrate  fraud,  (5)  the  extent  of  discussion  about  audit  responses  to  fraud  risk,  (6)  the  level  of 
manager  contribution  to  the  session,  and  (7)  the  level  of  partner  contribution  to  the  session.  These 
are  all  controllable  inputs  that  can  be  easily  fostered  by  engagement  management. 

In  testing  HI  and  H2,  we  replace  the  brainstorming  quality  indicator  variable  based  upon  21 

items  with  one  based  upon  these  seven  items.  Non-tabulated  regression  results  reveal  that  this 

seven-item  measure  positively  moderates  the  effect  of  rationalization  on  fraud  risk  assessments  (p 

=  0.03).  Recall  that  in  Table  6,  using  a  session-quality  measure  based  on  21  items,  all  interactions 

between  fraud  risk  factors  and  session  quality  are  not  significant.  Similarly,  the  indicator  variable 

for  session  quality  based  upon  seven  items  positively  moderates  the  relations  between  fraud  risk 

assessments  and  the  response  factors  of  nature  and  staffing  (p-values  <  0.05).  It  is  especially 

promising  that  concentrating  on  the  aforementioned  seven  items  could  improve  the  often  tenuous 

link  between  fraud  risk  assessments  and  the  nature  of  testing.  While  our  results  using  only  the 

seven  items  do  not  provide  exactly  the  same  results  as  those  tabulated  with  the  21 -item  measure, 

it  appears  that  ensuring  these  seven  items  occur  is  a  practical  first  step,  or  set  of  best  practices,  to 

12 

improve  the  quality  of  brainstorming. 

Control  Variables 

With  respect  to  the  direct  effects  of  control  variables  on  fraud  risk  assessment  in  Table  6,  we 
observe  a  negative  relation  between  client  size  and  fraud  risk  assessments.  Consistent  with  Bea¬ 
sley  et  al.  (1999),  who  find  that  fraud  can  be  more  prevalent  at  smaller  companies;  auditors  tend 
to  assess  fraud  risk  higher  for  smaller  clients.  We  find  that  engagement  team  expertise  has  a 
positive  association  with  risk  assessments,  suggesting  that  auditors  with  more  expertise  are  as¬ 
signed  to  riskier  engagements.  We  also  find  a  positive  effect  for  fraud  training  on  risk  assessment. 
Finally,  participants  who  stopped  and  restarted  the  survey  provided  lower  fraud  risk  assessments 
than  other  participants.  Given  this  result,  we  re-perform  our  analyses  of  HI  and  H2  without  the 
participants  who  stopped  and  restarted  and  observe  qualitatively  similar  results. 

The  mean  fraud  risk  assessment  for  our  sample  (4.92  on  a  ten-point  scale)  could  be  consid¬ 
ered  high,  as  the  base  rate  for  fraud  is  fairly  low.  On  the  other  hand,  given  the  relatively  high  rate 
of  fraud  detection  in  our  sample  (24  percent),  a  higher  mean  fraud  risk  assessment  might  be 
expected.  In  fact,  the  mean  fraud  risk  assessment  for  the  subset  of  audits  where  fraud  was  detected 
(n  =  43)  is  a  surprisingly  low  5.44,  and  is  only  marginally  higher  than  the  mean  for  the  non-fraud 
subset  (mean  =  4.75;  p  =  0.09;  non-tabulated  t-test).  Moreover,  fraud  detection  is  not  signifi- 


When  we  perform  the  same  analyses  with  the  remaining  14  items,  there  are  no  significant  results  for  HI  or  H2.  This 
suggests  that  the  seven  items  described  above  are  driving  our  results. 

It  is  unclear  why  these  participants  provided  lower  fraud  risk  assessments.  A  review  and  analysis  of  the  data  for  these 
participants  reveal  no  clear  explanation  of  the  variable’s  significance. 
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cantly  associated  with  fraud  risk  assessment  in  Table  6  (non-tabulated  p  >  0.50).  Thus,  while 
auditors  in  our  sample  were  generally  adept  at  incorporating  fraud  risk  factors  into  fraud  risk 
assessments,  given  ex  post  evidence  of  fraud  detection,  our  results  suggest  that  auditors  still  could 
improve  the  accuracy  of  their  fraud  risk  assessments. 

With  respect  to  fraud  responses.  Table  7  illustrates  that,  as  client  size  increases,  auditors 
appear  to  be  more  likely  to  adjust  the  nature,  staffing,  and  extent  of  testing.  The  larger  audit 
fees/budgets  for  bigger  clients  typically  allow  for  more  testing  when  needed.  Larger  clients  also 
tend  to  be  more  prestigious  and,  if  needed  to  respond  to  fraud  risks,  engagement  leaders  can 
typically  acquire  additional  audit  professionals  for  the  engagement.  The  negative  relation  between 
the  healthcare/pharmaceutical  industry  and  the  nature  of  testing  likely  reflects  that  this  industry 
was  the  only  industry  in  our  sample  not  to  have  fraud  detected  during  the  audit  (possibly  due  to  the 
industry  being  heavily  regulated). 

As  one  would  expect,  as  client  complexity  increases,  audit  teams  tend  to  alter  the  nature  of 
testing  and  use  more  competent  staff.  Of  particular  interest  is  the  positive  link  between  the 
detection  of  fraud  and  the  nature,  staffing,  and  extent  of  procedures.  These  responses  appear  to  be 
integral  to  detecting  fraud,  and  it  could  be  important  for  auditors  to  adjust  their  testing  in  multiple 
ways  to  improve  fraud  detection.  It  is  also  interesting  to  note  that  fraud  detection  is  not  signifi¬ 
cantly  associated  with  the  timing  of  tests.  Consistent  with  this  finding,  we  observe  negative 
relations  between  both  client  experience  and  fraud  experience  and  the  timing  of  testing.  These 
combined  findings  suggest  that  experienced  auditors  likely  question  the  effectiveness  of  changing 
the  timing  of  testing  to  detect  fraud.  However,  this  relation  could  be  industry  specific,  as  we  find 
a  positive  association  between  the  manufacturing  industry  and  the  timing  of  tests.  Given  the  risks 
associated  with  manipulating  ending  inventory,  auditors  of  manufacturing  clients  appear  to  be 
more  apt  to  change  the  timing  of  their  inventory  examinations. 

We  find  that  audit  firms  are  related  to  a  number  of  our  responses.  Firms  have  different  audit 
approaches  or  structures,  and  these  differences  can  lead  to  different  audit  judgments  (Prawitt  1995; 
Hyatt  and  Prawitt  2001).  As  one  would  expect,  higher  levels  of  engagement  team  expertise  and 
industry  experience  are  positively  associated  with  the  difficult  task  of  changing  the  nature  of 
testing.  Last,  we  observe  negative  associations  between  both  audit  experience  and  client  experi¬ 
ence  and  the  extent  of  the  response.  Given  the  positive  relationship  between  fraud  detection  and 
the  extent  of  testing  (i.e.,  evidence  that  changing  the  extent  of  procedures  can  be  an  effective  fraud 
detection  tool),  these  combined  findings  suggest  a  negative  effect  of  longer  auditor  tenure  on  audit 
quality.  Such  relations  could,  to  some  extent,  be  mitigated  by  the  Sarbanes-Oxley  Act  of  2002, 
which  mandates  a  five-year  rotation  for  the  lead  and  concurring  audit  partners  on  publicly  traded 
clients  (U.S.  House  of  Representatives  2002). 

Other  Data  Issues 

As  previously  noted,  fraud  was  detected  on  24  percent  of  the  engagements  in  our  sample,  a 
much  higher  rate  than  reported  by  previous  studies  (e.g.,  Loebbecke  et  al.  1989;  Nieschwietz  et  al. 
2000).  This  high  rate  of  fraud  could  be  due  to  the  fact  that  fraud  cases  were  more  salient  to 
participants  when  they  selected  an  audit  as  the  basis  for  responding  to  our  survey.  We  asked  for 
detailed  information  on  the  types  and  sizes  of  the  frauds  that  were  detected.  However,  few 
participants  chose  to  provide  this  information.  One  respondent  reported  a  fraud  related  to  man¬ 
agement’s  manipulation  of  payroll  reports,  while  another  reported  a  fraud  related  to  payments  to 
a  fictitious  vendor.  In  comparison  to  the  other  industries  in  our  sample,  both  retail  and 
govemmental/not-for-profit  entities  had  unusually  high  rates  of  fraud  detection  (36  percent  and  32 
percent,  respectively).  In  retail,  the  misappropriation  of  retail  inventory,  whether  material  or  not, 
could  have  contributed  to  its  high  rate.  For  governmental  and  not-for-profit  entities,  the  incentive 
to  fraudulently  misstate  financial  statements  is  lower  than  in  other  industries.  Thus,  it  is  possible 
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that  some  frauds  in  our  sample  were  related  to  misappropriations  of  assets  (versus  fraudulent 
financial  reporting)  and  were  likely  immaterial  from  the  auditor’s  perspective. 

Given  the  high  rate  of  fraud  in  our  sample,  we  consider  whether  our  primary  results  hinge  on 
recalled  audits  that  detected  fraud.  We  examine  this  concern  in  two  ways.  First,  we  control  for 
whether  fraud  was  detected  in  all  of  our  analyses.  We  find  support  for  H2  even  though  the  fraud 
detection  control  variable  is  significant  in  several  of  our  analyses  (see  Table  7,  Panels  A,  B,  and 
D).  Second,  in  non-tabulated  analyses,  we  remove  the  43  engagements  where  fraud  was  detected 
and  find  qualitatively  similar  results  (i.e.,  HI  (H2)  is  not  (is)  supported). Thus,  our  results  do  not 
appear  to  be  an  artifact  of  engagements  for  which  fraud  was  detected. 

We  also  examine  whether  our  brainstorming  data  presented  in  Table  3  are  significantly 
skewed  by  these  engagements.  Non-tabulated  t-tests  for  each  of  the  21  items  find  that  none  of  the 
means  are  significantly  different  between  audits  where  fraud  was  or  was  not  detected.  Likewise, 
correlations  between  the  fraud  detection  control  variable  and  the  21 -item  measure  and  indicator 
versions  of  brainstorming  session  quality  are  not  significant  (p-values  >  0.50).  Last,  the  mean 
brainstorming  quality  score  for  the  fraud  sample  (10.56)  is  actually  equal  to  the  mean  for  our 
entire  sample  (10.56)  and  is  not  significantly  different  than  the  mean  for  the  non-fraud  sample 
(10.57;  p  >  0.50).  Thus,  it  appears  that  the  descriptive  brainstorming  data  presented  in  Table  3  is 
not  significantly  skewed  by  our  fraud  observations  and  that,  for  audits  where  fraud  was  detected, 
participants  were  not  biased  toward  providing  responses  reflecting  high-quality  brainstorming 
sessions.  In  summary,  these  additional  analyses  suggest  that  our  results  are  not  primarily  attribut¬ 
able  to  the  audits  in  our  sample  where  fraud  was  detected. 

From  an  association  perspective,  it  might  be  that  audits  with  higher  fraud  risk  assessments 
prompt  auditors  to  conduct  higher  quality  brainstorming  sessions  and  carefully  link  fraud  judg¬ 
ments  as  precautionary  measures.  Consequently,  our  results  might  not  be  reflective  of  audits  with 
lower  fraud  risk  assessments.  In  Table  5,  we  And  that  session  quality  is  only  marginally  correlated 
with  fraud  risk  assessments  (p  =  0.09)  and,  in  a  multivariate  setting  (Table  6),  we  And  that  fraud 
risk  assessments  and  session  quality  are  not  significantly  related  (p  >  0.50).  Still,  we  re-perform 
our  regressions  with  only  audits  where  the  fraud  risk  assessment  ranged  from  2-6  (removing 
engagements  with  the  highest  fraud  risk  assessments  (7-10  or  the  top  quartile  of  fraud  risk 
assessments)).  As  noted  in  Figure  2,  no  respondents  provided  a  fraud  risk  assessment  of  1.  Non- 
tabulated  results  related  to  this  subset  of  less  risky  audits  are  qualitatively  similar  to  those  tabu¬ 
lated.  Thus,  our  results  do  not  appear  to  be  driven  by  the  riskier  audit  engagements  in  our  sample. 

V.  CONCLUSION 

SAS  No.  99  requires  brainstorming  sessions  on  every  audit  (AICPA  2002),  but  the  PCAOB 
has  raised  concerns  about  the  quality  of  these  sessions  in  practice  (PCAOB  2007).  This  is  the  first 
study,  to  our  knowledge,  to  consider  whether  and  how  the  quality  of  brainstorming  influences  team 
decision-making. 

Our  study  makes  several  contributions  to  research  and  practice.  Leveraging  the  field  survey 
method,  we  develop  and  test  a  measure  of  brainstorming  quality  using  data  collected  from  179 
actual  audit  engagements.  We  find  that  fraud  risk  factors  are  positively  correlated  with  fraud  risk 
assessments,  and  we  find  some  evidence  that  brainstorming  quality  moderates  these  relations. 
More  importantly,  we  find  that  when  brainstorming  quality  is  perceived  to  be  higher,  risk  assess¬ 
ments  are  more  positively  related  to  the  nature,  staffing,  timing,  and  extent  of  audit  procedures. 
Achieving  a  high  level  of  brainstorming  quality  appears  to  improve  the  audit  team’s  consideration 


In  tests  of  HI,  excluding  the  43  engagements  where  fraud  was  detected,  all  independent  variables  of  interest  are  not 
significant  (p-values  >  0.50).  In  tests  of  H2,  all  independent  variables  of  interest  are  significant  (p-values  <  0.10). 
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of  fraud  by  cultivating  a  broader  set  of  responses  to  identified  fraud  risks.  However,  we  also 
observe  that  under-auditing  can  occur  when  brainstorming  quality  is  low. 

Prior  and  contemporary  studies  provide  evidence  that  the  extent  that  auditors  benefit  from 
brainstorming  depends  on  the  brainstorming  method  employed  (Carpenter  2007 ;  Hunton  and  Gold 
2010).  Archival  research  on  fraud  has  investigated  data  from  multiple  public  companies,  but  has 
not  examined  auditors’  judgments  in  practice  (e.g.,  Beasley  1996).  Our  results  thus  triangulate 
prior  experimental  and  archival  research  and  extend  the  fraud  literature  by  providing  new  and 
important  insights  on  the  costs  and  benefits  of  fraud  brainstorming  for  actual  audit  teams.  Further, 
the  descriptive  data  contained  herein  further  enhance  our  understanding  of  the  fraud  decision¬ 
making  and  brainstorming  processes.  These  data  should  provide  a  basis  for  further  modeling  of 
brainstorming  and  future  empirical  research  in  team  decision-making. 

While  it  might  seem  obvious  (and  arguably  required  by  SAS  No.  99)  that  high-quality  brain¬ 
storming  sessions  should  exist  in  practice,  our  results  and  recent  PCAOB  findings  indicate  sub¬ 
stantial  variation  in  brainstorming  quality  (PCAOB  2007).  Indeed,  auditors  continue  to  seek  better 
brainstorming  methods  (Landis  et  al.  2008).  Thus,  we  contribute  to  practice  by  providing  auditors 
and  regulators  with  descriptive  data  that  (1)  complement  previous  PCAOB  inspections  and  (2) 
highlight  best  practices  for  auditors  to  improve  their  fraud  judgments  (e.g.,  attendance  of  an  IT 
audit  specialist,  and  occurrence  of  the  session  early  in  planning). 

Our  study  is  subject  to  several  limitations.  Allowing  participants  to  select  their  own  audits 
could  have  biased  their  responses  toward  engagements  that  were  memorable,  such  as  when  fraud 
was  found.  Our  approach  might  have  also  led  participants  to  select  sessions  they  perceived  as  high 
quality  or  to  report  in  such  a  way  as  to  bolster  their  perception  of  having  conducted  a  high-quality 
session.  Our  sample  contains  a  fraud  detection  rate  of  24  percent,  which  is  likely  higher  than  the 
actual  rate  of  fraud  in  the  population  and  is  higher  than  the  rate  reported  in  earlier  studies. 
Unfortunately,  few  participants  reported  details  related  to  the  frauds  they  identified,  so  we  have 
little  understanding  of  their  nature.  Sensitivity  analyses  to  control  for  these  engagements  suggest 
that  reported  results  are  not  driven  by  engagements  on  which  fraud  was  detected  or  fraud  risk  was 
high.  We  also  find  that,  for  audits  where  fraud  was  detected,  participants  were  not  biased  to  report 
higher  quality  brainstorming  sessions. 
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ABSTRACT:  McVay  (2006)  concludes  that  managers  opportunistically  shift  core  ex¬ 
penses  to  special  items  to  inflate  current  core  earnings,  resulting  in  a  positive  relation 
between  unexpected  core  earnings  and  income-decreasing  special  items.  However, 
she  further  notes  that  this  relation  disappears  when  contemporaneous  accruals  are 
dropped  from  the  core  earnings  expectations  model.  McVay  (2006)  calls  for  research  to 
improve  the  core  earnings  expectations  model  and  to  provide  additional  cross-sectional 
tests  of  classification  shifting.  Using  a  core  earnings  expectations  model  that  is  not 
dependent  on  accrual  special  items,  we  show  that  classification  shifting  is  more  likely  in 
the  fourth  quarter  than  in  interim  quarters.  We  also  find  more  evidence  of  classification 
shifting  when  the  ability  of  managers  to  manipulate  accruals  appears  to  be  constrained 
and  in  meeting  a  range  of  earnings  benchmarks.  Overall,  our  evidence  provides  broad 
support  for  McVay’s  (2006)  conclusion  that  managers  engage  in  classification  shifting. 
Our  study  also  sheds  new  understanding  of  the  conditions  under  which  managers  are 
more  likely  to  employ  classification  shifting. 

Keywords:  classification  shifting;  speciai  items;  earnings  thresholds;  accruals  ma¬ 
nipulation  constraints. 
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I.  INTRODUCTION 

McVay  (2006)  hypothesizes  that  managers  have  incentives  to  report  core  expenses  (de¬ 
fined  as  cost  of  goods  sold  and  selling,  general,  and  administrative  expenses)  as  income- 
decreasing  special  items  in  an  attempt  to  inflate  core  profitability.  Managers  are  moti¬ 
vated  to  manage  earnings  in  this  way  because  market  participants  may  be  particularly  interested  in 
core  earnings  (or  pro  forma  earnings)  rather  than  bottom-line  GAAP  earnings  (Bradshaw  and 
Sloan  2002;  Gu  and  Chen  2004;  Kinney  and  Trezevant  1997),  and  core  earnings  typically  receive 
higher  valuation  multiples  than  non-core  earnings  (Lipe  1986).  By  shifting  core  expenses  to 
special  items,  the  firm  increases  core  earnings,  while  bottom-line  net  income  remains  unaffected.^ 
McVay  (2006)  presents  empirical  evidence  supportive  of  classification  shifting  in  the  form  of 
a  positive  relation  between  income-decreasing  special  items  and  unexpected  core  annual  earnings. 
As  income-decreasing  special  items  increase,  the  firm  tends  to  report  higher  than  expected  core 
earnings.  However,  McVay  (2006,  Table  10)  also  reports  that  the  positive  relation  disappears  (and, 
in  fact,  becomes  negative)  when  contemporaneous  accruals  are  dropped  from  the  core  earnings 
expectations  model.  She  further  cautions  that  the  initial  evidence  in  favor  of  classification  shifting 
may  be  attributable  to  the  use  of  contemporaneous  accruals  in  the  core  earnings  expectations 
model.  More  specifically,  the  core  earnings  expectations  model  (which  includes  accruals  as  a 
control  for  performance)  could  induce  a  mechanical  relation  between  unexpected  core  earnings 
and  special  items  (which  are  mainly  accrual-based).  Thus,  there  is  currently  some  uncertainty 
regarding  the  extent  to  which  income-decreasing  special  items  include  shifted  core  expenses.^ 
Research  in  the  area  of  classification  shifting  is  relatively  new,  and  it  is  essential  that  subse¬ 
quent  research  provides  improvements  to  the  core  earnings  expectations  model.  We  first  extend 
McVay’s  (2006)  core  earnings  expectations  model  and  show  that  as  further  controls  for  perfor¬ 
mance  are  included,  the  evidence  in  favor  of  classification  shifting  increases.  We  also  present  a 
series  of  empirical  examinations  of  unexpected  core  earnings  at  the  quarterly  level  that  are  robust 
to  the  underlying  bias  discussed  in  McVay  (2006).  As  research  in  this  area  grows  and  researchers 
develop  fundamental  views,  it  is  important  that  findings  from  initial  studies  are  validated.  Our  tests 
provide  broad  support  for  McVay’s  conclusion  of  classification  shifting  by  managers.^ 

A  central  aspect  of  our  analysis  is  the  use  of  quarterly  rather  than  annual  data.  This  distinction 
allows  us  to  use  the  differential  incentives  to  manipulate  fourth  versus  interim  quarter  earnings  as 
a  platform  for  constructing  tests  of  classification  shifting.  These  examinations  are  inherently  tests 
of  a  joint  hypothesis — that  managers  both  engage  in  classification  shifting  and  are  more  prone  to 
do  so  in  fourth  quarters.  Consequently,  our  results  also  contribute  to  the  literature  on  differential 
incentives/constraints  facing  managers  attempting  to  manipulate  fourth  quarter  earnings.  Using 
quarterly  data,  we  demonstrate  that  classification  shifting  is  more  pronounced  in  the  fourth  quarter 
than  in  interim  quarters. 

We  next  consider  that  classification  shifting  is  one  means  of  earnings  manipulation,  while 
others  exist  (e.g.,  accrual  manipulation).  To  the  extent  that  managers  are  more  constrained  in  their 
ability  to  manipulate  accruals  (Brown  and  Pinello  2007),  they  may  use  classification  shifting  as  an 


'  Unlike  income  classification  shifting,  accrual  manipulation  and  real  activities  management  affect  reported  bottom-line 
GAAP  earnings. 

McVay  (2006)  provides  several  approaches  to  test  for  classification  shifting,  some  of  which  are  not  necessarily  expected 
to  be  affected  by  the  mechanical  relation  between  unexpected  core  earnings  and  (accrual)  special  items.  She  finds  (1) 
subsequent  reversal  of  shifted  earnings  is  more  likely  when  there  is  no  opportunity  to  shift  in  the  subsequent  year,  (2) 
shifting  is  more  likely  when  firms  just  meet  the  analyst  forecast,  especially  for  high-growth  firms,  (3)  shifting  is  more 
likely  when  the  type  of  special  item  is  easier  to  shift,  and  (4)  abnormal  returns  in  the  subsequent  year  tend  to  be 
^  negatively  related  to  shifted  earnings  as  these  earnings  recur  in  core  earnings,  disappointing  investors. 

In  this  study,  we  limit  our  tests  to  the  relation  between  unexpected  core  earnings  and  special  items  in  the  current  quarter. 
We  do  not  examine  the  subsequent  reversal  of  unusually  high  core  earnings  associated  with  special  items. 
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alternative  to  inflate  core  earnings.  Finally,  we  examine  classification  shifting  to  meet  earnings 
thresholds  such  as  analysts’  forecasts,  one-year-ago  same-quarter  earnings,  and  zero  earnings.  We 
do  so,  however,  using  a  core  earnings  expectations  model  that  does  not  rely  on  contemporaneous 
accruals. 

In  the  next  section,  we  discuss  the  background  literature  and  analysis  of  the  classification 
shifting  issue  identified  in  McVay  (2006).  In  Section  III,  we  develop  our  hypotheses.  Section  IV 
details  the  data,  sample,  and  descriptive  statistics.  Section  V  describes  the  research  design  and 
reports  tests  of  hypotheses,  while  Section  VI  concludes  the  study. 

II.  BACKGROUND  LITERATURE  AND  ANALYSIS 

Earnings  management  has  been  the  subject  of  considerable  academic  research.  Evidence  of 
managers  engaging  in  earnings  management  through  accrual  manipulation  has  been  shown  in 
many  different  contexts,  for  many  different  accruals,  and  in  response  to  many  managerial  incen¬ 
tives,  as  reviewed  by  Healy  and  Wahlen  (1999).  A  second  channel  through  which  earnings  could 
be  manipulated  is  real  activities  management,  such  as  providing  temporary  price  discounts  to 
increase  sales,  cutting  discretionary  expenditures  such  as  research  and  development  and  advertis¬ 
ing,  and  overproducing  inventory  to  reduce  cost  of  goods  sold  (e.g.,  Baber  et  al.  1991;  Bushee 
1998;  Gunny  2005;  Roychowdhury  2006;  Cohen  and  Zarowin  2008).  Unlike  accrual  manipula¬ 
tion,  which  involves  potential  accounting  fraud  that  brings  about  litigation  risk  to  the  firm  and/or 
employment  risk  to  the  manager,  real  activities  manipulation  does  not  violate  GAAP  but  does 
sacrifice  firms’  future  economic  benefits. 

McVay  (2006)  investigates  a  third  form  of  earnings  management.  She  tests  whether  managers 
manipulate  core  earnings  by  reclassifiying  ordinary  operating  expenses  as  special  items.  To  the 
extent  that  investors  fixate  on  core  earnings  instead  of  bottom-line  GAAP  earnings,  managers  may 
employ  classification  shifting  to  fool  financial  statement  users  by  manipulating  earnings  numbers 
within  the  income  statement.  This  is  because  financial  statement  users  appear  to  recognize  and 
distinguish  the  closeness  to  sales  of  individual  line  items  and  weigh  them  differently  (e.g.,  Lipe 
1986;  Eairfield  et  al.  1996;  Francis  et  al.  1996).  Various  studies  document  that  analysts  and 
investors  pay  more  attention  to  “street”  or  pro  forma  earnings  as  defined  by  managers  and  view 
them  as  more  value-relevant  (Bradshaw  and  Sloan  2002;  Bhattacharya  et  al.  2004;  Gu  and  Chen 
2004).  Thus,  it  is  likely  that  managers  take  advantage  of  the  market’s  focus  on  core  earnings 
instead  of  bottom-line  GAAP  earnings  to  misclassify  some  core  expense  items  in  the  income 
statement.  McVay  (2006)  finds  that  managers  opportunistically  shift  core  expenses  to  income- 
decreasing  special  items,  and  investors  act  as  if  they  are  surprised  in  the  following  period  when 
these  expenses  shift  back  into  core  earnings  in  the  following  year. 

For  empirical  tests  of  classification  shifting,  firms’  earnings  performance  poses  a  challenge. 
This  occurs  because  the  magnitude  and  frequency  of  income-decreasing  special  items  are  mark¬ 
edly  higher  among  firms  experiencing  poor  performance  (McVay  2006).  Thus,  there  is  a  natural 
negative  relation  between  earnings  performance  and  the  amount  of  income-decreasing  special 
items.  However,  classification  shifting  predicts  a  positive  relation  between  unexpected  core  earn¬ 
ings  and  the  amount  of  income-decreasing  special  items  (i.e.,  as  core  expenses  are  shifted  to 
special  items,  unexpected  core  earnings  and  income-decreasing  special  items  both  increase). 

The  existing  body  of  empirical  evidence  supportive  of  classification  shifting  centers  around 
measures  of  expected  core  earnings,  as  developed  in  McVay  (2006).  A  common  feature  of  these 
models  is  the  inclusion  of  contemporaneous  accruals  in  the  formation  of  expected  core  earnings 
values.  This  inclusion  follows  from  the  DeAngelo  et  al.  (1994)  linkage  of  accruals  to  underlying 
economic  performance.  As  discussed  by  McVay  (2006),  conditioning  core  earnings  expectations 
on  contemporaneous  accruals  is  problematic  because  this  number  includes  accrual  special  items. 
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As  these  same  accrual  special  item  amounts  also  appear  within  the  total  special  item  amount,  a 
situation  is  created  wherein  the  dependent  variable  (i.e.,  unexpected  core  earnings)  that  is  partially 
determined  by  variable  X  (accrual  special  items)  is  regressed  on  a  variable  (income-decreasing 
special  items)  that  is  also  partially  determined  by  variable  X.  This  potentially  creates  a  mechanical 
(positive)  relation  between  the  dependent  variable  and  independent  variable. 

To  illustrate,  the  basic  McVay  design  employs  a  two-stage  regression  procedure.  In  the  first 
stage,  expected  firm-year  core  earnings,  E{CE),  is  derived  as  a  function  of  accruals  in  that  year 
multiplied  by  an  industry-determined  multiplier  The  value  of  this  multiplier  averages  0.22  (see 
McVay  2006,  Table  4).  Consequently,  a  dollar  of  special  item  accrual  expense/loss  reduces  E(CE) 
by  $0.22  on  average.  The  difference  between  actual  core  earnings  and  E(CE)  is  identified  as 
“unexpected  core  earnings”  and  is  regressed  on  special  items  (SI)  in  a  second  stage  analysis  of  the 
form: 

[Core  Earnings,- ^  -  £'(C£,- ,)]  =  ao+  ajSIi  t  +  e,- ( 1 ) 

A  dollar  of  special  item  accrual  expense  does  not  impact  core  earnings  (from  which  special 
items  are  excluded  by  construction)  but,  on  average,  reduces  E{CEiJ),  and  therefore  increases 
[Core  Earnings,-, -^(CEj,)],  by  13^*81^.^  Definitionally,  the  special  item  accrual  also  increases 
Sli  t,  forcing  ai  >  0  in  cases  where,  in  fact,  no  relation  may  exist  between  SI  and  true  unexpected 
core  earnings  magnitudes.^’® 

As  McVay  (2006)  acknowledges  (see  her  Table  10),  when  accruals  are  removed  from  the 
expectations  model,  the  positive  relation  between  unexpected  core  earnings  and  special  items 
becomes  negative.  The  flipped  sign  of  the  relation  could  be  due  to  either  insufficient  control  for 
performance  or  the  possible  bias  described  above.  In  the  tests  detailed  in  the  next  section,  we 
exclude  contemporaneous  accruals  to  avoid  introducing  the  potential  bias  and  attempt  to  better 
control  for  the  performance  effect. 


III.  HYPOTHESES 

We  rely  on  comparative  analyses  of  relative  shifting  activities  between  firm-groups  that  are  a 
priori  more  versus  less  likely  to  engage  in  classification-shifting  activities.  Specifically,  we  use 
quarterly  data,  as  opposed  to  annual  data  in  McVay  (2006),  and  partition  the  sample  based  on 
accrual  manipulation  constraints  and  earnings  thresholds.  To  rule  out  the  possibility  of  spurious 
accrual  special  item  effects  driving  our  results,  we  eliminate  current-period  accruals  from  the  core 
earnings  expectations  model.^  It  is  likely  that  the  overall  relation  between  unexpected  core  earn¬ 
ings  and  special  items  includes  both  classification  shifting  and  a  performance-driven  effect.  How¬ 
ever,  for  certain  firms  and  in  certain  quarters,  classification  shifting  may  be  more  prevalent  be¬ 
cause  of  greater  managerial  incentives  or  opportunities.  Breaking  down  the  data  by  specific 
quarters  and  by  different  levels  of  managerial  incentives  and  opportunities  gives  us  a  better 


This  statement  holds  so  long  as  /3^  is  positive.  Empirically,  this  assumption  is  true  most  of  the  time  (81  5  percent  in 
McVay  [2006]). 

Gu  and  Chen  (2004,  footnote  10)  recognize  that  the  interrelationships  between  non-cash  special  items  and  accruals  can 
create  problems  in  research  designs  that  include  both  variables.  They  show  (in  their  Table  6,  Panel  B)  that  the  persis¬ 
tence  coefficient  of  special  items  changes  sign  after  the  inclusion  of  the  accruals  variable,  which  they  interpret  as  the 
effect  of  noncash  nonrecurring  items  being  included  in  accruals. 

It  is  also  straightforward  to  show  that  in  a  changes  analysis  (i.e.,  unexpected  change  in  core  earnings)  negative  accrual 
special  items  tend  to  increase  expected  change  in  core  earnings  and,  as  a  consequence,  decrease  unexpected  change  in 
core  earnings.  This  produces  a  mechanical  negative  relation  between  unexpected  change  in  core  earnings  and  income- 
decreasing  special  items. 

In  addition,  in  a  sensitivity  test  reported  below,  we  specifically  address  whether  the  relation  between  our  estimate  of 
unexpected  core  earnings  and  income-decreasing  special  items  is  related  to  the  amount  of  contemporaneous  accrual 
special  items.  We  find  no  evidence  that  it  is. 
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understanding  of  whether  and  when  classification  shifting  is  more  likely  to  occur. 

Our  motivation  for  using  quarterly  data  comes  from  prior  literature  that  suggests  that  earnings 
management  activities  are  likely  to  differ  between  the  fourth  quarter  and  the  interim  quarters. 
Brown  and  Pinello  (2007)  specifically  examine  whether  accruals  are  managed  more  in  the  fourth 
quarter  relative  to  interim  quarters.  They  contend  that  since  the  annual  reporting  process  is  subject 
to  a  more  rigorous  audit  compared  to  interim  quarters,  it  is  more  difficult  for  managers  to  ma¬ 
nipulate  accruals  upward  at  the  end  of  the  year.  Instead,  managers  are  more  likely  to  resort  to 
expectations  management  in  the  fourth  quarter  as  a  way  to  meet  analysts’  forecasts.  Their  results 
confirm  these  views.  However,  Brown  and  Pinello  (2007)  conclude  that  the  extent  of  expectations 
management  in  the  fourth  quarter  seems  too  small  to  explain  the  ability  to  meet  fourth  quarter 
earnings  expectations,  given  the  decrease  in  accrual  manipulation  ability.  Our  test  of  classification 
shifting  in  the  fourth  quarter  provides  an  additional  explanation  for  their  puzzling  result.  During 
the  year,  managers  are  more  likely  to  manipulate  accruals  to  meet  analysts’  expectations,  but  at  the 
end  of  the  year,  managers’  ability  to  manipulate  accruals  is  constrained  (and  it  is  more  difficult  to 
manage  expectations).  This  difficulty  leads  managers  to  resort  to  an  alternative  earnings  manage¬ 
ment  technique — classification  shifting. 

Consistent  with  the  views  expressed  by  Brown  and  Pinello  (2007),  Cornell  and  Landsman 
(1989)  find  that  both  price  reactions  and  analysts’  revisions  are  greater  around  earnings  announce¬ 
ment  in  the  fourth  quarter  than  in  interim  quarters.  They  conclude  that  audited  annual  announce¬ 
ments  are  more  informative.  Prior  research  also  provides  evidence  that  the  frequency  and  magni¬ 
tudes  of  special  items  are  significantly  greater  in  the  fourth  quarter  (Burgstahler  et  al.  2002; 
Kinney  and  Trezevant  1997).*  We  conjecture  that  these  results  indicate  that  not  only  do  managers 
have  more  incentives  to  employ  classification  shifting  in  the  fourth  quarter,  but  they  also  have 
greater  opportunity  to  camouflage  core  expenses  with  other  (true)  special  items.  This  discussion 
leads  to  our  first  hypothesis; 

HI:  Managers  shift  core  expenses  to  special  items  more  in  the  fourth  quarter. 

We  next  consider  that  managers’  propensity  to  shift  core  expenses  in  the  current  quarter  may 
be  affected  by  previous  accruals  manipulation  activity.  To  the  extent  that  managers  have  already 
managed  accruals  upward  in  previous  quarters,  the  ability  to  further  manage  accruals  upward  in 
the  current  quarter  is  constrained.  As  a  result,  earnings  management  through  income  classification 
shifting  may  provide  the  better  (or  only)  alternative.  Barton  and  Simko  (2002)  measure  firms’ 
previous  earnings  management  using  the  beginning  I6vel  of  net  operating  assets  (scaled  by  sales). ^ 
They  find  that  firms  with  higher  net  operating  assets  (i.e.,  more  optimistic  reporting  in  prior 
periods)  are  less  likely  to  report  earnings  that  just  meet  or  beat  analysts’  forecasts  (see  Barton  and 
Simko  2002,  Table  5,  Panel  B).  Related  to  our  study,  to  the  extent  that  accruals  manipulation  is 
constrained,  managers  would  be  more  likely  to  inflate  core  earnings  through  classification  shifting. 
We  predict  this  relation  as  our  second  hypothesis: 


®  We  find  similar  results  for  our  sample  of  firms.  The  average  magnitude  of  income-decreasing  special  items  (scaled  by 
sales)  in  the  fourth  quarter  is  5.36  percent,  compared  to  only  1.64  percent  in  the  first  three  quarters.  We  also  find  that 
income-decreasing  special  items  are  reported  for  31.7  percent  of  the  fourth  quarter  observations,  compared  to  only  15.6 
percent  of  the  observations  for  the  first  three  quarters. 

®  The  intuition  behind  their  use  of  net  operating  assets  is  that  the  balance  sheet  accumulates  the  effects  of  previous 
accounting  choices.  Therefore,  firms  with  higher  than  expected  net  operating  assets  are  more  likely  to  have  reported 
optimistic  earnings  in  the  past.  To  validate  their  measure.  Barton  and  Simko  (2002)  report  that  firms  with  larger  net 
operating  assets  at  the  beginning  of  the  current  year  reported  more  positive  abnormal  accruals  in  the  previous  20 
quarters.  Using  Dechow  et  al.’s  (2008)  Fraud  Score,  Omer  et  al.  (2008)  also  find  that  the  level  of  net  operating  assets 
provides  a  measure  of  prior  earnings  management. 
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H2:  Managers  shift  core  expenses  to  special  items  more  when  the  ability  to  manipulate 
accruals  is  constrained. 

Finally,  we  examine  the  extent  to  which  managers  use  classification  shifting  as  a  way  to  meet 
three  earnings  benchmarks:  analysts’  forecasts,  earnings  of  the  same  quarter  in  the  prior  year,  and 
profit.  In  general,  prior  studies  find  evidence  consistent  with  earnings  management  at  each  thresh¬ 
old  (e.g.,  Burgstahler  and  Dichev  1997a;  Degeorge  et  al.  1999;  Payne  and  Thomas  2003;  Brown 
and  Caylor  2005).  There  are  clear  market  incentives  for  meeting  versus  missing  the  analyst 
forecast  threshold  (e.g.,  Kasznik  and  McNichols  2002).  Dechow  and  Skinner  (2000)  note  that 
earnings  management  is  greater  when  such  actions  allow  managers  to  meet  or  beat  analysts’ 
forecasts  than  when  they  otherwise  would  not.  On  a  quarterly  basis,  Bartov  et  al.  (2002)  find  that 
firms  meeting  or  beating  analysts’  earnings  expectations  enjoy  a  higher  return  over  the  quarter  than 
firms  that  fail  to  meet  these  expectations.  They  find  that  the  equity  premium  to  firms  meeting  or 
beating  expectations  is  only  marginally  affected  by  whether  reported  quarterly  earnings  are  genu¬ 
ine  or  are  the  result  of  earnings  or  expectations  management.  Therefore,  the  ex  post  equity  pre¬ 
mium  following  quarterly  earnings  announcements  provides  an  ex  ante  incentive  for  managers  to 
manipulate  earnings  numbers. 

Other  studies  document  that  analysts  and  investors  pay  more  attention  to  “street”  or  pro  forma 
earnings  as  defined  by  managers  and  view  them  as  more  value-relevant  (Bradshaw  and  Sloan 
2002;  Bhattacharya  et  al.  2004;  Gu  and  Chen  2004).  Bradshaw  and  Sloan  (2002)  show  that  the 
market  responds  to  street  earnings  more  than  traditional  GAAP  earnings  and  that  managers  have 
taken  a  proactive  role  in  defining  street  earnings  when  communicating  to  investors.  Analysts  also 
tend  to  exclude  special  items  from  their  forecasts.  Thus,  it  is  likely  that  managers  take  advantage 
of  the  market’s  focus  on  core  earnings  instead  of  bottom-line  GAAP  earnings  to  shift  some  core 
expense  items  in  the  income  statement. 

For  similar  reasons,  managers  may  use  classification  shifting  to  report  quarterly  earnings  that 
meet  or  just  beat  earnings  of  the  same  quarter  in  the  prior  year  or  that  just  meet  or  beat  zero 
earnings.  This  could  occur  because  managers  have  incentives  (e.g.,  stock  premium,  bonuses, 
reputation,  job  security)  to  avoid  quarterly  earnings  decreases  and  losses.  In  summary,  we  expect 
that  managers  shift  core  expenses  to  income-decreasing  special  items  more  when  quarterly  earn¬ 
ings  of  their  firms  just  meet  or  beat  (1)  the  analysts’  consensus  forecasts,  (2)  earnings  of  the  same 
quarter  in  the  prior  year,  or  (3)  zero  earnings.  Our  hypothesis  related  to  classification  shifting 
around  earnings  benchmarks  follows: 

H3:  Managers  shift  core  expenses  to  special  items  more  when  quarterly  earnings  just  meet  or 
beat  earnings  benchmarks. 

Degeorge  et  al.  (1999)  suggest  that  managers  first  attempt  to  report  a  profit,  then  to  report 
earnings  increases  and,  last,  they  are  concerned  with  meeting  or  beating  analysts’  forecasts.  Payne 
and  Thomas  (2003)  find  evidence  consistent  with  earnings  management  mostly  for  the  zero  earn¬ 
ings  threshold  followed  by  the  analyst  forecast  threshold.  In  contrast,  Graham  et  al.  (2005)  report 
that  managers  most  prefer  to  avoid  reporting  an  earnings  decrease.  Brown  and  Caylor  (2005)  show 
that  achieving  the  analyst  forecast  threshold  has  become  the  most  important  over  time.  Because 
prior  research  provides  mixed  conclusions,  we  have  no  a  priori  expectations  on  which  of  the  three 
thresholds  should  show  the  most  evidence  of  classification  shifting. 
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IV.  DATA,  SAMPLE  SELECTION,  AND  DESCRIPTIVE  STATISTICS 

We  collect  data  for  the  years  1988  to  2007  from  Compustat  Industrial  Quarterly  File.  Ana¬ 
lysts’  forecast  data  are  obtained  from  the  I/B/E/S  Detail  File.  Each  firm-quarter  observation  is 
required  to  have  sufficient  data  to  calculate  variables  in  model  (2)  and  model  (3),  as  discussed 
below.  Following  McVay  (2006),  we  eliminate  firm-quarter  observations  that  have  annual  sales  of 
less  than  $1  million.  This  allows  our  sample  to  be  as  comparable  to  McVay’s  (2006)  as  possible. 
In  addition,  eliminating  these  firms  reduces  the  extreme  impact  that  small  firms  can  have  on  the 
results.  We  also  exclude  firms  that  had  a  change  in  fiscal  year,  to  ensure  that  quarterly  data  are 
comparable  across  years.  Further,  requiring  a  minimum  of  15  observations  per  industry-year- 
quarter  ensures  a  sufficiently  large  sample  to  estimate  expected  core  earnings.  Industry  classifica¬ 
tions  are  based  on  Fama  and  French  (1997).  The  full  sample  consists  of  132,393  firm-quarter 
observations.  The  subsample  with  available  analysts’  forecasts  has  67,980  firm-quarters. 

Table  1  lists  the  definitions  of  variables  used  in  the  analyses,  and  Table  2  provides  descriptive 
statistics  for  these  variables.  The  mean  (median)  of  core  earnings  for  all  firm-quarters,  scaled  by 
sales,  is  0.038  (0.085).  The  mean  (median)  of  special  items  as  a  percentage  of  sales  is  2.59  (0.0) 
percent.  Income-increasing  special  items  are  set  to  zero  and  are  not  included  in  the  analyses.  The 
mean  (median)  of  unexpected  core  earnings  deflated  by  sales  is  0.000  (0.005).  The  calculation  of 
unexpected  core  earnings  is  discussed  in  detail  in  the  following  section.  Table  3  presents  correla¬ 
tions  among  the  main  variables. 


TABLE  1 

Variable  Definitions  with  Corresponding  Compustat  Data  Item  Numbers 
Variable  Definition 


CE„ 


=  Core  Earnings,  calculated  as  sales  (#2)  minus  cost  of  goods  sold  (#30)  minus  selling, 
general,  and  administrative  expenses  (#1)  in  quarter  q. 


UE_CEg  —  Unexpected  Core  Earnings,  calculated  as  the  reported  core  earnings  (CE^)  minus  the 

expected  core  earnings  estimated  by  industry-year-quarter,  excluding  firm  i: 

CEg  =  ySo  +  /SiC£g_4  +  /32C£^_i  +  Id-iATO^  +  l3^ACCRUALSq_^  +  ^^CCRUALS^.^ 
+  fi(,ASALESg  +  lfNEG_ASALESg  +  fd^RETURNSg.^  +  /BgRETURNSg  +  Sg. 


%SIg 

ACCRUALSg 

RETURNSg 

ATOg 


ASALESg 

NEG_ASALESg 


Special  Items  (#32)  as  a  percentage  of  sales  (#2).  Income-decreasing  special  items  are 
multiplied  by  —  1 ,  where  special  items  are  income-decreasing,  and  are  set  to  0  where 
special  items  are  income-increasing. 

Accruals,  calculated  as  net  income  before  extraordinary  items  (#8)  minus  cash  from 
operations  (#108). 

Three-month  market-adjusted  return  corresponding  to  the  fiscal  quarter. 

Asset  Turnover  Ratio,  defined  as  SALESg  /  {{NOAg  +  NOAg_i)  /  2),  where  NOAg,  or  net 
operating  assets,  is  operating  assets  minus  operating  liabilities.  Operating  assets  are 
calculated  as  total  assets  (#44)  less  cash  and  short-term  investments  (#36).  Operating 
liabilities  are  calculated  as  total  assets  (#44)  less  total  debt  (#45  and  #51),  less  book 
value  of  common  and  preferred  equity  (#55  and  #56),  less  minority  interest  (#53). 
Average  NOA  is  required  to  be  positive. 

Percentage  Change  in  Sales,  calculated  as  {SALESg  -  SALES q_f) !  SALESg_^. 

ASALESg  if  the  percentage  change  in  sales  is  less  than  0,  and  0  otherwise. 
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TABLE  2 


Descriptive  Statistics 


Variable 

Mean 

Median 

Standard 

Deviation 

25% 

75% 

CE, 

0.038 

0.085 

0.346 

0.023 

0.154 

UE_CEq 

0.000 

0.005 

0.189 

-0.041 

0.056 

%SI, 

2.59% 

0.00% 

0.112 

0.00% 

0.00% 

CE,_4 

0.025 

0.086 

0.391 

0.023 

0.154 

CE,_i 

0.033 

0.084 

0.351 

0.021 

0.155 

ATO, 

2.873 

1.010 

6.885 

0.459 

2.259 

ACCRUALS,_4 

-0.093 

-0.066 

0.541 

-0.241 

0.063 

ACCRUALS,.^ 

-0.111 

-0.067 

0.519 

-0.248 

0.056 

ASALES, 

19.26% 

8.54% 

0.539 

-4.39% 

26.56% 

RETURNSq^i 

0.002 

-0.034 

0.331 

-0.183 

0.126 

RETURNS, 

0.001 

-0.033 

0.332 

-0.184 

0.126 

See  Table  1  for  variable  definitions.  The  full  sample  consists  of  132,393  firm-quarter  observations.  The  sample  with 
analysts’  forecast  data  available  has  67,980  firm-quarter  observations.  All  variables  are  winsorized  at  1st  and  99th 
percentile. 


V.  RESEARCH  DESIGN  AND  RESULTS 
Measuring  Unexpected  Core  Earnings 

To  measure  expected  core  earnings  for  firm  i,  we  estimate  model  (2)  within  each  industry- 
year-quarter  excluding  firm 


CEq  =  y8o  +  P\CEq_^  -f  +  P-iATOq  +  l3^ACCRUALSq_^  -t-  p^CCRUALS^_\ 

+  /3e^SALESg  +  f3jNEG_ASALES^  +  fisRETURNS^_i  -t-  ISgRETURNS^  +  .  (2) 

Model  (2)  is  a  variant  of  the  model  used  by  McVay  (2006).  The  intent  of  the  core  earnings 
expectations  model  is  to  capture  the  natural  relation  between  core  earnings  (CE)  and  firm  perfor¬ 
mance,  leaving  the  unexplained  portion  (i.e.,  unexpected  core  earnings)  as  a  measure  of  abnormal 
performance.^*  Lagged  core  earnings  are  included  because  core  earnings  are  persistent.*^  We 
include  core  earnings  from  the  last  quarter  and  four  quarters  ago.  The  last  quarter  is  closer  to  the 
current  quarter,  so  it  better  captures  firms’  current  economic  environments.  Thus,  it  may  provide  a 
better  indication  of  normal  core  earnings  for  the  current  quarter.  We  also  include  core  earnings 
four  quarters  ago  because  quarterly  earnings  exhibit  a  seasonal  pattern  for  many  firms,  such  that 
earnings  of  the  same  quarter  one  year  ago  may  provide  a  better  control  for  current  performance. 


Variables  are  defined  in  Table  1 . 

"  Reported  core  earnings  are  calculated  as  sales  minus  cost  of  goods  sold  and  selling,  general,  and  administrative 
expenses.  As  a  sensitivity  test,  we  also  consider  using  I/B/E/S  actual  quarterly  earnings  in  place  of  Compustat  core 
quarterly  earnings.  While  the  use  of  I/B/E/S  data  substantially  reduces  our  sample,  we  continue  to  find  support  for  our 
hypotheses.  So  that  our  results  are  directly  comparable  to  those  in  McVay  (2006),  we  employ  Compustat  core  earnings 
as  our  primary  analysis. 

In  Table  3,  the  Spearman  (Pearson)  correlation  between  CE^  and  is  0.7902  (0.7351).  The  Spearman  (Pearson) 
correlation  between  CE^  and  CE^_^  is  0.7357  (0.6530). 
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TABLE  3 

Spearman/Pearson  Correlation  Matrix 


CEq 

CEq i 

CEq^ 

ACCRUALSq i 

ACCRUALSq_4 

ATOq 

CEq 

1.0000 

0.7351 

0.6530 

-0.2750 

-0.2806 

0.0308 

CEq_i 

0.7902 

1.0000 

0.6446 

-0.2084 

-0.3043 

0.0295 

CEg_4 

0.7357 

0.6999 

1.0000 

-0.3019 

-0.2415 

0.0135 

ACCRUALSq_i 

-0.2947 

-0.2654 

-0.3129 

1.0000 

0.3171 

0.0381 

ACCRUALSq_4 

-0.3055 

-0.3441 

-0.2722 

0.0342 

1.0000 

0.0238 

ATOq 

-0.0554 

-0.0558 

-0.0921 

0.1460 

0.1039 

1.0000 

RETURNS^ 

0.1748 

0.1503 

0.0675 

-0.0396 

-0.0361 

0.0789 

RETURNSq.^ 

0.1627 

0.1678 

0.0764 

-0.0166 

-0.0538 

0.0819 

ASALESq 

0.2474 

0.1928 

0.0393 

0.0800 

-0.0094 

0.1573 

NEG_ASALESq 

-0.2621 

-0.2150 

-0.0900 

-0.0486 

0.0244 

-0.1836 

UE_CEq 

0.2392 

-0.0089 

0.0151 

-0.0121 

-0.0046 

-0.0346 

%SIq 

-0.0616 

-0.0327 

-0.0163 

-0.0339 

-0.0685 

-0.1000 

RETURNSq 

RETURNSq_i 

ASALESq 

NEG_ASALESq 

UEjCEq 

%SIq 

CEq 

0.0916 

0.0708 

0.0847 

-0.2010 

0.4476 

-0.2293 

CEq_i 

0.0678 

0.0846 

0.0011 

-0.1468 

0.0227 

-0.1580 

CEq_, 

0.0264 

0.0274 

-0.2388 

-0.0139 

0.0099 

-0.1493 

ACCRUALSq_^ 

-0.0073 

0.0033 

0.0650 

-0.0411 

-0.0020 

0.0156 

ACCRUALSq_4 

-0.0104 

-0.0232 

0.0229 

0.0132 

-0.0068 

0.0112 

ATOq 

0.0242 

0.0284 

0.0488 

-0.0764 

-0.0077 

-0.0434 

RETURNSq 

1.0000 

-0.0138 

0.0688 

-0.0915 

-0.0067 

-0.0755 

RETURNS  q_i 

0.0147 

1.0000 

0.0896 

-0.1000 

-0.0052 

-0.0601 

ASALESq 

0.1262 

0.1421 

1.0000 

-0.4780 

-0.0130 

-0.0474 

NEG_ASNLESq 

-0.1277 

-0.1356 

-0.8093 

1.0000 

0.0014 

0.1107 

UE  CEq 

-0.0277 

-0.0115 

-0.0972 

0.0936 

1.0000 

-0.0942 

%SIq 

-0.0740 

-0.0559 

-0.0961 

0.0951 

-0.0119 

1.0000 

Variable  definitions  are  in  Table  1.  There  are  132,393  firm-quarter  observations.  Spearman  (Pearson)  correlations  are  below 
(above)  the  diagonal.  All  variables  are  winsorized  at  1st  and  99th  percentile.  Amounts  in  bold  are  significant  at  the  0.01 
level. 


Since  asset  turnover  {ATO)  tends  to  be  inversely  related  to  profit  margins  (Nissim  and  Penman 
2001)  and  the  definition  of  core  earnings  closely  parallels  profit  margins,  we  include  ATO  in  the 
model.  Sloan  (1996)  finds  that  accrual  levels  are  an  explanatory  variable  for  future  performance, 
so  we  use  lagged  operating  accruals  {ACCRUALS q_i  and  ACCRUALS q_f)  as  a  control  for  current 
operating  performance.  For  reasons  discussed  earlier,  we  do  not  include  current-period  accruals. 
Also  similar  to  McVay  (2006),  we  include  percentage  changes  in  sales  {AS ALES)  and  allow 
different  slopes  for  sales  increases  and  decreases  {NEC).  The  reason  for  this  inclusion  is  that,  even 
though  core  earnings  are  scaled  by  sales,  as  sales  grow,  fixed  costs  become  smaller  per  sales 
dollar.  Those  costs  increase  more  when  activity  rises  than  they  decrease  when  activity  falls  by  an 
equivalent  amount  (Anderson  et  al.  2003). 

In  addition  to  the  variables  proposed  by  McVay  (2006),  we  also  add  the  current  quarter’s 
returns  {RETURNS)  and  the  previous  quarter’s  returns  to  the  model.  The  use  of  current  quarter’s 


The  Accounting  Review 


July  2010 

American  Accounting  Association 


1312 


Fan,  Barua,  Cready,  and  Thomas 


returns  controls  for  current  performance.  The  reason  for  including  prior-period  returns  is  that  the 
market  may  detect  deteriorating  performance  and  decrease  its  expectations  of  core  earnings  prior 
to  it  being  reported  in  the  current  quarter. 

We  obtain  coefficients  for  model  (2)  using  all  observations  in  a  particular  industry-year- 
quarter  excluding  firm  i.  Expected  core  earnings  for  firm  i  are  measured  using  these  coefficients 
multiplied  by  the  actual  values  of  the  variables  in  the  model  for  firm  i.  This  model  controls  for  the 
macroeconomic  and  industry  shocks  as  well  as  the  seasonal  effect  on  expected  core  earnings.  We 
then  obtain  unexpected  core  earnings,  calculated  as  the  difference  between  reported  and  expected 
core  earnings: 

UE_CE^  =  CE^-E{CE^).  (3) 

We  then  estimate  the  relation  between  UEjCE  and  %S1'. 

UE_CEq  -  aQ+  aj%SIq  +  s^,  (4) 

where  %SIq  is  special  items  as  a  percentage  of  sales  in  the  current  quarter.  Income-decreasing 
special  items  are  special  items  multiplied  by  —1,  where  special  items  are  income-decreasing. 
Income-increasing  special  items  are  set  to  zero  (McVay  2006). 

If  the  estimated  coefficient  on  %SIq  (aj)  is  positive,  then  classification  shifting  has  a  domi¬ 
nating  effect  (i.e.,  as  income-decreasing  special  items  increase,  unexpected  core  earnings  in¬ 
crease).  In  other  words,  managers  shift  core  expenses  to  special  items  such  that  reported  core 
earnings  would  be  above  expectations.  If  the  estimated  coefficient  is  negative,  then  firm  perfor¬ 
mance  would  be  the  dominating  effect.  In  other  words,  when  income-decreasing  special  items 
increase,  unexpected  core  earnings  actually  decrease.  This  would  be  consistent  with  prior  research 
findings  that  firms  incurring  large  write-offs  or  corporate  restructuring  charges  tend  to  be  poor 
performers  (Elliott  and  Shaw  1988;  DeAngelo  et  al.  1994;  Carter  2000). 

To  demonstrate  how  controlling  for  performance  in  model  (2)  affects  the  relation  between 
UE_CE  and  %SI,  we  start  with  a  model  that  provides  no  control  for  performance  and  then 
progressively  add  more  controls.  Eirst,  we  set  UE_CEq  equal  to  CE^.  This  is  the  same  as  setting 
E{CE)  equal  to  0  and  providing  no  control  for  performance.  As  shown  in  the  first  column  of  results 
in  Table  4,  the  coefficient  of  %S1  is  —0.7099  (t  =  —85.71;  p  <  0.01).  Recall  that  classification 
shifting  suggests  a  positive  relation  between  UEjCE  and  %SI,  whereas  a  performance  effect 
would  predict  that  income-decreasing  special  items  are  a  sign  of  poor  performance  (i.e.,  a  negative 
relation  between  UEjCE  and  %SI).  Thus,  as  expected  we  find  strong  evidence  of  a  performance- 
driven  relation  between  core  earnings  and  income-decreasing  special  items. 

In  the  second  column  of  Table  4,  we  use  a  core  earnings  expectations  model  that  relies  on  the 
assumption  that  expected  core  earnings  in  the  current  quarter  are  a  function  of  only  core  earnings 
in  the  same  quarter  one  year  ago. 

CEq  =  Po  +  PlCEq_4  +  Eq.  (5) 

Model  (5)  is  estimated  within  each  industry-year-quarter  excluding  firm  /.  Coefficient  esti¬ 
mates,  along  with  lagged  core  earnings,  are  used  to  calculate  unexpected  core  earnings  for  firm  i. 
We  then  estimate  the  relation  between  this  measure  of  unexpected  core  earnings  and  %SI.  The 
coefficient  on  %SI  decreases  in  magnitude  to  —0.3531  (t  =  —60.28;  p  <  0.01). 


Consistent  with  McVay  (2006),  we  identify  £(C£,)  as  “expected”  and  UE_CE^  as  “unexpected”  core  earnings.  Since 
EiCE^)  is  partially  determined  by  contemporaneous  independent  variables,  these  two  variables  are  not,  in  fact,  ex  ante 
in  nature  and  so  constitute  ex  post  measures  of  normal  and  abnormal  core  earnings. 
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TABLE  4 

Regression  of  Unexpected  Core  Earnings  on  Special  Items  as  a  Percentage  of  Sales  Using 
Alternative  Measures  of  Expected  Core  Earnings 


Dependent  Variable  = 

UEjCEq  =  CEq  -  E(CEg) 

EiCEq)  =  0 

E(CEq)  =f(CEq 4) 

E(CEq)  =/(McVay 
model) 

E(CEq)  =/(full 
model) 

Intercept 

0.0564 

0.0091 

0.0063 

0.0041 

(59.25) 

(13.57) 

(10.82) 

(6.67) 

%SIq 

-0.7099 

-0.3531 

-0.2630 

-0.1586 

(-85.71) 

(-60.28) 

(-48.06) 

(-16.38) 

R2 

5.26% 

2.67% 

1.71% 

0.89% 

Variable  definitions  are  in  Table  1.  The  full  sample  consists  of  132,393  firm-quarter  observations. 

In  column  1,  unexpected  core  earnings  are  defined  as  core  earnings  in  quarter  q. 

In  column  2,  unexpected  core  earnings  are  defined  as  core  earnings  minus  expected  core  earnings,  where  expected  core 
earnings  are  calculated  using  the  coefficients  from  the  model  shown  below,  estimated  for  each  industry-year-quarter, 
excluding  firm  i: 

CEg  =  I3q  +  Pi  . 

In  column  3,  unexpected  core  earnings  are  defined  as  core  earnings  minus  expected  core  earnings,  where  expected  core 
earnings  are  calculated  using  the  coefficients  from  the  model  shown  below,  estimated  for  each  industry-year-quarter, 
excluding  firm  i: 

CE^  =  ySo  -I-  /3iC£^_4  +  +  /3yACCRUALS^_^  +  ji^SSALES^  +  /3sNEG_SSALES^  +  e^. 

In  column  4,  unexpected  core  earnings  are  defined  as  core  earnings  minus  expected  core  earnings,  where  expected  core 
earnings  are  calculated  using  the  coefficients  from  the  model  shown  below,  estimated  for  each  industry-year-quarter, 
excluding  firm  i: 

CE^  =  /3o  +  l3iCE^_4  +  fiCE^-i  +  P-i^TO^  +  jS^ACCRUALS^^^  +  fip^CCRUALS^^i  -i-  /3(,ASALES^  +  /3^NEG_^SALES^ 

-I-  fgRETURNS^.i  +  fgRETURNS^  +  e^. 


All  variables  are  winsorized  at  the  1st  and  99th  percentile.  Amounts  reported  are  regression  coefficients  (with  t-statistics  in 
parentheses). 


Next,  we  use  a  core  earnings  expectations  model  that  approximates  the  one  used  in  McVay 
(2006),  except  that  it  excludes  current-period  accruals  and  is  estimated  using  quarterly  data. 

CE^  =  ySo  +  (3jCEq_4  +  f32ATOq  +  /33ACCRUALSq_4  +  /34ASALESg  +  ft5NEG_ASALESq  +  s^. 

(6) 

As  shown  in  the  third  column  of  Table  4,  the  coefficient  on  %SI  decreases  in  magnitude  to 
-0.263  (t  =  -48.06;  p  <  0.01). 

In  the  final  column,  we  report  results  after  using  the  full  core  earnings  expectations  model  as 
specified  in  model  (2).  The  estimated  coefficient  on  %SIq  is  —0.1586  (t  =  —16.38;  p  <  0.01), 
indicating  a  strong  performance-dominated  relation.  As  the  results  in  Table  4  clearly  demonstrate. 
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current  performance  and  special  items  have  a  strong  negative  relation.  However,  as  one  provides 
additional  controls  for  performance,  the  performance  effect  diminishes. 

Tests  of  Hypotheses 
Test  of  HI 

The  first  hypothesis  predicts  that  shifting  is  more  pervasive  in  the  fourth  quarter.  As  discussed 
in  Section  II,  prior  research  provides  evidence  that  accrual  manipulation  is  more  constrained  in  the 
fourth  quarter,  leaving  managers  to  resort  to  alternative  earnings  management  techniques  such  as 
income  classification  shifting.  Thus,  we  expect  that  the  relation  between  unexpected  core  earnings 
and  special  items  will  be  more  positive  (or  less  negative)  for  fourth  quarter  observations,  com¬ 
pared  to  observations  for  the  first  three  quarters. 

Table  5  presents  the  results  using  subsamples  with  observations  from  the  fourth  quarter  and 
those  from  the  first  three  quarters.  The  results  show  a  less  negative  coefficient  for  the  fourth 
quarter  subsample  compared  to  the  subsample  for  the  first  three  quarters.^®  Recall  that  a  more 
positive  (or  less  negative)  relation  between  unexpected  core  earnings  and  special  items  is  evidence 
of  more  classification  shifting.  Comparing  the  coefficients  of  —0.1440  for  the  fourth  quarter  and 
—0.1819  for  the  first  three  quarters  indicates  that  classification  shifting  is  more  pervasive  in  the 
fourth  quarter.  The  difference  between  the  two  estimated  coefficients  (0.0379)  is  statistically 
significant  (t  =  4.05;  p  <  0.01).  This  result  provides  support  for  HI  that  classification  shifting  is 
more  pervasive  in  the  fourth  quarter,  even  though  the  performance-driven  effect  is  still  the  domi¬ 
nating  factor. 

Test  of  H2 

Our  next  hypothesis  (H2)  suggests  that  those  managers  who  are  more  constrained  in  their 
ability  to  manipulate  accruals  because  of  previous  accrual  management  activity  are  more  likely  to 
engage  in  classification  shifting  in  the  current  quarter.  To  the  extent  that  one  earnings  management 
option  is  not  available,  managers  are  likely  to  implement  the  next.  Following  Barton  and  Simko 
(2002),  we  measure  managers’  accrual  manipulation  constraint  using  net  operating  assets  at  the 
beginning  of  the  quarter  (scaled  by  sales).  They  demonstrate  that  firms  that  have  optimistically 
reported  accruals  in  prior  periods  tend  to  have  higher  net  operating  assets.  We  classify  firms  in  the 
high  (low)  group  when  their  net  operating  assets  are  above  or  equal  to  (below)  the  median  for  the 


To  find  direct  evidence  of  classification  shifting,  ideally  one  would  find  a  positive  relation  between  UEjCE  and  %SI. 
Since  we  do  not  find  this  for  the  full  version  of  the  model  but  our  results  and  anecdotal  evidence  suggests  that  it  is 
occurring,  this  provides  a  fruitful  avenue  for  future  research.  As  the  literature  advances  and  expectations  models  better 
control  for  the  performance  effect,  the  relation  between  UE_CE  and  %SI  is  expected  to  become  positive. 

To  determine  whether  the  relation  between  our  estimate  of  unexpected  core  earnings  from  Model  (2)  and  income- 
decreasing  special  items  is  mechanically  driven  by  contemporaneous  accrual  special  items,  we  estimate  accrual  special 
items  as  other  funds  from  operations  (#81)  plus  gains  and  losses  on  the  sale  of  property,  plant,  and  equipment  and 
investments  (#102).  We  split  the  sample  into  those  with  estimated  accrual  special  items,  scaled  by  sales,  greater  (less) 
than  0.5  percent  (—0.5  percent).  We  also  create  a  third  sample  with  estimated  accrual  special  items  between  0.5  percent 
and  —0.5  percent.  We  find  the  relations  between  UEjCE  and  %Sl  across  these  three  groups  do  not  differ  significantly. 
For  estimated  accrual  special  items  less  than  —0.5  percent  (greater  than  0.5  percent)  [between  —0.5  percent  and  0.5 
percent],  the  coefficient  on  %SI  (i.e.,  a,  in  Model  (4))  is  -0.1542  (-0.1506)  [-0.1736]. 

In  Table  5  and  all  remaining  tables,  differences  in  coefficients  between  Q4  and  Ql-3  are  tested  using  an  indicator 
variable  for  fourth  quarter  observations.  For  example,  for  Table  5  we  estimate  the  following  model:  UE_CE^  =  Ug 
+  aj%Sf  +  a2Q4  +  a^TcSf  *  Q4  +  e^,  where  the  prediction  is  >  0.  In  the  tables,  we  present  results  separately  for  Q4 
and  Ql-3  to  make  the  results  for  each  subsample  more  evident. 

The  increased  evidence  of  classification  shifting  in  the  fourth  quarter  might  be  a  result  of  increased  opportunity  (i.e., 
firms  report  more  special  items  in  the  fourth  quarter)  or  a  result  of  increased  incentives.  However,  either  explanation 
supports  HI  that  classification  shifting  is  more  likely  in  the  fourth  quarter. 
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TABLE  5 

Regression  of  Unexpected  Core  Earnings  on  Special  Items  as  a  Percent  of  Sales: 
Fourth  Quarter  (Q4)  versus  First  Three  Quarters  (Ql-3) 


Independent  Variable 


Q4 


Ql-3 


Test  (HI):  Q4 
>  Ql-3 


Intercept 


0.0070 

(5.41)** 

-0.1440 


0.0032 

(5.72)** 

-0.1819 


%Sf 


0.0379 

(4.05)** 


R2 

Number  of  Observations 


(-19.12)** 

1.08% 

33,454 


(-28.54)** 

0.82% 

98,939 


*,  **  Indicates  significant  at  the  0.05  and  0.01  levels,  respectively,  using  a  two-tailed  test. 

Variable  definitions  are  in  Table  1.  All  variables  are  winsorized  at  the  1st  and  99th  percentile.  Amounts  reported  are 
regression  coefficients  (with  t-statistics  in  parentheses). 


industry-year-quarter.  If  classification  shifting  occurs,  then  we  expect  to  find  more  evidence  of  it 
for  firms  with  higher  net  operating  assets  (relative  to  the  industry-quarter  median)  because  these 
firms  are  more  constrained  in  their  ability  to  manipulate  accruals.*^ 

Results  are  reported  in  Table  6.  We  test  H2  for  the  full  sample  and  for  subsamples  based  on 
Q4  and  Ql-3.  The  subsample  tests  also  allow  us  to  re-examine  HI  within  the  context  of  H2.  The 
interactive  effect  provides  a  stronger  test  of  classification  shifting.  If  classification  shifting  is 
occurring,  then  we  expect  it  to  be  more  pervasive  in  fourth-quarter  observations  with  high  NOA. 
The  results  for  the  full  sample  and  the  subsamples  support  H2.  For  the  full  sample,  we  find  that  the 
relation  between  unexpected  core  earnings  and  special  items  is  significantly  less  negative  for  firms 
with  higher  net  operating  assets.  The  difference  is  significant  (t  =  9.88;  p  <  0.01).  The  same  is 
also  true  for  observations  in  Q4  (t  =  5.52;  p  <  0.01)  and  in  Ql-3  (t  =  6.77;  p  <  0.01). 
Furthermore,  fourth  quarter  observations  with  high  NOA  show  the  most  evidence  in  favor  of 
classification  shifting  relative  to  the  other  observations.  Supporting  H2,  we  conclude  that  when 
managers’  ability  to  manipulate  accruals  is  constrained,  classification  shifting  is  more  likely. 


Test  of  H3 


For  the  next  set  of  tests,  we  split  the  sample  based  on  reported  earnings  relative  to  earnings 
benchmarks.  Hypothesis  3  predicts  that  classification  shifting  is  more  prevalent  in  subsamples  that 
just  meet  or  beat  the  analyst  forecast,  one-year-ago  same-quarter  earnings,  and  zero  earnings, 
respectively.  In  particular,  the  coefficients  on  %SIq  are  expected  to  be  more  positive  (or  less 
negative)  for  subsamples  with  observations  that  have  these  characteristics. 


Of  course,  our  use  of  this  measure  requires  the  joint  assumption  that  net  operating  assets  provide  a  reliable  measure  of 
prior  upward  accrual  manipulation  and  that  prior  upward  accrual  manipulation  reduces  the  ability  of  managers  to 
manipulate  accruals  in  the  current  period.  To  the  extent  that  this  joint  assumption  is  false,  inferences  from  our  results  are 
limited.  Other  studies  employing  net  operating  assets  find  evidence  consistent  with  it  being  a  measure  of  managers’ 
accrual  manipulation  constraint  (e.g.,  Mosebach  and  Simko  2005;  Ettredge  et  al.  2007;  Omer  et  al.  2008). 

While  not  the  intended  purpose  of  our  study,  these  results  are  also  consistent  with  a  portfolio  approach  to  managing 
earnings  (e.g.,  Jiambalvo  1996;  Fields  et  al.  2001;  Lin  et  al.  2006;  Cohen  and  Zarowin  2008;  Cohen  et  al.  2008). 
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TABLE  6 


Regression  of  Unexpected  Core  Earnings  on  Special  Items  as  a  Percent  of  Sales: 
High  Net  Operating  Assets  (HighNOA)  versus  Low  NOA  (LowNOA) 


Independent  Variable 

Full  Sample 

Q4 

Ql-3 

Test  (HI):  ( 
>  Ql-3 

HighNOA 

0.0130 

0.0218 

0.0103 

(17.41)** 

(12.05)** 

(12.97)** 

LowNOA 

-0.0051 

-0.0084 

-0.0040 

(-6.80)** 

(-4.62)** 

(-5.04)** 

%SIg  *  HighNOA  (A) 

-0.1153 

-0.1019 

-0.1441 

0.0422 

(-17.83)** 

(-9.44)** 

(-16.55)** 

(3.22)** 

%SIg  *  LowNOA  (B) 

-0.2063 

-0.1848 

-0.2305 

0.0457 

(-31.49)** 

(-17.70)** 

(-24.67)** 

(3.42)** 

Test  (H2):  A  >  B 

0.0910 

0.0829 

0.0864 

(9.88)** 

(5.52)** 

(6.77)** 

R2 

1.25% 

1.77% 

1.07% 

Number  of  Observations 

132,393 

33,454 

98,939 

*,  **  Indicates  significant  at  the  0.05  and  0.01  levels,  respectively,  using  a  two-tailed  test. 

Variable  definitions  are  in  Table  1.  HighNOA  {LowNOA)  is  an  indicator  variable  that  equals  1  if  a  firm-quarter  observation 
has  net  operating  assets  at  the  beginning  of  the  quarter  above  or  equal  to  (below)  the  median  for  the  industry-year-quarter, 
and  0  otherwise.  All  variables  are  winsorized  at  1st  and  99th  percentile.  Amounts  reported  are  regression  coefficients  (with 
t-statistics  in  parentheses). 


We  use  the  I/B/E/S  Detail  File  to  determine  whether  firms  just  meet  or  beat  analysts’  fore¬ 
casts.  The  analyst  forecast  is  defined  as  the  median  of  the  last  three  individual  analysts’  forecasts 
after  the  previous  quarter’s  earnings  announcement  and  before  the  current  quarter’s  earnings 
announcement.  Firms  reporting  an  earnings  forecast  error  equal  to  $0.00  or  $0.01  are  considered 
as  just  meeting  or  beating  analysts’  forecasts.  These  observations  include  approximately  26  per¬ 
cent  of  our  sample.  For  the  prior  period  earnings  benchmark,  we  define  firms  as  just  meeting  or 
beating  if  they  report  a  change  in  reported  operating  income  per  share  (Compustat  data  item  #177) 
from  $0.00  to  $0.03,  constituting  approximately  17  percent  of  our  quarterly  observations.  For  the 
zero  earnings  benchmark,  firms  that  report  operating  income  per  share  from  $0.00  to  $0.04  are 
defined  as  those  just  meeting  the  threshold,  which  equals  approximately  nine  percent  of  our 
quarterly  observations. 

Results  for  testing  H3  are  presented  in  Tables  7-9.  In  Table  7  for  the  full  sample,  the  coeffi¬ 
cient  of  %SI  is  —0.0956  for  firms  that  just  meet  or  beat  the  analyst  forecast  threshold  {JustMBF) 
versus  —0.1288  for  other  firms  (NotJustMBF).  The  difference  in  coefficients  of  0.0332  is  statis¬ 
tically  significant  (t  =  2.17;  p  =  0.03).  The  difference  increases  to  0.0581  (t  =  2.01;  p  =  0.04)  for 
observations  in  the  fourth  quarter  and  falls  to  0.0251  (t  —  1.36;  p  =  0.17)  in  the  interim  quarters. 


We  use  the  I/B/E/S  data  unadjusted  for  stock  splits  for  reasons  discussed  in  Payne  and  Thomas  (2003). 
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TABLE  7 


Regression  of  Unexpected  Core  Earnings  on  Special  Items  as  a  Percent  of  Sales: 
Just  Meet  or  Beat  Forecasts  (JustMBF)  versus  Not  Just  Meet  or  Beat  Forecasts 

(NotJustMBF) 


Independent  Variable 

Full  Sample 

Q4 

Ql-3 

Test  (HI):  C 
>  Ql-3 

JustMBF 

0.0083 

0.0133 

0.0069 

(7.68)** 

(4.98)** 

(5.98)** 

NotJustMBF 

0.0061 

0.0118 

0.0042 

(9.33)** 

(7.75)** 

(5.93)** 

%SF  *  JustMBF  (A) 

-0.0956 

-0.0533 

-0.1270 

0.0737 

(-6.79)** 

(-1.98)* 

(-7.62)** 

(2.54)** 

%SF  *  NotJustMBF  (B) 

-0.1288 

-0.1114 

-0.1521 

0.0407 

(-21.57)** 

(-10.78)** 

(-19.84)** 

(3.38)** 

Test  (H2);  A  >  B 

0.0332 

0.0581 

0.0251 

(2.17)* 

(2.01)* 

(1.36) 

R2 

0.84% 

0.92% 

0.94% 

Number  of  Observations 

67,980 

17,563 

50,417 

Indicates  significant  at  the  0.05  and  0.01  levels,  respectively,  using  a  two-tailed  test. 

Variable  definitions  are  in  Table  1.  JustMBF  {NotJustMBF)  is  an  indicator  variable  that  equals  1  (0)  if  a  firm-quarter 
observation  has  an  analyst  forecast  error  of  $0.00  or  $0.01,  and  0  (1)  otherwise.  Analyst  forecast  error  is  defined  as  actual 
earnings  per  share  as  reported  by  I/B/E/S  less  the  median  of  the  last  three  individual  analyst  forecast  after  the  previous 
quarter’s  earnings  announcement  and  before  the  current  quarter’s  earnings  announcement.  All  variables  are  winsorized  at 
1st  and  99th  percentile.  Amounts  reported  are  regression  coefficients  (with  t-statistics  in  parentheses). 


We  also  note  that  the  differences  in  Q4  versus  Ql-3  are  significant  for  both  JustMBF  and  Not¬ 
JustMBF  firms.  Overall,  we  conclude  that  finding  the  most  evidence  of  classification  shifting  in 
the  fourth  quarter  for  firms  that  just  meet  or  beat  the  analyst  forecast  supports  H3. 

We  then  split  the  full  sample  into  the  subsample  with  firm-quarter  earnings  that  just  meet  or 
beat  prior  period  earnings  {JustMBF)  and  the  subsample  with  firm-quarter  earnings  that  do  not  just 
meet  or  beat  prior  period  earnings  {NotJustMBF).  Managers  of  firms  with  earnings  that  ex  post  ']nsX 
meet  or  beat  earnings  in  the  same  quarter  of  previous  year  are  expected  to  have  had  stronger 
incentives  ex  ante  to  shift  core  expenses  to  special  items,  especially  in  the  fourth  quarter.  The 
results  in  Table  8  show  that  for  the  full  sample  and  for  each  of  the  subsamples,  the  relation 
between  unexpected  core  earnings  and  special  items  is  less  negative  for  JustMBF  firms.  These 
results  are  consistent  with  H3.  We  also  find  stronger  evidence  of  classification  shifting  to  just 
meeting  or  beating  prior  period  earnings  in  the  fourth  quarter. 


As  an  additional  test  using  analyst  data,  we  consider  the  extent  to  which  analysts  exclude  income-decreasing  special 
items.  We  do  this  by  comparing  I/B/E/S  actual  earnings  to  reported  earnings  per  share  excluding  extraordinary  items 
from  Compustat.  To  the  extent  that  analysts  do  not  incorporate  income-decreasing  special  items  in  their  forecasts,  we 
expect  I/B/E/S  actual  earnings  to  be  greater  than  reported  earnings.  For  our  sample  of  observations  with  I/B/E/S  data  (n 
=  67,980),  we  find  that  approximately  23  percent  have  I/B/E/S  earnings  greater  than  reported  earnings  per  share 
excluding  extraordinary  items.  Furthermore,  if  classification  shifting  is  meant  to  mislead  financial  statement  users,  then 
we  expect  to  find  more  evidence  of  classification  shifting  when  LT3/E/S  earnings  exclude  expenses.  This  is  what  we  find. 
When  I/B/E/S  earnings  are  greater  than  reported  earnings,  we  find  that  the  relation  between  unexpected  core  earnings 
and  special  items  is  less  negative  for  the  full  sample  (t  =  4.12;  p  <  0.01),  in  Q4  (t  =  2.33;  p  =  0.02),  and  m  Ql-3  (t 
=  3.63;  p  <  0.01). 
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TABLE  8 

Regression  of  Unexpected  Core  Earnings  on  Special  Items  as  a  Percent  of  Sales: 

Just  Meet  or  Beat  Prior  Earnings  (JustMBP)  versus  Not  Just  Meet  or  Beat  Prior  Earnings 

(NotJustMBP) 


Independent  Variable 

Full  Sample 

Q4 

Ql-3 

Test  (HI): 
>  Ql-3 

JustMBP 

0.0120 

0.0156 

0.0113 

(9.15)** 

(4.21)** 

(8.41)** 

NotJustMBP 

0.0027 

0.0057 

0.0017 

(4.52)** 

(4.04)** 

(2.72)** 

%5/q  *  JustMBP  (A) 

-0.0995 

-0.0741 

-0.1373 

0.0632 

(-6.70)** 

(-3.06)** 

(-6.69)** 

(2.10)* 

%SIq  *  NotJustMBP  (B) 

-0.1630 

-0.1484 

-0.1860 

0.0376 

(-32.74)** 

(-18.23)** 

(-27.09)** 

(3.73)** 

Test  (H3):  A  >  B 

0.0635 

0.0743 

0.0487 

(4.05)** 

(2.91)** 

(3.36)** 

R2 

0.93% 

1.11% 

0.88% 

Number  of  Observations 

126,932 

31,923 

95,009 

*,  **  Indicates  significant  at  the  0.05  and  0.01  levels, 

respectively,  using  a 

two-tailed  test. 

Variable  definitions  are  in  Table  1.  JustMBP  {NotJustMBP)  is  an  indicator  variable  that  equals  1  (0)  if  a  firm-quarter 
observation  has  a  reported  change  in  operating  income  per  share  (Compustat  data  item  #177)  in  the  current  quarter 
compared  to  the  same  quarter  in  the  prior  year  between  $0.00  and  $0.03,  and  0  (1)  otherwise.  All  variables  are  winsorized 
at  1st  and  99th  percentile.  Amounts  reported  are  regression  coefficients  (with  t-statistics  in  parentheses). 


Table  9  reports  results  for  firms  that  just  meet  or  beat  the  zero  earnings  threshold  (JustMBZ) 
versus  other  firms  {Not JustMBZ).  For  the  full  sample,  the  coefficient  on  %SI  for  JustMBZ  firms  is 
positive  (0.0304)  and  significant  (t  =  1.96;  p  =  0.05),  while  the  coefficient  on  %SI  for  Not  Just¬ 
MBZ  firms  is  significantly  negative  (-0.1762;  t  =  -36.60;  p  <  0.01).  The  difference  is  significant 
(t  =  12.71;  p  <  0.01).  We  also  find  significant  differences  in  Q4  and  in  Ql-3.  For  JustMBZ  firms, 
the  coefficient  on  %SI  is  positive  in  Q4  (0.0400)  and  negative  in  Ql-3  (-0.0039),  but  the 
difference  is  not  significant  (t  =  1.39;  p  =  0.16),  probably  due  to  lower  power  tests  resulting  from 
the  small  number  of  observations  that  just  meet  or  beat  the  zero  earnings  threshold.  The  difference 
between  Q4  and  Ql-3  for  NotJustMBZ  is  significantly  positive  (t  =  3.73;  p  <  0.01).  Overall,  the 
evidence  supports  classification  shifting  for  firms  that  just  meet  or  beat  the  zero  earnings  bench¬ 
mark,  again,  supporting  H3. 

As  a  final  test,  we  consider  the  three  earnings  thresholds  jointly.  That  is,  we  test  for  income 
classification  shifting  when  firm-quarter  observations  just  meet  or  beat  any  of  the  three  earnings 
thresholds.  This  joint  test  presumably  contains  less  measurement  error  because  evidence  in  favor 
of  classification  shifting  is  found  for  each  of  the  three  earnings  thresholds  in  Tables  7-9.  For 
example,  in  our  test  of  the  zero  earnings  threshold  in  Table  9,  the  explicit  assumption  is  that  firms 
that  are  classified  as  not  just  meeting  or  beating  the  benchmark  {NotJustMBZ)  are  less  likely  to 
have  shifted  core  expenses.  However,  these  firms  may  just  meet  or  beat  the  analyst  forecast  (prior 
period  earnings)  threshold.  These  firms  may  shift  in  response  to  these  alternative  earnings  bench¬ 
marks,  but  our  test  in  Table  9  would  not  properly  test  this. 

Because  we  require  information  for  all  earnings  thresholds  jointly,  the  sample  size  reduces  to 
66,034  firm-quarter  observations.  We  first  note  that  for  this  smaller  sample,  approximately  12.6 
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TABLE  9 


Regression  of  Unexpected  Core  Earnings  on  Special  Items  as  a  Percent  of  Sales: 

Just  Meet  or  Beat  Zero  Earnings  (JustMBZ)  versus  Not  Just  Meet  or  Beat  Zero  Earnings 

(NotJustMBZ) 


Test  (HI):  Q4 


Independent  Variable 

Full  Sample 

Q4 

Ql-3 

>  Ql-3 

JustMBZ 

0.0388 

0.0514 

0.0355 

(22.46)** 

(11.41)** 

(19.76)** 

NotJustMBZ 

0.0005 

0.0031 

-0.0003 

(0.90) 

(2.30)* 

(-0.47) 

%S1„  *  JustMBZ  (A) 

0.0304 

0.0400 

-0.0039 

0.0439 

(1.96)* 

(1.57) 

(-0.18) 

(1.39) 

%SIa  *  NotJustMBZ  (B) 

-0.1762 

-0.1622 

-0.1980 

0.0358 

(-36.60)** 

(-20.65)** 

(-29.76)** 

(3.67)*^ 

Test  (H3):  A  >  B 

0.2066 

0.2022 

0.1941 

(12.71)** 

(7.59)** 

(8.63)** 

R2 

1.45% 

1.78% 

1.32% 

Number  of  Observations 

132,393 

33,454 

98,939 

*,  **  Indicates  significant  at  the  0.05  and  0.01  levels,  respectively,  using  a  two-tailed  test. 

Variable  definitions  are  in  Table  1.  JustMBZ  (NotJustMBZ)  is  an  indicator  variable  that  equals  1  (0)  if  a  firm-quarter 
observation  has  a  reported  operating  income  per  share  in  the  current  quarter  between  $0.00  and  $0.04,  and  0(1)  otherwise. 
All  variables  are  winsorized  at  1st  and  99th  percentile.  Amounts  reported  are  regression  coefficients  (with  t-statistics  in 
parentheses). 


(26.8)  [33.3]  percent  of  the  observations  that  are  classified  as  not  just  meeting  or  beating  the 
analyst  forecast  (prior  period  earnings)  [zero  earnings]  threshold  actually  just  meet  or  beat  one  of 
the  other  two  thresholds.  Thus,  we  know  that  our  tests  in  Tables  7—9  contain  some  measurement 
error  and  inferences  could  be  affected.  The  results  of  our  joint  tests  are  reported  in  Table  10.  We 
find  for  the  full  sample,  for  Q4,  and  for  Ql-3  that  the  coefficient  on  %SI  is  significantly  less 
negative  for  firms  that  just  meet  or  beat  any  earnings  threshold  {JustMB)  than  for  other  firms 
{NotJustMB).  We  also  note  that  the  coefficient  on  %SI  is  significantly  less  negative  in  Q4  than  in 
Q1-Q3  for  JustMB  firms  and  for  NotJustMB  firms.  The  results  in  Table  10  provide  broad  support 

for  H3. 

VI.  CONCLUSION 

This  study  addresses  the  issue  of  whether  and  when  managers  use  classification  shifting  to 
manage  core  earnings.  Prior  research  provides  evidence  that  investors  and  analysts  tend  to  fixate 
on  core  earnings  (Bradshaw  and  Sloan  2002;  Gu  and  Chen  2004;  Kinney  and  Trezevant  1997)  and 
that  earnings  reported  higher  in  the  income  statement  tend  to  have  higher  valuation  multiples  (Lipe 
1986).  Thus,  the  extent  to  which  managers  shift  core  expenses  to  special  items  is  important  for 
understanding  how  managers  may  take  advantage  of  the  market’s  fixation  on  core  earnings  num¬ 
bers. 

McVay  (2006)  documents  a  positive  relation  between  unexpected  core  earnings  and  income- 
decreasing  special  items,  concluding  that  managers  shift  core  expenses  to  special  items  to  improve 
core  earnings.  However,  McVay  (2006)  notes  that  this  finding  does  not  hold  (and  in  fact  becomes 
negative)  when  contemporaneous  total  accruals  (which  include  accrual  special  items)  are  excluded 
from  core  earnings  expectations  model.  Thus,  there  is  some  debate  of  the  extent  to  which  man- 
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TABLE  10 

Regression  of  Unexpected  Core  Earnings  on  Special  Items  as  a  Percent  of  Sales: 

Just  Meet  or  Beat  Any  of  the  Three  Earnings  Benchmarks  (JustMB)  versus  Not  Just  Meet 
or  Beat  Any  of  the  Three  Earnings  Benchmarks  (NotJustMB) 


Independent  Variable 

Full  Sample 

Q4 

Ql-3 

Test  (HI):  Q4 
>  01-3 

JustMB 

0.0108 

0.0196 

0.0083 

(11.84)** 

(8.56)** 

(8.63)** 

NotJustMB 

0.0040 

0.0078 

0.0027 

(5.56)** 

(4.64)** 

(3.49)** 

%Slq  *  JustMB  (A) 

-0.0812 

-0.0445 

-0.1140 

0.0695 

(-6.96)** 

(-2.02)* 

(-8.19)** 

(2.90)** 

%SIq  *  NotJustMB  (B) 

-0.1355 

-0.1185 

-0.1569 

0.0384 

Test  (H3):  A  >  B 

(-21.42)** 

(-10.90)** 

(-19.15)** 

(3.01)** 

0.0543 

0.0740 

0.0429 

(4.09)** 

(3.00)** 

(2.65)** 

0.93% 

1.13% 

0.98% 

Number  of  Observations 

66,034 

16,995 

49,039 

*,  **  Indicates  significant  at  the  0.05  and  0.01  levels,  respectively,  using  a  two-tailed  test. 

Variable  definitions  are  in  Table  1.  JustMB  {NotJustMB)  is  an  indicator  variable  that  equals  1  (0)  if  a  firm-quarter 
observation  has  any  of  the  following:  an  analyst  forecast  error  of  $0.00  or  $0.01,  a  reported  change  in  operating  income  per 
share  (Compustat  data  item  #177)  in  the  current  quarter  compared  to  the  same  quarter  in  the  prior  year  between  $0.00  and 
$0.03,  or  a  reported  operating  income  per  share  in  the  current  quarter  between  $0.00  and  $0.04,  and  0  (1)  otherwise.  All 
variables  are  wmsorized  at  1st  and  99th  percentile.  Amounts  reported  are  regression  coefficients  (with  t-statistics  in 
parentheses). 


agers  shift  core  expenses  to  special  items.  McVay  (2006)  calls  for  research  to  improve  the  core 
earnings  expectations  model  and  to  provide  additional  cross-sectional  tests  of  classification  shift¬ 
ing. 

Using  quarterly  data,  we  first  extend  McVay’s  (2006)  core  earnings  expectations  model  by 
dropping  current-period  accruals  and  including  additional  controls  for  performance.  We  develop 
alternative  measures  for  unexpected  core  earnings  based  on  different  models  for  expected  core 
earnings.  We  then  examine  the  relation  between  unexpected  core  earnings  and  income-decreasing 
special  items  (measured  as  a  positive  number).  Without  sufficient  control  for  performance,  we  first 
observe  a  strong  negative  relation,  as  opposed  to  a  positive  relation  that  indicates  classification 
shifting.  However,  this  negative  relation  significantly  diminishes  with  additional  performance 
controls. 

Next,  we  provide  additional  cross-sectional  tests  to  evaluate  increased  classification  shifting 
in  specific  settings.  First,  we  show  that  classification  shifting  is  more  prevalent  in  the  fourth 
quarter  than  in  interim  quarters.  This  finding  is  consistent  with  managers  adopting  alternative 
earnings  management  techniques  in  the  fourth  quarter  (Cohen  et  al.  2009)  and  that  it  is  more 
difficult  to  achieve  earnings  thresholds  using  accrual  manipulation  in  the  fourth  quarter  (Brown 
and  Pinello  2007).  Second,  we  document  that  classification  shifting  is  more  prominent  when 
managers  appear  to  be  constrained  in  their  ability  to  manipulate  curi’ent-period  accruals  because  of 
prior  upward  accrual  manipulation  (Barton  and  Simko  2002).  This  result  is  consistent  with  the 
reasoning  that  managers  have  a  portfolio  of  choices  for  manipulating  reported  performance.  To  the 
extent  that  one  option  for  manipulating  earnings  is  limited,  managers  will  turn  to  the  next.  While 
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accraal  manipulation  and  real  activities  management  have  been  extensively  investigated  in  the 
literature,  income  classification  shifting  has  received  relatively  little  attention  and  represents  a 
viable  opportunity  for  managers  to  manipulate  reported  earnings. 

In  addition,  we  find  greater  evidence  of  classification  shifting  for  samples  of  firms  that  just 
meet  or  beat  analysts’  forecasts,  one-year-ago  same-quarter  earnings,  and  zero  earnings.  These 
results  are  consistent  with  managers  attempting  to  avoid  market  penalties  (Barth  et  al.  1999; 
Bartov  et  al.  2002;  Kasznik  and  McNichols  2002)  and  other  negative  consequences  associated 
with  missing  earnings  targets.  These  results  are  also  consistent  with  managers  having  more  incen¬ 
tive  to  shift  income  when  investors  are  likely  giving  more  weight  to  earnings  performance  (e.g., 
Hayn  1995;  Burgstahler  and  Dichev  1997b;  Barth  et  al.  1998;  Collins  et  al.  1999;  Barua  et  al. 
2006). 

Clearly,  the  extent,  timing,  and  incentives  under  which  managers  engage  in  classification 
shifting  is  an  important  area  of  research.  To  the  extent  that  managers  attempt  to  fool  investors  by 
shifting  core  expenses  to  special  items,  it  is  important  for  researchers  to  better  understand  the 
effects  on  investors,  auditors,  regulators,  and  other  market  participants.  Because  this  area  of 
research  is  relatively  new,  there  is  ample  opportunity  for  future  research  to  explore  the  settings 
under  which  classification  shifting  occurs  and  to  improve  upon  models  to  measure  expected  core 
earnings  or  core  expenses.^^  Although  we  improve  upon  the  model  in  McVay  (2006),  we  have  not 
been  able  to  fully  control  for  performance.  Future  research  may  consider  further  advancements  of 
the  model  to  facilitate  research  on  classification  shifting. 
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Joint  Effects  of  Principles-Based  versus 
Rules-Based  Standards  and  Auditor  Type  in 
Constraining  Financial  Managers’  Aggressive 
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ABSTRACT:  Managers  sometimes  implement  accounting  standards  (such  as  the  lease 
standard)  opportunistically  to  move  debt  off  balance  sheet.  Regulators  and  standard- 
setters  are  considering  the  adoption  of  principles-based  accounting  standards  to  re¬ 
duce  such  opportunism.  We  report  the  results  of  an  experiment  in  which  experienced 
financial  managers,  with  incentives  to  structure  a  transaction  off  balance  sheet,  take  a 
reporting  position  on  how  a  lease  is  to  be  reported.  We  manipulate  the  type  of  account¬ 
ing  standards  (principles-based,  rules-based)  and  the  type  of  auditor  (principles- 
oriented,  rules-oriented,  or  client-oriented).  Results  show  that  for  a  rules-based  stan¬ 
dard,  auditor-type  does  not  influence  participants’  propensity  to  report  the  transaction 
off  balance  sheet.  However,  for  a  principles-based  standard,  auditor-type  matters  in 
that  this  propensity  is  lowest  when  the  auditor  is  principles-oriented  as  opposed  to 
rules-  or  client-oriented.  Our  results  suggest  that  a  move  toward  more  principles-based 
standards  is  likely  to  result  in  improved  financial  reporting  quality  only  when  there  is  a 
corresponding  shift  in  auditors’  mindsets  toward' being  more  principles-oriented. 

Keywords:  principles-based  standards;  rules-based  standards;  auditor-type;  off  bal¬ 
ance  sheet  debt;  transaction  structuring. 


We  thank  Steve  Glover,  John  Ferguson,  The  FEI  National,  Edmonton  and  Calgary  Chapter,  The  Institute  of  Chartered 
Accountants  of  Alherta,  and  CMA  Alberta  for  their  assistance  in  obtaining  participants  for  the  study.  We  appreciate  helpful 
comments  from  Laureen  Maines  (editor),  Steve  Kachelmeier  (senior  editor),  two  anonymous  referees,  as  well  as  workshop 
participants  at  the  University  of  Alberta,  Chinese  University  of  Hong  Kong,  University  of  Hawaii,  Peking  University, 
University  of  Toronto,  Tsinghua  University,  and  Yale  University.  We  also  thank  Wei  Chen,  Krista  Fiolleau,  Kris  Hoang, 
Elaine  Wang,  and  Bo  Zhou  for  research  assistance,  and  the  Social  Science  and  Humanities  Research  Council  of  Canada 
(SSHRC)  and  Nanyang  Technological  University  for  their  financial  support  of  this  study. 

Editor’s  note:  Accepted  by  Laureen  Maines. 

Submitted:  July  2008 
Accepted:  December  2009 
Published  Online:  June  2010 


1325 


1326 


Jamal  and  Tan 


1.  INTRODUCTION 

In  this  study,  we  investigate  the  joint  effects  of  principles-  versus  rules-based  standards  and  the 
type  of  audit  partner  overseeing  the  audit  on  the  proclivity  of  managers  to  undertake  aggres¬ 
sive  reporting.  Specifically,  we  examine  whether  the  extent  to  which  principles-based  (as 
opposed  to  rules-based)  standards  mitigate  managers’  aggressive  reporting  is  contingent  on 
auditor-type.  This  issue  is  important  because  of  alleged  benefits  of  principles-based  standards,  the 
intention  of  the  Securities  and  Exchange  Commission  (SEC)  to  move  toward  a  principles-based 
standards  framework  (SEC  2003),  and  concerns  that  auditors  have  become  rules-oriented  (Finan¬ 
cial  Accounting  Standards  Board  [EASE]  2004;  Essaides  2006). 

We  adopt  the  definitions  in  the  SEC  2003  report.  Study  Pursuant  to  Section  108(d)  of  the 
Sarbanes-Oxley  Act  of  2002  on  Adoption  by  the  United  States  Financial  Reporting  System  of  A 
Principles-Based  Accounting  System,  and  consider  standards  to  be  principles-based  if  they  are 
based  on  a  consistently  applied  conceptual  framework  with  a  clearly  stated  accounting  principle, 
have  an  appropriate  but  not  excessive  level  of  implementation  guidance,  and  possess  few  bright- 
line  tests  as  well  as  exceptions  to  the  principles  underlying  the  standard.^  One  salient  feature  of 
principles-based  standards  is  that,  as  compared  to  rules-based  standards,  they  narrow  reporting 
options  since  reliance  on  an  accounting  principle  tied  to  a  conceptual  framework  likely  leads  to  a 
preferred  accounting  treatment  and  reduces  exceptions  (SEC  2008).  Indeed,  commenting  on  the 
benefits  of  principles-based  standards,  regulators  have  indicated  that  “with  management  and  au¬ 
ditors  held  accountable  for  fulfilling  the  narrowly  circumscribed  substantive  accounting  objectives 
built  into  objectives-oriented  standards,  we  believe  that  the  degrees  of  freedom  to  achieve  ‘de¬ 
sired’  accounting  results  is  minimized  relative  to  either  a  rules-based  or  a  principles-only  ap¬ 
proach”  (SEC  2003,  Section  1(c)).  Whether  this  is  the  case  is  an  empirical  question.  To  investigate 
our  research  question,  we  use  the  context  of  existing  lease  standards  as  an  example  of  rules-based 
standards,  and  a  new  proposed  lease  accounting  treatment  as  an  example  of  principles-based 
standards. 

While  there  has  been  much  debate  about  the  effects  of  principles-  versus  rules-based  stan¬ 
dards,  less  attention  has  been  paid  to  the  role  of  the  auditor  in  implementing  generally  accepted 
accounting  principles  (GAAP).  We  posit  that  auditor-type  matters  in  the  implementation  of 
principles-  versus  rules-based  standards.  Although  auditor-type  varies  in  many  ways,  we  focus  on 
three — whether  the  auditor  is  relatively  more  rules-oriented,  principles-oriented,  or  client-oriented. 
These  three  types  of  auditors  likely  exist  regardless  of  GAAP  (principles-  versus  rules-based) 
regimes.  We  consider  a  rules-oriented  auditor  to  be  one  with  a  relatively  greater  concern  toward 
adhering  to  well-specified  rules  rather  than  capturing  the  underlying  substance  of  the  transaction. 
Compliance  with  the  letter  of  the  rule  rather  than  fair  presentation  would  be  the  orientation  of  such 
an  auditor.  An  excessive  emphasis  on  following  the  letter  of  the  rule  is  of  concern  to  regulators  and 
standard- setters  in  the  U.S.  (Essaides  2006;  Scott  2006).  A  principles-oriented  auditor  is  one  who 
focuses  on  the  underlying  substance  of  a  transaction  rather  than  literally  following  the  rules.  This 
fits  the  profile  of  the  kind  of  auditors  who  regulators  and  standard- setters  would  like  to  see 
(Millman  2005),  and  regulators  have  indicated  that  with  principles-based  standards,  “there  is  a 
need  to  train  preparers  and  auditors  in  understanding  the  substance  of  the  class  of  transactions” 


*  The  SEC  2003  report  also  refers  to  principles-based  standards  as  objectives-oriented  standards.  It  notes  that  there  is 
current  confusion  between  principles-based  standards  (which  the  SEC  advocates)  and  principles-only  standards,  and 
states  that  two  distinguishing  features  of  principles-only  standards  are  that  they  involve  little  implementation  guidance, 
and  require  significant  judgment  in  their  implementation  without  providing  a  sufficient  structure  to  frame  these  judg¬ 
ments.  In  fact,  the  SEC  report  indicates  that  properly  constructed  principles-based  standards  may  involve  less  judgment 
than  either  rules-based  or  principles-only  standards. 
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(SEC  2003,  Section  3(i)).  Finally,  even  in  a  post-Sarbanes-Oxley  Act  (SOX)  world,  it  is  likely  that 
client-oriented  auditors  exist  as  motivations  to  please  clients  cannot  be  completely  eliminated 
(Moore  et  al.  2006). 

We  posit  that  the  joint  consideration  of  both  GAAP-type  (principles-based,  rules-based)  and 
auditor-type  (principles-,  rules-,  or  client-oriented)  is  necessary  to  limit  managers’  propensity  to 
engage  in  aggressive  accounting  reporting  practices.  Our  underlying  premise  is  that  accounting 
standards  serve  as  a  source  of  negotiation  power,  in  that  they  justify  and  legitimatize  a  preferred 
stand  by  an  auditor  or  manager  (French  and  Raven  1959;  Fiske  and  Berdahl  2007).  A  rules-based 
standard  vests  negotiation  power  with  the  client  insofar  as  the  rules  in  the  accounting  standard 
permit  the  client’s  preferred  stand,  even  if  doing  so  does  not  reflect  the  underlying  economics  of 
the  transaction.  An  auditor  who  is  more  principles-oriented  and  wishes  to  reflect  the  economic 
substance  of  a  client’s  transaction  in  the  financial  statements  is  constrained  from  doing  so  because 
he/she  has  less  negotiation  power  in  this  situation.  Thus,  with  rules-based  standards,  the  nature  of 
the  auditor  does  not  constrain  the  client’s  propensity  to  engage  in  aggressive  reporting.  In  contrast, 
with  a  principles-based  standard,  auditor-type  plays  a  part.  The  client  knows  that  given  a 
principles-based  standard,  the  principles-oriented  auditor  is  more  likely  and  better  able  to  oppose 
aggressive  reporting.  Thus,  with  a  principles-based  standard,  the  client’s  propensity  to  engage  in 
aggressive  reporting  is  more  likely  to  be  reduced  with  a  principles-oriented  auditor  than  either  a 
rules-oriented  or  client-oriented  auditor. 

We  conduct  an  experiment  to  investigate  how  auditor- type  and  GAAP-type  jointly  influence 
financial  officers’  proclivity  to  engage  in  aggressive  reporting.  We  consider  accounting  reporting 
to  be  aggressive  if  it  is  biased  toward  an  outcome  consistent  with  management’s  incentives — 
specifically,  reporting  a  lease-like  transaction  off  balance  sheet  rather  than  as  a  capital  lease  (debt) 
in  our  experiment.  We  employ  an  experimental  approach  because  archival  data  either  do  not  exist 
within  current  accounting  regimes  (e.g.,  U.S.  GAAP  has  not  converted  to  a  principles-based 
regime)  or  entail  confounds  when  accounting  judgments  and  market  reactions  are  compared  be¬ 
tween  two  jurisdictions. 

The  participants  in  our  experiment  are  chief  financial  officers  (CFOs)  and  highly  experienced 
CFO-type  financial  managers.  In  our  experiment,  the  financial  manager  has  an  objective  of  car¬ 
rying  out  a  lease-like  transaction  without  reporting  additional  debt  on  the  balance  sheet.  We 
examine  a  setting  where  the  proposed  transaction  has  not  yet  been  carried  out,  thus  giving  the 
manager  an  opportunity  to  structure  the  transaction  before  interacting  with  the  auditor.  We  ma¬ 
nipulate  two  between- subjects  variables:  the  type  of  GAAP  (principles-based  or  rules-based)  and 
the  type  of  audit  partner  (principles-oriented,  rules-oriented,  or  client-oriented)  with  whom  the 
manager  anticipates  he/she  will  interact.  Participants  are  asked  to  assume  that  they  operate  under 
either  existing  Canadian  GAAP  (that  is  rules-based)  or  a  proposed  International  Accounting  Stan¬ 
dards  Board  (lASB)  guidance  consistent  with  the  recommendation  of  the  G4-fl  (that  is 
principles-based).^  The  nature  of  the  proposed  transaction  is  such  that  the  bright-line  criteria  under 
the  rules-based  standard  permit  it  to  be  reported  off  balance  sheet  (as  desired  by  the  manager), 
thus  serving  as  a  baseline  condition  in  which  we  expect  the  manager  to  report  the  transaction  off 
balance  sheet  as  an  operating  lease,  and  the  auditor  to  have  no  effect  on  the  aggressiveness  of 
financial  reporting.  In  contrast,  with  a  principles-based  standard,  we  expect  a  GAAP-type  X 
auditor-type  interaction  in  that  the  likelihood  of  aggressive  financial  reporting  is  lower  when  the 
auditor  is  principles-oriented  than  when  the  auditor  is  either  rules-oriented  or  client-oriented. 


^  The  G4+ 1  was  an  association  of  the  standard-setting  bodies  of  Australia,  Canada,  New  Zealand,  U.K.,  and  U.S.  with  the 
International  Accounting  Standards  Committee  (lASC)  participating  as  an  observer.  The  G4-i-l  was  disbanded  on 
February  1,  2001  after  the  formation  of  the  International  Accounting  Standards  Board  (lASB). 
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Our  results  indicate  that  when  GAAP  is  rules-based,  the  type  of  audit  partner  (even  one  who 
is  principles-oriented)  does  not  influence  the  managers’  propensity  to  take  the  lease  off  balance 
sheet.  However,  when  GAAP  is  principles-based,  auditor-type  matters — the  manager’s  propensity 
to  take  the  lease  off  balance  sheet  is  lower  when  the  audit  partner  is  principles-oriented  compared 
to  the  situation  when  the  audit  partner  is  either  rules-oriented  or  client-oriented.  In  fact,  this 
propensity  to  take  the  lease  off  balance  sheet  is  lowest  when  a  principles-based  accounting  stan¬ 
dard  is  adopted  and  the  auditor  enforces  the  substance  of  the  transaction  (i.e.,  the  auditor  is 
principles-oriented),  compared  to  any  other  combination  of  GAAP-type  or  auditor-type. 

Our  study  makes  a  contribution  to  the  literature  by  providing  what  we  believe  is  the  first 
evidence  on  how  principles-based  versus  rules-based  standards  and  the  type  of  auditor  jointly 
constrain  managers’  aggressive  reporting.  Our  findings  provide  useful  evidence  on  the  debate 
relating  to  the  efficacy  of  principles-  versus  rules-based  standards.  We  also  provide  ex  ante  evi¬ 
dence  to  help  standard-setters  evaluate  the  effectiveness  of  a  proposed  lease  accounting  standard. 
While  the  FASB  and  lASB  believe  that  this  narrowing  of  lease-reporting  options  (to  only  capital 
leases)  results  in  reduced  opportunities  to  engage  in  opportunistic  transaction  structuring  (FASB/ 
lASB  2009),  our  results  suggest  that  this  desired  outcome  is  dependent  on  the  orientation  of 
auditors.  At  a  broader  level,  our  study  also  informs  extant  research  on  auditor-client  negotiation, 
which  has  largely  focused  on  the  reporting  of  transactions  that  have  already  occurred  (e.g., 
Sanchez  et  al.  2007;  Tan  and  Trotman  2010).  Our  results  suggest  a  more  complex  picture  than  that 
assumed  in  the  current  literature — an  anticipated  negotiation  with  an  auditor  can  influence  the  way 
that  the  client  structures  a  transaction  that  has  yet  to  occur. 

Section  II  discusses  the  previous  literature  and  develops  our  hypotheses.  Section  III  describes 
the  experiment.  Section  IV  presents  the  results.  Section  V  summarizes  the  findings  and  presents  a 
discussion  of  implications  and  future  research  opportunities. 

II.  BACKGROUND  AND  HYPOTHESIS  DEVELOPMENT 
Current  Lease  Accounting  Standards 

In  this  study,  we  examine  a  context  in  which  management  of  an  oil  and  gas  company  is 
contemplating  a  leasing  transaction,  and  prefers  to  classify  the  lease  as  an  operating  lease  (to  keep 
the  debt  off  balance  sheet).  Current  lease  accounting  standards  (Statement  of  Financial  Accounting 
Standards  (SFAS)  No.  13  in  the  U.S.,  International  Accounting  Standard  (IAS)  17  by  lASB,  and 
the  Canadian  Institute  of  Chartered  Accountants  Handbook  (CICA  HB)  Section  3065  in  Canada) 
have  been  considered  to  be  examples  of  rules-based  standards  (Tweedie  2007;  SEC  2008).  They 
identify  four  tests  to  determine  whether  a  lease  is  classified  as  operating  or  capital:  ownership 
transfer,  lease  term  for  a  majority  of  asset’s  useful  life,  present  value  of  commitments  accounts  for 
substantial  part  of  asset’s  fair  value,  and  bargain  purchase  option.  The  primary  concern  about 
current  lease  accounting  standards  is  that  they  permit  two  classes  of  leases — an  operating  lease 
that  is  off  balance  sheet  and  a  capital  lease  that  is  on  the  balance  sheet — even  though  in  both  cases, 
the  right  to  use  the  leased  item  and  obligation  to  pay  rentals  give  rise  to  assets  and  liabilities 
(Nailor  and  Lennard  2000;  FASB/IASB  2009).  Hence,  the  existing  standards  are  inconsistent  with 
the  underlying  conceptual  framework  (FASB/IASB  2009).  Furthermore,  the  four  criteria  used  in 
these  standards  are  bright-line  tests  that  can  be  interpreted  as  conditions  for  classifying  leases 
(SEC  2008,  25).  These  attributes  render  current  lease  standards  more  rules-based  than  principles- 
based. 

In  February  2000,  the  standard-setters  of  the  G4-M  countries  (with  the  lASC  as  an  observer) 
issued  a  report  that  sought  to  close  the  loophole  caused  by  preparers  gaming  the  distinction 
between  capital  and  operating  leases  in  the  current  lease  standards  (Nailor  and  Lennard  2000).  The 
G4-M  report  recommends  that  organizations  report  in  the  balance  sheet  the  fair  value  of  all  assets 
and  liabilities  contained  in  a  lease  agreement,  a  stand  that  has  found  support  by  academics 
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(Ryan  et  al.  2001)  and  standard- setters  (Tweedie  2007).  In  effect,  this  recommendation  results  in 
all  leases  being  classified  as  capital  leases,  and  eliminates  consideration  of  the  traditional  four 
lease-classification  criteria.  More  recently,  the  lASB  and  FASB  issued  a  joint  discussion  paper 
making  a  similar  proposed  stand  and  arguing  that  current  lease  standards  are  conceptually  flawed 
and  inconsistent  (FASB/IASB  2009).  The  proposal  is  for  a  narrow  implementation  approach — that 
is,  the  recommendation  is  made  in  the  context  of  existing  lease  standards.  Hence,  with  respect  to 
lASB  standards,  the  existing  guidance  contained  in  IAS  17  applies,  except  that  the  four  indicators 
listed  in  IAS  17  would  no  longer  apply.  For  ease  of  exposition,  we  call  this  the  IAS  17-plus 
guidance,  and  consider  it  to  be  principles-based  since  it  specifies  a  clear  principle  (all  leases  are 
debt  that  should  be  reported  on  the  balance  sheet)  and  eliminates  tests  that  could  justify  different 
reporting  for  similar  transactions.  In  contrast,  the  CICA  HB  3065  guidance,  like  existing  U.S. 
lease  standards,  is  rules-based  in  that  it  contains  “bright-line”  criteria  and  is  inconsistent  with  the 
conceptual  framework  (FASB/IASB  2009). 

In  our  study,  the  structure  of  the  proposed  lease  is  such  that,  based  on  the  “bright-line”  criteria 
listed  in  CICA  HB  3065,  the  stated  lease  term  and  present  value  of  commitments  enables  the 
transaction  to  be  classified  as  an  operating  lease  and  be  taken  off  the  balance  sheet,  although  there 
is  a  penalty  after  25  years  if  the  company  does  not  renew  the  lease.  This  penalty  creates  some 
ambiguity  that  could  be  used  to  argue  in  favor  of  capital  or  operating  lease  treatment.  In  contrast, 
application  of  IAS  17-plus  suggests  that  the  substance  of  the  transaction  is  a  lease,  resulting  in  the 
recognition  of  the  assets  and  liabilities  of  the  lease  in  the  balance  sheet  (i.e.,  the  transaction  is 
considered  a  capital  lease).  However,  there  are  several  ways  to  get  around  this  prescribed  treat¬ 
ment.  There  are  scope  limitations  in  that  extractive  assets  (e.g.,  oil  and  gas)  as  well  as  executory 
contracts  are  excluded,  and  there  are  transition  provisions  and  other  definitions  that  can  be  used  to 
get  around  the  treatment  prescribed  in  the  standard.  To  achieve  a  desired  reporting  treatment,  the 
manager  can  also  alter  the  terms  of  the  lease,  or  take  the  transaction  off  balance  sheet  by  making 
the  transaction  something  other  than  a  lease  (e.g.,  by  getting  a  utility  to  build  the  power  plant  and 
then  entering  into  a  long-term  purchase  contract  to  buy  power).  Thus,  a  manager  could  justify 
either  on  or  off  balance  sheet  treatment  under  either  a  rules-based  or  principles-based  standard. 

Principles-Oriented,  Rules-Oriented,  and  Client-Oriented  Auditors 

We  argue  that  discussions  of  the  effects  of  principles-  and  rules-based  standards  that  do  not 
also  consider  the  effects  of  auditor-type  are  incomplete.  We  identify  three  auditor-types 
(principles-oriented,  rules-oriented,  and  client-oriented)  that  correspond  to  profiles  of  auditors 
discussed  by  standard- setters  and  regulators. 

An  auditor  intent  on  enforcing  an  accounting  standard  can  do  so  by  being  more  strict  and  firm 
in  his/her  stand  that  the  underlying  principles  of  the  standard  are  adhered  to  and  that  the  economic 
substance  of  the  transaction  is  captured,  even  if  the  specific  rules  in  the  standard  are  not  met.  This 
corresponds  to  the  profile  of  an  “ideal”  auditor  envisioned  by  some  regulators  and  standard- setters 
(Millman  2005;  SEC  2003).  We  consider  this  auditor  to  be  principles-oriented. 

Alternatively,  the  auditor  can  be  relatively  strict  and  firm  in  ensuring  that  the  specific  rules 
underlying  the  standard  are  adhered  to,  with  less  consideration  for  the  underlying  principles  of  the 
standard  or  the  economic  substance  of  the  transaction.^  We  consider  this  auditor  to  be  rules- 
oriented.  Anecdotal  evidence  from  discussions  with  senior  partners  from  public  accounting  firms, 
as  well  as  public  positions  adopted  by  regulators  (Taub  2005;  Public  Company  Accounting  Over¬ 
sight  Board  [PCAOB]  2007),  suggests  that  auditors  who  fit  the  profile  of  being  rules-oriented  are 


^  These  two  dimensions  are  not  necessarily  opposing,  and  an  auditor  can  clearly  do  both  when  the  rules  and  principles  do 
not  conflict. 
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relatively  common  (FASB  2004,  6-7).  In  fact,  one  of  the  main  concerns  about  SOX  reforms  and 
PCAOB  inspections  is  that  the  auditor  had  become  risk-averse  and  started  to  interpret  GAAP 
literally  rather  than  focusing  on  the  underlying  substance  of  the  transaction  (Grout  et  al.l994; 
Essaides  2006;  Scott  2006;  PCAOB  2007).  This  rules-oriented  auditor-type  has  been  blamed  for 
multiple  restatements  by  companies,  inadequate  fraud  detection  work  (PCAOB  2007),  excessive 
internal  controls  testing  to  achieve  SOX  404  certification,  and  unnecessary  implementation  costs 
for  SOX  reforms  (Millman  2005).'^ 

Finally,  the  auditor  could  be  client-oriented  in  terms  of  having  a  proclivity  to  take  the  side  of 
the  client.  Prior  research  indicates  that  auditors  have  incentives  to  please  clients  given  the  auditor- 
client  fee-dependence  relationship  (Moore  et  al.  2006),  and  that  auditors  exploit  ambiguities  in 
accounting  standards  to  justify  allowing  clients  to  engage  in  opportunistic  interpretations  of  GAAP 
(Hackenbrack  and  Nelson  1996).  While  regulators’  concerns  about  auditor  independence  and  audit 
quality  in  recent  years  may  constrain  the  extent  to  which  auditors  are  pro-client,  questionnaire 
results  also  indicate  that  CFOs  generally  feel  that  they  “win”  when  there  is  a  dispute  with  the 
auditor  (e.g.,  Gibbins  et  al.  2005).  Recent  empirical-archival  research  using  post-SOX  data  pro¬ 
vides  evidence  that  audit  quality  continues  to  be  influenced  by  incentives  arising  from  the  auditor- 
client  fee-dependence  relationship  (Lim  and  Tan  2008). 

Accounting  Standards  as  Source  of  Negotiation  Power 

Financial  statements  are  considered  to  be  the  outcome  of  negotiations  between  auditors  and 
client  (Antle  and  Nalebuff  1991;  Gibbins  et  al.  2001),  and  negotiation  research  in  psychology 
suggests  that  both  the  stand  of  the  auditor  with  respect  to  an  accounting  issue  and  auditor  nego¬ 
tiation  power  (relative  to  the  manager)  are  major  determinants  of  such  negotiation  outcomes 
(Bacharach  and  Lawler  1981;  Pruitt  and  Carnevale  1993).  An  auditor  who  is  either  principles- 
oriented  or  rules-oriented  can  be  considered  to  have  a  pre-negotiation  stand  with  respect  to  the 
treatment  of  an  accounting  issue  (whether  by  adherence  to  principles  or  rules).  Technically,  an 
auditor’s  stand  determines  the  negotiated  outcome  as  the  auditor  exercises  the  final  prerogative  in 
terms  of  what  is  ultimately  recorded  in  audited  financial  statements.  However,  many  issues  in 
accounting  involve  a  range  of  acceptable  outcomes  that  are  subjected  to  discussions  or  negotia¬ 
tions  with  client  managers,  and  the  extent  to  which  an  auditor’s  initial  preferred  stand  prevails 
during  auditor-client  negotiation  is  influenced  by  the  auditor’s  negotiation  power  relative  to  the 
manager. 

Accounting  standards  serve  as  a  source  of  this  negotiation  power,  in  that  they  justify  and 
legitimatize  a  preferred  stand  by  an  auditor  or  manager  (French  and  Raven  1959;  Fiske  and 
Berdahl  2007).  Although  auditors  have  the  final  say  in  terms  of  the  treatment  of  an  accounting 
issue,  the  auditor’s  stand  has  to  be  justifiable  and  is  constrained  by  applicable  accounting  stan¬ 
dards.  Thus,  accounting  standards  serve  as  a  source  of  negotiation  power  for  the  manager.  Finding 
authoritative  support  for  one’s  position  is  rated  as  the  most  important  factor  affecting  auditor-client 
negotiations  (Gibbins  et  al.  2001).  In  addition,  from  the  manager’s  perspective,  there  is  a  marked 
difference  in  terms  of  the  perceived  norms  attributable  to  a  principles-  versus  rules-based  standard. 


The  most  recent  example  in  which  auditors  are  cited  as  being  excessively  cautious  and  misapplying  the  spirit  of  the  rules 
is  in  the  conduct  of  SOX  404  internal  control  audits.  The  PCAOB  replaced  its  Auditing  Standard  No.  2  with  Auditing 
Standard  No.  5  on  July  25,  2007  to  induce  auditors  to  use  more  (risk-based)  judgment  when  conducting  internal  control 
audits.  This  change  in  auditing  standard  occurred  after  widespread  complaint  that  the  auditors  had  narrowly  applied  the 
guidance  in  Auditing  Standard  No.  2  and  caused  firms  to  incur  excessive  costs  (Weirich  and  Rouse  2007).  With  respect 
to  SOX  404  audits,  the  actions  of  the  auditor  were  seen  as  being  defensive  and  inconsistent  with  the  spirit  of  the  auditing 
standard. 
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and  psychology  research  indicates  that  such  norms  influence  managers’  judgments  (Cialdini  2007; 
Coleman  1990).  For  a  principles-based  standard,  managers  may  infer  that  the  spirit  of  the  standard 
is  to  record  the  underlying  economics  of  a  transaction.  For  a  rules-based  standard,  managers  may 
perceive  that,  given  the  specificity  of  the  rules,  it  is  more  legitimate  to  patently  apply  explicit 
exceptions  or  loopholes  in  the  rules  to  achieve  their  preferred  way  of  recording  a  transaction. 

Joint  Effect  of  Standards  and  Auditor-Type 

The  discussion  above  suggests  that,  in  situations  where  a  manager  disagrees  with  a  preferred 
stand  by  the  auditor,  the  manager  can  use  the  power  associated  with  accounting  standards  to 
initiate  negotiations  and  counter  the  auditor’s  stand  (e.g.,  Fiske  and  Berdahl  2007;  Magee  et  al. 
2007).  Thus,  although  the  orientation  of  an  auditor  (e.g.,  principles-  versus  rules-oriented)  likely 
determines  the  stand  the  auditor  takes  with  respect  to  an  accounting  issue,  the  nature  of  the 
accounting  standard  can  undermine  or  support  the  degree  to  which  the  auditor  can  enforce  his/her 
preferred  stand. 

To  the  extent  that  the  rules  in  a  rules-based  standard  permit  the  manager’s  preferred  stand,  the 
manager  can  use  them  to  justify  his/her  preferred  accounting  treatment.  Auditor-type  is  unlikely  to 
matter  with  a  rules-based  standard.  For  example,  a  rules-oriented  auditor  will  adhere  to  the  appli¬ 
cation  of  the  rules,  and  therefore  agree  with  the  manager’s  preferred  accounting  treatment.  A 
client-oriented  auditor  is  also  unlikely  to  curtail  managers’  opportunistic  accounting  practices  in 
the  presence  of  rules-based  standards.  A  client-oriented  auditor’s  stand  is  influenced  by  what  the 
manager  prefers  as  long  as  it  is  not  GAAP-inconsistent.  In  a  rules-based  regime,  a  client-oriented 
auditor  is  likely  to  permit  interpretations  of  the  rules  and  associated  loopholes  that  favor  the 
manager’s  preferred  accounting  treatment.  Finally,  a  principles-oriented  auditor  may  prefer  not  to 
adopt  the  client’s  proposed  accounting  treatment  to  the  extent  that  application  of  the  rules  is 
contrary  to  the  economic  substance  of  the  transaction  or  underlying  accounting  principles.  How¬ 
ever,  the  principles-oriented  auditor’s  negotiation  power  is  effectively  constrained  by  the  rules  in 
the  accounting  standards,  and  the  manager  is  likely  to  achieve  his/her  preferred  accounting  treat¬ 
ment.  This  discussion  suggests  the  following  hypothesis: 

Hla:  When  accounting  standards  are  rules-based,  managers’  likelihood  of  adopting  an  op- 
poitunistic  accounting  treatment  will  not  differ  as  a  function  of  whether  the  auditor  is 
rules-oriented,  client-oriented,  or  principles-oriented. 

Just  as  GAAP-type  can  constrain  the  effects  of  auditor-type  (as  in  the  illustration  above),  the 
reverse  is  also  true — the  efficacy  of  a  particular  type  of  accounting  standard  can  be  constrained  by 
auditor-type.  This  is  the  case  with  a  principles-based  standard.  As  argued  by  standard- setters  and 
regulators,  an  accounting  standard  that  is  principles-based  can  potentially  reduce  opportunities  for 
managers  to  achieve  their  incentive-driven  accounting  choices.  However,  this  is  contingent  on 
auditor-type.  If  the  auditor  is  more  rules-oriented,  then  the  underlying  principle  of  the  principles- 
based  standard  is  not  likely  to  be  enforced,  and  managers  can  use  seemingly  rule-like  provisions 
or  exceptions  in  the  standard  to  achieve  their  desired  accounting  choices,  even  if  these  are  incon¬ 
sistent  with  underlying  accounting  principles.  More  importantly,  to  the  extent  that  managers  en¬ 
gage  in  transaction  structuring  to  get  around  the  standard  and  achieve  their  preferred  accounting 
treatment,  the  rules-oriented  auditor  is  unlikely  to  object  because  he/she  is  less  likely  to  attend  to 
the  underlying  economics  of  the  transaction  or  accounting  principles.  Similarly,  a  client-oriented 
auditor  is  sympathetic  to  the  client,  and  will  not  object  to  the  use  of  exceptions  or  transaction 
structuring  even  with  a  principles-based  standard. 

This  discussion  suggests  that  the  presence  of  only  a  principles-oriented  auditor  (absent  a 
principles-based  standard)  or  a  principles-based  standard  (absent  a  principles-oriented  auditor)  is 
unlikely  to  curtail  aggressive  accounting  practices  of  managers.  Rather,  a  combination  of  both  a 
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principles-oriented  auditor  and  a  principles-based  standard  is  necessary.  In  particular,  a  principles- 
based  GAAP  standard  supports  the  principles-oriented  auditor’s  stand  and  strengthens  his/her 
negotiation  power  relative  to  that  of  the  manager.  Managers  who  are  aware  that  they  are  dealing 
with  a  principles-oriented  auditor  in  the  context  of  a  principles-based  standard  should  therefore  be 
less  likely  to  attempt  opportunistic  interpretations  of  GAAP  to  achieve  a  preferred  accounting 
treatment — the  auditor  is  unlikely  to  support  such  GAAP  interpretations,  and  a  principles-based 
accounting  standard  offers  little  negotiation  power  to  the  manager. 

Our  hypothesis  is  formally  stated  below: 

Hlb:  When  accounting  standards  are  principles-based,  managers’  likelihood  of  adopting  an 
opportunistic  accounting  treatment  will  be  lower  when  the  auditor  is  principles- 
oriented  than  when  the  auditor  is  either  rules-oriented  or  client-oriented. 

III.  RESEARCH  METHOD 

Participants 

Participants  are  90  financial  managers  in  Canada,  with  mean  business  experience  of  19.5 
years.  Sixty-three  participants  (70  percent)  have  current  or  prior  experience  as  chief  financial 
officers  (mean  experience  as  chief  financial  officer  =  6.7  years),  and  26  (29  percent)  of  them  have 
worked  as  CFOs  in  a  listed  company.  The  remaining  27  participants  are  financial  controllers/ 
managers  or  equivalent  with  significant  business  experience  (mean  =  10.8  years).  Several  profes¬ 
sional  associations — Financial  Executives  International  (FEI)  national  office  in  Toronto  as  well  as 
FEI  chapters  in  Calgary  and  Edmonton,  Institute  of  Chartered  Accountants  of  Alberta,  and  Certi¬ 
fied  Management  Accountants  (Alberta) — assisted  the  researchers  in  identifying  potential  partici¬ 
pants.  These  organizations  distributed  email  messages  describing  the  research  study  to  their  mem¬ 
bers,  advertised  the  study  in  their  professional  newsletters,  and  encouraged  members  to  participate 
in  the  study.  Executives  of  these  organizations,  and  some  prominent  corporate  directors  who  have 
national  professional  networks,  also  provided  an  endorsement  of  the  study.  One  key  inducement 
for  financial  managers  to  participate  in  the  study  was  the  promise  of  personalized  feedback  from 
a  Big  4  audit  firm  partner  who  would  read  their  memo.^  One  of  the  researchers  then  contacted  169 
potential  participants  by  phone  or  via  email  to  seek  their  participation.  Upon  their  consent,  163 
research  instruments  were  sent  to  participants  by  courier  mail.  Ninety  completed  instruments  were 
returned  to  the  researcher,  for  a  response  rate  of  55  percent.^ 

Design 

We  employ  a  2  X  3  between-subjects  design,  with  GAAP-type  (principles-based,  rules-based) 
and  Auditor-type  (principles-oriented,  rules-oriented,  and  client-oriented)  as  independent  vari¬ 
ables.  For  all  treatments,  participants  are  told  to  assume  that  they  are  the  CFO  of  Oilsands,  Inc.,  a 
publicly  listed  oil  extraction  company.  The  GAAP-type  manipulation  varies  the  accounting  stan¬ 
dard  applicable  to  the  client.  In  the  rules-based  condition,  participants  are  told  that  Oilsands 
operates  under  Canadian  GAAP.  The  manipulation  instructs  participants  to  assume  that  the  client 
is  “operating  under  current  Canadian  GAAP.  HB  3065  proposes  a  series  of  very  specific  tests  (e.g.. 


^  We  recruited  three  Big  4  firm  audit  partners  (two  from  firm  1  and  one  from  firm  2)  to  provide  feedback  to  participants 
(one  client-oriented,  one  principles-oriented,  and  one  rules-oriented).  A  partner  in  a  Big  4  firm  gave  us  his  opinion  about 
which  audit  partners  were  client-oriented,  principles-oriented,  and  rules-oriented.  We  then  called  these  audit  partners  and 
asked  them  to  rate  themselves  as  being  either  client-,  principles-,  or  rules-oriented.  If  both  our  informant’s  judgment  and 
the  self-rating  by  the  partner  matched,  then  we  asked  the  partner  to  provide  feedback  in  that  role. 

®  We  sent  out  two  separate  calls  for  participation  and  administrations  of  the  experiment.  In  the  first  (second)  administra¬ 
tion,  111  (52)  research  instruments  were  sent,  and  61  (29)  were  returned. 
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present  value  of  lease  payments  >  90%  of  fair  market  value  of  the  asset;  lease  term  >  75%  of 
useful  life  of  asset)  to  determine  the  type  of  lease  involved  (capital,  operating)  and  its  associated 
valuation  and  disclosure  in  the  financial  statements.” 

In  the  principles-based  condition,  participants  are  told  that  Oilsands  operates  under  Interna¬ 
tional  Accounting  Standards  (IAS),  and  that  under  IAS,  the  new  proposed  lease  standard  based  on 
a  G4-I-1  research  study  (IAS  17)  removes  the  four  conditions  in  the  current  lease  standard,  and 
requires  the  present  value  of  all  leases  to  be  capitalized  and  amortized  (all  leases  are  capital 
leases).  Thus,  our  GAAP-type  manipulation  captures  the  key  features  of  a  principles-  (versus 
rules-)  based  standard  (SEC  2003).  As  opposed  to  the  rules-based  standard,  the  principles-based 
standard  has  a  clearly  stated  accounting  principle  (leases  are  associated  with  corresponding  assets 
and  liabilities  that  should  be  recorded  on  the  balance  sheet),  no  bright-line  tests,  and  no  exceptions 
to  the  principles  underlying  the  standard  (although  loopholes  remain  available  for  moving  the 
transaction  off  balance  sheet  due  to  scope,  valuation  methods,  and  other  exceptions  in  the  existing 
IAS  17  standard). 

The  Auditor-type  manipulation  varies  whether  the  audit  partner  in  charge  of  the  audit  is 
known  to  be  principles-oriented,  rules-oriented,  or  client-oriented.  The  principles-oriented  audit 
partner  is  described  as  “neutral  and  objective.  He  is  an  economic  substance  auditor  who  makes 
judgments  based  on  what  he  assesses  to  be  the  economic  substance  of  a  transaction.  In  fact,  on  a 
couple  of  occasions,  it  has  been  said  that,  compared  to  the  average  partner,  he  is  more  concerned 
about  capturing  the  substance  of  a  transaction  than  about  strictly  following  the  Handbook.”  The 
rules-oriented  audit  partner  is  described  as:  “a  very  technical  person.  He  is  known  to  be  a  strictly 
by-the-Handbook  auditor  who  follows  Handbook  rules  very  closely.  In  fact,  on  a  couple  of  occa¬ 
sions,  it  has  been  said  that,  compared  to  the  average  partner,  he  is  more  concerned  about  strictly 
following  the  Handbook  than  about  the  economic  substance  of  the  transaction.”  Finally,  the  client- 
oriented  audit  partner  is  described  as  “entrepreneurial  and  client  service-oriented,  and  open  to 
client  concerns.  He  is  a  client-oriented  person  who  cares  about  the  client’s  views.  In  fact,  on  a 
couple  of  occasions,  it  has  been  said  that  he  grants  the  client  a  little  more  latitude  than  the  average 
partner.” 

Procedure 

The  case  was  developed  using  disclosures,  as  provided  by  a  large  publicly  listed  oil  company, 
of  a  significant  off  balance  sheet  transaction.  The  case  materials  contain  background  information 
along  with  a  summary  of  selected  income  and  balance  sheet  items  related  to  Oilsands,  a  major  oil 
extraction  company  in  Canada.^  Participants  are  told  that  Oilsands  had  earlier  made  an  announce¬ 
ment  about  a  major  facility  expansion.  The  chief  executive  officer  indicates  the  need  to  first  build 
a  new  power  plant,  and  that  his  preference  is  for  the  power  plant  transaction  to  be  structured  “off 
balance  sheet”  as  an  operating  lease  rather  than  a  capital  lease.  The  reason  is  that  Oilsands  intends 
to  seek  additional  debt  financing  for  the  major  facility  expansion  later  on,  and  the  CEO  is  con¬ 
cerned  that  showing  the  power  plant  transaction  as  debt  may  jeopardize  the  company’s  ability  to 
obtain  additional  debt  financing. 

We  provided  participants  with  details  about  the  proposed  leasing  arrangement  with  a  financial 
institution.  The  present  value  of  the  lease  payments  is  87.5  percent  of  the  power  plant’s  fair  market 
value.  The  lease  is  for  25  years,  which  is  about  71.4  percent  of  the  power  plant’s  useful  life  of  35 
years.  There  is  no  ownership  transfer  at  the  end  of  the  lease  period,  but  there  is  an  option  to 


’  Participants  in  the  first  administration  of  the  experiment  also  completed,  in  random  order,  a  case  relating  to  the 
underlying  basis  (e.g.,  based  on  direct  hours,  time  spent,  billings  to  date)  to  recognize  revenue  from  a  construction 
contract  under  Canadian  GAAP.  We  find  no  main  or  interactive  effect  of  order  on  our  dependent  variable  when  we 
include  order  as  an  additional  independent  variable.  We  find  no  effect  of  auditor-type  for  this  construction  contract  case. 


The  Accounting  Review 


July  2010 

American  Accounting  Association 


1334 


Jamal  and  Tan 


purchase  the  power  plant  at  the  end  of  the  lease  period  at  its  fair  market  value.  Oilsands  incurs  a 
penalty  payment  of  20  percent  of  the  fair  market  value  at  the  end  of  the  lease  if  it  does  not  do  so. 
We  make  this  design  choice  for  two  reasons.  First,  this  setting  keeps  the  accounting  reporting 
flexibility  constant  (at  a  low  level)  for  both  rules-based  and  principles-based  standards.  For  in¬ 
stance,  the  transaction  meets  the  explicitly  stated  bright-line  tests  listed  in  the  rules-based  Cana¬ 
dian  GAAP  for  classifying  the  transaction  as  an  operating  lease  (present  value  of  lease  payments 
<  90  percent  of  fair  market  value  of  the  asset;  lease  term  <  75  percent  of  useful  life  of  asset;  no 
bargain  purchase  option;  no  ownership  transfer).  Hence,  there  is  little  reporting  flexibility  under 
Canadian  (rules-based)  GAAP  going  by  the  bright-line  rules  that  point  toward  the  lease  being 
classified  as  an  operating  lease,  consistent  with  management’s  incentives.  Similarly,  under  the 
proposed  IAS  (principles-based)  GAAP,  there  is  also  little  reporting  flexibility  as  the  transaction 
will  be  required  to  be  reported  on  the  balance  sheet,  inconsistent  with  management’s  incentives. 
Second,  because  our  focus  is  on  the  potential  benefits  of  a  principles-based  standard  (relative  to 
the  rules-based  standard  as  a  comparison),  this  creates  a  situation  where  there  is  conflict  between 
the  accounting  treatment  the  manager  prefers  and  that  proposed  by  the  principles-based  standard. 
Such  a  setting  enables  us  to  assess  whether  auditor-type  (specifically,  a  rules-oriented  auditor)  can 
undermine  the  benefits  of  a  principles-based  standard.  Similarly,  in  the  case  of  the  rules-based 
standard  where  there  is  no  conflict  between  managers’  preference  and  the  accounting  treatment 
suggested  by  the  rules,  this  setting  enables  us  to  assess  whether  auditor-type  (specifically,  a 
principles-oriented  auditor)  can  overcome  the  rules  and  enforce  the  underlying  economics  of  the 
transaction. 

Participants  also  read  a  draft  disclosure  note  prepared  by  the  accounting  department  on  the 
lease  transaction,  classified  as  an  operating  lease.  They  were  told  that  the  CEO  has  reviewed  the 
draft,  and  felt  that  the  draft  note  drew  too  much  attention  to  the  power  plant  transaction.  After 
reading  the  case,  participants  were  required  to  indicate  how  they  would  set  up  the  power  plant 
transaction,  the  basis  for  their  recommendations,  and  how  Oilsands  should  disclose  the  transaction 
in  the  financial  statements.  They  were  also  asked  to  write  a  memo  to  convince  the  auditor  on  the 
propriety  of  the  accounting  treatment  of  this  lease. 

Our  primary  dependent  variable  relates  to  whether  participants  seek  to  take  the  transaction  off 
balance  sheet  (that  is,  not  as  a  capital  lease)  in  accordance  with  management’s  incentives.  This 
dependent  variable  is  important  because  regulators  have  been  concerned  about  material  off  bal¬ 
ance  sheet  transactions  and  management’s  attempts  at  transaction  structuring  (see  interview  with 
Sir  David  Tweedie,  as  reported  in  Sawers  2008).  To  measure  this,  we  asked  an  open-ended 
question  related  to  how  participants  suggest  the  power  plant  transaction  be  set  up,  given  the  CEO’s 
concern.^  Two  doctoral  students  who  are  chartered  accountants  independently  coded  the  responses 
into  three  categories:  operating  lease,  transaction  structuring  to  take  the  transaction  off  balance 
sheet,  and  capital  lease.  There  was  100  percent  agreement.  The  coders  were  not  aware  of  the 
experimental  conditions  associated  with  the  memos.  For  the  purpose  of  our  dependent  variable 
(whether  transaction  is  taken  off  balance  sheet),  the  first  two  categories  were  coded  as  off  balance 
sheet  transactions. 


IV.  RESULTS 

Manipulation  Checks 

As  a  check  on  the  GAAP  manipulation,  we  asked  participants  whether  the  company  operates 
under  Canadian,  U.S.,  or  lASB  GAAP.  Four  participants  answered  the  question  incorrectly.  The 


We  asked  an  open-ended  question  to  allow  participants  flexibility  in  structuring  the  transaction  (including  non-lease 
transaction).  Asking  a  close-ended  question  with  pre-specified  options  would  limit  their  responses  to  the  options  pro¬ 
vided,  and  also  provide  a  cue  that  non-lease  transactions  are  possible. 
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manipulation  check  question  related  to  auditor-type  asks  participants  to  indicate  whether  the  audit 
partner  can  best  be  characterized  as  strict  (Mr.  GAAP),  neutral  and  objective,  or  entrepreneurial 
and  client  service-oriented.  Five  participants  answered  this  question  incorrectly.  We  obtain  the 
same  results  after  excluding  these  participants  who  failed  the  manipulation  check. 

Test  of  Hypothesis 

Table  1,  Panel  A  presents  the  descriptive  statistics  for  the  proportion  of  participants  who  opted 
to  record  the  transaction  off  balance  sheet;  that  is,  not  as  a  capital  lease.  Table  1,  Panels  B  and  C 
show  the  categorical  ANOVA  results  along  with  the  planned  contrasts.  Hypothesis  la  predicts  that 
with  rules-based  standards,  auditor-type  does  not  matter  in  terms  of  participants’  taking  the  trans¬ 
action  off  balance  sheet,  while  Hlb  predicts  that  auditor-type  matters  with  principles-based  stan¬ 
dards.  This  suggests  an  interaction  between  GAAP-type  and  Auditor-type.  As  shown  in  Table  1, 
Panel  B,  we  find  a  statistically  significant  interaction  between  GAAP-type  and  Auditor-type  (p  = 
0.010).^ 

As  Table  1,  Panel  C  shows,  when  standards  are  rules-based,  we  find  no  simple  main  effect  of 
auditor-type  (p  =  0.423),  and  all  possible  pairwise  comparisons  are  not  statistically  significant  (all 
p  >  0.208).  These  results  are  consistent  with  Hla.  In  contrast,  Hlb  predicts  with  principles-based 
standards,  auditor-type  matters  in  that  the  proportion  of  financial  managers  taking  the  transaction 
off  balance  sheet  is  lower  when  the  auditor  is  principles-oriented  than  when  the  auditor  is  either 
rules-oriented  or  client-oriented.  Results  show  that,  consistent  with  Hlb,  when  standards  are 
principles-based,  there  is  a  simple  main  effect  of  auditor-type  (p  =  0.010).  Pairwise  comparisons 
show  that,  as  predicted,  the  proportion  of  participants  taking  the  transaction  off  balance  sheet  is 
lower  with  a  principles-oriented  auditor  than  with  either  a  rules-oriented  auditor  (p  =  0.002; 
one-tailed)  or  client-oriented  auditor  (p  =  0.036;  one-tailed).  The  difference  in  mean  proportion 
when  the  auditor  is  rules-oriented  or  client-oriented  is  not  statistically  significant  (p  =  0.113). 

A  corollary  of  Hla  and  Hlb  is  that  a  combination  of  both  a  principles-oriented  auditor  and  a 
principles-based  standard  is  necessary  to  curb  managers’  aggressive  accounting  practices.  This 
suggests  that  the  proportion  of  participants  taking  the  transaction  off  balance  sheet  is  likely  to  be 
lowest  in  the  principles-based  standard/principles-oriented  auditor  condition.  A  specific  contrast 
test  comparing  the  mean  proportion  in  the  principles-based  standard/principles-oriented  auditor 
condition  compared  to  the  average  in  the  other  five  conditions  is  statistically  significant  (p  = 
0.001,  one-tailed).  We  also  made  pairwise  comparisons  between  the  principles-based  standard/ 
principles-oriented  auditor  condition  and  each  of  the  other  five  conditions,  and  find  that  the 
proportion  is  always  significantly  lower  in  the  principles-based  standard/principles-oriented  audi¬ 
tor  condition  than  in  the  other  conditions  (all  p  <  0.036,  one-tailed). 

Transaction  Structuring 

We  analyze  the  proportion  of  participants  who  opt  to  structure  their  transactions  to  achieve  the 
objective  of  having  the  transaction  taken  off  the  balance  sheet.  This  is  a  costly  option  as  it  involves 
the  firm  incurring  costs  to  restructure  their  original  planned  transaction  to  avoid  having  the  trans¬ 
action  being  recognized  on  the  balance  sheet.  Participants  who  engaged  in  transaction  restructur- 


®  To  assess  whether  our  results  vary  by  the  two  administrations  of  the  experiment,  we  test  whether  the  GAAP-type  X 
Auditor-type  interaction  predicted  in  HI  holds  when  we  include  Administration  (first,  second)  as  a  third  independent 
variable.  Results  of  a  categorical  ANOVA  with  proportion  of  participants  opting  to  take  the  transaction  off  balance  sheet 
shows  that  the  GAAP-type  X  Auditor-type  interaction  continues  to  be  significant  (p  =  0.008).  There  is  also  a  marginal 
GAAP-type  X  Auditor-type  X  Administration  interaction  (p  =  0.095) — we  find  that  the  two-way  GAAP-type  X 
Auditor-type  is  always  significant  in  both  administrations,  but  is  more  significant  in  Administration  1  (p  =  0.000)  and 
less  so  in  Administration  2  (p  =  0.025). 

An  overall  test  assessing  for  differences  among  any  of  these  five  conditions  is  not  statistically  significant  (p  =  0.339). 
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ing  proposed  a  variety  of  alternative  structures.  The  simplest  was  to  get  a  utility  to  construct  and 
run  a  power  plant  and  then  sign  a  long-term  contract  to  purchase  the  power  (an  off  balance  sheet 
commitment).  More  elaborate  structures  involved  setting  up  a  trust,  or  getting  together  with  other 
oil  companies  to  set  up  a  variable  interest  entity. 

Following  our  theory  and  experimental  design,  when  the  standard  is  rules-based,  there  is  little 
need  for  participants  to  engage  in  costly  transaction  restructuring  regardless  of  auditor-type  be¬ 
cause  the  rules  permit  the  classification  of  the  transaction  as  a  lease.  In  contrast,  when  the  standard 
is  principles-based  and  requires  the  transaction  to  be  reported  on  the  balance  sheet,  transaction 
restructuring  is  expected  to  be  higher  because  this  is  one  of  the  key  approaches  to  keep  the 
transaction  off  balance  sheet.  However,  this  outcome  is  dependent  on  auditor-type.  Even  with  a 
principles-based  standard,  transaction  restructuring  is  expected  to  be  low  when  the  auditor  is 
principles-oriented  because  participants  likely  anticipate  the  auditor  would  still  require  the  restruc¬ 
tured  transaction  to  be  recognized  in  the  balance  sheet.  The  rules-oriented  or  client-oriented 
auditor  is  less  likely  to  insist  on  this. 

Descriptive  statistics  are  shown  in  Table  2,  Panel  A.  Table  2,  Panel  B  shows  the  results  of  the 
categorical  ANOVA.  We  find  a  statistically  significant  main  effect  of  GAAP  (p  =  0.021),  but  no 
significant  main  effect  of  Auditor-type  (p  =  0.561)  or  interaction  effect  (p  =  0.213).  Table  2, 
Panel  C  reports  the  simple  effects  of  auditor-type  when  the  standard  is  either  rules-  or  principles- 
based. 

Overall,  only  23.3  percent  of  the  participants  opt  for  transaction  structuring.  When  the  stan¬ 
dard  is  rules-based,  a  very  low  proportion  of  participants  engage  in  transaction  restructuring  (11.1 
percent).  Consistent  with  our  expectation,  there  is  no  significant  simple  main  effect  of  auditor-type 
(all  p  =  0.805),  and  all  pairwise  comparisons  within  auditor-type  are  not  statistically  significant  (p 
>  0.550).  As  predicted,  when  the  standard  is  principles-based,  we  find  a  marginally  significant 
simple  main  effect  of  auditor-type  (p  =  0.092).  With  a  principles-based  standard,  the  mean  pro¬ 
portion  of  participants  undertaking  transaction  structuring  when  the  auditor  is  principles-oriented 
is  very  low  (13.3  percent),  and  is  significantly  lower  than  when  the  auditor  is  client-oriented  (p  = 
0.056;  one-tailed)  or  rules-oriented  (p  =  0.015;  one-tailed),  where  much  of  the  transaction  struc¬ 
turing  occurs  (40.0  percent  and  53.3  percent,  respectively).  A  specific  contrast  test  comparing  the 
mean  proportion  in  these  latter  two  conditions  versus  the  other  four  conditions  is  statistically 
significant  (p  =  0.001;  one-tailed). 

Thought  Units 

Our  theoretical  reasoning  implies  that  accounting  standards  (principles-  or  rules-based)  serve 
as  a  source  of  negotiation  power  in  that  they  help  justify  a  particular  accounting  choice.  However, 
the  need  for  and  ease  of  justification  likely  vai'y  with  the  preferred  outcome  of  the  manager. 
Managers  who  wish  to  record  the  transaction  as  a  capital  lease  should  find  it  easier  to  justify  their 
preference  when  the  accounting  standard  is  principles-based.  Likewise,  managers  who  prefer  to 
record  the  transaction  off  balance  sheet  should  find  it  easier  to  justify  their  preference  when  the 
accounting  standard  is  rules-based.  This  suggests  an  accounting  choice  by  GAAP-type  interaction. 
This  interaction  may  be  weaker  when  participants  in  the  principles-based/on  balance  sheet  (rules- 
based/off  balance  sheet)  condition  anticipate  interacting  with  a  principles-oriented  (rules-oriented) 
auditor  because  they  believe  such  an  auditor  is  already  convinced  and  there  is,  therefore,  less  need 
to  make  a  strong  justification.  On  the  other  hand,  there  may  be  no  moderating  effect  of  auditor- 
type  if  participants  perceive  that  all  auditors  expect  to  see  appropriate  justifications  for  managers’ 
accounting  choices. 

To  test  this  prediction,  we  first  content  code  participants’  rationale  memos  to  identify  the 
distinct  thought  units  documented  in  the  memos  that  participants  used  to  persuade  their  assigned 
audit  partner  regarding  the  propriety  of  the  proposed  accounting  treatment.  We  develop  a  coding 
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scheme  using  Koonce  et  al.  (1995)  as  a  guide.  The  same  two  coders  (doctoral  students  who  are 
chartered  accountants)  independently  coded  the  memos  written  by  participants.  Inter-rater  agree¬ 
ment  was  89  percent,  and  discrepancies  were  resolved  by  discussion.  The  coders  were  not  aware 
of  the  experimental  conditions  associated  with  the  memos. 

Overall,  eight  different  categories  of  thought  units  were  identified:  (1)  specific  arguments  on 
whether  the  lease  is/is  not  capital;  (2)  negotiating  the  terms  of  the  lease;  (3)  setting  up  a  trust  or 
special  purpose  entity;  (4)  nature  of  the  business  and  whether  it  falls  outside  scope  of  standard;  (5) 
details  of  nature  of  disclosure  after  selection  of  classification;  (6)  recapitulation  of  case  facts 
without  adding  new  insights;  (7)  greetings  and  pleasantries  unrelated  to  case  facts,  and  (8)  other 
thoughts.  We  find  no  main  or  interaction  effect  of  Auditor- type  or  GAAP- type  on  the  total  number 
of  thought  units  (smallest  p  =  0.296). 

We  compute  the  proportion  of  relevant  thought  units  as  a  proportion  of  total  thought  units 
(Proportionrelevant)  the  dependent  variable.  The  first  four  categories  ((1)  to  (4))  are  arguments 
directly  relevant  to  justifying  the  proposed  classification  by  the  manager.  The  remaining  four  ((5) 
to  (8))  are  less  relevant  for  convincing  the  auditor  to  agree  with  the  accounting  measurement 
proposed  by  the  manager.  For  example,  the  thought  units  on  disclosure  relate  to  discussion  on  the 
disclosure  of  lease  terms,  but  not  the  rationale  for  classifying  the  lease  as  operating  or  capital. 
These  disclosures  per  se  would  not  convince  the  auditor  to  allow  this  measurement  treatment.  We 
interpret  a  higher  Proportion^eievant  to  mean  that  the  participant  is  better  able  to  garner  support  or 
justification  for  his/her  proposed  classification. 

We  conduct  an  ANOVA  with  Auditor-type,  GAAP-type,  and  Accounting  Choice  (operating 
versus  capital  lease)  as  independent  variables.'^  Table  3,  Panel  A  reports  the  proportion  of  relevant 
thought  units  documented  for  all  managers,  while  Table  3,  Panels  B  and  C  report  the  correspond¬ 
ing  proportions  when  the  managers  select  a  capital  lease  option  and  report  off  balance  sheet, 
respectively.  The  ANOVA  results  are  shown  in  Table  3,  Panel  D.  We  find  a  significant  GAAP-type 
by  Accounting  Choice  interaction  (p  =  0.001),  and  the  p-values  are  not  statistically  significant 
either  for  the  other  main  effects  (smallest  p  =  0.153)  or  interaction  effects  (smallest  p  =  0.332). 
Further  analysis  indicates  that  when  participants  make  a  capital  lease  option  (Table  3,  Panel  B), 
Proportionrelevant  is  higher  for  a  principles-based  GAAP  than  a  rules-based  GAAP  (mean  =  0.801 
and  0.470,  respectively;  p  =  0.005;  one-tailed).  We  find  the  opposite  for  participants  who  select  an 
off  balance  sheet  treatment  (Table  3,  Panel  C);  Proportionrelevant  is  marginally  lower  for  principles- 
based  GAAP  than  rules-based  GAAP  (mean  =  0.475  and  0.602,  respectively;  p  =  0.030; 
one- tailed). 

These  results  are  consistent  with  the  notion  that  principles-  versus  rules-based  standards  serve 
as  sources  of  negotiation  power  insofar  as  they  influence  the  extent  that  financial  managers  can 
justify  their  accounting  choices  in  their  negotiations  with  the  auditor. 

V.  CONCLUSION 

This  study  investigates  the  effect  of  principles-based  versus  rules-based  accounting  standards 
on  the  reporting  positions  of  managers  who  interact  with  three  different  auditor-types  (an  auditor 
who  is  principles-oriented,  rules-oriented,  or  client-oriented).  Our  experiment  comparing  the  ef- 


“  When  we  run  an  overall  Auditor-type  by  GAAP-type  ANOVA  (without  partitioning  by  accounting  choice),  we  find  no 
significant  main  (smallest  p  =  0.498)  or  interaction  effect  (p  =  0.135).  One  participant  did  not  write  the  memo. 

In  fact,  when  participants  select  an  off  balance  sheet  transaction,  a  contrast  test  indicates  that  Proportion  relevant  is 
lowest  in  the  Principles-based  standard/Principles-oriented  auditor  condition  (mean  =  0.262)  than  that  in  the  other  five 
conditions  (mean  =  0.570;  p  =  0.023;  one-tailed).  This  finding  is  consistent  with  the  argument  that  the  Principles-based 
standard/Principles-oriented  auditor  condition  constrains  managers’  negotiation  power  (and  justifications  they  can  come 
up  with)  when  they  attempt  to  make  a  case  for  taking  the  transaction  off  balance  sheet.  A  caveat  to  this  analysis  is  that 
the  sample  size  in  the  Principles-based  standard/Principles-oriented  condition  is  very  small. 
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fects  of  a  proposed  standard  change  and  a  current  rules-based  GAAP  standard  provides  useful  ex 
ante  evidence  for  standard- setters  who  are  considering  adoption  of  principles-based  standards. 

Our  results  indicate  that  with  a  rules-based  standard,  auditor-type  does  not  influence  manag¬ 
ers’  propensity  to  take  the  proposed  lease  transaction  off  balance  sheet.  An  implication  of  this 
result  is  that  even  when  the  auditor  is  principles-oriented  and  seeks  to  enforce  the  economic 
substance  of  a  transaction,  he/she  is  constrained  from  doing  so  in  the  presence  of  a  rules-based 
standard.  We  find  that  with  a  principles-based  standard,  the  auditor-type  matters.  Specifically,  the 
propensity  to  take  the  proposed  lease  transaction  off  balance  sheet  is  lowest  when  the  auditor  is 
principles-oriented  compared  to  the  situation  when  the  auditor  is  rules-oriented  or  client-oriented. 
In  fact,  a  combination  of  a  principles-oriented  auditor  and  a  principles-based  standard  results  in 
the  lowest  incidence  of  the  proposed  lease  transaction  being  reported  off  balance  sheet  relative  to 
any  other  combination  of  auditor-type  and  standard-type.  These  results  are  consistent  with  the 
testimony  of  Sir  David  Tweedie  (2002),  in  which  he  argues  that  a  principles-based  accounting 
system  requires  both  preparers  and  auditors  to  work  together  to  reflect  the  economic  substance  of 
transactions  in  the  financial  statements.  Thus,  managers  may  still  engage  in  aggressive  reporting 
when  the  principles-oriented  auditor  operates  under  rules-based  standards,  or  when  a  principles- 
based  standard  is  enforced  by  rules-oriented  or  client-oriented  auditors. 

Our  analysis  of  participants’  transaction  restructuring  attempts  indicates  a  similar  picture. 
They  are  less  likely  to  engage  in  costly  restructuring  attempts  when  the  standard  is  rules-based  (as 
there  is  no  need  to)  or  when  the  standard  is  principles-based  and  auditor  is  principles-oriented 
(there  is  a  need  to  but  the  auditor  will  object).  Transaction  restructuring  attempts  are  more  preva¬ 
lent  when  the  standard  is  principles-based  (necessitating  costly  restructuring)  and  the  auditor  will 
be  supportive  (when  s/he  is  rules-oriented  or  client-oriented).  Thus,  potential  benefits  of  a 
principles-based  standard  can  be  obfuscated  by  auditors  who  are  rules-  and/or  client-oriented. 

Our  results  with  respect  to  financial  managers’  propensity  to  restructure  the  transaction  also 
add  to  extant  auditor-client  negotiation  literature.  The  results  suggest  that  the  auditor’s  bargaining 
power  relative  to  the  client  (vested  through  the  type  of  accounting  standards)  has  effects  on 
pre-negotiation  transactions  undertaken  by  the  client,  not  only  at  the  time  of  negotiation  as  has 
been  commonly  looked  at  in  the  literature.  Furthermore,  the  subject  matter  brought  to  the  table  for 
auditor-client  negotiation  may  have  been  the  outcome  of  a  strategic  decision  made  by  the  financial 
manager  in  response  to  the  type  of  auditor  who  will  engage  in  the  negotiation. 

The  principles-based  standard  that  we  examine  is  modeled  after  a  proposal  by  the  G4+1 
countries  to  report  all  leases  on  the  balance  sheet,  which  is  also  consistent  with  the  current 
proposal  by  the  FASB  and  lASB.  Our  results  suggest  that  the  benefit  of  the  proposed  lease 
accounting  standard  may  only  be  realized  if  we  also  have  an  auditor  who  focuses  on  reflecting  the 
substance  of  the  transaction  in  the  financial  statements.  A  primary  message  of  our  paper  is  that  a 
better  reporting  environment  is  created  only  if  both  the  accounting  standards  and  the  auditor  are 
principles-oriented.  Changing  only  one  of  these  variables  (accounting  rules  or  the  auditor)  is  less 
likely  to  be  effective.  For  instance,  when  the  standard  is  rules-based,  a  principles-oriented  audi¬ 
tor’s  ability  to  deter  aggressive  accounting  practices  is  limited  by  the  rules  in  the  standard;  in 
contrast,  when  the  standard  is  principles-based,  the  principles-oriented  auditor  is  better  able  to  do 
so  (relative  to  a  rules-oriented  or  client-oriented  auditor).  Thus,  proposals  for  changing  accounting 
standards  require  a  simultaneous  consideration  of  changes  in  the  mindset  of  the  auditor  as  well. 
Furthermore,  to  the  extent  that  auditors  are  currently  not  sufficiently  principles-oriented,  the  need 
for  training  auditors  to  be  so  becomes  increasingly  important  (SEC  2003). 

One  limitation  of  our  study  is  that  the  accounting  standard  we  examine  is  restricted  to  lease 
accounting,  which  may  not  be  representative  of  all  accounting  standards.  In  addition,  we  test  a 
proposed  lease  guidance,  and  not  an  existing  lease  standard.  It  is  possible  that  the  manner  by 
which  managers  react  to  a  proposed  standard  may  be  different  from  how  they  react  to  an  actual 
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standard.  For  instance,  once  they  develop  familiarity  with  such  a  standard,  they  may  develop  new 
ways  to  get  around  the  standard.  Also,  CICA  HB  3065  and  the  IAS  17-plus  guidance  likely  differ 
in  other  aspects  (e.g.,  wording,  format),  so  while  we  can  attribute  our  results  to  GAAP-type 
differences,  we  cannot  isolate  the  specific  aspect  leading  to  our  results.  Another  limitation  is  that 
auditor-type  (especially  client-oriented  auditors)  might  matter  more  when  managers  prefer  out¬ 
comes  that  are  inconsistent  with  both  the  letter  and  intent  of  the  standards.  Furthermore,  we  do  not 
determine  how  auditors  respond  to  principles-based  versus  rules-based  standards,  and  the  extent  to 
which  auditors  are  currently  principles-oriented  versus  rule-oriented.  In  our  study,  the  auditor  does 
not  negotiate  with  the  financial  manager.  Future  research  can  investigate  these  issues.  Our  study 
provides  a  starting  point  to  understand  how  it  is  necessary  to  investigate  both  the  characteristics  of 
accounting  standards  and  auditors  to  understand  reporting  decisions  of  managers. 
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ABSTRACT:  Prior  studies  employ  a  two-period  empirical  model  and  interpret  the  nega¬ 
tive  association  between  accruals  in  period  one  and  returns  in  period  two  as  evidence 
that  investors  misprice  the  information  contained  in  accruals.  In  contrast  to  prior  stud¬ 
ies,  I  employ  a  three-period  log-linear  model  decomposed  from  a  firm’s  book-to-market 
ratio  and  show  that  investors  do  not  misprice  the  information  contained  in  accruals.  My 
study  shows  that  in  the  four-year  period  prior  to  accrual  recognition,  equity  prices  tend 
to  be  driven  disproportionately  by  intangible  returns,  or  returns  not  explained  by  ac¬ 
counting  measures.  Accordingly,  the  relation  between  prior  period  intangible  returns 
and  future  period  returns  subsumes  the  relation  between  current  period  accruals  and 
future  returns.  In  addition,  I  link  the  accrual  anomaly  and  the  value/growth  anomaly  to  a 
common  economic  mechanism  (intangible  returns)  and  show  that  a  strong  negative 
relation  between  external  financing  activities  and  future  returns  is  not  subsumed  by  the 
accrual  anomaly. 

Keywords:  accrual  anomaly;  book-to-market;  external  financing;  investment;  market 
efficiency. 

Data  Availability:  The  data  used  are  available  from  the  sources  noted  in  the  study. 

I.  INTRODUCTION 

Beginning  with  Sloan  (1996),  the  accounting  literature  has  produced  an  extensive  stream  of 
literature  examining  the  negative  relation  between  current  period  accruals  and  future  pe¬ 
riod  returns.  From  this  stream  of  literature  two  primary  explanations  for  the  negative 
relation  (the  accrual  anomaly)  have  emerged.  The  “persistence”  explanation  builds  off  Sloan 
(1996)  and  explains  the  accrual  anomaly  as  caused  by  investor  inability  to  distinguish  the  pricing 
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implications  of  the  differential  persistence  of  the  accrual  and  cash  flow  components  of  current 
earnings.  The  “growth”  explanation  builds  off  Fairfield  et  al.  (2003)  and  suggests  that  investors 
misprice  the  growth  information  contained  in  accruals. 

While  these  two  streams  of  literature  are  distinct,  they  share  some  similarities.  First,  both 
explanations  suggest  that  investors  may  be  subject  to  behavioral  biases  that  prevent  them  from 
properly  incorporating  publicly  available  accounting  information  into  stock  prices.'’^  Second,  as 
noted  by  Fairfield  et  al.  (2003)  and  Zhang  (2007),  large  positive  accruals  tend  to  occur  during 
periods  of  growth  and  equity  issuance.  There  is  a  strong  negative  relation  between  equity  issuance 
activity  and  future  returns  (Loughran  and  Ritter  1995).  Despite  the  empirical  relation  between 
equity  issuance,  accruals,  and  future  returns,  accrual  studies  in  both  streams  have  been  relatively 
silent  as  to  the  effect  that  the  interaction  of  these  three  variables  on  inferences  made  between 
future  period  returns  and  current  period  accruals.^  Finally,  both  streams  examine  the  relation 
between  accruals  and  future  returns  primarily  within  the  confines  of  a  strict  two-period  model. 
For  example,  studies  in  these  streams  generally  examine  whether  variation  in  current  period 
accruals  are  associated  with  variation  in  future  returns  after  controlling  for  current  period  charac¬ 
teristics  and  factors  known  to  be  associated  with  future  returns. 

This  study  takes  a  different  approach  from  that  taken  in  prior  accrual  studies  and  finds  results 
suggestive  of  a  third  explanation.  Prior  research  has  viewed  a  firm’s  book-to-market  ratio  (B/M)  as 
a  characteristic  distinct  from  the  firm’s  current  period  accruals.  However,  this  view  overlooks  the 
fact  that  current  period  accruals  affect  B/M.  Consider  that  B/M  is  comprised  of  four  primary 
elements:  (1)  retained  earnings,  (2)  invested  equity,  (3)  expected  cash  flows,  and  (4)  expected 
returns.  Accordingly,  changes  in  each  of  these  elements  explain  changes  in  B/M.  Given  that 
accruals  explain  part  of  the  changes  in  retained  earnings,  and  that  changes  in  each  of  the  other 
three  elements  are  associated  with  future  returns,  this  study  examines  whether  the  evolution  of 
B/M  can  shed  new  light  on  the  negative  association  between  current  period  accruals  and  future 
period  returns. 

The  empirical  design  employed  in  this  study  builds  on  the  work  of  Daniel  and  Titman  (2006; 
hereafter,  DT).  DT  note  that  the  information  investors  incorporate  into  price  can  be  partitioned 
into  two  parts:  tangible  information  and  intangible  information.  By  assuming  that  a  firm’s  market 
return  over  a  given  period  is  a  good  proxy  for  total  information,  DT  classify  the  portion  of  a  firm’s 
realized  market  return  that  is  explained  by  summary  accounting  measures  as  tangible  information, 
or  tangible  returns.  Accordingly,  the  residual  realized  return  variation  that  is  not  explained  by 
summary  accounting  measures  is  termed  intangible  information,  or  intangible  returns.  DT  show  in 
a  two-period  empirical  model  that  the  strong  negative  relation  between  price-scaled  accounting 
variables  (e.g.,  book-to-market,  eamings-to-price)  and  future  returns  is  not  associated  with  total 
realized  growth  as  measured  by  accounting  variables.  Rather,  DT  suggest  that  the  return  reversal 
associated  with  price-scaled  accounting  variables  is  driven  by  realized  intangible  returns. 

I  expand  DT’s  empirical  model  into  a  three-period  model  to  investigate  the  relation  between 
prior  period  intangible  returns  in  period  one,  accruals  in  period  two,  and  future  returns  in  period 


*  Assuming  certain  conditions  relating  to  changes  in  a  firm’s  expected  returns  and  optimal  investment  in  response  to  these 
changes,  the  growth  explanation  does  not  necessarily  imply  investor  irrationality. 

^  Recent  work  by  Mashruwala  et  al.  (2006)  and  Khan  (2008)  suggest  rational  asset-pricing  explanations  of  the  accrual 
anomaly.  Specifically,  Mashruwala  et  al.  suggest  that  arbitrageurs  are  prevented  from  capitalizing  on  the  mispricing 
associated  with  accruals  due  to  high  arbitrage  risk,  while  Khan  proposes  a  four-factor  ICAPM-based  pricing  model  that 
explains  the  cross-sectional  future  return  variation  attributable  to  the  current  period  accruals. 

^  Recent  work  by  Cohen  and  Lys  (2006)  and  Dechow  et  al.  (2008)  suggest  that  the  negative  relation  between  future 
returns  and  accruals  subsumes  the  negative  relation  between  future  returns  and  external  financing  activities. 

Kothari  et  al.  (2006)  and  Zhang  (2007)  consider  the  relation  between  accruals  in  the  current  year  and  firm  growth  in 
prior  years,  the  current  year,  and  subsequent  years. 
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three.  Intangible  returns  can  generally  be  interpreted  as  capturing  future  growth  options  that  have 
not  been  recognized  by  accounting  performance  measures.  Given  the  discrete,  conservative  nature 
of  financial  accounting,  intangible  returns  will  tend  to  lead  tangible  returns  as  the  market  prices 
information  more  quickly  than  GAAP  recognizes  value.  By  examining  the  relation  between  cur¬ 
rent  period  accruals  and  prior  period  intangible  returns,  derived  from  the  book-to-market  ratio,  I 
provide  insight  into  a  possible  common  link  between  the  value/growth  anomaly  and  the  accrual 
anomaly. 

Consistent  with  prior  studies,  I  show  that  in  two-period  settings,  the  accrual  component  of  log 
book  returns  explains  a  significant  portion  of  future  return  variation.  Controlling  for  firm  growth 
captured  by  accounting  measures  of  growth  over  the  four-year  period  prior  to  accrual  recognition, 

I  show  that  the  accrual  component  of  log  book  returns  still  explains  significant  future  return 
variation.  However,  once  I  control  for  prior  period  intangible  returns  (PPIR),  I  show  that  the  return 
reversal  previously  attributed  to  accruals  is  subsumed  by  PPIR.  More  germane  to  the  accounting 
literature,  I  show  that  once  intangible  returns  from  period  one  are  controlled,  the  negative  relation 
between  accrual-based  returns  in  period  two  and  future  returns  in  period  three  is  insignificant.  This 
pattern  holds  for  multiple  definitions  and  decompositions  of  accruals. 

This  study  makes  several  contributions  to  the  existing  literature  examining  the  relation  be¬ 
tween  future  returns  and  accounting  measures  of  performance  such  as  accruals.  First,  I  provide 
empirical  evidence  that  investors  correctly  incorporate  the  information  contained  in  accruals  into 
price.  This  evidence  is  significant  given  the  widely  held  view  that  investors  misprice  the  growth 
information  contained  in  accruals  (Fairfield  et  al.  2003;  Zhang  2007)  and/or  the  persistence  char¬ 
acteristics  of  accruals  (Sloan  1996;  Richardson  et  al.  2005). 

Second,  this  study  contributes  to  research  examining  the  relation  between  external  financing 
activities,  accruals,  and  future  returns.  Recent  studies  examining  the  relation  between  these  three 
variables  are  mixed.  Cohen  and  Lys  (2006)  and  Dechow  et  al.  (2008)  suggest  that  the  negative 
relation  between  future  returns  and  external  financing  activities  is  essentially  subsumed  by  the 
accrual  anomaly,  while  Daniel  and  Titman  (2006)  show  that  their  composite  issuance  variable 
subsumes  any  relation  between  future  returns  and  accounting-based  performance  measures.  The 
empirical  evidence  reported  in  this  study  serves  to  link  these  two  disparate  findings.  Specifically, 
this  study  highlights  an  economically  significant  negative  relation  between  future  returns  and 
equity  issuance/repurchase  activity  and  intangible  returns  while  finding  no  significant  association 
between  future  returns  and  accruals. 

Third,  this  study  employs  an  empirical  design  that  addresses  some  of  the  concerns  raised  by 
Kraft  et  al.  (2006,  2007)  relating  to  accrual  anomaly  studies.  Specifically,  Kraft  et  al.  (2007)  show 
how  the  exclusion  of  variables  correlated  with  accruals  can  affect  inferences  with  future  returns 
from  Mishkin  (1983)  pricing  tests.  The  empirical  design  used  in  this  study  endogenizes  the  pricing 
of  all  prior  and  current  period  information.  This  empirical  design  structure  is  important  for  two 
reasons.  First,  it  mitigates  the  concern  that  significant  relations  between  explanatory  variables  and 
future  returns  are  caused  by  the  omission  of  relevant  variables.  Second,  the  inferences  drawn 
between  future  period  returns  and  current  period  accruals  are  not  contingent  on  the  rational  pricing 
of  all  information.  Despite  the  agnostic  stance  on  rational  pricing  this  empirical  design  allows,  the 
design  still  allows  for  inferences  to  be  drawn  about  whether  accruals  are  properly  incorporated 
into  price. 

Fourth,  this  study  provides  a  link  between  the  value/growth  anomaly  and  the  accrual  anomaly. 
Although  the  theories  used  to  explain  both  anomalies  are  similar,  these  anomalies  have  generally 
been  considered  distinct.  This  study  extends  the  idea  of  Beaver  (2002)  and  Desai  et  al.  (2004)  who 
suggest  that  the  accrual  anomaly  may  be  a  function  of  the  value/growth  anomaly,  providing  the 
economic  intuition  and  empirical  evidence  that  shows  these  two  anomalies  are  likely  driven  by  the 
same  underlying  construct— specifically,  intangible  returns. 
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Section  II  introduces  the  empirical  design  and  the  economic  intuition  supporting  it.  Section  III 
provides  empirical  evidence  examining  the  relation  between  PPIR,  current  period  accruals,  and 
future  returns.  Sections  IV  and  V  discuss  empirical  evidence  from  additional  robustness  tests. 
Section  VI  provides  concluding  remarks. 

II.  RESEARCH  DESIGN— METHODS  AND  INTUITION 
Research  Motivation  and  Variable  Intuition 

The  value/growth  anomaly  is  characterized  by  the  positive  association  between  future  returns 
and  price-scaled  accounting  variables  such  as  book-to-market  and  eaming-to-price.  One  of  the 
more  prominent  explanations  for  the  value/growth  anomaly,  proposed  by  Lakonishok  et  al.  (1994; 
hereafter,  LSV),  suggests  that  the  future  return  patterns  of  “value”  and  “growth”  firms  are  due  to 
investors  over-extrapolating  realized  accounting  measures  of  growth.  LSV  suggest  that  investors 
over-react  to  realized  earnings  and  sales,  bidding  the  price  of  firms  with  high  past  accounting 
growth  too  high.  In  subsequent  periods  when  investor  expectations  of  sales  and  earnings  are  not 
met,  the  book-to-market  ratios  of  “growth”  firms  increase  as  investors  reassess  their  earnings 
expectations  and  bid  prices  down. 

In  a  recent  study,  Daniel  and  Titman  (2006)  provide  empirical  evidence  countering  the  claims 
of  LSV.  DT  show  that  past  accounting  growth,  when  measured  on  a  per-share  basis,  is  unrelated 
to  future  return  patterns.  Further,  DT  show  that  the  future  return  pattern  LSV  associate  with 
five-year  past  accounting  growth  is  actually  due  to  information  that  is  unrelated  (i.e.,  orthogonal) 
to  past  accounting  returns.  This  information,  tagged  intangible  returns  by  DT,  is  strongly  associ¬ 
ated  with  future  returns  over  a  five-year  period  and  can  be  viewed  as  a  proxy  for  changes  in 
investor  expectations  of  future  earnings  and/or  changes  in  firm  risk. 

The  empirical  evidence  of  DT  raises  the  question  as  to  whether  inferences  between  other 
accounting  measures  associated  with  firm  growth  and  future  returns  are  also  affected  by  intangible 
returns  and/or  equity  issuance  activity.  Of  particular  interest  to  the  accounting  researchers  over  the 
past  15  years  is  the  negative  relation  between  future  period  returns  and  current  period  accruals, 
commonly  known  as  the  accrual  anomaly.  The  primary  intuition  used  to  explain  the  accrual 
anomaly  is  similar  in  many  respects  to  that  proposed  by  LSV.  Specifically,  firms  with  high  (low) 
current  accruals  tend  to  experience  lower  (higher)  future  earnings  and,  because  investors  do  not 
understand  this  relation,  lower  (higher)  returns  are  realized  in  future  periods.  However,  while  the 
intuition  used  to  explain  the  future  return  pattern  of  the  value/growth  anomaly  is  similar  to  that 
used  to  explain  the  accrual  anomaly,  the  empirical  patterns  are  distinct.  Whereas  the  value/growth 
anomaly  is  relatively  long  in  horizon  (up  to  five  years),  the  accrual  anomaly  is  relatively^  short, 
with  current  period  accruals  only  explaining  future  return  variation  for  about  one  year.  In  addition, 
prior  accrual  studies  show  that  simple  controls  for  value/growth  characteristics  (B/M)  or  risk 
factors  do  not  explain  the  accrual  anomaly.  Accordingly,  with  the  exception  of  Desai  et  al.  (2004), 
most  studies  have  viewed  the  value/growth  anomaly  and  the  accrual  anomaly  as  distinct. 

Examining  the  evolution  of  B/M  offers  an  alternative  and  possible  improvement  to  the  two- 
period  research  designs  used  in  prior  accrual  studies,  while  examining  possible  common  links 
between  the  accrual  anomaly  and  value/growth  anomaly.  By  construction,  B/M  represents  the 
intersection  of  four  variables:  (1)  retained  earnings,  (2)  equity  investment,  (3)  expected  future 
earnings,  and  (4)  expected  returns.  Accordingly,  changes  in  B/M  capture  changes  in  at  least  one  of 
these  elements.  While  the  accounting  literature  has  focused  on  understanding  the  relation  between 
accrual-based  changes  in  retained  earnings  and  future  returns,  the  finance  literature  has  focused  on 
understanding  the  relations  between  changes  in  the  other  three  elements  of  B/M  and  future  returns. 
For  example,  equity  issuance  and  repurchase  activity  affect  the  equity  base  of  the  firm,  and  prior 
studies  have  shown  that  these  activities  are  associated  with  future  returns  for  up  to  five  years 
(Loughran  and  Ritter  1995;  Ikenberry  et  al.  1995).  Further,  the  primary  argument  in  the  value/ 
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growth  literature  is  whether  the  positive  association  between  future  returns  and  price-scaled  ac¬ 
counting  variables  is  due  to  reversals  of  irrational  future  earnings  expectations  formed  in  prior 
periods  (LSV;  Barberis  et  al.  1998)  or  rational  shifts  in  risk  (Fama  and  French  1992,  1995). 

Prior  accrual  studies  have  not  necessarily  ignored  the  effects  that  changes  in  the  other  ele¬ 
ments  of  B/M  have  on  inferences  between  future  returns  and  accruals.  Most  studies  use  B/M  as  a 
control  variable  for  characteristics  associated  with  firm  growth  and  expected  returns.  However,  it 
is  possible  that  B/M  is  too  noisy  of  a  variable  to  control  for  effects  on  future  returns  due  to 
changes  in  equity  issuance,  changes  in  expected  earnings,  and  shifts  in  risk  (Fama  and  French 
2008).  To  illustrate,  consider  a  firm  that  experiences  a  positive  shock  to  expected  cash  flows  in 
period  one,  the  period  prior  to  the  recognition  of  abnormally  large  accruals.  Due  to  the  discrete, 
conservative  nature  of  accounting,  the  shock  to  future  cash  flows  will  tend  to  be  reflected  in 
market  value  earlier  than  book  value,  especially  for  positive  shocks,  and  B/M  will  be  lower  at  the 
end  of  period  one.^  In  the  subsequent  period,  period  two,  managers  capitalize  on  these  positive 
NPV  projects  and  expand  productive  capacity.  To  the  extent  that  managers  do  not  fund  these 
projects  with  existing  cash  or  debt  facilities,  managers  may  issue  equity.  With  this  decision  to 
expand  productive  capacity,  positive  accruals  are  recognized.  Since  accruals  are  just  one  element 
of  the  four  comprising  B/M,  properly  controlling  for  changes  in  the  other  three  elements  is 
necessary  to  distill  the  incremental  future  return  variation  that  is  explained  by  accruals  from  that 
explained  by  changes  in  the  other  B/M  elements  occurring  in  periods  contemporaneous  with,  and 
prior  to,  the  recognition  of  accruals. 

A  decomposition  of  current  B/M  into  different  components  offers  a  structural  way  to  define 
distinct  variables  that  capture  changes  in  each  of  these  elements.  In  log  form,  the  book-to-market 
ratio  at  time  t  is  equal  to  the  book-to-market  ratio  at  time  t-r  plus  log  change  in  book  value, 
minus  log  change  in  market  value. 

log(5j/M,)  =  bm^  =  bmi_T+  log(B/5,_7.)  - 

=  bmt.,,+  Alog(5)(_^,,  -  Alog(M)j_^,,.  (1) 


Log  change  variables  have  convenient  additive  properties  that  allow  a  cumulative  multi-period 
change  variable  to  be  decomposed  into  multiple  single-period  change  variables.  Using  this  prop¬ 
erty,  bm^  can  be  decomposed  into  the  following  five  year  decomposition: 


bruj  =  -I-  log  I 
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^  This  tendency  is  documented  by  Kothari  et  al.  (2006,  Table  2).  Their  evidence  shows  that  firms  classified  as  high-accmid 
firms  tend  to  experience  increasingly  larger  abnormal  returns  in  the  years  leading  up  to  classification  as  a  high-accrual 
firm.  Complementing  this  evidence,  they  show  that  firms  classified  as  low-accrual  firms  tend  to  expenence  increasingly 
negative  abnormal  returns  in  the  years  leading  up  to  classification  as  a  low-accrual  firm,  although  the  absolute  magm- 
tude  of  the  returns  is  smaller  relative  to  the  high-accrual  firms. 
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The  above  decomposition  (3)  shows  that  btrii  is  equal  to  ^m,_5  plus  several  sub-period,  accounting- 
based  log  changes  in  value,  minus  a  cumulative  market-based  log  change.  Note  that  logarithmic 
properties  allow  me  to  decompose  log  change  in  book  value  between  t-l  and  t  into  the  accrual 
and  non-accrual  component  of  log  book  growth.  The  definition  of  TACC  is  consistent  to  accruals 
as  described  in  Feltham  and  Ohlson  (1995)  and  with  total  accrual  variables  used  in  Richardson  et 
al.  (2005,  2006),  Dechow  et  al.  (2008),  and  other  recent  studies  examining  total  accruals.^’^ 
N_TACC  is  mathematically  equivalent  to  Alog(5),_j A\og{NOA)f_i  /.  Assuming  the  firm  has  no 
equity  issuances/repurchases  between  r-1  and  t,  change  in  book  value  equals  net  income,  and 
N_TACC  can  be  interpreted  in  a  manner  similar  to  free  cash  flow,  the  cash  component  of  earnings 
that  Dechow  et  al.  (2008)  suggest  investors  also  misprice.  For  robustness  purposes,  alternative 
accrual  and  non-accrual  definitions  are  examined  in  later  sections. 

Log  Returns  versus  Log  Changes 

The  log-linear  decomposition  of  the  book-to-market  ratio  is  useful  because  it  gives  simple 
variables  that  can  proxy  for  the  dynamics  of  B/M  evolution  discussed  above.  Consistent  with  DT, 
I  interpret  changes  in  accounting  values  as  representing  tangible  information  (e.g.,  TACC, 
N_TACC) .  To  proxy  for  intangible  information,  or  information  not  captured  by  summary  account¬ 
ing  measures,  I  need  to  construct  a  variable  that  represents  information  used  by  investors  that  is 
unrelated  to  accounting  information.  To  do  this,  I  regress  realized  log  market  returns  over  a  given 
period  of  time  on  tangible  return  components  of  B/M  over  that  same  time  period.  Assuming  log 
market  return  is  a  good  proxy  for  total  value-relevant  information,  I  interpret  the  fitted  value  from 
this  regression  as  tangible  information  and  the  residual  as  intangible  information. 

Before  running  these  regressions,  log  change  values  defined  in  Equation  (3)  above  need  to  be 
transformed  into  log  return  values.  Assuming  there  are  no  equity  issues,  dividends,  or  other 
corporate  actions  affecting  shareholder’s  equity,  log  change  in  market  value  between  t— rand  t  is 
equal  to  the  log  return  an  investor  earns  between  t-rand  t  (i.e.,  Alog(M),_^,  =  However, 

as  the  time  between  t— rand  t  increases,  Alog(M),_^(  will  generally  not  equal  To  convert 

log  change  in  market  value  into  a  log  return  metric,  I  introduce  the  variable  This  com¬ 
posite  equity  issuance  variable,  is  defined  consistent  with  the  composite  equity  issuance 

variable  of  DT  and  represents  the  difference  between  log  market  return  and  total  log  change 
in  market  value.  More  specifically: 

=  Alog(M),_^,,  -  (4) 

The  isSi_ji  variable  can  be  used  to  convert  Alog(R),_.j.,  into  a  log  return  measure.  For  example,  just 
as  —  —  a  log  book  return  variable  can  be  constructed  as 

—  ^®§(  A'®)/-r,i  —  The  variable  ret  gives  the  log  book  value  at  t  of  a  $1  investment  in 

book  equity  made  at  t-r.  Since  is  equal  to  Alog(5),_,^  minus  iss,_,„  the  one-year  log 

book  return  between  t-l  and  t  can  be  defined  as  follows: 

=  Alog(5),_, rACC  +  /V_7ACC-  (5) 


Richardson  et  al.  (2005,  2006)  argue  that  change  in  net  operating  assets  offers  a  more  complete  measure  of  accruals.  For 
^  robustness,  this  study  also  considers  the  more  common  working  capital  accruals  in  Table  4. 

Prior  studies  have  generally  defined  accruals  as  the  linear  change  in  accrual  accounts,  scaled  by  total  assets.  This  study 
defines  accruals  as  the  log  change  in  the  accrual  account  (e.g.,  Alog(A?OA),_,  „  Mog{Working  Capital), Conceptu¬ 
ally,  both  types  of  accrual  definitions  capture  the  dynamics  associated  with  one-year  growth  in  net  operating  assets  or 
net  working  capital.  However,  the  logarithmic  definition  can  be  interpreted  as  the  log  return  earned  by  equity  holders 
(i.e.,  the  accrual  component  of  earnings)  once  equity  issuance  is  controlled.  For  robustness  purposes,  the  linear  variable 
is  directly  examined  in  Table  5. 
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As  Equation  (5)  reflects,  the  log  book  return  between  t-\  and  t  can  be  interpreted  as  the  sum  of 
three  log  return  components;  total  accruals  iTACC),  non-accrual  growth  {N_TACC),  and  equity 
issuance.  When  these  three  components  are  used  as  explanatory  variables  of  future  monthly 
returns,  the  coefficient  on  TACC  will  represent  the  expected  average  monthly  return  in  period 
t+l  for  an  investor  who  realized  $1  of  log  accrual  return  between  period  t-1  and  t,  holding 
constant  log  book  return  attributable  to  N_TACC  and  Alternatively,  the  coefficient  on  the 

variable  will  capture  the  expected  average  monthly  return  in  period 
1+1  for  an  investor  who  invested  $1  in  the  new  equity  issued  at  t-1.^ 

Intangible  Value  Change 

Once  log  change  in  book  value  (Alog(5),_^j)  is  transformed  into  a  log  book  return  {ret^i_^J), 
I  construct  a  proxy  for  intangible  information  by  employing  a  simple  process  of  estimating  cross- 
sectional  annual  regressions,  defining  the  residuals  from  each  regression  as  intangible  returns. 
More  specifically,  to  compute  the  change  in  value  due  to  intangible  information  over  the  one  year 
current  period  (t-1,  t)  and  the  four  year  prior  period  (t-5,  t-1),  I  regress  the  realized  log  market 
returns  inclusive  of  dividends,  ret™-  on  and  the  components  of  the  ret^ 

same  period  of  time  (t— r,  t). 

=  To  +  +  JiTACCi  -t  y^N_TACCi  -l-  yAss^^{t~\,t)  +  u-i/,  (6a) 

=  To  +  y\bmat-5)  +  r2Alog(5),- -I-  y^isSi^(t-s,t-\)  +  “tr-i  •  (6b) 

Consistent  with  DT,  I  define  the  return  due  to  tangible  information  as  the  fitted  component  and 
intangible  information  as  the  residual  from  the  above  regressions: 

Ui  f  =  CPIR  =  current  period  intangible  return;  and 
Ui  t_i  =  PPIR  =  prior  period  intangible  return. 

By  construction,  total  log  return  is  equal  to  the  sum  of  tangible  returns  plus  intangible  returns.  “ 

III.  RESULTS 

Data  Definitions  and  Construction 

My  study  examines  the  relation  between  future  returns  and  accruals  in  firms  with  fiscal 
year-ends  between  1/31/67  and  12/31/06.  To  be  included  in  the  sample,  firms  must  appear  on  the 
CRSP/Compustat  merged  database  and  have  positive  book  value  of  equity  at  fiscal  year-end  for 


^  There  are  two  alternative  methods  for  transforming  total  log  change  in  book  value  into  ret* First,  ret'’,.^,  can  be 
computed  using  a  share  adjustment  factor  as  noted  in  the  CRSP/Compustat  data  manual.  Second,  can  be  com¬ 
puted  as  a  “plug:”  =  bm,-  bm,_^-  Alog{B),_^+ 

®  An  important  subtlety  to  note  is  that  TACC  and  N_TACC  can  be  interpreted  as  accrual  and  non-accrual  components  of 
earnings  in  multivariate  settings  that  include  Variation  in  Alog(S),_i,,  is  explained  by  earnings  for  that  period 

once  equity  issuance  and  repurchase  activity  is  controlled.  Hence,  in  multivariate  settings  that  include  iss,_i  t,  TACC  can 
be  viewed  as  the  total  accrual  component  of  earnings. 

Fitted  values  for  6a  and  6b  explain  17  percent  and  40  percent  of  the  cross-sectional  variation  of  reC',_i ,  and 
respectively. 

“  Another  interpretation  of  is  that  this  variable  captures  elements  of  intangible  information  not  captured  by  PPIR  or 
CPIR.  By  construction,  the  intangible  returns  defined  above  (CPIR  and  PPIR)  only  capture  the  intangible  information 
that  investors  use  to  price  securities.  Accordingly,  it  is  reasonable  to  assume  that  a  manager’s  intangible  information  is 
more  precise  and  extensive  than  the  empirical  proxies  defined  above.  Prior  literature  (Hovakimian  et  al.  (2001))  suggest 
that  firms  are  more  likely  to  issue  (repurchase)  equity  following  periods  when  stock  prices  increase  (decrease)  more 
dramatically  than  earnings.  Accordingly,  can  be  interpreted  as  an  additional  (and  possibly  independent)  measure 

of  intangible  information. 
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years  t-5,  t-\,  and  t.  However,  firms  with  negative  book  values  in  intermediate  years  are  not 
excluded.  In  addition,  firms  must  have  positive  net  operating  asset  balances  at  fiscal  year-end  of 
years  t- 1 ,  and  t.  Additional  accounting  measures  used  later  in  robustness  tests  affect  the  sample 
sizes  used  in  those  robustness  tests  accordingly,  although  not  significantly.  I  exclude  financial 
firms  (SIC  6000-6999),  utilities  (SIC  4900-4999),  firms  with  stock  prices  on  CRSP  of  less  than 
$5  per  share  as  of  the  last  day  of  trading  in  June  of  year  t+ 1 ,  and  firms  without  a  complete  series 
of  (60)  monthly  returns  between  fiscal  year-end  t-5  and  fiscal  year-end  t.  The  future  return  series 
begins  on  July  1,  t+1  and  delisting  returns  are  used,  when  necessary,  to  compute  annual  portfolio 
returns. I  also  require  that  the  firm  have  a  valid  price  and  shares  outstanding  as  of  the  last  day 
of  trading  in  fiscal  year  t-5,  t-l,  t  and  the  last  day  of  trading  in  June  of  year  t+\.  From  this  final 
set,  I  annually  trim  (0.5,  99.5)  the  tangible  return  components  (TACC,  WCACC,  N_TACC,  and 
N_WCACC  variables).  Consistent  with  prior  studies,  scaled  linear  change  variables  used  in  later 
robustness  tests  are  annually  winsorized  (0.5,  99.5).*^  Returns  are  from  CRSR  All  other  variables 
are  from  the  CRSP/Compustat  merged  database. 

A  firm’s  log  book-to-market  ratio  at  time  t  {bm,)  is  computed  as  the  log  book  value  of  the  firm 
at  fiscal  year-end  of  year  t  minus  the  log  market  value  of  the  firm  as  of  the  last  day  of  trading  in 
fiscal  year  t  as  reported  in  CRSP/Compustat  merged  database  (data25  *  data  199).  All  log  change 
variables  are  defined  with  two  subscripts:  the  first  subscript  notes  the  beginning  of  the  change 
period  and  the  second  subscript  notes  the  end  of  the  change  period.  For  example,  Alog(R),_7.,  is 
computed  as  log  book  value  as  of  the  end  of  fiscal  year  t  minus  the  log  book  value  as  of  the  end 
of  fiscal  year  t-T.  Similarly,  log  return  measures  (e.g.,  are  italicized  and  defined  with  two 

subscripts,  with  the  first  subscript  noting  the  beginning  of  the  return  period  and  the  second 
subscript  noting  the  end. 

Data  Summary 

Table  1  reports  summary  statistics.  These  statistics  show  that  accruals  appear  to  be  a  relatively 
symmetric  variable,  although  there  is  some  right  skewness.  The  skewness  is  consistent  with  the 
average  firm  growing  its  balance  sheet  over  time.  Given  the  data  requirements  of  a  trailing 
five-year  return  series,  my  sample  is  skewed  toward  larger,  more  mature  firms.  However,  the 
standard  deviation  of  the  variables  suggests  considerable  variation,  despite  their  relative  homoge¬ 
neous  size  characteristic. 

Table  2  reports  pairwise  correlation  coefficients.  Overall,  the  univariate  correlations  support 
my  hypothesis  that  PPIR  affects  inferences  of  the  relation  between  future  returns  and  current 
period  accruals.  First,  consistent  with  the  prior  accrual  literature,  each  accrual  measure  is  nega¬ 
tively  correlated  with  bm,.  Second,  TACC  is  more  strongly  correlated  with  PPIR  than  working 
capital  accruals  {WCACC).  Further  the  relation  between  accrual  variables  and  PPIR  is  stronger 
than  the  relation  between  the  respective  accrual  variables  and  bm,,  suggesting  that  bm,  may  be  an 
inadequate  control  for  prior  period  intangible  returns.  Finally,  future  returns  (Ret)  are  negatively 
related  to  each  accrual  measure  as  well  as  PPIR  and  both  issuance  variables.  Given  the  strong 
positive  association  among  the  accrual  measures,  PPIR,  and  the  issuance  variables,  the  return 


Size-adjusted  returns  are  computed  as  the  difference  between  the  raw  return  as  reported  on  CRSP  and  the  size-matched 
return  reported  on  the  CRSP  market  capitalization  decile  return  file  (crsp.ermportl). 

^  Consistent  with  DT,  I  begin  the  future  return  series  in  July  of  year  t+ 1  for  all  firms,  regardless  of  the  month  in  year  t  that 
their  fiscal  year-ends.  Results  are  qualitatively  identical  when  only  firms  with  December  fiscal  year-ends  are  included. 
If  a  monthly  return  is  missing  during  a  return  period  because  there  is  no  price  on  CRSP  (a  code  of  “.B”  in  the  WRDS 
SAS  data  set),  the  return  is  set  to  zero. 

Prior  accrual  studies  have  primarily  defined  accruals  as  a  linear  difference,  scaled  by  average  total  assets  (e.g.,  NOA, 
-NOA,_,,  divided  by  average  total  assets).  To  be  consistent  with  these  studies,  1  winsorize  linear  difference  variables, 
although  results  are  not  overly  sensitive  to  this  treatment. 
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TABLE  1 

Summary  Statistics 


BM 

ShrsOut 

Price 

ME 

TACC 

WCACC 

Ret 

For  All  61,241  Firms 
Mean  0.795 

55.35 

25.76 

2,125.7 

0.10 

0.08 

-0.0001 

0.16 

Std.  Dev. 

0.596 

250.80 

27.84 

11,221.3 

0.33 

0.40 

0.1180 

0.57 

Q25 

0.402 

4.05 

11.38 

56.47 

-0.05 

-0.08 

-0.0379 

-0.14 

Q50 

0.644 

11.15 

19.75 

216.41 

0.07 

0.08 

-0.0106 

0.08 

Q75 

1.011 

33.98 

33.00 

905.5 

0.22 

0.24 

0.0122 

0.35 

For  1968: 

Mean 

633  Firms 
0.604 

12.03 

42.75 

691.0 

0.15 

0.16 

-0.0010 

-0.01 

Std.  Dev. 

0.391 

30.39 

29.45 

2,438.4 

0.21 

0.33 

0.0871 

0.33 

Min. 

0.043 

0.11 

5.25 

3.71 

-0.50 

-1.00 

-0.5583 

-0.62 

Max 

3.919 

541.19 

353.75 

39,700.7 

1.35 

2.02 

0.9316 

4.05 

For  1977: 

Mean 

1,179  Firms 
1.142 

14.93 

22.88 

538.6 

0.02 

0.04 

-0.0233 

0.21 

Std.  Dev. 

0.637 

33.78 

\1M 

2,098.6 

0.27 

0.32 

0.0671 

0.35 

Min. 

0.164 

0.31 

5.00 

1.57 

-1.33 

-1.83 

-0.6398 

-1.00 

Max 

5.838 

615.53 

264.0 

39,213.2 

1.47 

1.72 

0.5913 

1.83 

For  1978: 
Mean 

1,457  Firms 
1.168 

13.35 

22.56 

443.4 

0.12 

0.12 

-0.0238 

0.21 

Std.  Dev. 

0.665 

32.13 

16.12 

1,791.6 

0.26 

0.32 

0.0724 

0.39 

Min. 

0.129 

0.25 

5.00 

3.00 

-1.44 

-1.91 

-0.9746 

-1.00 

Max 

8.060 

652.88 

257.25 

39,091.4 

1.35 

2.06 

0.7549 

4.28 

For  2000: 

Mean 

1,811  Firms 
0.654 

113.82 

25.80 

5,130.2 

0.14 

0.10 

0.0043 

0.14 

Std.  Dev. 

0.491 

432.09 

27.79 

24,959.4 

0.38 

0.43 

0.1438 

0.63 

Min. 

0.006 

0.01 

5.00 

0.07 

-1.35 

-2.18 

-1.96 

-1.00 

Max 

3.710 

9,893.43 

520.00 

524,351.6 

2.40 

2.19 

1.51 

6.52 

This  table  reports  the  mean,  standard  deviation,  lower  quartile,  median,  upper  quartile,  minimum,  and  maximum  for  the 
primary  sample  of  firm-year  observations  for  firms  with  fiscal  years  ending  t  (1/31/67  and  12/31/06).  BM  is  the  book-to- 
market  ratio  computed  as  book  value  of  common  equity  as  of  fiscal  year-end  scaled  by  market  value  of  equity  as  of  fiscal 
year-end  (data25  *  data  199).  ShrsOut  is  equal  to  the  number  of  shdres  outstanding  (in  millions).  Price  is  equal  to  the  price 
per  share.  ME  is  equal  to  market  value  of  equity.  TACC  is  equal  to  total  accruals  computed  as  log{NOA,  /  NOA,_d.  WCACC 
is  equal  to  working  capital  accruals  computed  as  log(VkQ/  WC,_{).  WC,  is  computed  as  current  assets  (data4)  minus  cash 
and  short-term  investments  (datal)  minus  current  liabilities  (dataS)  plus  debt  in  current  liabilities  (data34)  at  fiscal  year-end 
t.  Specifically,  working  capital  equals  (data4  —  datal),— (data5  —  data34),.  is  a  composite  equity  issuance  variable 

computed  as  the  difference  between  log  change  in  the  market  value  of  equity  between  fiscal  year-ends  t-l  and  t  and  the 
log  market  return  (inclusive  of  dividends)  between  fiscal  year-ends  t—\  and  t.  Specifically,  isSf_i  [  — log{MEd  MEi_i} 
-  Ret  equals  the  buy-and-hold  return,  inclusive  of  distributions,  between  July  1  of  year  t+l  to  June  30  of  year  t+2. 

See  the  Appendix  for  the  Compustat  data  items  that  comprise  book  value  of  common  equity  and  NOA.  ME,  ShrsOut,  and 
Price  are  per  CRSP  and  as  of  last  day  of  trading  in  June,  f+1.  BM,  TACC,  WCACC,  and  are  of  fiscal  year-end  t  and 
are  per  the  merged  CRSP/Compustat  database. 


reversal  that  characterizes  the  accrual  anomaly  may  be  due  to  investor  reaction  to  information  not 
measured  by  accruals,  but  correlated  with  the  recognition  of  accruals. 

Regression  Results 

The  null  hypothesis  of  this  study  is  that  future  period  returns  are  unrelated  to  realized  ac¬ 
counting  values  such  as  accruals,  consistent  with  the  tenets  of  market  efficiency  and  risk-neutral 
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TABLE  2 

Full  Sample  Correlation  Statistics 


bm, 

PPIR 

N_TACC 

TACC 

WCACC 

Ret 

bm. 

1.000 

-0.587 

-0.146 

-0.155 

-0.003 

-0.123 

-0.106 

0.086 

PPIR 

-0.578 

1.000 

0 

0.052 

0.024 

0.188 

0.151 

-0.027 

-0.209 

0.031 

1.000 

0.208 

-0.002 

0.089 

0.071 

-0.049 

-0.284 

0.107 

0.470 

1.000 

0.149 

0.206 

0.155 

-0.036 

N_TACC 

-0.023 

-0.008 

0.001 

0.140 

1.000 

-0.738 

-0.291 

0.028 

TACC 

-0.160 

0.229 

0.106 

0.159 

-0.703 

1.000 

0.499 

-0.034 

WCACC 

-0.149 

0.189 

0.099 

0.163 

-0.403 

0.641 

1.000 

-0.010 

Ret 

0.102 

-0.023 

-0.100 

-0.096 

0.040 

-0.041 

-0.023 

1.000 

Correlations  significant  at  less  than  0.01  level  are  bolded. 

This  table  reports  full  sample  univariate  correlations  for  log-linear  return  components  of  the  book-to-market  ratio  with 
Pearson  (Spearman)  coefficients  reported  above  (below)  the  diagonal.  Sample  includes  firms  with  fiscal  year-ends  between 
1/31/67  and  12/31/06.  bm,  equals  the  log  book-to-market  ratio  as  of  fiscal  year-end  t  with  book  value  of  common  equity 
and  market  value  of  equity  as  of  fiscal  year-end  t.  PPIR  is  equal  to  prior  period  intangible  (log)  return,  defined  as  the 
residual  from  annual  regressions  of  log  market  return,  on  log(B,_^  I  and  few, .5.  is  a 

composite  equity  issuance  variable  computed  as  the  difference  between  log  change  in  the  market  value  of  equity  between 
fiscal  year-ends  t-rand  t  and  the  log  market  return  (inclusive  of  dividends)  between  fiscal  year-ends  t-rand  t.  N_TACC 
is  equal  to  log{{B,  I NOA,)  /  {B,_^l  NOA,_{)).  TACC  is  equal  to  total  accruals  computed  as  log{NOA,l  NOA,_^).  WCACC  is 
equal  to  working  capital  accruals  computed  as  log{WCACC,l  WCACC, _i).  Ret  equals  the  buy-and-hold  return,  inclusive  of 
distributions,  between  July  1  of  year  t-i-1  to  June  30  of  year  r-i-2.  See  the  Appendix  for  the  Compustat  data  items  that 
comprise  book  value  of  common  equity  and  NOA. 


investors  properly  incorporating  the  pricing  implications  of  accruals  into  price.  However,  if  future 
period  returns  are  significantly  related  to  current  period  accruals,  and  this  relation  is  not  explained 
by  risk  or  other  correlated  corporate  activities  such  as  equity  issuance,  then  the  null  hypothesis  can 
be  rejected.  To  examine  whether  accruals  are  negatively  associated  with  future  returns  after  prior 
period  intangible  returns  are  controlled,  I  perform  incremental  regressions  that  reconcile  my  re¬ 
sults  to  prior  accrual  literature  and  link  the  accrual  anomaly  to  the  value/growth  anomaly.  I  begin 
by  establishing  the  negative  relation  between  current  period  accruals  and  future  returns.  I  then 
introduce  other  tangible  and  intangible  components  of  total  market  return  over  the  current  period. 
These  variables  complete  the  one-year  decomposition  of  the  bm,  and  largely  confirm  inferences 
from  prior  accrual  literature.  Finally,  I  complete  the  five-year  decomposition  of  bm,  and  provide 
evidence  that  future  return  reversal  is  not  associated  with  current  period  accruals,  but  rather  the 
intangible  returns  that  precede  accrual  recognition.  Further,  the  five-year  decomposition  of  bm, 
provides  evidence  that  the  accrual  anomaly  is  a  function  of  the  value/growth  anomaly. 

Table  3  reflects  the  results  of  Fama-MacBeth  regressions  of  raw  monthly  returns  on  compo¬ 
nents  of  log  book-to-market.*^  Regression  1  of  Table  3  shows  that,  in  a  univariate  specification, 
total  accruals  {TACC)  are  strongly  negatively  associated  with  future  returns,  consistent  with  prior 
studies.  Regression  2  shows  that  the  negative  relation  holds  after  controlling  for  non-accrual 
growth  {N_TACC).  Regression  3  shows  that,  even  after  controlling  for  equity  issuance  and  non¬ 
accrual  growth,  there  is  still  a  strong  negative  association  between  TACC  and  future  returns. 

Regression  4  includes  current  period  intangible  returns  {CPIR)  as  an  additional  explanatory 


Inferences  from  empirical  analysis  are  qualitatively  the  same  if  size-adjusted  returns  are  examined. 
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Resutek 


variable.'^  The  significant  negative  relation  between  future  period  returns  and  CPIR  suggests  that, 
in  expectation,  current  period  returns  are  not  a  strict  function  of  current  period  changes  in  book 
value.  This  finding  is  important  because  many  accrual  studies  that  adopt  the  Mishkin  model 
implicitly  assume  that,  in  expectation,  only  earnings  and  earnings  components  explain  future 
returns.  As  Kraft  et  al.  (2007)  show,  to  the  extent  this  assumption  is  incorrect  and  the  omitted 
explanatory  variable  is  mispriced,  incorrect  inferences  can  result  regarding  the  relation  between 
future  period  returns  and  current  period  accruals.'^ 

Subsequent  specifications  (5-7)  introduce  tangible  return  components  from  the  four-year  pe¬ 
riod  prior  to  accrual  recognition.  Specifically,  regressions  5-7  introduce  prior  period  equity  issu¬ 
ance  prior  period  log  book  growth  (PPBG),  and  long-horizon  expectations  {bm,_^). 

Inferences  on  these  additional  explanatory  variables  are  consistent  with  those  of  DT.  More  im¬ 
portantly,  even  with  the  current  and  prior  period  log  returns  controlled,  there  is  still  an  economi¬ 
cally  and  statistically  strong  negative  relation  between  future  returns  and  current  period  accruals. 
However,  once  PPIR  is  introduced  as  an  explanatory  variable  in  regression  8,  completing  the 
log-linear  return  decomposition  of  accruals  are  no  longer  associated  with  future  returns  at 
conventional  levels  of  significance. 

Building  off  regression  4,  which  shows  that  TACC  are  associated  with  future  returns  after 
current  period  tangible  and  intangible  returns  are  controlled,  regression  9  adds  PPIR  as  a  control 
variable.  Regression  9  shows  that,  after  PPIR  is  controlled,  TACC  is  unrelated  with  future  returns. 
This  finding  implies  that  PPIR  is  a  powerful  control  variable  that  can  distort  inferences  of  current 
period  accounting  growth  metrics  and  future  returns  when  omitted. 

IV.  ALTERNATIVE  VARIABLE  SPECIFICATIONS  AND  ROBUSTNESS  TESTS 
Alternative  Accrual  Definitions 

Table  4  examines  other  measures  of  accruals  to  determine  whether  the  exclusion  of  prior 
period  intangible  returns  in  prior  studies  explains  the  relation  between  accruals  and  future  returns. 
The  first  set  of  regressions  in  Table  4,  Panel  A  examines  the  relation  between  the  components  of 
total  accruals  suggested  by  Richardson  et  al.  (2006)  and  future  returns.  Richardson  et  al.  (2006) 
linearly  decompose  total  accruals  into  three  parts:  change  in  sales  {ASales),  change  in  asset 
turnover  {AATO),  and  an  interaction  between  ASales  and  AATO.  Using  a  log-linear  decomposition 
avoids  the  complicating  interaction  term,  yielding  log  total  accruals  {TACC)  as  the  log  sum  of 
A\o%{Sales)i_x  I  and  Alog(ArO),_i 

Consistent  with  the  intuition  of  Richardson  et  al.  (2006),  Table  4,  Panel  A  shows  a  strong 
statistical  and  economic  relation  between  both  A\og{Sales)i_i ,  and  Alog(A7’0),_, ,  and  future 
returns.'^  Subsequent  controls  for  current  and  prior  period  log  return  components  in  regressions  2 
and  3  show  that  Alog(A7’0),_i ,  is  still  significantly  associated  with  future  returns,  although 
Alog{Sales)t^i  t  becomes  insignificant  with  the  introduction  of  However,  once  PPIR  is 

added  as  an  explanatory  variable  in  regression  4,  completing  the  log-linear  decomposition  of  bm„ 
neither  accrual  component  is  associated  with  future  returns. 


Since  some  firms  used  to  compute  CPIR  do  not  have  a  full  series  of  12  future  monthly  returns,  the  coefficients  on  TACC, 
NJTACC,  and  iss,_ij  change  slightly  with  the  introduction  of  CPIR. 

Alternatively,  to  the  extent  that  the  omitted  variable  is  correctly  priced,  but  the  expected  return  model  is  incorrectly 
specified,  incorrect  inferences  can  also  result. 

Richardson  et  al.  (2006)  do  not  directly  examine  the  relation  between  future  returns  and  accruals.  However,  the  negative 
relation  between  future  returns  and  accrual  components  reflected  in  Table  4,  Panel  A  is  consistent  with  the  persistence 
coefficients  reported  in  their  study. 

The  lack  of  a  significant  relation  between  the  “growth”  component  of  accruals,  Alog(Sa/ei),_|  „  and  future  returns  is 
consistent  with  the  results  of  Daniel  and  Titman  (2006). 
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Panel  B  of  Table  4  examines  working  capital  accruals.  Due  to  the  logarithmic  nature  of  this 
variable,  I  omit  depreciation  from  my  working  capital  accrual  variable.  This  omission  will  lead  to 
a  positive  working  capital  accruals  on  average  (see  Table  1),  but  should  not  affect  regression 
inferences.^^  Regression  1  shows  the  telltale  negative  relation  between  VkCACC  and  future  returns. 
However,  this  relation  becomes  insignificant  once  f  is  introduced  into  the  regression  speci¬ 
fication.  Regressions  3-5  reinforce  the  validity  of  the  relations  noted  in  prior  tables;  that  is,  there 
is  a  strong  negative  relation  between  equity  issuance  and  intangible  returns  with  future  returns, 
while  the  accrual  component  of  earnings  is  unrelated  with  future  returns. 

The  relation  between  PPIR  and  current  period  accruals  dovetails  nicely  into  the  intuition  of 
prior  studies  that  explain  the  accrual  anomaly  as  a  function  of  firm  growth  and  investor  misun¬ 
derstanding  of  the  diminishing  returns  on  investment  that  follow  firm  growth.  However,  the 
evidence  in  this  study  suggests  a  different  interpretation  of  the  relation  between  future  returns  and 
current  period  accruals  than  that  proposed  in  prior  studies.  While  investors  may  misunderstand  the 
diminishing  returns  on  investment,  this  misunderstanding  is  not  caused  by  the  information  con¬ 
tained  in  accruals  (or  other  accounting  performance  measures).  Rather,  the  empirical  evidence  of 
this  study  suggests  that  the  return  reversal  that  prior  studies  have  attributed  to  accounting  measures 
of  growth  appears  to  be  due  to  growth  not  captured  by  the  financial  accounting  system,  denoted  as 
intangible  returns. 


Linear  Accrual  Variable 

Prior  literature  has  defined  current  period  accruals  as  a  linear  difference  scaled  by  assets.  For 
example,  total  accruals  have  been  defined  as  the  one-year  change  in  net  operating  assets  scaled  by 
beginning  period  assets. 


{NOA,-NOA,_{) 

TA,^i 


Simple  algebra  shows  that  TACC^'^^^''  can  be  expressed  as  the  product  of  a  beginning  stock 
measure  and  a  one-year  growth  measure: 


TACC 


’linear . 


NOA,_, 

TA,_, 


X 


/  NOA, 

\noa,_^ 


(7) 


Equation  (7)  is  useful  because  it  shows  that  is  similar  to  the  log-linear  accrual  variable 

(TACC)  used  in  this  study.^’  Given  that  is  positive  for  all  observations  in  this  study, 

TACC^'"^"'"  is  a  monotonic  transformation  of  one-year  growth  in  NOA.  Thus,  TACC^'”^"''  should 
capture  the  same  underlying  econonuc  dynamics  as  TACC,  albeit  it  in  a  different  scale.  7ACC'“^“'' 
captures  growth  on  a  linear  basis,  whereas  TACC  captures  growth  on  a  log-linear  basis.  This 
difference  will  cause  TACC^'’^^^’'  values  to  be  more  extreme  than  TACC  values  for  observations 
with  the  most  extreme  positive  NOA  growth.  Accordingly,  differences  in  the  regression  coeffi¬ 
cients  of  TACC‘^"^^''  and  TACC  may  be  due  to  differences  in  their  respective  econometric  proper¬ 
ties,  not  the  economic  dynamics  they  capture. 

Table  5  examines  whether  the  primary  results  of  Table  3  are  sensitive  to  accrual  measurement. 
Consistent  with  the  intuition  used  in  prior  literature  to  explain  the  accrual  anomaly,  regression  1 
establishes  that  the  component  of  TACC''"'”''''  responsible  for  the  strong  negative  relation  with 
future  returns  is  the  one-year  growth  in  NOA  component.  Regression  2  directly  examines  the 
sensitivity  of  the  results  from  Table  3  to  the  accrual  definition  used  by  regressing  future  monthly 


Working  capital  accruals  that  include  depreciation  tend  to  have  negative  values,  which  are  undefined  in  log  form. 
Empirically,  the  Pearson  and  Spearman  correlations  between  TACC  and  TACC"'“‘''  are  (p  =  0.85  and  0.98). 
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returns  on  current  and  prior  period  return  components,  with  substituted  for  TACC.  The 

coefficients  in  regression  2  are  largely  consistent  in  economic  and  statistical  significance  to  those 
reported  in  Table  3,  except  that  TACC^‘"^‘^^  shows  a  significantly  negative  association  with  future 
returns.  Given  that  regression  1  shows  the  negative  relation  between  and  future  period 

returns  is  due  primarily  to  the  one-year  growth  in  NOA,  which  is  insignificant  when  measured 
logarithmically  {TACC),  the  significant  coefficient  on  reflected  in  regression  2  could  be 

due  to  the  way  values  of  are  affected  by  extreme  growth  in  NOA. 

To  examine  whether  the  significant  negative  relation  between  future  returns  and  7ACC^‘”^“''  is 
due  to  economic  differences  or  econometric  differences  between  and  TACC,  1  transform 

TACC^'"^"''  into  percentile  ranks.  Specifically,  1  rank  firms  annually  based  on  7ACC^'”^“''  (0  to  99) 
and  use  these  ranks  as  accrual  explanatory  variables.  This  transformation  reduces  right  skewness 
in  the  accrual  variable,  thereby  reducing  the  leverage  on  the  TACC^'"®"''  regression  coefficients 
caused  by  observations  with  extreme  realizations  of  NOA  growth.  This  transformation  maintains 
the  ordinal  ranking  of  the  variable  and  should  not  affect  inference  differences  caused  by  economic 
differences  between  TACC  and  TACC^'"^'^'".  Regression  3  in  Table  5,  Panel  A  reports  the  results  of 
this  transformation  with  economic  and  statistical  significance  consistent  with  that  reported  in  Table 
3.  For  comparison  to  prior  tables,  regression  4  removes  PPIR  as  an  explanatory  variable  and 
shows  that,  consistent  with  prior  tables,  the  exclusion  of  PPIR  significantly  affects  inferences 
between  accruals  and  future  returns. 

An  alternative  way  to  examine  the  effect  that  extreme  positive  growth  in  NOA  has  on  regres¬ 
sion  inferences  between  future  returns  and  TACC*”^"''  is  to  use  a  least  trimmed  squares  (LTS) 
research  design.  While  OLS  regressions  have  many  desirable  attributes,  they  are  sensitive  to 
leverage  points  (e.g.,  extreme  independent  variable  values).  Given  that  extreme  positive  growth  in 
NOA  will  cause  7ACC^'”®“''  to  lie  disproportionately  on  the  right-hand  side  of  the  x-axis  (relative  to 
TACC),  and  thereby  impart  high  leverage  on  the  regression  coefficients,  I  examine  the  sensitivity 
of  regression  inferences  to  observations  with  extreme  positive  TACC^'"^^''  values.  Specifically,  I 
employ  a  LTS  empirical  design  in  the  spirit  of  Knez  and  Ready  (1997)  and  Kraft  et  al.  (2006)  and 
annually  trim  firms  in  the  100th  percentile  of  one-yeai'  growth  in  NOA.  If  TACC^'”^"''  captures  a 
robust  economic  relation  incremental  to  that  of  TACC,  then  its  relation  with  future  returns  should 
not  be  affected  by  the  omission  of  the  most  extreme  TACC‘‘'^^‘‘''  observations.  However,  if  the 
significant  negative  relation  noted  in  regression  2  of  Table  5  is  econometric  rather  than  economic 
and  caused  by  extreme  values  of  TACC^'”®"'^,  then  removing  extreme  observations  will  have  a 
significant  effect  on  inferences. 

Panels  B  and  C  of  Table  5  report  the  regression  results  based  on  the  LTS  procedure,  with 
Panel  B  trimming  the  most  extreme  TACC^'”^"'’  percentile  (top  1  percent)  and  Panel  C  trimming  the 
top  two  extreme  percentiles  (top  2  percent).  Results  suggest  that  extreme  TACC^'”^"'' 

values  have  a  significant  effect  on  regression  inferences.  Regression  5  shows  only  a  moderately 
significant  relation  between  future  returns  when  only  the  top  percentile  is  trimmed; 

regression  7  shows  no  relation.  Equally  important  are  regressions  6  and  8,  which  show  that  the 
exclusion  of  PPIR  as  a  control  variable  has  a  significant  effect  on  inferences  between  future 
returns  and  current  period  accruals. 

The  evidence  in  Table  5  suggests  that  the  negative  relation  between  current  period  accruals 
and  future  returns  is  sensitive  to  the  manner  in  which  accruals  are  measured.  However,  this 
sensitivity  is  not  caused  by  differences  in  the  economic  fundamentals  captured  by  the  log-linear 
accrual  variable  (TACC)  versus  the  linear  accrual  variable  TACC^'”^'"'.  Rather,  the  sensitivity  ap- 


Inferences  from  regression  specification  (2)  (Panel  A)  are  qualitatively  identical  when  a  LTS  procedure  is  employed  that 
trims  monthly  observations  in  the  highest  squared  residual  percentile  (1  percent)  as  done  in  Kraft  et  al.  (2006). 
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pears  to  be  due  to  differences  in  the  econometric  properties  of  the  two  accrual  variables.  Table  5 
reinforces  the  point  raised  by  ICraft  et  al.  (2006)  regarding  the  influence  of  extreme  observations 
on  retum/accrual  inferences  and  highlights  the  importance  of  using  multiple  variable  definitions 
when  examining  the  properties  of  accruals.  Further,  Table  5  reinforces  that  PPIR  has  a  significant 
effect  on  the  relation  between  current  period  accruals  and  future  returns. 

The  Relation  between  Non-Accrual  Growth  and  Future  Returns 

The  non-accrual  book  return  component,  N_TACC,  is  insignificant  in  regression  specifications 
that  exclude  PPIR,  but  becomes  positive  and  significant  in  specifications  that  include  PPIR.  As 
discussed  above,  N_TACC  effectively  captures  changes  in  book  value  due  to  free  cash  flow. 
However,  unlike  the  traditional  free  cash  flow  variables  defined  in  a  linear  manner  (e.g.,  (Net 
Income  —  Accruals)/ Avg.  Total  Assets),  N_TACC  assumes  that  positive  free  cash  flow  reduces  a 
firm’s  net  debt  position  and  negative  free  cash  flow  increases  a  firm’s  net  debt  position.  In  essence, 
N_TACC  can  be  viewed  as  a  lever  that  amplifies  (attenuates)  log  book  returns.  For  example,  if 
N_TACC  is  positive,  then  this  implies  that  a  firm  was  able  to  reduce  its  net  debt  position  after 
operating  and  investing  needs  were  met,  thereby  amplifying  the  log  book  returns  realized  by 
equity  holders  (where  net  debt  equals  debt  minus  cash).  On  the  other  hand,  if  N_TACC  is  negative, 
it  means  that  some  growth  in  net  operating  assets  was  due  to  an  increase  in  net  debt.  As  such,  the 
log  book  return  is  attenuated  since  part  of  the  book  return  is  funded  by  an  increase  in  net  debt. 

To  examine  the  sensitivity  of  my  results  to  using  NJTACC  as  a  control  variable.  Table  6 
substitutes  two  alternative  variables  that  capture  the  effect  on  log  book  return  due  to  debt  issuance 
activities:  ChgDebt  and  ChgNetDebt. 


(8a) 


(8b) 


The  results  from  Table  6  largely  confirm  inferences  from  prior  tables.  Specifically,  the  accrual 
component  of  log  book  return  is  not  associated  with  future  returns  once  current  period  net  debt 
changes  and  prior  period  intangible  return  components  are  controlled.^^  However,  in  specifications 
that  exclude  PPIR,  there  is  a  significant  negative  relation  between  the  accrual  component  of 
earnings  and  future  returns. 

The  fact  that  NJTACC,  ChgDebtt_i  „  and  ChgNetDebJi ^  are  tangible  return  components  and 
are  significantly  associated  with  future  returns  does  not  change  the  primary  contention  of  this 
study  or  that  of  Daniel  and  Titman  (2006).  Richardson  et  al.  (2005)  classify  the  reliability  of  all 
balance  sheet  accruals  and  suggest  that  accruals  relating  to  debt  instruments  are  the  most  reliable 
and  are  the  least  likely  to  be  mispriced.  This  contention  is  premised  on  the  fact  that  managers  have 
very  little  discretion  in  the  recognition  of  debt-based  accruals,  which  leads  to  little  measurement 
error  in  these  variables.  Accordingly,  if  investors  are  unlikely  to  misprice  the  information  con¬ 
tained  in  NJTACC,  ChgDebJij,  and  ChgNetDebJi  j,  then  a  more  likely  explanation  is  that  these 
variables  have  characteristics  similar  to  As  discussed  in  DT,  managers  are  more  likely  to 

issue  equity  in  response  to  positive  growth  opportunities  or  in  response  to  a  perceived  decline  in 
the  cost  of  equity.  In  either  case,  the  issuance  of  equity  is  triggered  by  intangible  returns.  In  a 
similar  vein,  the  debt  issuance  activity  of  a  firm  appears  also  to  be  triggered  by  realizations  of 


For  brevity,  similar  regression  analyses  examining  working  capital  accruals  are  not  reported,  but  results  are  consistent 
with  those  reflected  in  Table  6  for  total  accruals. 
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short-term  investments  (datal93)  -I-  cash  (datal).  ChgDebt  is  equal  to  Debt,- Debt,_{)  I  where  Debt  is  equal  to  long-term  debt  (data9)  -I-  short-term  debt  (data34) 

-I-  notes  payable  (data206)  -I-  preferred  stock  (shareholder’s  equity  —  book  value  of  equity).  7’ACC^‘“‘"'  is  equal  to  {NOA,-  NOA,^i)  divided  by  total  assets  (datab)  at  t-1.  N_TACC, 
iss,_i  „  CPIR,  and  PPIR  are  the  same  as  defined  in  prior  tables.  Ranked  variables  are  equal  to  the  annual  rank  (0-99)  of  the  respective  variable.  Consistent  with  prior  accrual  literature 
that  adjusts  linear  accounting  variables,  ChgDebt  and  ChgNetDebt  are  annually  winsorized  (0.5,  99.5). 
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intangible  information  in  current  and  prior  periods.  However,  note  that,  even  if  the  relation  be¬ 
tween  the  debt  issuance  variables  and  future  returns  is  symptomatic  of  some  sort  of  behavioral 
pricing  bias,  the  intuition  used  to  explain  this  pricing  bias  must  be  different  than  that  relied  upon 
in  prior  accrual  studies  that  asserts  that  greater  manager  discretion  leads  to  greater  measurement 
error  and  hence,  greater  mispricing. 

Portfolio  Tests 

To  examine  whether  my  results  hold  in  a  nonparametric,  portfolio  setting,  I  employ  a  series  of 
dependent  sorts  that  minimize  the  variation  in  the  control  variables,  while  maximizing  the  varia¬ 
tion  in  the  variable  of  interest.  The  variable  I  examine  is  log  accrual  return  (TACC^-^),  and  the  two 
control  variables  are  total  period  intangible  return  (TPIR)  and  non-accrual  growth  {N_TACC). 
TACC‘^‘^j  represents  growth  in  net  operating  assets  on  a  per-share  basis  (i.e.,  log  accrual  return). 
TPIR  represents  the  component  of  total  log  return  realized  between  t-5  and  t  that  is  orthogonal  to 
TACC^^j,  NJTACC,  and  (where  equals  log(5,_i  /  5;_5)  -  f_i). 

To  examine  how  variation  in  TACC°‘^^  explains  future  returns,  I  sort  all  observations  into 
annual  deciles  ranked  on  TPIR.  Each  of  these  ten  annual  TPIR  deciles  is  then  sorted  into  ten 
annual  sub-deciles  based  on  NJTACC.  Each  of  these  annual  sub-deciles  is  then  sorted  into  annual 
quintiles  based  on  TACCJ^K  I  then  pool  the  sub-deciles  together  by  year,  based  on  their  respective 
TACC^^-'  rank.  The  result  is  five  portfolios  in  each  year,  with  significant  variation  in  accruals  and 
minimal  variation  across  the  control  variables  (e.g.,  TPIR  and  NJTACC).  Values  reflected  in  Table 
7  represent  equal-weighted  annual  averages  of  the  40  years  in  the  sample,  with  t-statistics  consis¬ 
tent  with  Eama  and  MacBeth  (1973).  Delisting  returns  are  incorporated  into  the  computation  of 
annual  returns  as  per  Beaver  et  al.  (2007),  although  results  are  not  sensitive  to  the  inclusion  of 

1  1  •  •  24 

delisting  returns. 

Panel  A  of  Table  7  shows  the  average  hedge  portfolio  return  (size-adjusted  and  raw)  from 
implementing  an  unconditional  portfolio  strategy  of  sorting  firms  each  year  into  accrual  deciles 
based  on  TACC  and  taking  a  long  (short)  position  in  the  low  (high)  accrual  firms.  Size-adjusted 
returns  are  denoted  sRet  and  raw  returns  are  denoted  Ret.  Eor  comparative  purposes,  I  also  reflect 
TACC‘^^^  and  WCACC  to  show  the  close  similarity  between  portfolios  formed  on  TACC  and  those 
formed  on  TAC(J‘^j  and  WCACC  (where  TACCJ^j  =  TACC  - 

Panel  A  of  Table  7  suggests  that  the  hedge  returns  are  statistically  significant,  and  the  eco¬ 
nomic  significance  is  comparable  to  those  reflected  in  prior  accrual  studies.  Panels  B,  C,  and  D  use 
the  same  sample  of  firms,  sorted  into  quintiles  as  described  above,  to  determine  whether  accruals 
explain  future  returns  after  controlling  for  correlated  characteristics. 

Panel  B  of  Table  7  shows  that  once  TPIR  and  NJTACC  are  controlled,  accruals  are  not 
associated  with  future  returns.  Panel  C  in  Table  7  examines  the  abnormal  returns  that  can  be 
earned  on  TPIR,  controlling  for  TAC(J“^^  and  NJTACC.  This  panel  shows  that  economically  and 
statistically  significant  abnormal  returns  can  be  earned  on  portfolios  sorted  on  TPIR,  suggesting 
that  intangible  returns  capture  important  cross-sectional  differences  in  either  risk  or  some  type  of 
behavioral  bias.  Panel  D  in  Table  7  examines  the  abnormal  returns  that  can  be  earned  on  NJTACC 
controlling  for  TPIR  and  TACC^^K  Similar  to  Panel  B,  the  average  hedge  return  across  the  40  years 
of  the  sample  is  only  marginally  statistically  significant,  although  economically  insignificant. 


Firms  with  delisting  codes  between  500  and  599  (liquidations)  receive  a  delisting  return  of  —1.0.  I  thank  Richard  Price 
for  his  delisting  return  code  at  http://richardp.rice.edu/research/delistings.sas. 

Results  are  qualitatively  identical  (and  quantitatively  very  similar)  if  ChgNetDebt  or  ChgDebt  are  substituted  for 
NJTACC. 
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V.  UNTABULATED  ROBUSTNESS  TESTS 

Inferences  are  not  sensitive  to  the  inclusion  of  extreme  observations  or  to  trimming  observa¬ 
tions  at  other  percentiles  (i.e.,  1  percent,  99  percent).  I  also  investigate  the  sensitivity  of  my  results 
to  the  length  of  the  prior  period  intangible  return  period  by  shortening  the  prior  period  from  48 
months  to  24  months.  Results  with  a  shorter  prior  period  are  consistent  with  the  primary  results. 
Results  are  slightly  sensitive  to  the  inclusion  of  stocks  trading  under  $5,  although  PPIR  still  has  a 
material  effect  on  the  economic  and  statistical  significance  of  accruals.^^  Given  that  returns  of 
low-priced  stocks  are  affected  by  bid-ask  bounce  (which  directly  affects  calculation  of  intangible 
returns),  and  low-priced  stocks  have  characteristics  distinct  from  stocks  trading  at  more  than 
$5/share  (e.g.,  institutional  attention,  liquidity,  short-sale  constraints),  it  is  unclear  whether  exclud¬ 
ing  low-priced  stocks  from  the  primary  sample  materially  affects  the  robustness  of  the  main 
results. 

I  also  examine  whether  intangible  returns  proxy  for  investments  in  R&D  and  advertising. 
Prior  studies  suggest  that  investors  are  misled  by  the  conservative  manner  in  which  GAAP  ex¬ 
penses  investments  in  R&D  (Lev  and  Sougiannis  1996;  Eberhart  et  al.  2004;  Penman  and  Zhang 
2002).  These  studies  suggest  that  investors  underestimate  the  implications  to  future  earnings  of 
current  investments  in  R&D,  leading  to  positive  future  returns  in  subsequent  periods.  In  unre¬ 
ported  analysis,  I  find  that  firms  that  invest  heavily  in  R&D  (high  R&D  intensity)  and  firms  that 
are  growing  their  R&D  investment  tend  to  have  larger,  and  more  positive,  intangible  returns. 
However,  if  intangible  returns  proxy  for  unrecognized  investments  in  such  things  as  R&D,  then 
high  R&D  firms  should  have  lower  future  returns.  However,  prior  R&D  studies  show  a  positive 
relation  between  future  returns  and  R&D  activity,  suggesting  that  intangible  returns  are  not  simply 
capturing  a  dynamic  directly  attributable  to  accounting  conservatism — see  Donelson  and  Resutek 
(2010)  for  detailed  analysis  of  R&D  investment  and  intangible  returns. 

VI.  CONCLUDING  REMARKS  AND  IMPLICATIONS  FOR  FUTURE  RESEARCH 

This  study  re-examines  the  evidence  that  accruals  are  negatively  associated  with  future  re¬ 
turns.  Using  a  three-period  empirical  model,  I  examine  the  relation  between  future  returns  in 
period  three,  current  period  accruals  in  period  two,  and  prior  period  firm  growth  and  contraction  in 
period  one.  I  operationalize  the  empirical  tests  through  a  log-linear  return  decomposition  of  the 
book-to-market  ratio.  Using  this  empirical  design,  I  decompose  the  information  investors  use  to 
price  equities  into  two  components:  tangible  and  intangible  information.  Tangible  information 
refers  to  the  value-relevant  information  measured  by  summary  accounting  measures.  Intangible 
information  refers  to  all  other  value  relevant  information  not  captured  by  summary  accounting 
measures. 

The  empirical  evidence  suggests  that  current  period  accruals  are  not  associated  with  future 
returns.  This  finding  holds  for  multiple  definitions  and  decompositions  of  accruals.  Rather,  this 
study  suggests  that  the  negative  relation  between  future  returns  and  current  period  accruals  docu¬ 
mented  in  prior  literature  is  due  to  the  relation  between  prior  period  intangible  returns  and  accru¬ 
als.  Hence,  once  prior  period  intangible  returns  are  controlled,  the  negative  association  between 
future  returns  and  current  period  accruals  disappears.  The  results  are  also  consistent  with  the 
premise  that  the  accrual  anomaly  is  a  derivative  of  the  value/growth  anomaly.  This  finding  builds 
on  the  work  of  Desai  et  al.  (2004)  by  showing  that  a  common  economic  mechanism  drives  the 
negative  future  return  pattern  associated  with  the  accrual  anomaly  and  the  value/growth  anomaly. 


In  unreported  tests,  I  expand  my  sample  to  include  stocks  trading  between  $  1/share  and  $5/share  as  of  June  30,  t+\. 
Inferences  from  the  primary  tables  are  qualitatively  identical  to  those  in  the  paper,  although  a  marginal  negative 
association  exists  between  future  returns  and  total  accruals  in  some  of  the  robustness  tests  specifications. 


The  Accounting  Review 


July  2010 

American  Accounting  Association 


1372 


Resutek 


Finally,  this  study  provides  evidence  on  the  relations  among  future  returns,  external  financing 
activities,  and  accruals.  Consistent  with  Daniel  and  Titman  (2006),  I  show  tangible  log  returns  that 
represent  accounting  based  performance  are  not  associated  with  future  returns,  whereas  transac¬ 
tions  with  outside  equity  holders  and  intangible  log  returns  are  negatively  associated  with  future 
returns.  This  finding  builds  on  the  work  of  Dechow  et  al.  (2008)  and  Cohen  and  Lys  (2006)  and 
related  work  examining  the  relations  between  current  period  accruals  and  external  financing  ac¬ 
tivities  with  future  returns.  These  studies  note  that  accruals  are  associated  with  changes  in  invested 
capital  and  these  changes  are  associated  with  diminishing  returns  on  investment.  This  study 
confirms  this  relation,  but  shows  that  a  material  amount  of  the  information  responsible  for  the 
subsequent  return  reversal  is  due  to  pricing  dynamics  occurring  in  the  periods  prior  to  accrual 
recognition  and  orthogonal  to  the  information  captured  by  summary  accounting  measures. 

The  results  from  this  study  provide  a  new  perspective  on  the  relation  between  accruals  and 
future  returns.  From  this  perspective  come  many  new  questions,  particularly  relating  to  the  com¬ 
position  of  intangible  returns  and  the  interpretation  of  how  accounting  properties  and  principles 
affect  intangible  returns.  These  are  not  simple  questions  to  answer  empirically  as  any  definitively 
defined  empirical  construct  for  intangible  information  will  have  to  be  uncorrelated  with  accounting 
information  and  highly  correlated  with  contemporaneous  returns.  Complicating  this  exercise  more 
is  the  fact  that  future  research  needs  to  consider  that  intangible  returns  may  not  (only)  represent 
the  mispricing  of  some  type  of  information.  As  Daniel  and  Titman  (2006)  note,  the  negative 
relation  between  future  returns  and  intangible  returns  is  consistent  with  certain  rational  pricing 
models.  Thus,  future  research  must  be  careful  to  not  overlook  rational  pricing  dynamics  as  an 
explanation  for  the  future  return  patterns  associated  with  intangible  returns. 

Despite  the  nebulous  composition  and  multiple  interpretations  of  intangible  returns,  the  pri¬ 
mary  finding  of  this  study  is  unambiguous.  Consistent  with  the  tenets  of  rational  markets,  the 
empirical  evidence  in  this  study  shows  that  the  publicly  available  infonnation  contained  in  accru¬ 
als  is  not  mispriced  by  investors.  This  finding  does  not  diminish  the  importance  of  accruals  or  that 
of  other  summary  accounting  metrics.  The  significant  abnormal  returns  and  share  volume  occur¬ 
ring  in  the  short  window  around  earnings  announcements  emphasizes  the  importance  of  account¬ 
ing  information  to  investors.  However,  this  study  suggests  that  researchers  and  investors  need  to 
consider  non-accounting  (intangible)  information  from  prior  periods  when  drawing  inferences  of 
how  accounting  information  is  incorporated  into  stock  prices. 


APPENDIX 

All  data  are  from  the  annual  CRSP/Compustat  Merged  Database. 

Book  Value  of  Equity  (Bi) 

To  obtain  shareholder’s  equity  I  use,  when  not  missing,  stockholder’s  equity  (data216).  If 
data216  is  missing,  I  use  total  common  equity  (data60)  plus  preferred  stock  —  par  value  (datal30), 
if  both  items  are  not  missing.  Otherwise,  I  use  total  assets  (data6)  minus  total  liabilities  (datalSl), 
if  both  items  are  not  missing.  If  none  of  these  produce  a  value  total  shareholder’s  equity  value, 
then  it  is  treated  as  missing  for  that  year. 

To  obtain  book  equity,  I  subtract  from  shareholder’s  equity  (as  computed  above)  the  preferred 
stock  value.  Specifically,  I  subtract  from  shareholder’s  equity  preferred  stock  —  redemption  value 
(data56).  If  data56  is  missing,  I  subtract  from  shareholder’s  equity  preferred  stock  —  liquidating 
value  (datalO).  If  both  data56  and  datalO  are  missing,  I  subtract  from  shareholder’s  equity  pre¬ 
ferred  stock  —  par  value  (data  130).  If  data56,  datalO,  and  data  130  are  all  missing,  I  set  book  value 
of  equity  as  missing  for  this  year.  In  addition,  when  riot  missing,  I  add  to  book  value  of  equity 
deferred  taxes  —  balance  sheet  (data35)  and  subtract  the  FASB  No.  106  adjustment  (data330). 
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Net  Operating  Assets  {NOA^ 

To  obtain  operating  assets  {OA),  I  subtract  from  total  assets  (datab)  financial  assets.  Financial 
assets  (M)  are  defined  as  the  sum  of  Investments  and  Advances  —  Other  (data32)  plus  short-term 
investments  (datal93)  plus  cash  and  short-term  investments  (datal). 

To  obtain  operating  liabilities  (OL),  I  subtract  financial  liabilities  from  total  liabilities 
(datal 81).  Financial  liabilities  (FL)  equal  the  sum  of  long-term  debt  (data9)  plus  debt  in  current 
liabilities  (data34)  plus  notes  payable  (data206)  plus  the  amount  of  preferred  stock  in 
shareholder’s  equity  (as  computed  above).  Accordingly,  Net  Operating  Assets  (NOA)  is  computed 
as  OA  —  OL.  Total  accruals  (TACC,)  as  log(A''OA);- log(AOA);_i.  Data  items  32,  34,  and  193  are 
set  to  0  when  missing. 
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ABSTRACT:  Statement  of  Financial  Accounting  Standards  No.  157  (FAS  No.  157),  Fair 
Value  Measurements,  prioritizes  the  source  of  information  used  in  fair  value  measure¬ 
ments  into  three  levels:  (1)  Level  1  (observable  inputs  from  quoted  prices  in  active 
markets),  (2)  Level  2  (indirectly  observable  inputs  from  quoted  prices  of  comparable 
items  in  active  markets,  identical  items  in  inactive  markets,  or  other  market-related 
information),  and  (3)  Level  3  (unobservable,  firm-generated  inputs).  Using  quarterly 
reports  of  banking  firms  in  2008,  we  find  that  the  value  relevance  of  Level  1  and  Level 
2  fair  values  is  greater  than  the  value  relevance  of  Level  3  fair  values.  In  addition,  we 
find  evidence  that  the  value  relevance  of  fair  values  (especially  Level  3  fair  values)  is 
greater  for  firms  with  strong  corporate  governance.  Overall,  our  results  support  the 
relevance  of  fair  value  measurements  under  FAS  No.  157,  but  weaker  corporate  gov¬ 
ernance  mechanisms  may  reduce  the  relevance  of  these  measures. 

Keywords:  FAS  No.  157;  fair  value;  fair  value  hierarchy;  value  relevance;  corporate 
governance. 
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L  INTRODUCTION 

In  response  to  users’  requests  for  a  comprehensive  framework  for  measuring  and  disclosing  fair 
value,  the  Financial  Accounting  Standards  Board  (FASB)  issued  Statement  of  Financial  Ac¬ 
counting  Standards  No.  157,  Fair  Value  Measurements  (FAS  No.  157;  FASB  2006)  in  2006, 
effective  for  fiscal  years  beginning  after  November  15,  2007.  FAS  No.  157  does  not  increase  the 
use  of  fair  value  measurements,  but  instead  provides  a  uniform  definition  of  fair  value,  establishes 
a  framework  for  measuring  fair  value,  and  expands  disclosure  about  fair  value  measurements. 
More  specifically  and  of  particular  interest  to  our  study,  FAS  No.  157  requires  fair  value  assets  and 
liabilities  to  be  disclosed  by  levels,  where  levels  are  based  on  inputs  used  to  measure  fair  values: 
(1)  Level  1  (observable  inputs  from  quoted  prices  in  active  markets),  (2)  Level  2  (indirectly 
observable  inputs  from  quoted  prices  of  comparable  items  in  active  markets,  identical  items  in 
inactive  markets,  or  other  market-related  information),  and  (3)  Level  3  (unobservable,  firm¬ 
generated  inputs).^ 

First,  we  test  the  value  relevance  of  fair  value  measures  for  each  of  the  three  disclosure  levels. 
A  long-standing  debate  of  fair  value  accounting  has  been  centered  on  the  trade-off  between 
relevance  and  reliability.  Proponents  of  fair  value  accounting  argue  that  fair  value  information  has 
greater  relevance,  more  accurately  reflects  real  volatility,  and  simplifies  financial  reporting  (e.g., 
hedge  accounting).  In  contrast,  opponents  of  fair  value  accounting  argue  that  fair  value  measure¬ 
ments  are  less  verifiable  by  investors,  subject  to  greater  estimation  error  by  management,  and 
prone  to  greater  managerial  manipulation.  These  shortcomings  create  information  asymmetry 
between  investors  and  managers  that  can  be  a  serious  threat  to  the  reliability  of  fair  values 
(Landsman  2007;  Penman  2007).  Because  these  problems  are  expected  to  become  more  severe  as 
fair  value  inputs  become  less  observable  by  investors,  we  are  particularly  interested  in  testing  the 
value  relevance  of  Level  1  fair  values  versus  Level  3  fair  values.  The  reliability  and  therefore 
potential  information  asymmetry  problems  associated  with  Level  2  fair  values  potentially  fall 
between  those  of  Level  1  and  Level  3  fair  values. 

As  our  second  test,  we  examine  whether  the  value  relevance  of  fair  values  depends  on  firms’ 
corporate  governance  mechanisms.  The  motivation  for  this  test  arises  from  the  greater  subjectivity 
on  the  part  of  management  in  measuring  and  reporting  fair  values.  Although  in  some  instances 
managers  may  use  their  private  information  to  credibly  report  fair  values  (e.g.,  Barth  et  al.  1998), 
prior  studies  also  provide  evidence  that  managers  may  manipulate  inputs  for  fair  values  for  their 
own  interests  (e.g.,  Aboody  et  al.  2006;  Bartov  et  al.  2007).  For  firms  with  weaker  corporate 
governance  mechanisms,  information  asymmetry  problems  associated  with  fair  values  may  be 
greater,  leading  to  more  severe  moral  hazard  problems,  and  therefore  lower  value  relevance  of 
these  disclosures.  On  the  whole.  Level  1  fair  value  reporting  is  likely  to  suffer  the  least  from 
information  asymmetry,  and  one  might  expect  corporate  governance  mechanisms  to  have  the  least 
impact  on  the  valuation  of  these  fair  values.  Instead,  corporate  governance  should  probably  play  a 
larger  role  in  the  value  relevance  of  Level  3  fair  values  where  information  asymmetry  is  likely  the 
highest. 

Using  a  sample  of  quarterly  reports  by  banking  firms  in  2008,  we  provide  the  following 
results.  First,  we  find  evidence  that  fair  value  measurements  of  Level  1,  Level  2,  and  Level  3 


The  FASB  Staff  Position  (FSP)  FAS  No.  157-4  (FASB  2009)  was  issued  in  April  2009  to  clarify  the  application  of  FAS 
No.  157.  It  provides  additional  guidance  on  estimating  fair  values  when  the  volume  and  level  of  activity  for  asset  or 
liability  have  significantly  decreased.  Specifically,  FSP  FAS  No.  157-4  provides  a  list  of  factors  that  a  company  should 
evaluate  in  order  to  determine  whether  a  significant  decrease  has  occurred  in  the  volume  and  level  of  activity  for  the 
asset  or  liability  in  relation  to  normal  market  activity  for  the  asset  or  liability.  When  estimating  fair  value,  the  company 
should  place  (1)  more  weight  on  transactions  that  the  company  concludes  are  orderly;  (2)  less  weight  on  transactions  for 
which  the  company  does  not  have  sufficient  information  to  conclude  whether  the  transaction  is  orderly;  and  (3)  little,  if 
any,  weight  on  transactions  that  the  company  concludes  are  not  orderly. 
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inputs  are  value-relevant.  We  also  find  that  the  valuation  coefficient  of  Level  3  assets  is  signifi¬ 
cantly  less  positive  than  those  of  Level  1  and  Level  2  assets.  The  lower  valuation  of  Level  3  assets 
is  consistent  with  investors  decreasing  the  weight  they  place  on  less  reliable  fair  value  measure¬ 
ments  (Maines  and  Wahlen  2006).  The  valuation  coefficient  of  Level  3  liabilities  is  significantly 
more  negative  than  those  of  Level  1  and  Level  2  liabilities,  consistent  with  less  reliable  amounts 
for  Level  3  liabilities.  We  additionally  test  whether  the  coefficient  on  each  Level  is  significantly 
different  from  its  theoretically  predicted  value  of  1  for  assets  and  —  1  for  liabilities.^  In  particular, 
if  investors  perceive  reliability  concerns  for  Level  3  assets  (liabilities),  the  coefficient  on  these  fair 
values  could  be  less  than  1  (—1).  Our  results  are  consistent  with  this  prediction.  In  order  to 
mitigate  concerns  that  firm  size  may  be  correlated  with  the  types  of  assets  held  by  banks  and  that 
the  strength  of  a  bank’s  capital  ratio  could  be  correlated  with  managers’  choice  of  valuation  levels, 
we  examine  whether  our  results  hold  in  subsamples  partitioned  by  firm  size  and  Tier  1  capital 
ratio.  We  find  that  our  results  continue  to  hold  in  all  subsamples. 

For  our  second  test,  we  examine  whether  the  value  relevance  of  fair  values  varies  across  six 
individual  governance  mechanisms  (i.e.,  board  independence,  audit  committee  financial  expertise, 
the  frequency  of  annual  audit  committee  meetings,  the  percent  of  shares  held  by  institutional 
investors,  the  auditor’s  office  size,  and  no  material  control  weakness  problem  under  Sections  302 
and  404  of  the  Sarbanes-Oxley  Act),  as  well  as  a  factor  score  based  on  these  mechanisms.  We  find 
that  governance  has  a  significant  impact  on  Level  2  and  Level  3  fair  values.  As  the  strength  of 
corporate  governance  increases,  investors’  valuation  of  these  fair  value  assets  and  liabilities  in¬ 
creases  toward  the  theoretically  predicted  coefficient  values  of  1  and  —  1 .  These  results  highlight 
the  importance  of  corporate  governance  mechanisms  in  mitigating  the  information  asymmetry 
problem  associated  with  Level  2  and  Level  3  inputs.  Consistent  with  the  notion  that  Level  1  fair 
value  information  suffers  the  least  from  information  asymmetry,  we  find  little  impact  of  corporate 
governance  on  these  fair  values. 

Finally,  we  provide  two  additional  analyses.  First,  we  find  that  the  ability  of  Level  information 
provided  by  FAS  No.  157  interacted  with  existing  asset  and  liability  Type  information  has  a 
greater  explanatory  power  for  firm  value  than  does  Type  information  only.  This  difference  sug¬ 
gests  that  valuation  of  Level  information  is  not  simply  a  reordering  of  valuation  by  Type  infor¬ 
mation  that  existed  prior  to  FAS  No.  157  (i.e.,  FAS  No.  157  provides  new  information).  Second, 
we  consider  how  conclusions  may  have  changed  as  the  economic  crisis  worsened  and  market 
illiquidity  became  more  severe  in  2008.  Although  we  find  no  evidence  of  a  decline  in  the  valuation 
of  fair  values  during  the  first  three  quarters  of  2008,  we  do  find  some  evidence  that  the  valuation 
of  Level  3  assets  and  liabilities  actually  moved  closer  to  their  predicted  values  of  1  and  —  1.  In 
addition,  we  find  evidence  that  the  impact  of  corporate  governance  for  valuation  of  fair  value 
assets  remains  significant  over  time. 

Overall,  we  conclude  that  the  fair  value  hierarchy  required  by  FAS  No.  157  provides  useful 
information  to  investors  and  the  strength  of  corporate  governance  appears  to  mitigate  the  infor¬ 
mation  asymmetry  problem  arising  from  relatively  less  reliable  fair  value  inputs.  These  results 
contribute  to  the  literature  on  fair  value  accounting.  We  provide  early  evidence  of  the  value 
relevance  of  new  disclosures  under  FAS  No.  157.  Prior  to  FAS  No.  157,  direct  tests  of  the 
association  between  the  reliability  of  fair  value  information  and  equity  prices  were  more  difficult. 
Using  the  fair  value  hierarchy  under  FAS  No.  157,  we  provide  direct  evidence  of  the  value 
relevance  of  more  reliable  (Level  1)  versus  less  reliable  (Level  3)  information. 


^  These  theoretical  predictions  are  based  on  implicit  assumptions  that  (1)  the  valuation  model  is  properly  specified  and  (2) 
markets  are  efficient.  Therefore,  we  do  not  claim  that  estimated  coefficients  must  be  1  or  -1.  We  merely  use  these 
theoretical  values  as  benchmarks  for  our  statistical  testing. 
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Two  recent  working  papers  (Kolev  2009;  Goh  et  al.  2009)  provide  tests  of  value  relevance 
similar  to  ours.  Although  we  find  valuations  of  Level  1  and  Level  2  assets  and  liabilities  close  to 
1  and  —  1,  these  papers  find  valuations  of  Level  1  and  Level  2  net  assets  significantly  less  than  1. 
Furthermore,  Goh  et  al.  (2009)  document  that  investors  value  Level  2  net  assets  less  than  Level  1 
net  assets  but  do  not  value  Level  2  and  Level  3  net  assets  differently.  In  contrast,  we  show  that 
Level  1  and  Level  2  assets  are  valued  similarly,  while  Level  3  assets  are  valued  the  least.  Goh  et 
al.  (2009)  also  document  that  the  value  relevance  of  net  fair  value  assets  decreases  over  the  first 
three  quarters  of  2008,  whereas  we  find  that  the  value  relevance  of  fair  values  does  not  decrease 
over  this  period.''  Finding  evidence  that  the  value  relevance  of  fair  values  does  not  decrease  as 
markets  become  less  liquid  may  be  particularly  important  to  standard-setters  who  are  interested  in 
the  market’s  perception  of  the  reliability  of  fair  values  during  an  economic  crisis. 

We  also  contribute  to  the  literature  by  examining  directly  the  association  between  the  strength 
of  corporate  governance  and  value  relevance  of  fair  values.  Standard-setters  understand  that  in¬ 
formation  asymmetry  would  be  higher  for  Level  3  assets  and  for  this  reason  they  required  firms  to 
provide  additional  disclosures  for  these  items  (e.g.,  the  inputs  used  to  measure  fair  value  and  the 
effect  of  the  measurements  on  earnings  or  changes  in  net  assets).  Presumably  these  additional 
disclosures  would  reduce  or  even  eliminate  the  information  asymmetry  problem.  We  find  evidence 
consistent  with  the  information  asymmetry  problem  continuing  to  exist,  but  the  strength  of  cor¬ 
porate  governance  appears  to  ameliorate  this  problem.  These  results  highlight  the  importance  of 
corporate  governance  for  the  value  relevance  of  accounting  information,  especially  for  information 
that  is  potentially  less  reliable. 

The  next  section  describes  FAS  No.  157,  reviews  relevant  prior  research,  and  develops  hy¬ 
potheses.  Section  III  outlines  the  sample  selection  process  and  provides  descriptive  statistics. 
Section  IV  presents  empirical  results.  The  study  concludes  with  a  discussion  of  the  results. 

11.  LITERATURE  REVIEW  AND  HYPOTHESES  DEVELOPMENT  FAS  NO.  157 

In  recent  years,  U.S.  Generally  Accepted  Accounting  Principles  (GAAP)  have  evolved  toward 
greater  use  of  fair  values  for  reporting  assets  and  liabilities.  However,  prior  to  FAS  No.  157, 
neither  a  single  coherent  definition  for  fair  value  nor  detailed  guidance  for  applying  the  fair  value 
measurement  existed.  In  June  2003,  the  FASB  added  the  fair  value  measurement  project  to  its 
agenda  to  address  these  issues.  In  September  2006,  the  FASB  released  FAS  No.  157,  Fair  Value 
Measurements.  FAS  No.  157  does  not  expand  the  use  of  fair  value  measurements,  but  provides  a 
coherent  framework  for  applying  fair  value  measurements  and  enhances  disclosures  about  the 
nature  and  source  of  fair  value  measurements  to  increase  consistency  and  comparability. 

The  FASB  has  concluded  that  fair  value  information  is  relevant  (e.g.,  see  Paragraph  217  of 
FAS  No.  133  or  Paragraph  C2  of  FAS  No.  157).^  To  the  extent  that  established,  active  markets  for 


^  We  acknowledge  that  different  samples,  time  periods,  and  variable  definitions  across  these  studies  do  not  allow  us  to 
make  direct  comparisons. 

''  Other  differences  between  our  study  and  Kolev  (2009)  and  Goh  et  al.  (2009)  include:  (1)  we  have  a  larger  sample  size 
and/or  longer  sample  period,  (2)  we  consider  the  impact  of  a  larger  set  of  corporate  governance  mechanisms  on  the 
pricing  of  fair  value  assets  under  FAS  No.  157,  and  (3)  we  consider  the  incremental  pricing  effect  of  expanded 
disclosure  format  under  FAS  No.  157.  However,  it  should  be  noted  these  studies  provide  additional  tests  beyond  ours 
that  will  contribute  to  the  literature.  For  example,  Kolev  (2009)  uses  a  changes  specification,  allowing  tests  of  the  value 
relevance  of  Level  3  realized  and  unrealized  gains/losses  versus  net  purchases  and  transfers. 

^  Claims  have  been  made  that  fair  value  may  not  be  the  most  relevant  measurement  attribute.  For  example,  one  claim  is 
that  fair  value  may  misrepresent  management’s  intent  (e.g.,  intent  to  hold/owe  an  asset/a  liability  to  its  maturity). 
However,  fair  value  measurements  allow  shareholders  to  assess  whether  holding  an  asset  or  owing  a  liability  is  appro¬ 
priate  by  providing  timely  value  change  information.  See  Ryan  (2007,  Ch.  6)  for  other  claims  and  related  discussions. 
Also  see  Ramesh  and  Graziano  (2004)  for  various  discussions  around  the  benefits  and  costs  of  historical  cost  versus  fair 
value  accounting. 
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assets/liabilities  are  in  place,  fair  value  is  likely  to  be  the  most  relevant  measurement  attribute. 
However,  fair  value  measurements  in  the  absence  of  observed  prices  might  be  unreliable  due  to 
intrinsic  measurement  error  (noise)  and  management-induced  error  (bias).  FAS  No.  157  estab¬ 
lishes  a  fair  value  hierarchy  that  prioritizes  the  inputs  used  to  measure  fair  values  into  three  broad 
levels,  considering  the  relative  reliability  of  the  inputs  to  fair  value  measurements.  Level  1  inputs 
are  quoted  prices  in  active  markets,  and  a  quoted  price  is  a  single  primary  basis  for  the  fair  value 
measurement.  Therefore,  the  information  asymmetry  between  preparers  and  users  is  very  low  for 
Level  1  inputs.  Level  2  inputs  include  either  observable  prices  in  active  market  for  comparable 
assets  and  liabilities  or  observable  market  prices  in  inactive  markets  for  identical  assets  and 
liabilities.  Level  2  inputs  could  also  include  prices  corroborated  by  market-based  measures  (e.g., 
correlation  with  the  yield  curve).  Level  3  inputs  are  not  observable  from  the  market  and  reflect 
management’s  assessment  of  the  assumptions  that  market  participants  would  use  in  pricing  the 
asset  or  liability. 

The  key  distinction  between  Level  2  inputs  and  Level  3  inputs  is  whether  inputs  are  observ¬ 
able.  This  distinction  is  important  for  verifiability  of  fair  values.  Therefore,  the  Level  3  inputs  are 
subject  to  the  highest  degree  of  information  asymmetry  between  preparers  and  users.  FAS  No.  157 
enhances  fair  value  disclosure  by  requiring  firms  to  provide  fair  value  measurements  by  input 
levels  in  the  hierarchy,  enabling  users  to  assess  the  relative  reliability  of  fair  value  measurements. 
The  Appendix  provides  an  example  of  FAS  No.  157  fair  value  measurements  disclosures  from  a 
quarterly  report  filed  by  Wells  Fargo  &  Company. 

Value  Relevance  of  Fair  Value  Hierarchy 

Accounting  information  is  considered  to  be  value-relevant  when  it  has  the  predicted  associa¬ 
tion  with  market  value  of  equity  (Barth  et  al.  2001).  If  a  significant  association  is  found,  then  it  is 
assumed  that  the  accounting  information  of  research  interest  is  relevant  to  investors  and  reliable 
enough  to  be  reflected  in  share  prices.  In  this  section,  we  discuss  literature  related  to  the  value 
relevance  of  fair  value  measures.  We  also  detail  why  reliability  concerns  could  affect  the  value 
relevance  of  fair  values.  This  discussion  helps  in  formulating  an  expectation  regarding  the  differ¬ 
ential  value  relevance  across  Levels  information  under  FAS  No.  157. 

Several  studies  document  that  fair  values  of  investment  securities  of  banks  and  property- 
liability  insurers  are  value-relevant  (Barth  1994;  Petroni  and  Wahlen  1995;  Barth  et  al.  1996; 
Eccher  et  al.  1996;  Nelson  1996;  Carroll  et  al.  2003).  Petroni  and  Wahlen  (1995)  find  that  fair 
values  for  equities  and  Treasury  securities  are  value-relevant,  but  fair  values  of  municipal  and 
corporate  bonds  are  not,  suggesting  securities  actively  traded  in  the  market  are  more  reliably 
associated  with  market  value  of  equity.  In  contrast,  using  a  sample  of  closed-end  mutual  funds, 
Carroll  et  al.  (2003)  find  that  fair  values  of  thinly  traded  securities  are  value-relevant. 

When  no  established  market  is  available  (i.e.,  fair  values  are  less  reliable),  the  evidence  of 
value  relevance  of  other  financial  instruments  is  mixed.  For  example.  Nelson  (1996)  finds  that  fair 
values  of  loans,  deposits,  and  long-term  debt  are  not  value-relevant.  In  contrast,  Barth  et  al.  (1996) 
find  that  fair  values  of  loans  are  value-relevant,  whereas  Eccher  et  al.  (1996)  find  the  value 
relevance  of  fair  values  of  loans  only  in  limited  settings.  Finally,  Venkatachalam  (1996)  examines 
the  value  relevance  of  fair  values  of  derivatives  and  finds  that  these  fair  values  are  positively 
associated  with  market  values  of  equity. 

Another  stream  of  fair  value  accounting  research  examines  how  managerial  opportunism 
affects  the  reliability  of  fair  value  measurements.  Beaver  and  Venkatachalam  (2003)  partition  U.S. 
bank  loan  fair  values  into  three  categories  (i.e.,  non-discretionary,  discretionary,  and  noisy  com¬ 
ponents)  and  show  that  the  pricing  coefficient  of  the  discretionary  loan  component  is  negative  if 
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the  managerial  intent  for  the  discretion  is  likely  opportunistic.^  Studies  also  examine  whether 
managers  use  private  information  for  inputs  to  option  valuation  models  to  help  or  hurt  information 
users  in  the  U.S.  setting.  For  example,  in  the  case  of  stock  option  expense  reporting  under  FAS 
No.  123  (FASB  2004),  Aboody  et  al.  (2006)  and  Bartov  et  al.  (2007)  show  that  managerial 
incentives  induce  managers’  discretion  over  model  assumptions  (e.g.,  the  expected  option  life  and 
stock  volatility  input  assumptions),  lowering  option  values. 

In  addition  to  financial  assets,  several  studies  support  the  value  relevance  of  fair  value  non- 
financial  assets  (e.g.,  Barth  and  Clinch  1998;  Aboody  et  al.  1999;  Easton  et  al.  1993;  Dietrich  et 
al.  2000).  Some  of  these  studies  also  examine  whether  the  value  relevance  of  fair  value  measure¬ 
ments  varies  with  the  reliability  of  the  information.  For  example,  Dietrich  et  al.  (2000)  find  the 
reliability  of  fair  value  estimates  is  an  increasing  function  of  the  presence  of  monitoring  of 
external  (as  opposed  to  internal)  appraisers.  Similarly,  Muller  and  Riedl  (2002)  provide  evidence 
that  market  participants  appear  to  find  external  appraisals  to  be  more  reliable.  However,  Barth  and 
Clinch  (1998)  find  no  difference  in  value  relevance  between  internal  and  external  appraisals. 

In  summary,  the  literature  generally  suggests  that  fair  value  measurements  are  value-relevant, 
and  that  the  value  relevance  of  fair  value  measurements  varies  with  the  source  of  information. 
Although  the  source  of  fair  value  measurements  is  assumed  to  be  associated  with  the  reliability  of 
information,  direct  archival  tests  of  the  association  between  the  reliability  of  fair  value  information 
and  equity  prices  were  difficult  prior  to  FAS  No.  157.  FAS  No.  157  now  explicitly  categorizes  the 
inputs  to  fair  value  measurements  and  allows  us  to  assess  whether  the  value  relevance  of  less 
reliable  fair  values  (i.e..  Level  3)  is  different  from  that  of  more  reliable  fair  values  (i.e..  Level  1). 

Regarding  investors  and  how  they  weigh  reported  fair  values  in  their  pricing  of  the  firm,  two 
factors  are  important  to  consider:  the  discount  adjustment  and  the  cash  flow  adjustment.  Investors, 
to  the  extent  that  they  perceive  greater  uncertainty  of  fair  values,  will  adjust  upward  the  discount 
rate  applied  to  those  reported  amounts,  resulting  in  less  than  a  dollar-for-dollar  valuation.  Further¬ 
more,  investors,  to  the  extent  that  they  perceive  reported  assets  (liabilities)  to  be  biased  upward 
(downward),  either  intentionally  or  unintentionally,  will  adjust  downward  for  a  cash  flow  effect. 
Again,  this  adjustment  results  in  less  than  a  dollar-for-dollar  valuation  of  reported  amounts.  Below 
we  discuss  why  investors’  downward  valuation  adjustment  is  likely  to  be  greater  for  reported 
Level  3  fair  values  than  for  reported  Level  1  and  Level  2  fair  values.  Given  that  our  tests  of  value 
relevance  estimate  investors’  adjustment  to  reported  fair  values  (i.e.,  valuation  coefficient  on 
reported  discounted  cash  flows),  we  do  not  specify  whether  the  adjustment  is  more  likely  to  be  the 
consequence  of  a  discount  effect  versus  a  cash  flow  effect.  Rather,  we  contend  that  both  are  likely 
contributing  factors. 

Level  3  fair  values  are  less  observable  and  are  naturally  subject  to  greater  information  asym¬ 
metry  between  investors  and  management.  In  addition,  the  more  subjective  nature  of  Level  3  fair 
values  makes  them  prone  to  greater  estimation  error  by  management  (i.e.,  noise).  Both  information 
asymmetry  and  estimation  error  inherent  to  the  production  of  specific  accounting  information 
increase  investors’  adverse  selection,  liquidity  risk,  and  information-processing  costs,  all  of  which 


^  Nissim  (2003)  examines  whether  the  U.S.  bank  managers  strategically  overstate  bank  loans,  finding  that  the  estimated 
degree  of  overstatement  systematically  varies  with  regulatory  capital  requirements  and  the  change  in  the  rate  of  credit 
losses.  In  contrast,  Bernard  et  al.  (1995)  examine  whether  Danish  bank  managers  manage  fair  value  estimates  to  meet 
the  mark-to-market  based  regulatory  capital  requirement  in  Denmark,  but  find  no  such  evidence. 

^  In  contrast,  Hodder  et  al.  (2006)  find  that  a  large  proportion  of  firms  use  value-increasing  discretion  in  their  valuation 
model  inputs.  Note  that  FAS  No.  157  does  not  apply  to  stock  option  compensation  measurements  in  FAS  No.  123(R). 
However,  these  studies  are  relevant,  showing  managerial  discretion  over  input  values  in  option  value  models. 
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increase  a  firm’s  cost  of  capital  (Diamond  and  Verrecchia  1991;  Baiman  and  Verrecchia  1996).^ 
These  effects  hold  even  without  the  presence  of  moral  hazard.  As  cost  of  capital  increases,  the 
value  of  a  firm’s  assets  decreases.  From  an  investor’s  perspective,  accounting  amounts  that  are  less 
reliable  (e.g..  Level  3  fair  values)  are  assigned  a  higher  cost  of  capital  and,  therefore,  are  valued 
less  than  are  more  reliable  accounting  amounts  (e.g..  Level  1  fair  values).  Thus,  we  expect 
investors’  downward  adjustment  to  Level  3  fair  values  to  be  greater  because  of  a  greater  discount 
effect. 

We  now  turn  our  discussion  to  cash  flow  effects.  By  cash  flow  effect,  we  mean  that  investors 
perceive  management’s  estimate  of  future  cash  flows  to  differ  systematically  from  realized  future 
cash  flows  (i.e.,  bias).  When  certain  accounting  information  is  highly  subjective  in  nature,  and 
managers  are  allowed  to  exercise  a  high  degree  of  discretion  over  it,  managers  may  be  more  likely 
to  generate  intentional  biases  in  their  estimations  (e.g.,  Aboody  et  al.  2006;  Bartov  et  al.  2007).^ 
To  the  extent  that  these  biases  are  expected  on  average,  investors  likely  adjust  such  estimates  in 
valuing  the  firm.  Specifically,  if  investors  are  concerned  about  possible  overstatement  of  Level  3 
fair  value  assets  and  understatement  of  Level  3  fair  value  liabilities,  then  they  will  adjust  their 
valuation  of  management-reported  Level  3  assets  and  liabilities  to  less  than  1  and  —1,  respec¬ 
tively. 

In  addition,  less  reliable  accounting  information  reduces  the  ability  of  investors  to  monitor 
managerial  behavior,  potentially  reducing  the  firm’s  operating  performance  and  future  cash  flows. 
Bushman  and  Smith  (2001)  discuss  the  important  role  of  financial  accounting  information  as  a 
mechanism  to  discipline  managerial  behavior.  As  the  quality  of  financial  information  deteriorates, 
investors  lose  their  ability  to  link  the  activities  of  the  manager  to  firm  performance.  In  other  words, 
without  the  disciplining  mechanism  afforded  by  reliable  financial  accounting  information,  manag¬ 
ers  are  held  less  accountable  for  their  actions  and  therefore  operate  the  firm  less  efficiently  or 
extract  private  benefits  directly,  both  of  which  are  detrimental  to  firm  value. Level  3  fair  values 
are  less  observable,  making  it  more  difficult  (compared  to  Level  1  fair  values)  for  investors  to  link 
their  performance  to  managerial  decisions,  reducing  the  efficiency  of  these  activities. 

Because  of  the  issues  discussed  above,  investors  are  likely  to  decrease  the  weight  they  place 
on  less  reliable  Level  3  fair  value  measurements  in  their  equity-pricing  decisions  (Maines  and 
Wahlen  2006).  Level  1  fair  values  are  expected  to  suffer  less  from  these  issues,  as  these  amounts 
can  be  easily  verified  by  investors.  Thus,  we  expect  the  pricing  coefficient  of  Level  1  fair  value 
measurements  to  be  greater  than  that  of  Level  3  fair  value  measurements.  Regarding  Level  2  fair 
values,  these  measurements  potentially  represent  the  nfiddle  ground  of  reliability.  Level  2  fair 
values  are  based  on  observable  market  inputs,  but  in  some  cases  it  may  be  more  difficult  for 
investors  to  observe  directly  how  bank  managers  adapt  those  inputs  to  generate  reported  fair 
values.  Because  observability  of  market  inputs  for  Level  2  fair  values  is  meant  to  indicate  in¬ 
creased  reliability  of  fair  value  measurements,  we  expect  the  valuation  of  Level  2  fair  values  to  be 
greater  than  that  of  Level  3  fair  values,  but  we  do  not  make  a  prediction  for  the  difference  in 
valuation  of  Level  1  versus  Level  2  fair  values.  This  leads  to  our  first  hypothesis  (stated  in 
alternative  form): 


®  This  concept  relates  to  estimation  risk  (Barry  and  Brown  1985)  and  investors’  assessment  of  the  extent  to  which  a  firm’s 
cash  flows  are  correlated  with  other  firms’  cash  flows  (Lambert  et  al.  2007).  The  extent  to  which  a  firm’s  (assessed)  cash 
flows  are  non-diversifiable  positively  impacts  its  cost  of  capital.  These  conjectures  are  supported  empirically  (e.g.,  Leuz 
and  Verrecchia  2000;  Francis  et  al.  2004;  Hope  et  al.  2009). 

®  Another  type  of  bias  is  unintentional  optimism  by  managers  (e.g.,  psychological  upward  bias)  (Martin  et  al.  2006;  Roll 
1986). 

Several  studies  find  evidence  consistent  with  higher  quality  accounting  information  facilitating  investors’  ability  to 
monitor  managers’  decisions  and  therefore  improving  firm  performance  (Lombardo  and  Pagano  2002;  Bens  and  Mona¬ 
han  2004;  Kanodia  et  al.  2004;  Biddle  and  Hilary  2006;  Hope  and  Thomas  2008). 
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HI:  The  value  relevance  of  Level  1  and  Level  2  fair  values  is  greater  than  the  value  rel¬ 
evance  of  Level  3  fair  values. 

Although  not  explicitly  stated  as  part  of  our  hypothesis,  we  also  test  whether  valuation 
coefficients  differ  from  their  theoretically  predicted  values.  That  is,  if  reported  fair  values  accu¬ 
rately  represent  their  underlying  economic  value,  then  investors  are  expected  to  assign  dollar-for- 
dollar  value  to  these  amounts.  Easily  verifiable  in  most  instances,  one  would  predict  Level  1  and 
Level  2  assets  (liabilities)  to  have  valuation  coefficients  close  to  1  (—1).  Furthermore,  consistent 
with  arguments  above  to  support  HI,  the  more  subjective,  less  reliable  Level  3  assets  (liabilities) 
would  have  a  valuation  coefficient  less  than  1  (—1). 

The  Effect  of  Corporate  Governance  on  the  Value  Relevance  of  Fair  Values 

In  the  previous  section,  we  discuss  how  intrinsic  estimation  error  (noise)  and  management- 
induced  error  (bias)  inherent  in  fair  value  measurements  might  reduce  the  ability  of  investors  to 
rely  on  these  amounts.  In  this  section,  we  consider  the  extent  to  which  stronger  corporate  gover¬ 
nance  mechanisms  can  mitigate  these  problems.  We  do  so  by  testing  whether  the  value  relevance 
of  fair  values  varies  with  the  strength  of  a  firm’s  corporate  governance. 

Given  that  measurement  errors  are  likely  to  be  more  severe  for  inputs  without  observable 
prices  (Level  3  and  perhaps  Level  2)  than  for  inputs  directly  observable  in  active  markets  (Level 
1),  we  expect  corporate  governance  to  be  more  effective  at  mitigating  problems  associated  with 
Level  3  fair  values. 

Bhat  (2009)  provides  evidence  that  over  the  2003-2005  period  market  participants  perceived 
fair  value  gains  and  losses  of  banks  with  strong  corporate  governance  as  more  relevant  and 
reliable.  She  interprets  these  results  to  mean  that  corporate  governance  aids  market  participants  in 
evaluating  the  quality  of  fair  value  estimates.  Aboody  et  al.  (2006)  find  firms  with  stronger 
governance  are  less  likely  to  understate  option  value  estimates.  Penman  (2007)  discusses  the 
importance  of  both  the  competence  and  independence  of  monitors,  as  well  as  the  effectiveness  of 
internal  control  systems  in  minimizing  biases  in  Level  3  fair  value  estimates. 

To  the  extent  that  strong  corporate  governance  mechanisms  reduce  reliability  concerns  relat¬ 
ing  to  estimation  error  and  management  induced  bias  in  Level  3  fair  values,  investors  are  more 
likely  to  view  Level  3  fair  values  as  relevant,  which  leads  to  our  second  hypothesis  (stated  in 
alternative  form): 

H2:  Corporate  governance  has  a  greater  impact  on  the  value  relevance  of  Level  3  fair  values 
than  on  the  value  relevance  of  Level  1  or  Level  2  fair  values. 

III.  SAMPLE  SELECTION  AND  DESCRIPTIVE  STATISTICS 
Sample  Selection 

To  maximize  the  power  of  our  value  relevance  test,  we  focus  on  the  banking  industry  where 
firms  have  significant  amounts  of  fair  value  assets  and  liabilities.  Table  1  delineates  the  sample 
selection  process.  The  sample  firms  were  initially  identified  from  Compustat  Bank  Fundamentals 
Quarterly  Research  File.  To  be  included  in  the  sample,  firms  need  to  provide  the  first  quarter  FAS 
No.  157  fair  value  hierarchy  disclosure  after  November  15,  2007  (n  =  522).  To  avoid  the  con¬ 
founding  effect  of  prices  from  different  macroeconomic  events,  we  focus  on  firms  that  end  the  first 
quarter  on  March  31,  2008,  eliminating  10  non-March  31  quarter  firms  (n  =  512).  We  further 
require  firms  to  have  price  information  in  the  Center  for  Research  in  Security  Prices  (CRSP) 
database  (n  =  452).  To  avoid  the  effect  from  extreme  outliers,  we  follow  Belsley  et  al.  (1980)  and 
Fox  (1991),  eliminating  21  observations  that  have  studentized  residuals  greater  than  2  in  the 
estimation  of  Equation  (1)  below.  This  procedure  yields  an  initial  sample  of  431  firms. 
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TABLE  1 
Sample  Selection 


Process 


Firms  that  report  the  first  quarter  FAS  No.  157 
information  in  Compustat  Bank  Fundamentals 
Quarterly  after  November  15,  2007. 

Less:  Non-March  31  1st  Quarter  Ending  Firms 
Less:  Firms  that  do  not  have  price  information 
in  the  CRSP  database 

Less:  Outliers  that  have  studentized  residual 
greater  than  2  in  the  estimation  of  Equation  (1), 
following  Belsley  et  al.  (1980)  and  Fox  (1991) 

Sample  for  the  Test  of  HI  (Table  3) 

Sample  for  the  Additional  Type  Test  (Table  7) 
Less:  Firms  that  do  not  have  proxy  statements 
or  have  incomplete  information  on  governance 
variables 

Sample  for  the  Test  of  H2  (Table  6) 


#  of  Firms  #  of  Firms  #  of  Firms 
(1st  quarter)  (2nd  quarter)  (3rd  quarter)  Total 

522 


(10) 

(60) 

(21) 


431  419  410  1,260 

431  431 

(26) 


405  398  392  1,195 


This  table  delineates  the  sample  selection  process.  The  sample  firms  were  initially  identified  from  Compustat  Bank 
Fundamentals  Quarterly  Research  File.  To  be  included  in  the  sample,  firms  should  provide  the  first  quarter  FAS  No.  157 
fair  value  hierarchy  disclosure  after  November  15,  2007  (n  =  522).  To  avoid  the  confounding  effect  of  prices  from 
different  macroeconomic  events,  we  focus  on  firms  that  end  the  first  quarter  on  March  31,  2008,  eliminating  10  non-March 
31 -quarter  firms  (n  =  512).  We  further  require  firms  to  have  price  information  in  the  Center  for  Research  in  Security  Prices 
(CRSP)  database  (n  =  452).  To  avoid  the  effect  from  extreme  outliers,  we  follow  Belsley  et  al.  (1980)  and  Fox  (1991), 
eliminating  21  observations  that  have  studentized  residuals  greater  than  2  in  the  estimation  of  Equation  (1).  This  procedure 
yields  an  initial  sample  of  431  firms  for  HI.  Hypothesis  2  requires  the  sample  firms  to  have  valid  governance  variables.  We 
further  eliminate  26  firms  due  to  missing  proxy  statements  or  incomplete  proxy  statements  to  determine  the  value  of 
corporate  governance  variables  of  interest  in  this  study.  An  additional  analysis  (Table  7)  is  performed  based  on  the  first 
quarter  observations  only  (n  =  431)  due  to  manual  data  collection  of  fair  value  asset/liability  Type  information. 


We  test  HI  for  the  first  three  quarters  of  2008.  Some  firms  fall  from  the  sample.  Therefore  the 
number  of  firms  in  the  second  and  third  quarters  is  reduced,  resulting  in  a  total  of  1,260  firm- 
quarter  observations  for  tests  of  HI. As  discussed  in  more  detail  below,  tests  of  H2  require 
hand-collected  corporate  governance  data.  Thus,  we  collect  corporate  governance  data  only  for  the 
first  quarter  of  2008  (i.e.,  using  the  initial  431  firms)  and  assume  that  these  governance  variables 
remain  constant  in  2008.  For  tests  of  H2,  missing  or  incomplete  proxy  statement  data  eliminate  65 
bank-quarter  observations  for  26  firms  from  these  tests  (n  =  1,195). 

Descriptive  Statistics 

Panel  A  of  Table  2  provides  descriptive  statistics  on  the  relative  size  of  fair  value  assets  and 
liabilities  from  1,260  firm-quarters.  Compared  to  total  assets  and  total  liabilities,  the  mean  total 


"  In  sensitivity  tests  discussed  below,  we  rerun  all  tests  using  a  constant  sample  of  firms  (i.e.,  firms  that  have  complete 
data  for  the  first  three  quarters  of  2008).  None  of  our  inferences  are  affected  when  using  this  slightly  reduced  sample. 
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fair  value  assets  and  liabilities  are  about  15  percent  and  0.4  percent,  respectively.'^  The  fair  value 
amounts  under  Level  2  inputs  account  for  most  fair  values. 

Panel  B  of  Table  2  presents  descriptive  statistics  of  variables  used  to  test  for  the  value 
relevance  of  Level  information  under  FAS  No.  157.  All  variables  are  per-share  numbers.  The  mean 
share  price  {PRC)  is  14.024.  The  mean  of  non-fair  value  assets  {NFVA)  is  133.606,  and  the  means 
of  fair  value  assets  using  Level  1  inputs  {FVAl),  Level  2  inputs  {FVA2),  and  Level  3  inputs  {FVA3) 
are  1.842,  21.150,  and  0.889,  respectively.  Similarly,  the  mean  of  non-fair  value  liabilities  (NFVL) 
is  142.619,  whereas  the  means  of  fair  value  liabilities  using  Level  1  inputs  (FVLl),  Level  2  inputs 
{FVL2),  and  Level  3  inputs  {FVL3)  are  0.071,  0.783,  and  0.044,  respectively. 

In  Panel  C  of  Table  2  we  provide  a  matrix  of  the  reporting  frequency  of  fair  value  Levels  by 
hand-collected  Type  information  from  the  first  quarter  10-Qs  of  our  431  sample  firms.  For  Type 
information,  we  classify  fair  value  assets  and  liabilities  into  the  following  categories;  (1)  invest¬ 
ment  securities  {INVSEC)  including  trading  securities  and  available-for-sale  securities;  (2)  deriva¬ 
tive  assets  (ADEV);  (3)  loans  (LOAN);  (4)  other  assets  (AOTHER)  including  mortgage  service 
rights,  federal  funds  sold,  asset-backed  securities,  and  other  investments;  (5)  trading  liabilities 
(TRDL)-,  (6)  long-term  debt  {LTDEBT)\  (7)  derivative  liabilities  (LDEV);  and  (8)  other  liabilities 
{LOTHER).  Panel  C  shows,  for  example,  that  260  out  of  431  firms  (60.32  percent)  report  Level  1 
investment  securities.  Similarly,  390  firms  (90.49  percent)  and  122  firms  (28.31  percent)  report 
Level  2  and  Level  3  investment  securities,  respectively.  Across  the  four  asset  types,  a  reasonable 
number  of  firms  disclose  fair  values  at  all  three  levels,  with  just  a  few  exceptions.  The  average 
frequency  across  the  12  cells  is  21.75  percent.  However,  for  liabilities,  the  average  frequency 
across  the  12  cells  is  only  3.85  percent,  and  only  one  cell  (Level  2  derivative  liabilities,  LDEV)  has 
a  reasonable  number  of  observations.  In  sum,  both  the  amount  and  the  frequency  of  fair  values  are 
much  greater  for  assets  than  for  liabilities.  Nevertheless,  the  fact  that  all  asset/liability  types  are 
represented  in  all  levels  of  FAS  No.  157’s  hierarchy  provides  evidence  of  the  potential  usefulness 
of  these  expanded  disclosures. 

Whereas  Panel  C  of  Table  2  shows  how  many  firms  report  nonzero  values  for  each  type  and 
level.  Panel  D  provides  the  relative  dollar-value  size  of  fair  value  hierarchy  information  within 
each  fair  value  asset  type.  For  example,  out  of  424  nonzero  INVSEC  observations  from  the  first 
quarter  of  2008,  260,  390,  and  122  firms  have  Level  1,  Level  2,  and  Level  3  investment  securities, 
and  the  relative  sizes  of  these  assets  are  15.7  percent,  82.2  percent,  and  2.1  percent  of  total 
investment  securities,  respectively.  The  relative  size  is  qualitatively  similar  to  relative  frequency 
reported  in  Panel  C. 


IV.  RESULTS 

Value  Relevance  of  Fair  Value  Hierarchy  (HI) 

To  test  the  value  relevance  of  FAS  No.  157’s  fair  value  hierarchy,  we  estimate  the 
association  between  share  prices  and  fair  values  of  assets  and  liabilities  per  share  using  a  modified 
Ohlson  (1995)  model,  which  has  been  extensively  employed  in  the  literature.  Barth  and  Clinch 
(2009)  provide  evidence  that  share-deflated  specifications  (as  opposed  to  equity  book  value- 


The  relative  size  of  fair  value  assets  and  non-fair  value  assets  reported  in  this  study  is  comparable  to  those  reported  in 
Bech  and  Rice  (2009).  They  collect  balance  sheet  data  from  the  Consolidated  Reports  of  Condition  and  Income  for 
ensured  domestic  commercial  banks  and  non-deposit  trust  companies.  They  report  that  in  2008  (our  sample  period)  81 
percent  of  banks’  assets  consist  of  the  following:  loans  and  leases  (57  percent),  noninterest-earning  assets  (15  percent), 
federal  funds  transactions  (6  percent),  and  other  (3  percent).  These  amounts  are  typically  carried  at  historical  cost  or 
carrying  amount.  The  remaining  19  percent  of  assets  are  classified  as  investments.  Of  this  amount,  any  securities  not 
classified  as  held-to-maturity  would  be  reported  at  fair  value,  requiring  classification  into  Levels  1-3.  Thus,  the  portion 
of  fair  value  assets  for  our  sample  (15  percent)  is  similar  to  that  of  all  banking  institutions  in  the  U.S. 
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deflated,  lagged  price-deflated,  returns,  or  equity  market  value-deflated  specifications)  perform  the 
best  in  reducing  scale  effects  in  the  modified  Ohlson  (1995)  model.  We  partition  book  value  into 

13 

non-fair  value  assets  and  liabilities  and  each  of  the  fair  value  levels. 


PRCi,  =  ao  +  a^NFVAit  +  a2FVAl  i,  +  a^FYAl^j  +  a^FYAS^  +  a^NFYLi,  +  a^FYL12i, 

+  a2FYL3,,  +  /3,NI,,  +  s,,  (1) 

Due  to  the  low  frequency  of  fair  value  liability  reporting  in  our  sample  (see  Table  2),  we 
combine  Level  1  and  Level  2  liabilities  {FYL12).  The  dependent  variable,  PRC,  is  per  share  price 
measured  on  the  10-Q  filing  month-end  for  firm  i  in  quarter  t.  Other  variables  are  as  defined 
previously  on  a  per  share  basis. 

In  estimating  Equation  (1),  we  pool  observations  from  the  first  three  quarters  of  2008  for  each 
firm.  Residuals  could  be  correlated  across  quarters  or  across  firms,  so  we  correct  standard  errors 
and  related  t-statistics  based  on  two  dimensions  (i.e.,  firms  and  quarters)  following  Petersen 
(2009).'^  Petersen  (2009)  shows  that  this  standard  error  adjustment  (or  clustering  by  two  dimen¬ 
sions)  produces  less-biased  standard  errors. 

Hypothesis  1  examines  whether  the  value  relevance  of  Level  1  and  Level  2  fair  values  is 
greater  than  the  value  relevance  of  Level  3  fair  values.  In  Table  3  we  report  results  of  Equation  (1) 
and  test  whether  the  valuation  coefficient  is  0  (Column  A),  whether  it  is  different  from  1  for  assets 
(Column  B),  and  whether  it  is  different  from  —1  for  liabilities  (Column  C).  Before  reporting  tests 
of  HI,  we  first  note  that  estimated  coefficients  for  both  non-fair  value  assets/liabilities  and  fair 
value  assets/liabilities  are  different  from  0,  indicating  their  value  relevance.  In  addition,  we  find 
that  the  coefficients  on  FYAl  and  FYA2  are  not  different  from  their  theoretically  predicted  value  of 
1.  In  contrast,  the  coefficient  on  FYA3  is  significantly  less  than  1,  meaning  that  investors  place  less 
weight  on  Level  3  fair  value  assets  relative  to  Level  1  and  Level  2.  At  the  bottom  of  Table  3,  we 
provide  tests  of  whether  the  coefficients  across  Levels  are  equal.  We  find  that  the  coefficients  on 
Level  1  and  Level  2  assets  are  not  significantly  different,  and  these  coefficients  are  both  signifi¬ 
cantly  greater  than  the  coefficient  on  Level  3  assets.  These  results  support  HI. 

As  for  liabilities,  the  coefficient  on  FYL12  is  not  statistically  different  from  its  predicted  value 
of  —  1 .  For  FYL3,  we  find  the  valuation  coefficient  to  be  significantly  less  than  —  1  (or  greater  than 
1  in  absolute  terms)  and  less  than  the  coefficient  on  FYL12.  The  coefficient’s  magnitude  is 
consistent  with  investors’  perception  that  Level  3  fair  value  liabilities  are  understated. 

In  summary.  Level  1  and  Level  2  fair  value  estimates  are  value-relevant  and  valuation  coef¬ 
ficients  are  not  statistically  different  from  their  theoretical  values  of  1  or  —  1 .  For  Level  3  fair 


We  believe  that  a  price  model  rather  than  a  returns  model  is  appropriate  for  testing  our  hypotheses  because  our  research 
question  is  to  determine  whether  fair  value  hierarchy  is  reflected  in  firm  value  (i.e.,  value  relevance),  rather  than  testing 
whether  fair  value  hierarchy  is  reflected  in  changes  in  firm  value  over  a  specific  period  of  time  (see  Barth  et  al.  2001). 
As  an  example  of  how  failure  to  adjust  standard  errors  in  this  manner  can  affect  inferences  in  accounting  research,  see 
Gow  et  al.  (2010).  We  used  cluster!  command  in  STATA  provided  by  Mitchell  Petersen.  Our  results  are  robust  to 
alternative  panel  data  model  specifications  such  as  the  firm  fixed  effect  model. 

Based  on  discussion  in  Barth  and  Clinch  (2009),  our  theoretical  prediction  is  that  the  coefficient  on  non-fair  value  assets 
(liabilities)  would  be  between  0  and  1  (—1).  We  note  that  our  observed  coefficients  are  similar  to  those  reported  in 
earlier  research  (e.g.,  Barth  (1994)  who  reports  a  coefficient  of  0.79  for  historical  cost  book  value  in  a  regression  similar 
to  ours)  and  to  those  reported  in  more  recent  studies  (e.g.,  Kolev  (2009)  reports  coefficients  on  book  value  of  equity 
between  0.709  and  0.748). 
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values,  valuation  coefficients  suggest  that  investors  place  less  weight  on  these  less  reliable 
amounts.*® 

As  a  robustness  test,  we  consider  that  results  in  Table  3  could  be  confounded  by  certain  bank 
characteristics.  For  example,  suppose  larger  banks  tend  to  report  Level  1  and  Level  2  fair  values, 
whereas  smaller  banks  tend  to  report  Level  3  fair  values.  The  differences  in  value  relevance 
reported  in  Table  3  could  reflect  differences  in  bank  characteristics  rather  than  differences  in  fair 
value  levels.  For  similar  reasons,  we  may  also  consider  that  differences  in  capital  ratio  are  corre¬ 
lated  with  managers’  choice  of  valuation  levels.  If  this  is  the  case,  and  investors  value  banks’ 
assets  according  to  bank  characteristics,  then  the  results  in  Table  3  could  be  confounded. 

To  examine  these  issues,  we  repeat  the  analyses  with  subsamples  partitioned  by  these  bank 
characteristics  in  Table  4  (i.e.,  large  versus  small  banks  in  Panel  A  and  high  versus  low  Tier  1 
capital  ratio  banks  in  Panel  B).  We  find  that  results  for  HI  generally  hold  for  all  subsamples. 
Specifically,  we  find  across  subsamples  that  (1)  all  of  fair  value  hierarchies  are  value-relevant,  (2) 
Level  3  asset  valuation  is  less  than  that  of  Level  1  or  Level  2  assets  except  for  the  low  capital  ratio 
subsample,  (3)  Level  3  liability  valuation  is  more  negative  than  those  of  Level  1  and  Level  2 
liabilities,  (4)  Level  1  and  Level  2  asset  valuations  are  not  significantly  different  from  1,  (5)  Level 
3  asset  valuation  is  less  than  1,  (6)  Level  1  and  Level  2  liability  valuations  are  not  significantly 
different  from  —1,  and  (7)  Level  3  liability  valuation  is  less  than  —1.*^  We  also  note  that  the 
valuation  coefficients  for  small  banks  tend  to  be  lower  than  those  for  large  banks.  These  results  are 
consistent  with  investors  perceiving  fair  value  estimates  by  managers  at  small  banks  as  being  less 
reliable. 

The  Effect  of  Corporate  Governance  on  the  Value  Relevance  of  Fair  Values  (H2) 

Results  so  far  indicate  that  FAS  No.  157  Level  information  is  value-relevant.  Given  the  high 
level  of  information  asymmetry  between  managers  and  investors,  especially  relating  to  reliability 
of  Level  3  and  perhaps  Level  2  inputs,  H2  tests  whether  investors  place  differential  weights  on  fair 
values  across  Levels,  based  on  the  firm’s  corporate  governance  mechanisms.  Based  on  prior 
literature,  we  use  six  measures  of  corporate  governance  mechanisms  germane  to  the  reliability  of 
Level  3  fair  value  estimates.  The  six  governance  measures  include  (1)  board  independence 
{BDIND)^^  measured  by  the  number  of  independent  board  members  divided  by  the  number  of  total 
board  members,  (2)  audit  committee  financial  expertise  (ACFE)  measured  by  the  number  of 
audit  committee  members  with  financial  expertise  divided  by  the  number  of  total  audit  committee 


Recall  from  Table  1  that  the  number  of  firms  in  the  sample  decreases  from  431  in  the  first  quarter  to  410  by  the  third 
quarter  for  tests  of  HI.  We  repeat  tests  using  only  the  410  firms  that  have  data  for  all  three  quarters  (i.e.,  1,230 
firm-quarter  observations).  We  find  results  nearly  identical  to  those  reported  in  Table  3.  Coefficients  for  FVAl,  FVA2, 
and  FVLI2  (0.958,  0.962,  and  -1.003,  respectively)  are  not  significantly  different  from  1  or  -1,  the  coefficient  on 
FVA3  is  less  than  1  (0.611)  and  significant  at  the  0.01  level,  and  the  coefficient  on  FVLi  is  less  than  -1  (-1.983)  and 
significant  at  the  0.01  level. 

We  also  conduct  tests  by  including  firm  size  and  capital  ratio  as  separate  independent  variables  in  a  pooled  regression 
(untabulated).  These  variables  are  included  in  alternate  forms  (indicator,  deciles,  or  continuous)  and  either  interacted  or 
non-interacted  with  main  test  variables.  Under  all  specifications,  we  continue  to  reach  the  same  conclusion;  levels 
valuations  are  not  driven  by  differences  in  firm  size  or  capital  ratio.  We  also  consider  a  growth  factor  (i.e.,  the  asset 
growth)  and  do  not  find  any  evidence  that  the  control  for  the  asset  growth  alters  our  conclusions. 

Dechow  et  al.  (1996)  and  Beasley  (1996)  show  that  firms  that  are  investigated  by  the  SEC  had  boards  with  significantly 
lower  percentages  of  outside  members.  Similarly,  both  Klein  (2002)  and  Peasnell  et  al.  (2006)  find  a  negative  relation 
between  board  independence  and  earnings  management.  Ajinkya  et  al.  (2005)  provide  evidence  that  the  percentage  of 
outside  directors  on  the  board  is  negatively  associated  with  optimistic  biases  in  management  forecasts. 

Section  407  of  SOX  required  the  SEC  to  issue  a  rule  requiring  public  companies  to  disclose  whether  their  audit 
committee  has  at  least  one  financial  expert.  Additionally,  if  companies  do  not  have  financial  experts  on  their  audit 
committees,  then  they  are  required  to  disclose  that  fact.  Recent  studies  provide  evidence  that  audit  committees  with 
more  financial  expertise  and  activities  are  less  likely  to  manage  earnings  and  to  issue  restatements  (Abbott  and  Parker 
2000;  Agrawal  and  Chadha  2005;  Bedard  et  al.  2004). 
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members,  (3)  the  frequency  of  annual  audit  committee  meetings  (ACMEET),  (4)  total  percent  of 
shares  held  by  institutional  investors  {INSTHOLDPClf^  calculated  from  13-F  filings,  (5)  size  of 
audit  engagement  office  (AUDITOEEICESIZE),^^  and  (6)  no  material  control  weaknesses  problem 
{NOMCWf^  under  Sarbanes-Oxley  Act  (SOX)  302  or  404.  Prior  studies  indicate  that  firms  with 
independent  boards,  highly  financially  literate  audit  committees,  active  audit  committees,  the 
presence  of  institutional  investors,  audits  by  auditors  from  larger  offices,  and  no  material  control 
weaknesses  are  less  likely  to  engage  in  financial  reporting  biases. 

Panel  A  of  Table  5  provides  the  descriptive  statistics  for  the  six  attributes  of  corporate  gov¬ 
ernance.  The  mean  value  of  BDIND  is  0.78,  reflecting  the  improved  board  independence  after 
SOX.  The  mean  of  ACEE  is  0.32,  indicating  that  one-third  of  the  audit  committee  members  are 
classified  as  financial  experts  under  SOX.  Audit  committees  in  our  sample  meet  eight  times 
annually.  On  average,  30  percent  of  shares  of  sample  firms  are  held  by  institutional  investors,  and 
the  average  log  of  audit  engagement  office  audit  fee  revenues  is  15.04.  Finally,  82  percent  of  our 
sample  firms  do  not  report  any  material  control  weakness  problem  (NOMCW)  prior  to  2007.  Panel 
B  of  Table  5  provides  the  correlation  coefficients  among  corporate  governance  variables. 

To  reduce  the  random  measurement  error  of  individual  governance  variables  and  to  parsimo¬ 
niously  summarize  the  underlying  latent  construct  of  governance  quality,  we  create  a  standardized 
governance  score  (GOVSCORE)  based  on  the  principal-component  factor  analysis  of  six  afore¬ 
mentioned  governance  variables.  Archival  accounting  research  often  uses  this  methodology  to 
summarize  investor  characteristics  (e.g.,  Bonner  et  al.  2003)  or  firm  characteristics  (e.g.,  Baik  et 
al.  2009).  The  factor  loadings  for  the  varimax  orthogonal  rotation  are  shown  in  the  first  column  of 
Panel  C  of  Table  5,  representing  how  individual  governance  variables  are  weighted  for  (i.e., 
correlated  with)  GOVSCORE.  Consistent  with  our  intuition,  all  governance  variables  are  posi¬ 
tively  loaded  in  generating  GOVSCORE.  The  factor  analysis  generates  a  factor  with  the  eigenvalue 
of  1.878,  which  accounts  for  about  39  percent  of  the  total  variations  in  the  original  variables.  The 
second  column  provides  the  Kaiser-Meyer-OLkin  measure  of  sampling  adequacy  (Kaiser  1974). 
The  mean  of  this  statistic  (0.62)  is  greater  than  0.5,  indicating  that  GOVSCORE  well  captures  the 
underlying  common  factor  of  six  individual  variables  (Stewart  1981).  Panel  D  of  Table  5  shows 
the  distribution  of  GOVSCORE.  Due  to  this  standardization,  the  mean  and  the  standard  deviation 
of  GOVSCORE  are  0  and  1,  respectively.  Based  on  GOVSCORE,  we  create  the  decile  rank 
(GOVRANK)  from  0  to  9,  and  then  scale  by  9.  To  test  H2,  we  estimate  Equation  (2). 

PRC  it  =aQ  +  a^NFVAit  -l-  a2pVAlit  +  a^EVAlit  -I-  a^FVASi,  +  a2aFVAli,  *  GOVRANKi 
-t  a-ittEVA2it  *  GOVRANKi  +  a^aFVASu  *  GOVRANKi  +  a^NFVLn  +  af^EVLlln 
+  ajFVLSi,  +  ySi  A4  +  /32GOVRANKi  +  e,-,  (2) 

Equation  (2)  is  essentially  identical  to  Equation  (1)  with  the  exception  that  all  levels  of  fair 


Institutional  investors  appear  to  play  a  monitoring  role  over  corporate  financial  reporting.  Chung  et  al.  (2002)  find  a 
negative  association  between  earnings  management  and  the  presence  of  institutional  investors.  Bushee  (1998)  and 
Bange  and  DeBondt  (1998)  find  that  the  presence  of  institutional  investors  with  a  long-term  investment  horizon  reduces 
the  level  of  earnings  management  through  R&D  activities.  Ajinkya  et  al.  (2005)  provide  evidence  that  institutional 
ownership  has  a  favorable  effect  on  the  likelihood  of  forecast  occurrence  as  well  as  the  frequency  of  issuance. 
Previous  studies  use  a  dichotomous  variable  (e.g.,  Big  N  auditors)  to  measure  the  quality  of  the  auditor.  Recent  studies 
(Francis  and  Yu  2009;  Choi  et  al.  2010)  further  show  that  even  within  brand  name  auditors,  the  size  of  a  specific 
engagement  office  matters  for  the  audit  quality  and  audit  fees.  Thus,  following  Francis  and  Yu  (2009),  we  calculate  the 
auditor  office  size  of  our  sample  firms  based  on  their  2007  audit  fee  revenues  to  proxy  for  the  quality  of  audit  provided 
for  our  sample  firms. 

Ashbaugh-Skaife  et  al.  (2008)  and  Doyle  et  al.  (2007)  document  a  negative  association  between  disclosed  control 
weaknesses  and  earnings  quality.  Similarly,  Feng  et  al.  (2009)  show  that  ineffective  internal  controls  are  likely  to  induce 
less  accurate  management  earnings  forecasts. 
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value  assets  are  also  interacted  with  GOVRANK.  Because  the  frequency  and  amount  of  fair  value 
assets  greatly  exceeds  those  of  fair  value  liabilities,  we  limit  our  design  to  fair  value  assets.  If 
investors  consider  that  high-quality  corporate  governance  can  mitigate  the  concerns  of  managers’ 
opportunistic  reporting  or  errors  inherent  to  the  estimation,  then  we  predict  the  coefficients  on  the 
interaction  with  GOVRANK  to  be  positive. 

Panel  A  of  Table  6  provides  the  results  of  the  estimation  of  Equation  (2).  The  coefficient  on 
fair  value  assets  alone  (i.e.,  FVAl,  FVA2,  and  FVA3)  can  be  interpreted  as  the  valuation  of  fair 
value  assets  for  firms  in  the  lowest  governance  decile.  The  coefficients  of  the  interaction  terms  are 
the  incremental  valuations  when  moving  from  the  lowest  decile  to  the  highest  decile.  As  shown  in 
the  table,  the  coefficients  on  all  three  non-interaction  terms  are  positive,  but  only  FVAl  and  FVA2 
are  significantly  different  from  zero.  This  difference  suggests  that  Level  1  and  Level  2  fair  values 
of  low  governance  firms  are  value-relevant,  but  Level  3  fair  values  are  not.  It  is  also  noteworthy 
that  the  coefficients  on  Level  1  and  Level  2  assets  (0.81  and  0.83)  are  less  than  1  at  marginal 
significance  levels  (0.07  level,  one-tailed). 

The  interaction  terms  are  positive  for  all  three  levels,  but  only  Level  2  and  Level  3  are 
significant.^^  By  adding  the  coefficient  of  the  interaction  term  (e.g.,  FVAl  *  GOVRANK)  with  that 
of  the  non-interaction  term  (e.g.,  FVAl),  we  can  determine  the  valuation  coefficient  for  the  highest 
decile  (i.e.,  firms  with  strongest  corporate  governance).  Lor  the  highest  decile,  the  valuations  of 
Level  1  and  Level  2  assets  increase  to  near  1  (1.01  and  0.97).  Lurthermore,  the  level  of  corporate 
governance  has  the  greatest  impact  on  valuation  of  Level  3  fair  value  assets.  The  coefficient  on  the 
lowest  decile  (i.e.,  the  FVA3  term)  is  0.06;  this  coefficient  increases  by  0.76  (i.e.,  the  FVA3  * 
GOVRANK  term)  to  0.82  for  the  highest  decile.  Overall,  these  findings  support  the  importance  of 
corporate  governance  in  valuing  all  levels  of  fair  values,  but  especially  Level  3  fair  values  which 
likely  represent  the  values  with  greatest  information  asymmetry.^'*’^^ 

Lor  completeness,  we  also  examine  each  of  the  six  corporate  governance  measures.  Given  that 
each  measure  in  isolation  provides  a  limited  picture  of  a  firm’s  overall  corporate  governance,  we 
view  these  results  as  exploratory  for  the  interested  reader.  We  find  that  the  coefficient  on  FVA2  * 
GOVRANK  is  positive  for  five  of  our  six  corporate  governance  measures  and  significant  for  three 
(audit  committee  meetings,  institutional  holding,  and  no  material  control  weaknesses  problems). 
The  coefficient  on  FVA3  *  GOVRANK  is  also  positive  for  five  of  the  six  governance  measures  and 
significant  for  three  (board  independence,  auditor  office  size,  and  no  material  control  weaknesses 
problem).  The  coefficient  on  FVAl  *  GOVRANK  is  positive  only  for  four  of  the  governance 
measures  and  significant  for  two  (audit  committee  meetings  and  no  material  control  weaknesses 


It  should  be  noted  that  while  the  coefficient  for  FVAl  *  GOVRANK  is  significant  and  the  coefficient  for  FVAl  * 
GOVRANK  is  not,  the  two  coefficients  have  similar  magnitudes  and  are  not  significantly  different.  Thus,  we  do  not  rule 
out  the  possibility  that  the  insignificance  of  FVAl  *  GOVRANK  is  due  to  lower  power  tests  of  fewer  Level  1  assets  being 
reported  (see  Table  2).  However,  we  believe  the  results  are  primarily  attributable  to  Level  1  assets  being  less  affected  by 
governance. 

When  we  restrict  the  sample  to  the  392  firms  that  have  data  for  all  three  quarters,  we  find  similar  results.  Specifically, 
for  tests  of  H2,  the  coefficients  on  FVAl  *  GOVRANK  and  FVA3  *  GOVRANK  are  positive  (0.140  and  0.441)  and 
significant  at  the  0.05  level,  while  the  coefficient  on  FVAl  *  GOVRANK  is  positive  (0.194)  but  not  significant. 

As  a  sensitivity  test,  we  examine  the  impact  of  corporate  governance  on  the  valuation  of  fair  value  liabilities.  As 
discussed  previously,  results  related  to  fair  value  liabilities  should  be  considered  exploratory.  The  low  frequency  of 
reported  fair  value  liabilities  (see  Table  2)  makes  it  difficult  to  draw  reliable  conclusions.  Nevertheless,  we  interact 
GOVRANK  with  FVLll  and  FVL3  in  Equation  (4)  to  test  H2.  First,  we  note  that  none  of  our  inferences  regarding  the 
impact  of  corporate  governance  on  fair  value  assets  changes.  Second,  we  find  no  impact  of  GOVRANK  on  FVLll.  Third, 
we  find  that  the  coefficient  on  FVL3  *  GOVRANK  is  significantly  negative  at  the  0.10  level.  The  unexpected  sign  on 
FVL3  *  GOVRANK  is  likely  due  to  the  high  correlation  between  FVA3  *  GOVRANK  and  FVL3  *  GOVRANK.  The  large 
number  of  observations  with  zero  FVL3  reduces  the  ability  to  draw  reliable  conclusion  regarding  the  coefficient  on 
FVL3  *  GOVRANK. 
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problem). Overall,  the  analyses  of  individual  measures  continue  to  support  the  greater  impact  of 
governance  on  valuation  of  Level  2  and  Level  3  assets. 

Incremental  Value  Relevance  of  Fair  Value  Hierarchy 

In  this  section,  we  consider  whether  FAS  No.  157’s  Level  information  is  incremental  to 
existing  Type  information  (defined  in  Panel  C  of  Table  2).  In  particular,  we  examine  whether  the 
ability  of  Level  information  provided  by  FAS  No.  157  interacted  with  existing  asset  and  liability 
Type  information  has  a  greater  explanatory  power  for  firm  value  than  does  Type  information 
alone. The  incremental  value  relevance  of  FAS  No.  157  fair  value  hierarchy  disclosures  will 
depend  on  whether  investors  differentially  value  inputs  across  the  three  levels  and  whether  firms 
report  sufficient  fair  value  asset  and  liability  types  (i.e.,  existing  disclosures)  where  investors  are 
not  able  to  easily  discern  the  level  of  input. 

Based  on  manually  collected  fair  value  Type  data  (Panel  C  of  Table  2)  from  the  first  quarter 
of  2008  (n  =  431),  we  assess  the  incremental  value  relevance  of  the  fair  value  hierarchy  by 
comparing  the  explanatory  powers  of  Equation  (3)  (Type  information)  versus  Equation  (4)  (Type 
disaggregated  into  Level  information). 

PRCit  =  <5o  +  SjNFVAit  +  S2lNVSECii  +  S^LOANii  +  84ADEVi,  +  SsAOTHERi,  +  S^EVLi, 

+  SjTRDLi,  +  SgETDEBTi,  +  SgEDEV^  +  SjoLOTHERit  +  /3;A4  +  s,,.  (3) 

3  3 

PRC  it  =  00  +  4>jNPVAit  +  X  Va^8!VSEC  by  Each  Leveli,  +  2  lisLOAN  by  Each  Levelit 

A=I  B=I 

3  3 

+  2  cocADEV  by  Each  Leuelit+  2  ctqAOTHER  by  Each  Leveli,+  aNEVLij 

C=I  D=1 

2  2 

+  2  keTRDL  by  Each  Levels  +  2  dfLTDEBT  by  Each  Levelit 

E=1  F=1 

2  2 

+  2  ‘ic^DEV  by  Each  LevelnA-  ^  ipf^LOTHER  by  Each  Levelit  +  PjNIit  +  Ei,.  (4) 

G=I  H=I 

Table  7  provides  results  of  Equations  (3)  and  (4)  in  the  left  and  the  right  columns,  respec¬ 
tively.  The  from  the  estimation  of  Equation  (3)  is  67.75  percent  and  most  coefficients  have  their 
predicted  sign  and  are  statistically  significant.  The  from  the  estimation  of  Equation  (4)  is  69.6 
percent.  For  this  equation  we  do  not  interpret  coefficients.  Given  the  large  number  of  fair  value 
components,  the  relatively  low  number  of  nonzero  observations  in  each  cell,  and  the  potential  for 
high  collinearity  among  variables,  coefficients  are  likely  unreliable.  Note,  however,  that  the  sign 
and  the  magnitude  of  coefficients  appear  to  be  appropriate  as  long  as  the  coefficients  are  statisti¬ 
cally  significant  at  the  conventional  level.  Our  focus  is  the  explanatory  power  difference  across 
these  models.  When  we  formally  test  whether  these  two  R^s  are  statistically  different,  the  Vuong’s 
Z-statistic  (Vuong  1989)  is  2.22  with  a  p-value  less  than  0.01,  suggesting  that  when  the  Level 


Unlike  Kolev  (2009),  we  find  no  evidence  that  the  value  relevance  of  fair  values  varies  based  on  audit  committee 
financial  expertise.  His  sample  includes  a  small  number  of  banks  (88),  financial  services  firms  (35),  and  insurance 
companies  (54).  Kolev  (2009)  also  finds  that  valuations  of  fair  values  are  lower  for  firms  with  lower  equity  capital  and 
firms  that  develop  their  mark-to-model  internally. 

Clor-Proell  and  Warfield  (2009)  provide  experimental  evidence  that  the  expanded  disclosure  format  on  fair  value  level 
information  may  improve  non-professional  investors’  judgment  performance. 
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information  is  used  together  with  the  Type  information,  FAS  No.  157  Level  information  is  incre¬ 
mentally  value-relevant.  We  caution  that  the  economic  significance  may  be  small  because  the 
increase  in  is  relatively  small.^^  However,  given  that  most  assets  and  liabilities  for  our  banking 
sample  are  not  reported  at  fair  values,  the  small  increase  in  explanatory  power  should  not  be 
surprising. 

Tests  of  HI  and  H2  Over  Time 

Finally,  we  consider  how  conclusions  may  have  changed  as  the  economic  crisis  worsened  and 
market  illiquidity  became  more  severe  in  2008.  We  estimate  Equation  (1)  for  each  of  the  first  three 
quarters.  The  first  (untabulated)  result  to  note  is  the  decrease  in  of  the  model  over  time.  In  the 
first  quarter,  the  R^  equals  67.57  percent  but  falls  to  50.91  percent  by  the  third  quarter.  This  result 
likely  reflects  the  greater  noise  in  stock  prices  as  the  economic  crisis  deepened.  Second,  we  find 
that  the  valuation  coefficients  of  FVAl  and  FVA2  are  not  significantly  different  from  1  for  all  three 
quarters.  FVL12  is  approximately  —1  for  the  first  and  second  quarters  and  then  becomes  slightly 
less  than  —1  (i.e.,  absolute  value  greater  than  1)  and  significant  at  the  0.10  level  in  the  third 
quarter.  Finally,  the  coefficient  on  FVA3  moves  slightly  closer  to  1  as  the  year  proceeds  (from 
0.615  in  the  first  quarter  to  0.766  in  the  third  quarter).  Similarly,  FVL3  moves  closer  to  —  1 
(—2.302  in  the  first  quarter  to  —1.669  in  the  third  quarter).  Thus,  if  anything,  valuations  of  Level 
3  assets  and  liabilities  moved  eloser  to  their  theoretically  predicted  values  of  1  and  —  1  as  the  crisis 
worsened.  Although  these  results  provide  some  initial  evidence  of  investors’  perceptions  of  fair 
values  during  an  economic  crisis,  we  caution  that  these  results  should  be  viewed  as  exploratory  at 
best  because  we  do  not  have  strong  predictions  on  the  outcome  of  this  test. 

For  tests  of  H2,  the  second  quarter  seems  to  be  the  quarter  that  best  distinguishes  the  impact 
of  corporate  governance  on  the  valuation  of  fair  value  assets.  Only  in  this  quarter  is  the  valuation 
of  all  three  Levels  significantly  greater  for  strong  corporate  governance  firms  compared  to  low 
governance  firms.  Given  that  the  economic  crisis  seemed  to  steadily  worsen  as  the  year  pro¬ 
gressed,  it  is  difficult  to  draw  definitive  conclusions  on  the  joint  effect  of  corporate  governance 
and  market  liquidity  on  valuations  of  fair  value  assets.  Clearly,  this  area  promises  to  be  fruitful  for 
future  research  as  more  data  become  available  and  tests  are  refined. 

V.  CONCLUSIONS 

Under  FAS  No.  157,  firms  are  required  to  disclose  fair  values  of  asset  and  liability  types  by 
levels,  where  levels  are  based  on  inputs  used  to  generate  fair  values:  (1)  Level  1  (observable  inputs 
from  quoted  prices  in  active  markets),  (2)  Level  2  (indirectly  observable  inputs  from  quoted  prices 
of  comparable  items  in  active  markets,  identical  items  in  inactive  markets,  or  other  market-related 
information),  and  (3)  Level  3  (unobservable,  firm-generated  inputs).  Thus,  fair  values  are  dis¬ 
closed  from  most  reliable  (Level  1)  to  least  reliable  (Level  3),  with  a  classification  that  potentially 
falls  somewhere  in  the  middle  (Level  2).  Using  banking  firm  data  from  the  first  three  quarters  of 
2008,  we  examine  two  important  research  questions  related  to  fair  value  information  provided  by 
banks  under  FAS  No.  157.  First,  we  compare  the  value  relevance  of  Level  1  and  Level  2  fan- 
values  to  the  value  relevance  of  Level  3  fair  values.  Second,  we  consider  whether  the  impact  of 
corporate  governance  on  the  value  relevance  of  fair  values  is  greater  for  Level  3  assets  compared 
to  Level  1  and  Level  2  assets. 


In  comparison,  the  in  a  Levels-only  model  (i.e.,  Equation  (1))  for  these  observations  is  67.57  percent.  Thus,  Level 
and  Type  information  are  incremental  to  one  another  but  the  increase  in  explanatory  value  is  relatively  small. 

Goh  et  al.  (2009)  conclude  the  opposite.  For  their  smaller  sample  of  banks  and  with  a  different  valuation  model  than 
ours,  they  find  that  the  valuation  of  fair  value  assets  declines  over  the  first  three  quarters  of  2008. 
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We  find  the  following.  First,  all  Level  information  is  value-relevant.  Level  1  and  Level  2 
assets  (liabilities)  have  valuation  coefficients  close  to  their  theoretically  predicted  value  of  1  (—1). 
However,  Level  3  assets  are  valued  less  than  1  and  less  than  Level  1  and  Level  2  assets.  Level  3 
liabilities  have  a  valuation  coefficient  less  than  —1  (i.e.,  absolute  value  greater  than  1)  and  less 
than  those  of  Level  1  and  Level  2  liabilities.  These  results  are  robust  to  potential  confounding  firm 
characteristics  such  as  firm  size  and  Tier  1  capital  ratios.  These  results  suggest  that  investors  are 
likely  to  decrease  the  weight  they  place  on  less  reliable  Level  3  fair  value  measurements  in  their 
equity-pricing  decisions  due  to  the  information  risk,  inherent  estimation  errors,  and  possible  re¬ 
porting  bias. 

Second,  we  find  evidence  that  the  value  relevance  of  fair  value  assets  varies  with  the  strength 
of  a  firm’s  corporate  governance.  For  firms  with  low  corporate  governance.  Level  1  and  Level  2 
assets  are  below  1  (0.81  and  0.83).  These  valuations  are  marginally  different  from  1  at  approxi¬ 
mately  the  0.07  level  (one-tailed).  We  find  that  the  valuation  of  Level  3  assets  of  low  governance 
firms  is  close  to  0  (0.06)  and  not  significant,  suggesting  no  value  relevance.  For  high  governance 
firms,  we  find  that  the  valuations  of  Level  1  and  Level  2  assets  increase  to  near  1  (1.01  and  0.97), 
and  valuation  of  Level  3  assets  increases  to  0.82.  The  increase  in  asset  valuations  for  each  of  the 
levels  is  consistent  with  strong  governance  reducing  information  asymmetry  and  mitigating  esti¬ 
mation  errors  or  reporting  biases,  and  this  is  especially  apparent  for  Level  3  assets  (i.e.,  unobserv¬ 
able,  firm-generated  amounts)  where  information  asymmetry  is  expected  to  be  the  highest. 

As  an  additional  analysis,  we  find  that  FAS  No.  157  fair  value  hierarchy  disclosure  disaggre¬ 
gating  Type  information  to  Level  information  (i.e.,  matrix  format  for  reporting  fair  values  based  on 
asset/liability  type  and  level  of  input)  adds  some  incremental  value  relevance  to  existing  Type 
information.  This  finding  is  consistent  with  the  prediction  that  FAS  No.  157  provides  useful 

30 

incremental  information  for  investors’  equity  valuations. 

Although  we  recognize  that  fair  values  may  affect  financial  statement  users  in  contexts  other 
than  the  pricing  of  equity  securities  (Holthausen  and  Watts  2001),  we  believe  that  our  findings 
should  be  important  to  U.S.  and  international  standard-setters  for  understanding  not  only  the 
effects  of  FAS  No.  157  disclosures,  but  also  how  future  standards  (e.g.,  the  joint  lASB/FASB 
project  on  financial  statement  presentation)  can  enhance  existing  fair  value  disclosures.  For  ex¬ 
ample,  we  find  that  investors  discount  less  reliable  fair  values  possibly  due  to  information  asym¬ 
metry  and  moral  hazard  problems.  However,  to  the  extent  firms  have  strong  governance,  these 
problems  appear  less  severe.  The  impact  of  corporate  governance  on  the  value  relevance  of  fair 
values  has  received  limited  attention  in  academic  research,  especially  in  the  U.S. 

Our  study  is  subject  to  a  few  caveats.  First,  our  sample  firms  are  limited  to  the  banking 
industry  and  represent  observations  during  the  first  three  quarters  of  2008.  Until  more  data  are 
analyzed,  both  in  the  cross-section  and  over  time,  the  results  may  not  be  generalizable.  Second,  we 
note  that  the  current  economic  crisis  existed  and  was  worsening  over  our  sample  period.  It  is  not 
yet  clear  to  what  extent  fair  value  measures  should  be  value-relevant  under  the  framework  of  FAS 
No.  157,  which  requires  prices  to  be  measured  based  on  “orderly  transactions,”  when  market 
liquidity  is  low.  Although  standard- setters  likely  hope  that  fair  values  are  even  more  informative 
during  an  economic  crisis,  the  low  liquidity  in  the  market  makes  reported  fair  values  less  observ¬ 
able  and  more  subjective  to  measure,  especially  for  Level  2  and  Level  3  items.  In  additional 
analysis,  we  find  little  evidence  that  the  value  relevance  of  fair  value  levels  worsened  as  the 


In  the  past,  users  have  not  always  been  granted  their  requests  to  have  information  disclosed  in  a  matrix-type  format.  For 
example,  in  deliberations  of  segment  disclosures  under  FAS  No.  131,  users  lobbied  for  disclosure  of  industry  segment 
information  within  each  geographic  segment.  In  the  end,  the  FASB  did  not  adopt  the  matrix  format  for  segment 
reporting  but  instead  required  separate  disclosure  of  industry  segment  and  geographic  segment  information  (Herrmann 
and  Thomas  2000). 
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economic  crisis  deepened.  In  fact,  we  find  some  preliminary  evidence  that  some  fair  values  gained 
in  value  relevance. 


APPENDIX 

An  Example  of  FAS  No.  157  Disclosure  of  Wells  Fargo  &  Company  from  Q1  2008 
13.  FAIR  VALUES  OF  ASSETS  AND  LIABILITIES 

Fair  Value  Hierarchy 

Under  FAS  157,  we  group  our  assets  and  liabilities  at  fair  value  in  three  levels,  based  on  the 
markets  in  which  the  assets  and  liabilities  are  traded  and  the  reliability  of  the  assumptions  used  to 
determine  fair  value.  These  levels  are; 

•  Level  1 — Valuation  is  based  upon  quoted  prices  for  identical  instruments  traded  in  active 

markets. 

•  Level  2 — Valuation  is  based  upon  quoted  prices  for  similar  instruments  in  active  markets, 

quoted  prices  for  identical  or  similar  instruments  in  markets  that  are  not  active, 
and  model-based  valuation  techniques  for  which  all  significant  assumptions  are 
observable  in  the  market. 

•  Level  3 — Valuation  is  generated  from  model-based  techniques  that  use  significant  assump¬ 

tions  not  observable  in  the  market.  These  unobservable  assumptions  reflect  our 
own  estimates  of  assumptions  that  market  participants  would  use  in  pricing  the 
asset  or  liability.  Valuation  techniques  include  use  of  option  pricing  models, 
discounted  cash  flow  models,  and  similar  techniques. 

The  table  below  presents  the  balances  of  assets  and  liabilities  measured  at  fair  value  on  a  recurring 
basis. 


(in  millions) 

Total 

Level  1 

Level  2 

Level  3 

Balance  at  March  31,  2008 

Trading  assets 

$  8,893 

$  1,124 

$  7,407 

$  362 

Securities  available  for  sale 

81,787 

41,912 

33,191 

6,684(2) 

Mortgages  held  for  sale 

27,927 

— 

26,667 

1,260 

Mortgage  servicing  rights 

14,956 

— 

— 

14,956 

(residential) 

Other  assets  (1) 

3,167 

2,226 

893 

48 

Total 

$136,730 

$45,262 

$68,158 

$23,310 

Other  liabilities  (1) 

$(6,235) 

$(3,597) 

$(2,230) 

$(408) 

Balance  at  March  31,  2007 


Trading  assets 

$  6,525 

$  1,572 

$  4,599 

$  354 

Securities  available  for  sale 

45,443 

32,412 

10,223 

2,808(2) 

Mortgages  held  for  sale 

25,692 

— 

25,692 

— 

Mortgage  servicing  rights 
(residential) 

17,779 

17,779 

Other  assets 

538 

470 

58 

10 

Total 

$  95,977 

$34,454 

$40,572 

$20,951 

(continued  on  next  page) 
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The  table  below  presents  the  balances  of  assets  and  liabilities  measured  at  fair  value  on  a  recurring 
basis. 

(in  millions)  Total  Level  1  Level  2  Level  3 


Other  liabilities  (1) 


$  (3,056)  $  (1,285)  $(1,460)  $(311) 


(1)  Derivatives  are  included  in  this  category. 

(2)  Non-rated  asset-backed  securities  collateralized  by  auto  leases  represent  substantially  all  of  this  balance. 


The  changes  in  Level  3  assets  and  liabiUties  measured  at  fair  value  on  a  recurring  basis  are 
summarized  as  follows: 


Trading 

Mortgage 

Net 

Other 

assets 

Securities 

Mortgage 

servicing 

derivative 

liabilities 

(excluding 

available 

held  for 

rights 

assets  and 

(excluding 

(in  millions) 

derivatives) 

for  sale 

sale 

(residential) 

liabilities 

derivatives) 

Quarter  ended  March  3 1 ,  2008 
Balance,  beginning  of  quarter 
Total  net  gains  (losses)  for  the 

$418 

$5,381 

$  146 

$  16,763 

$  6 

$(280) 

quarter  included  in: 

Net  income 

(68) 

(8) 

(5) 

(2,564) 

(179) 

(66) 

Other  comprehensive  income 

— 

42 

— 

— 

— 

— 

Purchases,  sales,  issuances 
and  settlements,  net 

12 

1,269 

27 

757 

142 

17 

Net  transfer  into/out  of  Level  3 

— 

1,092(3) 

— 

— 

— 

Balance,  end  of  quarter 

$362 

$6,684 

$1,260 

$  14,956 

$  (31) 

$(329) 

Net  unrealized  losses  included 

$40 

$(4) 

$(5) 

$(1,794) 

$(27) 

$(66) 

in  net  income  for  the  quarter 
relating  to  assets  and 
liabilities  held  at  March  31, 
2008  (1) 

(2) 

(4) 

(4)(5) 

(4) 

(4) 

Quarter  ended  March  3 1 ,  2007 

Balance,  beginning  of  quarter 
Total  net  gains  (losses) 

$360 

$3,447 

— 

$17,591 

$  (68) 

$(282) 

included  in: 

Net  income 

(41) 

(799) 

17 

(6) 

Purchase,  sales,  issuances 
and  settlements,  net 

34 

(639) 

— 

987 

— 

39 

Balance,  end  of  quarter 

$353 

$2,808 

$  — 

$  17,779 

$  (51) 

$(249) 

Net  losses  included  in  net 

$(25) 

$  — 

$(10) 

$(43) 

$(6) 

income  relating  to  assets 
held  at  March  31,  2007  (1) 

(2) 

(4)(5) 

(4) 

(4) 

(1)  Represents  only  net  losses  that  are  due  to  changes  in  economic  conditions  and  management’s  estimates  of  fair  value  and 
excludes  changes  due  to  the  collection/realization  of  cash  flows  over  time. 

(2)  Included  in  other  noninterest  income  in  the  income  statement. 


(continued  on  next  page) 
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(3)  Represents  mortgages  held  for  sale  that  were  transferred  from  Level  2  to  Level  3  due  to  reduced  levels  of  market 
liquidity  for  certain  residential  mortgage  loans. 

(4)  Included  in  mortgage  banking  in  the  income  statement. 

(5)  Represents  total  unrealized  losses  of  $1,798  million  and  $11  million,  net  of  losses  of  $4  million  and  $1  million  related 
to  sales,  for  first  quarter  2008  and  2007,  respectively. 

We  may  be  required,  from  time  to  time,  to  measure  certain  assets  at  fair  value  on  a  nonrecurring  basis  in  accordance  with 
GAAR  These  adjustments  to  fair  value  usually  result  from  application  of  lower-of-cost-or-market  accounting  or  write¬ 
downs  of  individual  assets.  For  assets  measured  at  fair  value  on  a  nonrecurring  basis  that  were  still  held  in  the  balance 
sheet  at  quarter  end,  the  following  table  provides  the  level  of  valuation  assumptions  used  to  determine  each  adjustment  and 
the  carrying  value  of  the  related  individual  assets  or  portfolios  at  quarter  end. 


(in  millions) 

Total 

Carrying  value  at  quarter  end 

Level  1  Level  2  Level  3 

Total  losses 
for  quarter 
ended 

March  31,  2008 

Mortgages  held  for  sale 

$1,781 

$  — 

$1,678 

$103 

$  (78) 

Loans  held  for  sale 

360 

— 

360 

— 

(11) 

Loans  (1) 

546 

— 

546 

6 

(1,297) 

Private  equity  investments 

19 

16 

— 

3 

(14) 

Foreclosed  assets  (2) 

384 

— 

384 

— 

(104) 

Operating  lease  assets 

19 

— 

19 

— 

— 

$(1,504) 

March  31,  2007 

Mortgages  held  for  sale 

$5,023 

$  — 

$5,023 

$  — 

$  (66) 

Loans  (1) 

592 

— 

592 

— 

(575) 

Private  equity  investments 

3 

— 

— 

3 

(5) 

Foreclosed  assets  (2) 

225 

— 

225 

— 

(89) 

$  (735) 


(1)  Represents  carrying  value  and  related  write-downs  of  loans  for  which  adjustments  are  predominantly  based  on  the 
appraised  value  of  the  collateral.  The  carrying  value  of  loans  fully  charged-off,  which  includes  unsecured  lines  and 
loans,  is  zero. 

(2)  Represents  the  fair  value  and  related  losses  of  foreclosed  real  estate  and  other  collateral  owned  that  were  measured  at 
fair  value  subsequent  to  their  initial  classification  as  foreclosed  assets. 
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ABSTRACT:  This  study  investigates  whether  MD&A  disclosures  have  predictive  ability 
for  future  firm  performance  in  cases  of  disproportionate  inventory  increases.  Using  a 
sample  of  568  manufacturing  firms  with  disproportionate  inventory  increases,  I  find  that 
the  favorability  of  explanations  for  inventory  changes  in  MD&A  is  positively  associated 
with  a  firm’s  profitability  and  sales  growth  in  the  subsequent  three  years.  I  also  find  that 
future  profitability  and  sales  growth  of  firms  that  do  not  explain  disproportionate  inven¬ 
tory  increases  in  MD&A  fall  between  those  of  firms  with  favorable  explanations  and 
firms  with  unfavorable  explanations.  These  results  suggest  that  the  existence  and  the 
favorability  of  MD&A  inventory  disclosures  help  users  interpret  disproportionate  inven¬ 
tory  increases  and  predict  future  firm  performance. 

Keywords:  MD&A  disclosures;  inventory  increases;  firm  performance;  financial  state¬ 
ment  analysis. 

Data  Availability:  All  data  are  available  from  public  sources. 

I.  INTRODUCTION 

This  study  investigates  whether  disclosures  in  the  Management  Discussion  and  Analysis 
(MD&A)  section  of  Form  10-K  help  users  interpret  disproportionate  inventory  increases. 
This  study  addresses  two  specific  questions.  First,  do  explanations  for  disproportionate 
inventory  increases  in  MD&A  have  predictive  ability  for  a  firm’s  future  performance?  Second, 
what  are  the  implications  of  a  firm’s  decision  to  explain  disproportionate  inventory  increases  in 
MD&A  for  its  prospects?  My  objective  is  to  determine  whether  the  existence,  and  the  favorability, 
of  MD&A  disclosures  on  disproportionate  inventory  increases  contain  useful  information  for 
financial  statement  analysis. 

Studies  find  that  inventory  is  an  important  item  in  financial  statement  analysis.  Disproportion¬ 
ate  inventory  changes  are  negatively  associated  with  a  firm  s  current  and  future  stock  returns  and 
future  earnings  (Lev  and  Thiagarajan  1993;  Abarbanell  and  Bushee  1997,  1998).  Large  inventory 


I  am  grateful  for  guidance  from  my  dissertation  committee  members:  Richard  Frankel  (chair),  Armando  Gomes,  Ronald 
King,  and  Tzachi  Zach.  I  appreciate  comments  and  suggestions  from  the  editor,  two  anonymous  reviewers,  Shuping  Chen, 
Xia  Chen,  Daniel  Collins,  Nicholas  Dopuch,  Thomas  Fields,  Christopher  Jones,  Raj  Mashmwala,  Mikhail  Pevzner,  Frank 
Selto  and  Phil  Shane.  I  benefited  from  the  input  of  workshop  participants  at  Baruch  College,  Boston  University,  Drexel 
University  Saint  Louis  University,  University  of  Colorado  at  Boulder,  University  of  Minnesota,  and  Washington  University 
in  St.  Louis,  and  participants  at  the  2007  PARS  Midyear  Meeting  and  the  2007  AAA  Annual  Meeting. 

Editor’s  note:  Accepted  by  Mark  Trombley. 

Submitted:  June  2008 
Accepted:  December  2009 
Published  Online:  June  2010 


1411 


1412 


Sun 


changes  are  accompanied  by  reversals  of  firm  performance  (Thomas  and  Zhang  2002).  Financial 
statement  analysis  textbooks  suggest  that  a  disproportionate  increase  in  inventory  is  a  red  flag  that 
deserves  careful  examination  (Penman  2001;  Mulford  and  Comiskey  1996,  2002).  However,  dis¬ 
proportionate  inventory  increases  do  not  have  unambiguous  implications  for  a  firm’s  prospects. 
Within  the  group  of  firms  with  disproportionate  inventory  increases,  inventory  changes  can  arise 
under  a  variety  of  situations.  Such  changes  can  be  a  result  of  a  firm’s  decision  to  increase 
inventory  to  prepare  for  future  sales  growth,  inclusion  of  new  products  in  inventory,  a  firm’s 
decision  to  keep  a  higher  inventory  level  (for  example,  to  improve  customer  service),  a  downward 
shift  in  the  demand  for  a  firm’s  products,  or  earnings  management.  Penman  (2001)  recommends 
examining  firm  disclosures  to  better  understand  changes  in  accounting  ratios.  This  study  investi¬ 
gates  whether  MD&A  disclosures  can  help  financial  information  users  interpret  disproportionate 
inventory  increases. 

Narrative  disclosures  in  MD&A  represent  a  potentially  important  channel  for  firms  to  com¬ 
municate  with  investors.  Prior  studies  (e.g.,  Bryan  1997;  Cole  and  Jones  2004)  find  that  MD&A  is 
a  source  of  useful  information  for  financial  statement  analysis.  Because  any  known  trends  or 
events  that  have  affected  or  will  affect  the  results  of  operations,  liquidity,  and  capital  resources 
should  be  identified  and  described  in  MD&A  (Regulation  S-K),  and  because  disproportionate 
inventory  increases  are  a  result  of  operations  and  can  be  related  to  liquidity,  firms  are  likely  to 
discuss  such  inventory  changes  in  MD&A. 

I  identify  568  firm-years  with  disproportionate  inventory  increases  in  the  manufacturing  in¬ 
dustry  between  1998  and  2002. 1  read  the  MD&A  section  of  Form  10-K  for  each  firm  and  code  the 
existence,  and  the  favorability,  of  explanations  for  inventory  changes.  Explanations  are  classified 
as  favorable  if  they  are  associated  with  an  expectation  of  higher  future  sales.  Explanations  are 
unfavorable  if  they  are  related  to  lower  than  expected  sales  in  the  current  period.  To  control  for  the 
possibility  that  the  favorability  of  MD&A  inventory  disclosures  is  correlated  with  the  favorability 
of  general  forward-looking  information  in  Form  10-K,  I  also  identify  statements  on  a  firm’s 
performance  outlook  and  assess  the  overall  favorability  of  these  statements.  A  firm’s  forward- 
looking  statements  are  favorable  if  they  suggest  that  profitability  will  remain  strong  or  even 
increase,  and  unfavorable  if  they  indicate  that  profit  will  be  negative  or  lower.  Using  the  568 
firm-years  with  disproportionate  inventory  increases  and  the  MD&A-inventory-disclosure  vari¬ 
ables  defined  for  each  firm,  1  investigate  the  predictive  ability  of  explanations  for  inventory 
changes  in  MD&A  for  future  financial  performance  and  the  implications  of  a  firm’s  disclosure 
decision  for  its  prospects. 

Among  the  568  manufacturing  firm-years  with  disproportionate  inventory  increases  during 
1998-2002,  282  (50  percent)  provide  an  explanation  in  MD&A.  Among  these  282  firm-years,  119 
(42  percent)  have  favorable  explanations,  105  (37  percent)  provide  neutral  explanations,  and  58 
(21  percent)  have  unfavorable  explanations.  1  find  that  the  favorability  of  the  provided  explana¬ 
tions  is  positively  associated  with  a  firm’s  return  on  assets  and  sales  growth  in  the  subsequent  three 
years,  suggesting  that  MD&A  disclosures  on  disproportionate  inventory  increases,  if  provided, 
supplement  financial  statement  information  and  help  users  predict  a  firm’s  prospects.  I  also  find 
that  firms  not  explaining  the  inventory  changes  in  MD&A  (hereafter  non-disclosure  firms)  do  not 
have  significantly  different  future  performance  from  disclosure  firms,  indicating  that  the  set  of 
non-disclosure  firms  is  a  pool  of  firms  with  favorable  private  information  and  firms  with  negative 
development  in  business  or  low-quality  earnings.  I  further  find  that  future  profitability  and  sales 
growth  of  non-disclosure  firms  are  greater  than  those  of  firms  with  unfavorable  explanations, 
while  they  are  lower  than  those  of  firms  with  favorable  explanations.  Taken  together,  the  results 
suggest  that  the  existence  of,  and  the  favorability  of,  explanations  for  disproportionate  inventory 
increases  in  MD&A  have  predictive  ability  for  future  firm  performance. 
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This  study  contributes  to  the  literature  in  several  ways.  First,  this  study  provides  evidence  that 
MD&A  disclosures  on  inventory  are  useful.  The  disproportionate  inventory  increase  for  a  particu¬ 
lar  firm  does  not  necessarily  indicate  bad  news.  Results  suggest  that  MD&A  inventory  disclosures 
can  help  users  understand  the  inventory  change  and  predict  future  firm  performance.  The  differ¬ 
ence  in  future  financial  performance  between  firms  with  favorable  MD&A  inventory  disclosures 
and  those  with  unfavorable  disclosures  is  significant.  Second,  this  study  confirms  the  existing 
result  that  forward-looking  information  in  MD&A  is  informative  (e.g.,  Bryan  1997),  and  extends 
the  literature  by  documenting  the  magnitude  of  the  association  between  the  favorability  of 
forward-looking  information  and  future  accounting  outcomes.  Third,  this  study  contributes  to  the 
literature  on  financial  statement  analysis  by  providing  evidence  that  context  matters  a  lot  for 
interpreting  the  meaning  of  the  levels  and  changes  of  financial  ratios,  in  particular  those  related  to 
inventory.  In  this  respect,  it  supplements  the  results  by  Bernard  and  Noel  (1991)  that,  in  terms  of 
predicting  future  accounting  outcomes,  it  matters  whether  inventory  increases  relate  to  raw  mate¬ 
rial,  work  in  process,  or  finished  goods.  Overall,  this  study  provides  practical  advice  to  financial 
information  users  seeking  to  investigate  inventory  “red  fiags.”  It  also  provides  insight  into  the 
relevance  and  reliability  of  MD&A  disclosures. 

I  review  prior  studies  and  develop  hypotheses  in  Section  II,  select  the  sample  in  Section  III, 
discuss  research  design  and  results  in  Section  IV,  and  conclude  in  Section  V. 

II.  HYPOTHESES 

This  study  examines  MD&A  disclosures  on  inventory  increases  for  several  reasons.  First, 
inventory  is  an  important  financial  statement  line  item  to  examine  in  predicting  future  firm  per¬ 
formance.  For  example,  Bernard  and  Noel  (1991)  show  that  unexpected  changes  in  inventory  and 
its  components  are  incrementally  useful  in  predicting  future  earnings  and  sales.  Thomas  and 
Zhang  (2002)  find  that  large  inventory  changes  are  accompanied  by  reversals  of  financial  perfor¬ 
mance  measures  and  stock  returns.  They  further  find  that  the  negative  relation  between  accruals 
and  future  abnormal  stock  returns  first  documented  by  Sloan  (1996)  is  primarily  due  to  inventory 
changes.  Prior  research  also  documents  that  disproportionate  inventory  changes,  measured  as  the 
percentage  change  in  inventory  minus  the  percentage  change  in  sales,  have  incremental  value 
relevance  over  earnings  (Lev  and  Thiagarajan  1993)  and  are  negatively  associated  with  one-year- 
ahead  earnings  and  stock  returns  (Abarbanell  and  Bushee  1997,  1998). 

Second,  disproportionate  inventory  increases  arising  under  different  situations  have  different 
implications  for  a  firm’s  prospects.  Although,  on  average,  firms  with  disproportionate  inventory 
increases  have  poorer  future  performance,  such  an  inventory  change  for  a  particular  firm  is  not 
necessarily  a  signal  of  bad  prospects.  If  a  firm  increases  its  inventory  level  in  anticipation  of 
higher  future  sales  or  in  preparation  for  new  product  introduction,  then  the  inventory  change  is 
likely  a  positive  indicator  of  future  performance.  However,  if  an  inventory  increase  is  associated 
with  fundamental  problems  with  the  firm’s  products  or  markets,  delayed  inventory  write-downs, 
overproduction,  or  fictitious  inventory  (Mulford  and  Comiskey  1996,  2002),  then  it  is  an  unfavor¬ 
able  signal  of  future  financial  performance.  To  evaluate  a  firm’s  inventory  increase  and  to  assess 
its  prospects,  investors  can  utilize  information  from  the  firm. 

Third,  MD&A  is  a  primary  vehicle  by  which  firms  can  explain  disproportionate  inventory 
increases.  The  Securities  and  Exchange  Commission  (SEC)  requires  that  publicly  traded  firms 
include  a  narrative  section  called  Management  Discussion  and  Analysis  in  audited  annual  reports 


'  This  study  looks  at  inventory  increases,  instead  of  decreases,  because  studies  show  that  managers  have  a  propensity  to 
manipulate  earnings  upward  (Fields  et  al.  2001)  and  because  manufacturing  firms  can  artificially  inflate  earnings  with 
delayed  inventory  write-downs  or  overproduction  (Revsine  et  al.  2004;  Roychowdhury  2006).  Investors  of  manufactur¬ 
ers  are  particularly  interested  in  getting  information  related  to  inventory  increases. 
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because  the  SEC  believes  that  financial  statements  and  their  footnotes  alone  may  be  insufficient 
for  investors  to  judge  the  quality  of  earnings  and  to  predict  future  firm  performance  (SEC  1987). 
The  SEC  requires  registrants  to  identify  and  describe  in  MD&A  any  known  trends  or  any  known 
demands,  commitments,  events,  or  uncertainties  that  have  affected  or  will  affect  the  results  of 
operations,  liquidity,  or  capital  resources  in  any  material  way  (Regulation  S-K;  SEC  2003).  Dis¬ 
proportionate  inventory  increases  are  directly  related  to  operations  in  the  current  and  the  future 

periods  and  can  affect  a  firm’s  liquidity.  Firms  are  likely  to  explain  such  inventory  increases  in 

2 

MD&A,  even  if  they  make  similar  disclosures  elsewhere. 

Prior  research  finds  that  MD&A  is  a  source  of  useful  information  for  financial  statement 
analysis.  For  example,  Bryan  (1997)  finds  that  MD&A  discussions  of  future  operations  and 
planned  capital  expenditures  are  associated  with  one-year-ahead  financial  performance  and  invest¬ 
ment  decisions.  Rogers  and  Grant  (1997)  report  that  MD&A  is  the  basis  of  31  percent  of  the 
information  in  an  average  analyst  report,  while  financial  statements  provide  only  14  percent  of  the 
information.  Barron  et  al.  (1999)  document  that  the  MD&A  quality  score  measured  by  the  SEC  is 
negatively  related  to  analysts’  forecast  error  and  dispersion  in  the  subsequent  year.  Cole  and  Jones 
(2004)  find  that  MD&A  disclosures  on  revenue  changes  and  future  capital  expenditures  help  users 
predict  future  revenues  and  future  earnings  for  firms  in  the  retail  industry. 

The  purpose  of  this  study  is  to  investigate  whether  and  how  specific  MD&A  disclosures  help 
users  interpret  changes  in  specific  accounting  numbers  or  ratios  where  there  is  particular  demand 
for  additional  information.  Financial  statement  analysis  textbooks  (e.g..  Penman  2001;  Mulford 
and  Comiskey  1996,  2002)  regard  unusual  changes  in  accounting  ratios  as  red  flags  that  deserve 
careful  examination.  Penman  (2001)  recommends  examining  firm  disclosures  to  obtain  a  better 
understanding  of  changes  in  accounting  ratios.  In  particular.  Penman  (2001)  calls  for  investigation 
for  changes  in  individual  asset  turnover  ratios  to  search  for  explanations  for  these  changes.  Inven¬ 
tory  turnover  is  one  such  ratio.  In  this  study,  I  identify  firms  with  disproportionate  increases  in 
inventory  and  examine  whether  related  disclosures  in  MD&A  help  users  understand  these  in¬ 
creases  and  predict  future  performance. 

My  first  objective  is  to  investigate  whether  explanations  for  inventory  increases  in  MD&A,  if 
provided,  have  predictive  ability  for  future  firm  performance.  Essentially,  I  attempt  to  evaluate  the 
credibility  of  inventory  disclosures  in  MD&A.^  If  the  disclosures  are  credible,  then  firms  with 
more  favorable  disclosures  are  likely  to  have  better  future  performance.  MD&A  disclosures  can  be 
credible  because  misleading  information  in  Form  10-K  attracts  litigation.  The  SEC  has  undertaken 
numerous  enforcement  actions  against  firms  with  inadequate  or  misleading  MD&A  disclosures."^ 
MD&A  disclosures  supplement  the  information  contained  in  financial  statements.  If  MD&A  dis¬ 
closures  are  credible,  then  the  provided  explanations  for  inventory  increases  should  help  financial 
information  users  interpret  a  firm’s  inventory  change,  analyze  its  current  performance,  and  assess 


^  Frankel  et  al.  (2009)  find  that  firms  just  missing  earnings  expectations  have  longer  earnings  conference  calls,  suggesting 
that  conference  calls  are  a  channel  through  which  investors  and  analysts  get  the  information  that  they  demand.  To 
examine  whether  firms  explain  inventory  changes  during  conference  calls  or  earnings  announcement  press  releases  and 
thus  choose  not  to  repeat  the  explanations  in  annual  reports,  I  look  at  inventory  disclosures  in  those  channels  for  sample 
firms  in  2002.  I  find  that  the  few  firms  that  explain  inventory  increases  in  conference  calls  or  earnings  announcements 
also  explain  the  changes  in  MD&A,  suggesting  that  conference  calls  or  earnings  announcement  are  not  the  places  where 
firms  choose  to  exclusively  explain  disproportionate  inventory  increases.  It  is  reasonable  to  examine  MD&A  for  relevant 
explanations  for  such  inventory  changes. 

^  Several  recent  studies  examine  credibility  of  narrative  disclosures  in  other  disclosure  channels.  For  example,  Davis  et  al. 
(2008)  find  that  the  optimistic  and  pessimistic  language  that  managers  use  in  earnings  press  releases  contains  credible 
information  about  expected  future  firm  performance.  Concurrent  studies  also  investigate  firms’  strategic  use  of  narrative 
disclosures.  Li  (2008)  finds  that  annual  reports  of  firms  with  lower  earnings  are  more  difficult  to  read. 

See,  e.g.,  In  the  Matter  of  Edison  Schools,  Inc.,  Release  No.  34-45925  (May  14,  2002);  In  the  Matter  of  Caterpillar  Inc., 
Release  No.  34-30532  (March  31,  1992). 
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its  prospects.  Then,  for  the  subsample  of  firms  that  explain  inventory  increases  in  MD&A,  I  expect 
that  the  favorability  of  the  provided  explanations  is  positively  associated  with  a  firm’s  future 
financial  performance. 

However,  just  as  litigation  risks  do  not  preclude  all  firms  from  manipulating  accounting 
numbers,  they  may  be  insufficient  to  force  firms  to  disclose  credible  information  in  MD&A.  Firms 
with  inventory  increases  associated  with  negative  development  in  business  and  managers  of  firms 
with  inventory  increases  due  to  inappropriate  decisions  may  not  want  to  disclose  the  real  reasons 
underlying  the  inventory  changes.  For  example,  the  SEC  filed  charges  against  the  former  CEO  and 
CFO  of  Kmart  in  2005,  accusing  them  of  misleading  investors  about  Kmart’s  financial  condition 
in  the  months  preceding  the  company’s  bankruptcy.  Kmart  had  a  massive  inventory  overbuy  in  the 
summer  of  2001.  However,  in  the  MD&A  section  of  Kmart’s  Form  10-Q  for  the  third  quarter  and 
nine  months  ending  October  31,  2001,  the  managers  attributed  the  increases  in  inventory  to 
seasonal  inventory  fluctuations  and  actions  taken  to  improve  the  company’s  overall  in-stock  po¬ 
sition.  The  SEC  claims  that  these  statements  materially  misrepresented  the  company’s  liquidity 
(SEC  2005).  If  firms  do  not  provide  reliable  explanations  for  inventory  increases,  then  the  relevant 
MD&A  disclosures  are  not  likely  to  help  financial  information  users  in  their  financial  statement 
analysis.  I  test  the  predictive  ability  of  MD&A  inventory  disclosures  by  examining  the  relation 
between  the  favorability  of  inventory  disclosures  and  a  firm’s  future  financial  performance  against 
the  following  null  hypothesis: 

HI:  The  favorability  of  MD&A  disclosures  on  inventory  increases  is  not  associated  with  a 
firm’s  future  financial  performance. 

The  second  objective  of  this  study  is  to  investigate  the  implications  of  a  firm’s  MD&A 
disclosure  decision  for  its  prospects.  Although  firms  can  explain  disproportionate  inventory  in¬ 
creases  in  MD&A,  firms  may  choose  not  to  do  that  for  several  reasons.  First,  the  inventory 
increases  may  be  related  to  earnings  management  for  some  firms.  Earnings  management  refers  to 
a  situation  in  which  “managers  use  judgment  in  financial  reporting  and  in  structuring  transactions 
to  alter  financial  reports  to  either  mislead  some  stakeholders  about  the  underlying  economic 
performance  of  the  company,  or  to  influence  contractual  outcomes  that  depend  on  reported  ac¬ 
counting  numbers’’  (Healy  and  Wahlen  1999,  368).  Information  asymmetry  is  a  necessary  condi¬ 
tion  for  earnings  management  (Dye  1988;  Trueman  and  Titman  1988;  Richardson  (2000).  For 
earnings  management  to  be  successful,  “at  least  some  users  of  accounting  information  must  be 
either  unable  or  unwilling  to  unravel  completely  the  effects  of  the  earnings  management”  (Fields 
et  al.  2001,  260).  Studies  suggest  that  a  firm’s  information  asymmetry  is  negatively  associated 
with  its  disclosure  quality  (Diamond  and  Verrecchia  1991;  Lang  and  Lundholm  1996).  Better 
disclosures  reduce  a  firm’s  flexibility  in  earnings  management  and  increase  the  probability  that 
earnings  management  will  be  detected.^  To  conceal  earnings  management,  firms  that  inflate  the 
inventory  level  are  less  likely  to  disclose  the  underlying  reasons  for  the  inventory  changes.  Be¬ 
cause  earnings  management  reverses  over  time,  ceteris  paribus,  these  firms  are  likely  to  have 
lower  profitability  in  the  future. 

Second,  firms  with  inventory  increases  associated  with  negative  business  development  may 
choose  not  to  disclose  the  reasons  for  the  inventory  changes.  Verrecchia  (1983)  demonstrates 
analytically  that,  in  the  presence  of  disclosure  costs,  only  firms  with  favorable  news  above  a 
certain  threshold  disclose  their  information  to  differentiate  themselves  from  other  firms  and  to 


^  Lobo  and  Zhang  (2001)  find  a  negative  relation  between  analysts’  disclosure  rating  and  the  magnitude  of  earnings 
management.  Jo  and  Kim  (2007)  document  a  negative  relation  between  disclosure  frequency  and  earnings  management. 
Hirst  and  Hopkins  (1998)  and  Hunton  et  al.  (2006)  show  with  experiments  that  financial  reporting  transparency  reduces 
the  prevalence  of  earnings  management  and  increases  the  detection  of  earnings  management. 
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avoid  undervaluation.  If  the  majority  of  firms  that  do  not  explain  inventory  increases  in  MD&A 
are  firms  with  low-quality  earnings  (i.e.,  earnings  manipulators)  or  firms  with  bad  news,  then  the 
existence  of  explanations  can  be  an  indicator  of  superior  future  firm  performance. 

Third,  even  firms  with  favorable  private  information  may  choose  not  to  explain  the  inventory 
changes  if  disclosure  costs  are  greater  than  disclosure  benefits.  Research  shows  that  disclosures 
can  improve  the  liquidity  of  a  firm’s  stock,  increase  analyst  following,  and  reduce  the  cost  of 
capital  (Healy  and  Palepu  2001;  Core  2001;  Lang  and  Lundholm  1996;  Botosan  1997).  But 
disclosures  are  costly.  Besides  the  costs  associated  with  information  dissemination,  firms  face 
proprietary  costs  (Verrecchia  1983;  Dye  1985,  1986).  Release  of  proprietary  information  can 
reduce  a  firm’s  competitive  advantage.  For  example,  in  cases  of  disproportionate  inventory  in¬ 
creases,  a  firm  may  receive  private  information  about  increased  demand  for  a  certain  product  and 
increase  its  stock  of  that  product  accordingly.  Such  a  firm  is  better  prepared  for  the  demand  shock 
than  its  competitors  and  can  increase  its  market  share.  This  advantage  will  be  lost  if  the  firm 
discloses  the  reason  for  its  inventory  buildup.  Verrecchia  (2001)  and  Dye  (2001)  discuss  the 
cost-benefit  trade-offs  associated  with  disclosures.  If  disclosure  costs  are  too  high  or  if  disclosure 
benefits  are  too  low,  then  firms  with  inventory  increases  arising  from  favorable  trends  will  not 
explain  the  inventory  changes.  However,  these  firms  are  likely  to  perform  well  in  the  future. 

Because  the  set  of  firms  that  do  not  explain  disproportionate  inventory  increases  in  MD&A 
can  be  a  pool  of  firms  with  poor  or  good  prospects,  and  because  a  firm’s  future  performance  is 
highly  dependent  on  future  shocks,  it  is  unclear  whether  one  should  expect  a  relation  between  a 
firm’s  MD&A  disclosure  decision  and  its  future  performance.  I  test  the  following  null  hypothesis 
on  the  relation  between  the  existence  of  MD&A  disclosures  on  inventory  increases  and  a  firm’s 
future  financial  performance: 

H2:  The  existence  of  MD&A  disclosures  on  inventory  increases  is  not  associated  with  a 
firm’s  future  financial  performance. 

III.  SAMPLE  SELECTION 

To  address  the  research  objectives,  I  need  a  sample  of  firms  for  which  investors  are  likely  to 
demand  additional  information  on  inventory.  Such  firms  should  have  a  material  level  of  inventory 
and  a  disproportionate  inventory  increase.  I  start  with  all  U.S.  manufacturing  firms  (SIC  2000- 
3999)  traded  on  NYSE/AMEX/NASDAQ  in  years  between  1998  and  2002.  I  select  sample  firms 
from  the  manufacturing  industry  for  two  reasons.  First,  inventory  is  an  important  asset  for  manu¬ 
facturers.  A  manufacturing  firm  procures  materials,  makes  products,  and  sells  finished  goods.  Each 
step  of  its  operations  involves  inventory  management  and  inventory  valuation.  Therefore,  the 
level,  and  the  quality,  of  inventory  can  reflect  a  manufacturing  firm’s  current  operating  results  and 
predict  its  future  performance.  For  example,  higher  than  expected  demand  in  the  current  period 
can  lead  to  a  lower  ending  inventory  level;  an  ending  inventory  with  a  higher  probability  of 
becoming  obsolete  implies  lower  profit  margin(s)  in  future  period(s).  Second,  manufacturing  firms 
can  overproduce  to  artificially  inflate  earnings.  When  the  inventory  level  increases  under  absorp¬ 
tion  costing,  the  amount  of  fixed  production  cost  charged  to  income  decreases.  Firms  can  artifi¬ 
cially  inflate  income  and  gross  margin  by  producing  more  products.  The  favorable  income  effect 
of  large  inventory  increases  can  even  offset  an  unfavorable  income  effect  of  a  sales  decrease 
(Revsine  et  al.  2004).  Roychowdhury  (2006)  finds  that  firms,  especially  manufacturing  firms, 
avoid  reporting  losses  by  overproduction  to  reduce  the  reported  cost  of  goods  sold.  Because  of 
these  two  reasons,  financial  information  users,  such  as  investors,  of  manufacturing  firms  are 
particularly  interested  in  getting  more  information  to  analyze  disproportionate  inventory  increases. 

I  require  that  firms  in  the  sample  satisfy  the  following  criteria: 

(1)  The  firm  has  necessary  Compustat  data  to  compute  the  change  in  days  in  inventory  in 
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year  t  and  performance  measures  (e.g.,  return  on  assets,  sales  growth)  for  year  t+1.  It  also 
has  CRSP  monthly  stock  returns  over  the  12  months  ending  four  months  after  the  fiscal 
year-end  of  year  t  so  that  its  stock  is  actively  traded  before  and  around  the  time  when  its 
Form  10-K  for  year  t  is  filed  with  the  SEC. 

(2)  At  the  end  of  year  t-1,  the  firm  has  a  book  value  of  inventory  at  least  10  percent  of  its 
book  value  of  total  assets,  so  that  inventory  is  a  material  asset  for  the  firm. 

(3)  The  firm  does  not  change  its  fiscal  year-end  or  engage  in  mergers  or  acquisitions  in  year 
t.  The  purpose  of  this  criterion  is  to  eliminate  explanations  for  inventory  changes  unre¬ 
lated  to  a  firm’s  normal  operations.  Following  Hribar  and  Collins  (2002),  mergers  and 
acquisitions  are  identified  using  footnote  #1  of  the  Compustat  annual  file. 

(4)  To  identify  firms  with  disproportionate  inventory  increases,  I  require  that  the  firm  have  a 
percentage  increase  in  days  in  inventory  of  at  least  20  percent  in  year  t.^  Because  an 
increase  in  selling  activities  can  necessitate  a  corresponding  increase  in  inventory,  I 
define  disproportionate  inventory  increases  as  those  that  cannot  be  readily  justified  by  a 
firm’s  change  in  performance.  I  compute  each  firm’s  days  in  inventory,  which  represents 
the  number  of  days  it  takes  to  sell  the  inventory,  as  365/(cost  of  goods  sold/inventory). 
Among  the  5,056  observations  satisfying  criteria  (l)-(3),  835  (16.52  percent)  have  an 
increase  in  days  in  inventory  of  at  least  20  percent. 

(5)  The  firm  has  a  positive  change  in  the  inventory  level.  Days  in  inventory  can  increase  even 
if  the  inventory  level  decreases  if  the  reduction  in  cost  of  goods  sold  is  sufficiently  large. 
Therefore,  I  delete  firm-years  with  a  decrease  in  the  inventory  level. 

(6)  The  Form  10-K  filing  for  the  firm-year  is  available  on  the  SEC’s  EDGAR  database.  In 
addition,  inventory  numbers  on  Compustat  correspond  to  those  in  Eorm  10-K.  This 
sample  selection  process  is  described  in  Table  1. 

There  are  568  firm-years  from  1998  to  2002  that  satisfy  these  sample  selection  criteria.^  Table 
2  reports  descriptive  statistics  for  the  sample.  Current  and  future  performance  measures  display 
considerable  cross-sectional  variation  (Panel  A).  For  example,  although  sales  growth  rate  in  year 
t  (SALESGR)  has  a  mean  of  2.95  percent,  it  has  a  standard  deviation  of  27.40  percent;  the  first 
quartile  of  its  distribution  is  —11.45  percent  and  the  third  quartile  is  11.79  percent.  Similarly,  sales 
growth  rate  in  year  t-i-1  has  a  mean  of  9.83  percent  and  a  standard  deviation  of  34.50  percent.  The 
other  performance  measures — return  on  assets  (ROA),  change  in  ROA  {ROAjCH)  and  stock 
returns  {BHR) — also  have  large  standard  deviations  and  interquartile  ranges.  These  statistics  sug¬ 
gest  that  disproportionate  inventory  increases  arise  under  various  circumstances.  As  a  result, 
investors  want  to  utilize  additional  information  to  better  understand  the  implications  of  the  inven¬ 
tory  changes. 

I  report  the  sample  distribution  by  two-digit  SIC  industry  in  Panel  B  and  by  year  in  Panel  C. 
Table  2  provides  no  evidence  that  a  particular  segment  of  the  manufacturing  industry  is  over¬ 
weighed  in  the  sample.  However,  the  number  of  sample  observations,  by  itself  or  as  a  percentage 
of  all  observations  satisfying  sample  selection  criteria  (l)-(3)  in  a  particular  year,  decreases  in 
2001  and  2002.  There  are  two  possible  explanations  for  this  decrease.  First,  accounting  scandals  in 
2001  and  the  Sarbanes-Oxley  Act  passed  in  2002  reduce  the  prevalence  of  earnings  management 
via  inventory.  Second,  the  economic  recession  starting  in  2001  reduces  realized  and/or  anticipated 
sales  growth  at  firm  level.  As  a  result,  fewer  firms  have  disproportionate  inventory  increases 
associated  with  current  or  expected  future  sales  growth. 


The  results  on  hypotheses  are  robust  to  alternative  definition  of  disproportionate  inventory  increases,  such  as  an  increase 
in  days  in  inventory  of  at  least  30  percent  or  falling  in  the  highest  decile  for  firms  with  the  same  two-digit  SIC  code. 
’  These  568  observations  come  from  487  distinct  firms.  Results  on  hypotheses  are  similar  if  I  only  use  the  first  time  that 
each  firm  has  a  disproportionate  inventory  increase  during  the  sample  period. 
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TABLE  1 
Sample  Selection 


Sun 


Number  of 
Observations 


U.  S.  manufacturing  firms  (SIC  2000-3999)  traded  on  NYSE/AMEX/  13,102 

NASDAQ  in  year  t  between  1998  and  2002 
Less: 

(1)  Firms  with  missing  Compustat  data  to  compute  the  change  in  (4,016) 

days  in  inventory  in  year  t  or  financial  performance  measures 
for  year  t  or  year  r+1,  or  missing  CRSP  monthly  stock  returns 


for  any  of  the  12  months  ending  four  months  after  the  fiscal 
year-end  of  year  t 

(2)  Firms  with  a  book  value  of  inventory  less  than  10  percent  (2,528) 

of  the  book  value  of  total  assets  at  the  end  of  year  t-l 

(3)  Firms  changing  the  fiscal  year-end  or  engaging  in  mergers  (1,502) 

or  acquisitions  in  year  t 

(4)  Firms  with  a  percentage  change  in  days  in  inventory  of  less  (4,221) 

than  20  percent  in  year  t 

(5)  Firms  without  a  positive  change  in  inventory  level  in  year  t  (165) 

(6)  Firms  without  a  Form  10-K  for  year  t  on  the  SEC’s  EDGAR  (102) 

database  or  with  different  inventory  numbers  on  Compustat 

and  in  Form  10-K  _ 

Final  Sample  568 


To  examine  how  firms  with  disproportionate  inventory  increases  differ  from  other  firms,  I 
compare  selected  firm  characteristics  and  performance  measures  between  sample  observations  and 
non-sample  observations  in  Table  3.  Non-sample  observations  refer  to  the  4,386  firm-years  satis¬ 
fying  sample  selection  criteria  (l)-(3),  but  with  an  increase  in  days  in  inventory  of  less  than  20 
percent  or  without  an  increase  in  the  inventory  level.  By  construction,  sample  firms  have  a  larger 
increase  in  days  in  inventory  in  year  t  {1NVDAYS_CH%).  The  average  change  in  days  in  inventory 
is  —7.53  percent  for  non-sample  firms  and  45.62  percent  for  sample  firms.  The  difference  between 
these  two  groups  of  firms  is  53.15  percent.  Sample  firms  also  have  a  higher  variation  in  inventory 
changes  {Std_INVDAYS_CH%)  in  the  past  five  years.  Although  sample  firms  have  a  smaller 
average  change  in  days  in  inventory  in  year  t-l  {INVDAYS_CH%lagl),  the  difference  in  the 
average  change  between  non-sample  firms  and  sample  firms  (8.00  percent)  is  much  smaller  than 
the  difference  in  year  t  (53.15  percent).  Therefore,  the  disproportionate  inventory  increases  for 
sample  firms  in  year  t  cannot  be  explained  by  their  smaller  inventory  changes  in  year  t- 1 . 

Sample  firms  have  a  smaller  market  value  of  common  equity  (MV),  a  greater  variation  in 
profitability  and  stock  returns  from  year  t-4  to  year  t  {StdROA,  StdBHR),  fewer  analysts  following 
(NUMEST),  and  a  smaller  institutional  ownership  (PERCINT).  Investors  for  firms  with  these 
characteristics  are  likely  to  demand  more  information  from  the  firm  even  without  disproportionate 
inventory  increases. 

Looking  at  performance  measures,  I  find  that  in  the  current  year,  sample  firms  have  a  bigger 
decrease  in  ROA  {ROAjCH)  and  a  lower  sales  growth  rate  {SALESGR).  In  the  subsequent  three 
years,  on  average,  sample  firms  have  a  smaller  ROA  {ROAleadl,2,3),  a  larger  decrease  in  ROA 
{ROAjCHleadl ,2,3),  and  lower  buy-and-hold  stock  returns  {BHRleadl,2,3),  consistent  with  the 
finding  in  prior  studies  that  disproportionate  inventory  increases  are  negatively  associated  with 
future  profitability  and  stock  returns  (Abarbanell  and  Bushee  1997,  1998).  However,  sample  firms 
also  have  higher  future  sales  growth  {SALESGRleadl ,2).  Therefore,  it  appears  that  the  group  of 
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TABLE  3 

Comparison  between  Sample  Firm- Years  (n  =  568)  and  Non-Sample  Firm- Years 

(n  =  4,386)** 


Variables 

Inventory  Related 
INVDAYS_CH% 
StdJNVDAYS_CH% 
INVDAYS_CH%lagl 
Selected  Characteristics 
MV 
BM 

StdROA 

StdBHR 

NUMESf’ 

PERCINT 
Current  Performance 
ROA 

ROA_CH 

SALESGR 

BHR 

Future  Performance 
ROAleadl 
ROAlead2 
ROAleadd 
ROA_CHleadl 
ROA_CHlead2 
ROA_CHlead3 
SALESGRleadl 
SALESGRlead2 
SALESGRlead3 
BHRleadl 
BHRlead2 
BHRlead3 


Non- 

Sample 

Mean 

Sample 

Mean 

-0.0753 

0.4562 

0.3295 

0.3678 

0.0789 

-0.0010 

1114.02 

550.22 

0.8147 

0.8072 

0.0831 

0.0986 

0.6004 

0.6861 

5.2169 

4.3333 

0.3596 

0.3246 

0.0256 

0.0251 

-0.0117 

-0.0166 

0.0602 

0.0295 

0.1059 

0.0714 

0.0179 

-0.0066 

0.0219 

-0.0006 

0.0272 

0.0080 

-0.0075 

-0.0321 

-0.0062 

-0.0154 

-0.0043 

-0.0112 

0.0367 

0.0983 

0.0382 

0.0678 

0.0398 

0.0553 

0.2447 

0.1808 

0.0935 

0.0455 

0.0936 

0.0437 

Difference  in 
Mean 

t-stat 

-0.5315*** 

-43.80 

-0.0383** 

-1.98 

0.0800*** 

4.66 

563.80  *** 

4.87 

0.0075 

0.21 

-0.0155*** 

-3.47 

-0.0857*** 

-3.24 

0.8835*** 

3.77 

0.0351*** 

2.95 

0.0004 

0.05 

0.0050 

0.80 

0.0306** 

2.49 

0.0345 

0.83 

0.0245** 

2.58 

0.0225** 

2.44 

0.0193** 

2.32 

0.0246*** 

3.83 

0.0093** 

2.23 

0.0070** 

2.18 

-0.0616*** 

-4.09 

-0.0296*** 

-2.78 

-0.0156 

-1.61 

0.0639 

1.41 

0.0480** 

2.25 

0.0499*** 

2.95 

Difference  in 
Median 

Z-stat 

-0.4095*** 

-37.30 

-0.0352*** 

-4.26 

0.0756*** 

6.95 

18.82 

1.40 

0.0431 

1.29 

-0.0164*** 

-5.97 

-0.0718*** 

-4.70 

0.0000* 

1.80 

0.0611*** 

2.63 

0.0082 

0.77 

0.0145*** 

3.04 

0.0351*** 

3.60 

0.0664* 

1.96 

0.0232*** 

4.18 

0.0207*** 

4.11 

0.0183*** 

4.02 

0.0196*** 

6.22 

0.0053*** 

3.24 

0.0040*** 

3.10 

-0.0319*** 

-3.99 

-0.0142** 

-2.51 

-0.0009 

-0.53 

0.0700*** 

3.16 

0.0571*** 

3.34 

0.0549*** 

3.57 

*,  **,  ***  Indicate  statistical  significance  at  the  10  percent,  the  5  percent,  and  the  1  percent  confidence  levels,  respec¬ 
tively  (two-tailed). 

^  This  table  compares  selected  firm  characteristics  and  performance  measures  between  sample  firm-years  and  non-sample 
firm-years.  Non-sample  firm-years  are  those  satisfying  sample  selection  criteria  (l)-(3),  but  with  an  increase  in  days  in 
inventor  of  less  than  20  percent  or  without  an  increase  in  the  inventory  level.  There  are  4,386  such  observations. 
Average  firm  characteristics  for  sample  firm-years  and  non-sample  firm-years,  differences  in  means,  and  differences  in 
medians  are  reported.  Test  for  differences  in  means  are  based  on  t-statistics  (two-tailed).  Tests  for  differences  in  medians 
are  based  on  Z-statistics  (two-tailed)  from  Wilcoxon  rank  sum  test.  Variables  are  defined  in  Table  2. 

*’  1,578  out  of  4,386  non-sample  firm-years  and  208  out  of  568  sample  firm-years  are  not  followed  by  analysts  as  reported 
on  I/B/E/S. 

41  out  of  4,386  non-sample  firm-years  and  3  out  of  568  sample  firm-years  do  not  have  shares  held  by  institutions  as 
reported  on  CDA  Spectrum. 
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firms  with  disproportionate  inventory  increases  includes  not  only  firms  with  deteriorating  perfor¬ 
mance,  but  also  those  with  growth  potential.  To  differentiate  between  these  two  types  of  firms, 
financial  information  users  demand  additional  information. 

IV.  RESEARCH  DESIGN  AND  RESULTS 
MD&A  Disclosures  on  Inventory  Increases 

I  read  the  MD&A  for  each  of  the  568  firm-year  observations  in  the  sample  and  collect 
disclosures  on  inventory  changes.  Because  assessing  the  quality  of  narrative  disclosures  is  subjec¬ 
tive,  I  only  code  the  existence,  and  the  favorability,  of  MD&A  disclosures  on  inventory  increases. 
The  disclosure-existence  score,  INVDIS_EXS,  takes  a  value  of  1  if  the  firm  explains  inventory 
increases  in  MD&A,  and  0  otherwise.  To  control  for  the  possibility  that  a  firm  can  discuss 
inventory  changes  in  other  part(s)  of  its  Form  10-K,  I  read  all  statements  on  inventory  in  Form 
10-K.  I  find  that  most  inventory-change  disclosures  are  provided  under  “liquidity  and  capital 
resources”  or  “results  of  operations”  in  MD&A.  Among  the  568  sample  firm-years,  only  three 
discuss  inventory  changes  in  footnotes  to  financial  statements  and  three  discuss  them  in  business 
overview.  For  empirical  tests,  I  treat  these  six  observations  as  firms  explaining  inventory  increases 
in  MD&A.  The  results  are  qualitatively  similar  if  I  exclude  these  six  observations. 

I  divide  the  subsample  of  firms  that  explain  inventory  increases  into  three  groups  depending 
on  the  favorability  of  the  provided  explanations.  The  favorability  is  determined  by  the  direction  of 
the  demand  shift  suggested  by  the  explanations.  Appendix  A  describes  the  coding  system  of 
MD&A  disclosures  on  inventory  increases.  Favorable  explanations  include  anticipation  of  high 
future  demand,  expectation  of  sales  growth,  introduction  of  new  product(s)  in  the  current  or  the 
subsequent  year,  receipt  of  new  contract(s),  and  addition  of  new  retail  store(s).  Such  statements 
indicate  that  sales  will  increase  in  the  future  and,  thus,  represent  good  news.  If  a  firm  states  that 
the  inventory  level  increases  to  support  sales  growth  in  the  current  period  and  if  there  is  no 
specific  indication  in  Form  10-K  that  sales  will  decrease  in  the  subsequent  year,  then  this  firm  is 
also  classified  as  having  favorable  explanations  for  inventory  increases.  The  favorable-inventory- 
disclosure  variable,  INVDIS_FAV,  takes  a  value  of  1  for  firm-years  with  favorable  explanations  for 
disproportionate  inventory  increases,  and  0  otherwise. 

Examples  of  unfavorable  disclosures  include  lower  than  expected  sales,  decreases  in  orders  or 
sales  in  the  current  period,  cancellation  of  orders,  and  products  returned  from  certain  customer(s) 
or  distributor(s).  These  explanations  suggest  that  sales  in  the  current  period  are  lower  than  the 
expectation  or  past  sales.  A  downward  demand  shift  and  a  significant  inventory  buildup  increase 
the  likelihood  of  inventory  obsolescence  and  can  lead  to  an  inventory  write-off  that  reduces  profit. 
Therefore,  these  explanations  are  classified  as  unfavorable.  The  unfavorable-inventory-disclosure 
variable,  INVDIS_UNF,  equals  1  for  observations  with  unfavorable  explanations  for  dispropor¬ 
tionate  inventory  increases,  and  0  otherwise. 

Neutral  explanations  for  inventory  increases  include  improvement  in  customer  service,  place¬ 
ment  of  more  trial  and  demonstration  machines  at  customer  locations,  addition  of  new  showrooms, 
an  increase  in  the  amount  of  safety  stock  of  inventory,  a  shift  in  the  timing  of  inventory  purchase 
or  production,  an  increase  in  inventory  to  support  a  higher  level  of  production,  a  volume  purchase 
of  inventory  items  at  a  favorable  price,  a  planned  buildup  in  inventory  in  anticipation  of  price 
changes,  and  a  lower  than  normal  inventory  level  at  the  end  of  the  previous  year.  Such  explana¬ 
tions  are  not  directly  associated  with  a  firm’s  current  expectation  for  future  sales  or  with  the 
relation  between  current  sales  and  past  expectation  (for  current  sales).  Interpretations  of  these 
explanations  are  ambiguous.  For  example,  the  decision  to  improve  customer  service  can  be  related 
to  several  reasons.  It  is  possible  that  a  firm  wants  to  improve  customer  satisfaction  in  order  to  have 
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a  positive  sales  growth  rate  in  the  coming  year.  However,  it  is  also  possible  that  a  firm  foresees 
problems  with  its  products  and  tries  to  reduce  the  sales  decrease  by  improving  customer  service. 
Therefore,  explanations  of  this  type  are  classified  as  neutral. 

Table  4  reported  descriptive  statistics  for  MD&A  disclosures  on  disproportionate  inventory 
increases.  Among  the  568  firm-years  with  disproportionate  inventory  increases,  282  (49.65  per¬ 
cent)  explain  the  changes  in  MD&A  (Panel  A).  Thus,  a  significant  number  of  firms  with  dispro¬ 
portionate  inventory  increases  choose  not  to  explain  inventory  changes  in  MD&A.  Among  the  282 
observations  that  explain  inventory  changes,  119  (42.20  percent)  have  favorable  explanations,  105 
(37.23  percent)  provide  neutral  explanations,  and  58  (20.57  percent)  give  unfavorable  explanations 
(Panel  B).  Untabulated  analyses  present  no  evidence  that  observations  in  a  particular  sub-industry 
are  more  likely  to  explain  inventory  changes  or  are  more  likely  to  provide  explanations  of  a  certain 
type  (favorability).  However,  67  percent  of  sample  firms  in  2002  explain  inventory  increases, 
while  48  percent,  on  average,  explain  the  changes  in  other  years.  The  increase  in  the  percentage  of 
firms  that  explain  inventory  increases  in  2002  can  be  a  result  of  several  reasons.  First,  the 
Sarbanes-Oxley  Act,  passed  on  July  30,  2002,  requires  management  to  review  and  certify  financial 
reports  since  August  29,  2002.  Managers  may  thus  have  an  incentive  to  provide  a  more  complete 
discussion  in  annual  reports.  Second,  the  SEC  announced  in  December  2001  that  it  would  review 
the  annual  reports  filed  in  2002  by  the  Fortune  500  companies  to  identify  inappropriate  or  deficient 
disclosures.  The  SEC  also  issued  a  statement  about  its  view  regarding  the  purpose  and  the  content 
of  MD&A  disclosures  in  January  2002.  The  SEC’s  attempt  to  discipline  MD&A  disclosures  can 
motivate  more  firms  to  explain  disproportionate  inventory  increases  in  MD&A. 

Forward-Looking  Statements  in  Form  10-K  on  a  Firm’s  Performance  Outlook 

Prior  research  (e.g.,  Bryan  1997;  Cole  and  Jones  2004)  finds  that  MD&A  disclosures  contain 
information  associated  with  future  firm  performance.  To  control  for  the  possibility  that  the  infor¬ 
mation  in  MD&A  inventory  disclosures  is  correlated  with  general  forward-looking  information  in 
MD&A  or  in  Form  10-K,  I  identify  forward-looking  statements  in  Form  10-K  on  a  firm’s  perfor¬ 
mance  outlook  and  assess  the  overall  favorability  of  these  statements  for  each  observation  in  the 

o 

sample.  A  firm  can  provide  forward-looking  statements  on  many  aspects  of  its  operations,  such  as 
sales,  costs,  strategy,  and  competition.  To  assess  the  overall  favorability  of  a  firm’s  performance 
outlook,  I  focus  on  future  profitability  or  the  change  in  future  profitability  implied  by  the  forward- 
looking  statements.  If  these  statements  suggest  that  a  firm’s  profit  or  profit  margin  will  be  main¬ 
tained  at  a  similar  level  or  even  increase,  the  favorable-outlook  variable,  FUTURE_FAV,  takes  a 
value  of  1 .  If  a  firm  indicates  that  its  profitability  will  decrease  or  if  negative  income  is  expected, 
the  unfavorable-outlook  variable,  FUTURE_UNF,  equals  1 .  If  the  forward-looking  statements  do 
not  have  consistent  implications  for  a  firm’s  future  profitability  or  if  a  firm  does  not  have  specific 
predictions,  then  the  firm’s  outlook  is  expected  to  be  neutral.  Appendix  B  details  the  classification 
of  forward-looking  statements  on  a  firm’s  performance  outlook.  The  forward-looking  statements 
are  favorable  for  146  (25.70  percent)  of  the  568  firm-years  in  the  sample,  neutral  for  297  (52.29 
percent)  observations,  and  unfavorable  for  125  (22.01  percent)  firm-years  (Table  4,  Panel  C). 

Table  4,  Panel  D  presents  a  contingency  table  of  the  favorability  of  forward-looking  state¬ 
ments  in  Form  10-K  and  the  type  of  inventory  disclosures  in  MD&A.  I  find  that  the  favorability 
of  a  firm’s  forward-looking  statements  and  the  favorability  of  its  explanations  for  the  dispropor¬ 
tionate  inventory  increase  are  correlated.  For  example,  49  percent  of  firm-years  with  favorable 
inventory  disclosures  also  have  a  favorable  performance  outlook.  However,  at  the  same  time,  they 


The  forward-looking  statements  in  10-K  are  identified  by  searching  for  sentences  containing  “believe,”  “anticipate,” 
“expect,”  “estimate,”  “plan,”  “seek,”  “intend,”  “project,”  “outlook,”  or  “goal.” 
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are  likely  to  contain  information  incremental  to  each  other.  Among  the  119  firm-years  with  favor¬ 
able  inventory  disclosures,  41  percent  (10  percent)  have  a  neutral  (unfavorable)  outlook  because 
expected  increases  in  costs  offset  or  more  than  offset  expected  increases  in  sales. 

Tests  of  Hypotheses 

Predictive  Ability  of  MD&A  Inventory  Disclosures  for  Future  Firm  Performance  (HI) 

To  investigate  the  predictive  ability  of  MD&A  explanations  for  disproportionate  inventory 
increases  for  future  firm  performance  (HI),  I  examine  the  association  between  inventory- 
disclosure  variables  and  average  financial  performance  measures — return  on  assets  (ROA)  and 
sales  growth  (SALESGR) — in  the  subsequent  one,  two,  or  three  years  using  Regressions  (1)  and 
(2). 

ROAleadi,  =  a,  -l-  bifNVDIS_EXS  +  baINVDIS_FAV+  bijlNVDIS_UNE  +  bi^FUTURE_EAV 
+  bi5EUTURE_UNE  +  bi(,lNVDAYS_CH%  +  baINV_CH%  +  bi^SALESGR 
+  bjgROA  -I-  bii()ROA_CH  +  bmStdROA  +  bii2LnMV  +  bn^BM  +  CjYearlndicators 
1,2,3);  (1) 


SALESGRleadi,  =  a,  +  bifNVDlS_EXS  +  bi2lNVDIS_EAV  +  bi^lNVDISJJNE 

+  bi^FUTURE_EAV+  bi^EUTURE_UNF  +  bi(,INVDAYS_CH% 

+  bi2lNV_CH%  +  bi^SALESGR  +  bigTA_CH%  +  b^oROA  +  bn^ROA_CH 
+  CiYearlndicators  +  Ciii  =  1,2,3).  (2) 

I  examine  ROA  because  profitability  is  the  focus  of  financial  statement  analysis.  I  examine 
sales  growth  because  sales  are  a  necessary  condition  for  operating  profits  for  manufacturing  firms 
and  because  the  favorability  of  MD&A  inventory  disclosures  is  defined  based  on  the  direction  of 
demand  shifts  suggested  by  the  explanations. 

The  variables  of  interest  in  Regressions  (1)  and  (2)  are  the  variables  related  to  inventory 
disclosures  in  MD&A.  INVDIS_EXS  indicates  the  existence  of  MD&A  explanations  for  inventory 
increases.  It  takes  a  value  of  1  if  a  firm  explains  the  inventory  changes,  and  0  otherwise.  Its 
coefficient  b^  represents  the  performance  difference  between  firms  that  provide  neutral  explana¬ 
tions  and  those  that  do  not  explain  the  inventory  changes.  1NVD1S_EAV  takes  a  value  of  1  if  a  firm 
explains  the  inventory  increase  in  MD&A  and  if  the  provided  explanations  are  favorable,  and  0 
otherwise.  Similarly,  1NVDIS_UNE  equals  1  if  a  firm  provides  unfavorable  explanations  for  its 
inventory  increase,  and  0  otherwise.  The  coefficient  &,2  (^,3)  on  INVDIS_EAV  (1NVDIS_UNF) 
measures  the  performance  difference  between  firms  with  favorable  (unfavorable)  explanations  and 
those  with  neutral  explanations.  To  test  HI,  I  focus  on  the  coefficient  bi2  on  INVDIS_FAV  and  the 
coefficient  bi^  on  INVDISJJNF.  The  null  hypothesis  that  the  favorability  of  MD&A  explanations 
is  not  associated  with  future  firm  performance  implies  bi2  =  ^,-3  =  0.  Evidence  of  >  0,  bi^,  <  0, 
and/or  would  suggest  that  future  firm  performance  is  positively  associated  with  the 

favorability  of  MD&A  inventory  disclosures.^  Although  I  do  not  examine  the  implications  of  a 
firm’s  decision  to  explain  the  inventory  increase  in  MD&A  for  its  prospects  (H2)  with  Regressions 
(1)  and  (2),  the  coefficients  on  inventory  disclosure  variables  in  these  regressions  imply  the 


’  Treating  the  MD&A  inventory-disclosure  variables  as  endogenous  is  inappropriate  because  instead  of  examining 
whether  the  disclosures  have  any  impact  on  future  firm  performance,  I  investigate  whether  firms  that  explain  the 
inventory  changes  or  provide  a  certain  type  of  explanations  differ  in  terms  of  future  financial  performance.  I  examine  the 
association  between  disclosures  and  future  performance,  instead  of  a  causal  relation  between  them. 
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prospects  of  non-disclosure  firms,  as  compared  to  those  of  firms  with  favorable  or  unfavorable 
disclosures.  In  particular,  +  &;2  {bn  +bi^)  measures  the  performance  difference  between  firms 
with  favorable  (unfavorable)  explanations  and  non-disclosure  firms. 

Following  Core  et  al.  (1999)  and  Hanlon  et  al.  (2003),  in  Regression  (1),  average  ROA  in  the 
subsequent  i  years  (ROAleadi)  is  regressed  on  the  inventory  disclosure  variables,  ROA  in  the 
current  year  (ROA),  and  control  variables.  Two  indicator  variables  related  to  statements  in  Form 
10-K  on  a  firm’s  performance  outlook  are  also  included  to  control  for  general  forward-looking 
information  in  Form  10-K.  FUTURE_FAV  {FUTURE_UNF)  takes  a  value  of  1  if  a  firm’s  forward- 
looking  statements  in  Form  10-K  are  favorable  (unfavorable),  and  0  otherwise.  To  examine 
whether  MD&A  disclosures  on  inventory  increases  augment  the  information  on  inventory  changes 
available  in  financial  statements,  I  control  for  the  percentage  change  in  days  in  inventory  {INV- 
DAYS_CH%)  and  the  percentage  change  in  inventory  {INV_CH%).  To  address  the  possibility  that 
the  favorability  of  MD&A  inventory  disclosures  is  correlated  with  current  sales  growth,  I  control 
for  sales  growth  in  year  t  {SALESGR).  I  use  the  level  of  ROA  and  the  change  in  ROA  {ROAjCH) 
in  year  t  as  additional  independent  variables  to  control  for  their  predictive  ability  for  future  ROA. 
To  control  for  the  relation  between  firm  risk  and  future  profitability,  I  also  include  the  standard 
deviation  of  ROA  from  year  t-A  to  year  t  (StdROA),  the  natural  logarithm  of  market  value  of 
common  equity  at  the  end  of  year  t  (LnMV),  and  the  book-to-market  ratio  at  the  end  of  year  t 
{BM).  I  use  year  indicators  to  control  for  the  time  variation  in  ROA. 

In  Regression  (2)  for  future  sales  growth  {SALESGRleadi),  besides  the  disclosure  variables, 
the  two  inventory-change  variables  {INV_CH%,  INVDAYS_CH%),  and  sales  growth  in  year  t 
{SALESGR),  I  include  the  percentage  change  in  total  assets  from  year  t-l  to  year  t  {TA_CH%), 
ROA  in  year  t  {ROA),  and  the  change  in  ROA  in  year  t  {ROA_CH)  as  another  three  independent 
variables.  TA_CH%  controls  for  the  positive  relation  between  the  current  change  in  total  assets  and 
future  sales  growth.  As  the  firm  grows,  sales  are  likely  to  increase.  ROA  and  ROAjCH  control  for 
the  effect  of  current  profitability  or  change  in  profitability  on  future  competition.  A  high  profit¬ 
ability  level  or  an  increase  in  profitability  attracts  competition  and  thus  may  lead  to  a  lower  sales 
growth  rate  in  the  future. 

Estimation  results  for  Regression  (1)  are  reported  in  Table  5,  Panel  A.  The  coefficient  on  the 
indicator  of  favorable  MD&A  inventory  disclosures  {INVDIS_EAV)  is  positive  in  all  three  columns 
and  significant  in  the  regressions  for  average  ROA  in  the  subsequent  two  or  three  years 
{ROAleadi, 3).  The  coefficient  on  the  indicator  of  unfavorable  inventory  disclosures  {INVDIS- 
_UNE)  is  negative  in  all  three  columns  and  significant  in  the  regression  for  ROA  in  year  t+\ 
{ROAleadi).  These  coefficients  suggest  that  firms  with  favorable  (unfavorable)  explanations  for 
disproportionate  inventory  increases  have  significantly  higher  (lower)  future  ROA  than  firms  with 
neutral  explanations.  To  examine  whether  firms  with  favorable  explanations  have  significantly 
higher  future  ROA  than  those  with  unfavorable  explanations,  I  compare  the  coefficients  on  IN- 
VDIS_FAV  and  INVDISJJNF  (Test  1  (null):  1NVDIS_FAV  -  INVDISJJNE  =  0).  I  find  that  firms 
with  favorable  inventory  disclosures  have  significantly  higher  future  ROA  in  all  three  columns. 
For  example,  1NVDIS_FAV  -  INVDISJJNF  equals  0.0790  (p  =  0.001)  in  the  regression  for  ROA 
in  year  t+1  {ROAleadi).  0.0790  represents  58  percent  of  the  interquartile  range  of  ROAleadi  for 
all  manufacturing  firm-years  satisfying  example  selection  criteria  (l)-(3),  suggesting  that  the 
performance  difference  between  firms  with  favorable  inventory  disclosures  and  those  with  unfa¬ 
vorable  inventory  disclosures  is  economically  significant.  These  results  suggest  that  MD&A  ex¬ 
planations  for  disproportionate  inventory  increases,  if  provided,  have  predictive  ability  for  ROA  in 
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Sun 


the  subsequent  three  years,  rejecting  null  hypothesis  of  Hl.**^ 

For  control  variables,  the  coefficient  on  the  indicator  of  favorable  forward-looking  informa¬ 
tion  in  Form  10-K  (FUTURE^FAV)  is  significantly  positive  and  the  coefficient  on  the  indicator  of 
unfavorable  forward-looking  statements  in  Form  10-K  {FUTURE_UNF)  is  significantly  negative, 
confirming  the  results  in  prior  studies  (e.g.,  Bryan  1997)  on  the  predictive  ability  of  narrative 
disclosures  in  MD&A  by  using  data  in  a  more  recent  period,  although  FUTURE_FAV  and  FU- 
TUREJJNF  are  defined  using  forward-looking  statements  in  Form  10-K.  One  difference  in  re¬ 
search  design  between  Bryan  (1997)  and  this  study  is  that  Bryan  (1997)  uses  the  direction  of 
change  in  future  sales  or  earnings  per  share  {EPS)  as  the  dependent  variable,  while  this  study  uses 
the  level  of  future  ROA  or  sales  growth  directly.  The  later  approach  enables  an  examination  of  the 
magnitude  of  the  association  between  current  disclosures  and  future  performance.  For  example, 
results  in  Table  5,  Panel  A  indicate  that  average  ROA  in  year  t+\  {ROAleadl)  of  firms  with 
favorable  (unfavorable)  forward-looking  disclosures  in  Form  10-K  is  higher  by  0.0510  (lower  by 
0.0640)  than  firms  with  neutral  disclosures.  These  performance  differences  are  economically  sig¬ 
nificant  because  they  represent  37  percent  and  47  percent  of  the  interquartile  range  of  ROAleadl 
for  all  manufacturing  firm-years  satisfying  example  selection  criteria  (l)-(3). 

The  coefficients  on  the  percentage  change  of  days  in  inventory  or  inventory  {INVDAYS_CH% 
and  INV_CH%)  are  not  significant  in  any  column,  suggesting  that,  for  firms  with  disproportionate 
inventory  increases,  inventory  changes  have  varying  implications  for  future  profitability.  The 
coefficient  on  ROA  is  significant  and  between  0  and  1,  consistent  with  mean  reversion  of  account¬ 
ing  performance  measures.  I  also  find  that  firms  with  a  greater  variation  in  ROA  over  time 
(StdROA)  have  a  lower  future  ROA,  while  firms  with  a  larger  size  {FnMV)  have  a  higher  future 
ROA. 

Table  5,  Panel  B  reports  the  estimation  results  for  Regression  (2).  For  HI,  the  coefficient  on 
the  indicator  of  favorable  inventory  disclosures  {JNVDIS_FAV)  is  significantly  positive  in  the 
regressions  for  average  sales  growth  in  the  subsequent  one  or  two  years  {SALESGRleadl  ,2)-,  the 
coefficient  on  the  indicator  of  unfavorable  inventory  disclosures  {INVDISJUNF)  is  negative  but 
insignificant  in  all  three  columns.  The  test  on  the  performance  difference  between  firms  with 
favorable  explanations  and  those  with  unfavorable  explanations  (Test  1)  suggests  that  favorable- 
disclosure  firms  have  significantly  higher  sales  growth  in  subsequent  years.  For  example,  INVD- 
IS_FAV  -  INVDIS_UNF  is  0.1590  in  the  regression  for  SALESGRleadl,  and  0.1590  is  69  percent 
of  the  interquartile  range  of  SALESGRleadl  for  all  manufacturing  firm-years  satisfying  sample 
selection  criteria  (l)-(3).  These  results  suggest  that  MD&A  explanations  for  disproportionate 
inventory  increases  have  predictive  ability  for  a  firm’s  future  sales  growth,  again  rejecting  the  null 
of  HI. 

The  coefficient  on  the  indicator  of  favorable  forward-looking  statements  {FUTURE_FAV)  is 
significantly  positive  in  all  three  columns,  and  the  coefficient  on  the  indicator  of  unfavorable 


Bernard  and  Noel  (1991)  show  that  changes  in  different  inventory  components  have  different  implications  for  future 
earnings  and  sales.  In  particular,  unexpected  changes  in  raw  materials  and  work  in  process  are  positive  predictors  for 
future  sales,  while  unexpected  changes  in  finished  goods  are  negative  predictors  for  future  earnings  for  manufacturers. 
To  examine  whether  MD&A  inventory  disclosures  have  predictive  ability  for  future  ROA  and  sales  growth  after 
controlling  for  changes  in  inventory  components,  I  use  the  381  firm-years  that  disclose  inventory  components  in  both  the 
current  year  and  the  previous  year  and  replace  inventory  change  variables  in  Regressions  (1)  and  (2)  by  changes  in 
inventory  components.  The  results  for  hypotheses  are  qualitatively  similar. 

"  Although  results  in  Table  5  and  statistics  in  Table  4,  Panel  C  both  suggest  that  inventory  disclosures  in  MD&A  and 
forward-looking  statements  in  10-K  contain  incremental  information,  I  conduct  additional  analysis  to  examine  whether 
MD&A  inventory  disclosures  unrelated  to  predictions  for  the  future  have  predictive  ability  for  firm  prospects.  After 
removing  32  observations  that  attribute  inventory  increases  to  forecasted  future  sales  change,  I  still  find  that  firms  with 
favorable  MD&A  inventory  disclosures  have  better  accounting  outcomes  in  the  subsequent  years  than  those  with 
unfavorable  disclosures,  suggesting  that  MD&A  inventory  disclosures  related  to  past  events  are  informative. 
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forward-looking  information  {FUTURE_UNF)  is  significantly  negative  in  the  regression  for 
SALESGRleadl ,  suggesting  that  forward-looking  statements,  though  generally  qualitative,  contain 
information  associated  with  future  sales  growth.  The  two  inventory-change  variables,  INVDAY- 
S_CH%  and  INV__CH%,  do  not  have  a  significant  coefficient  in  any  column,  suggesting  that 
disproportionate  inventory  increases  do  not  have  uniform  implications  for  future  sales  growth. 
Consistent  with  my  expectation,  the  coefficient  on  the  change  in  total  assets  {TA_CH%)  is  positive 
and  the  coefficient  on  ROA  is  negative,  indicating  that  sales  increase  as  the  firm  size  grows  and 
decrease  as  competition  intensifies. 

In  summary,  results  in  Table  5  show  that  the  favorability  of  MD&A  disclosures  on  inventory 
increases  has  predictive  ability  for  a  firm’s  profitability  and  sales  growth  in  the  subsequent  three 
years,  implying  that  MD&A  explanations  for  inventory  increases  are  reliable  on  average  and 
contain  relevant  information  for  financial  statement  analysis.  Untabulated  analyses  show  that  97 
(82  percent)  out  of  the  119  firm-years  with  favorable  explanations  have  positive  sales  growth  in 
the  subsequent  year.  Among  the  105  and  the  58  firm-years  with  neutral  or  unfavorable  explana¬ 
tions,  only  58  (55  percent)  and  15  (26  percent)  have  positive  sales  growth  in  the  subsequent  year, 
respectively.  This  evidence  provides  further  support  that  MD&A  disclosures  on  inventory  in¬ 
creases  are  not  cheap  talk. 

Prior  research  finds  that  disproportionate  inventory  changes  are  negatively  associated  with  a 
firm’s  current  and  future  stock  returns  and  future  earnings  (Lev  and  Thiagarajan  1993;  Abarbanell 
and  Bushee  1997,  1998).  However,  the  results  from  testing  HI  suggest  that  the  disproportionate 
inventory  increase  is  not  necessarily  a  negative  signal  for  a  specific  firm.  The  group  of  firms  with 
disproportionate  inventory  increases  consists  of  firms  with  good  prospects  and  firms  with  poor 
prospects.  For  example,  firms  with  favorable  (unfavorable)  MD&A  inventory  disclosures  have  an 
average  ROA  of  5.90  percent  (—11.61  percent)  in  the  following  year,  well  above  (below)  the 
industry  average  of  1.51  percent.  The  HI  results  imply  that  investors  can  potentially  differentiate 
between  these  two  types  of  firms  and  make  better  financial  statement  analyses  by  looking  at 
relevant  disclosures  in  MD&A.  Such  results  reinforce  the  recommendation  by  Penman  (2001)  to 
examine  firm  disclosures  to  better  understand  changes  in  accounting  ratios.'^ 

Implications  of  the  Decision  to  Explain  Inventory  Increases  for  Future  Performance  (H2) 

I  investigate  the  implications  of  a  firm’s  decision  to  explain  its  disproportionate  inventory 
increase  in  MD&A  for  its  future  performance  (H2)  by  examining  the  association  between  the 
existence  of  MD&A  inventory  disclosures  and  average  return  on  assets  {ROA)  or  sales  growth 
(SALESGR)  in  the  subsequent  one,  two,  or  three  years  using  Regressions  (3)  and  (4). 


ROAleadi,  =  a,  -l-  hifNVDIS_EXS  +  baFUTURE^FAV  +  bi-iFUTURE_UNF 

+  bi4lNVDAYS_CH%  +  bi5lNV_CH%  +  hi(,SALESGR  +  hpROA  +  hi^ROA_CH 
+  bi()StdROA  +  biioEnMY -i-  b^BM  +  c lY earindicators  +  ei{i  =1,2,3).  (3) 


To  better  understand  the  link  between  current  inventory  changes  and  future  earnings  and  sales  growth,  in  untabulated 
analyses,  I  compare  several  other  performance  outcomes  between  firms  with  different  types  of  MD&A  inventory 
disclosures.  I  find  that  firms  with  favorable  explanations  have  an  increase  in  gross  margin  in  year  t,  an  increase  in  labor 
force  starting  in  year  t  and  a  smaller  decrease  in  days  in  inventory  in  the  following  years.  These  results  indicate  that 
because  current  inventory  increases  for  firms  with  favorable  explanations  are  related  to  good  reasons  (e.g.,  expectation 
of  sales  growth),  these  firms  can  charge  a  higher  sales  price,  hire  more  employees,  and  maintain  an  inventory  level 
suggested  by  its  performance  expectation.  These  results  also  indirectly  suggest  that  the  classification  of  MD&A  inven¬ 
tory  disclosures  in  this  study  is  reasonable. 
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SALESGRleadit  =  a;  +  hixINVDIS_EXS  +  bi2FUTURE_EAV  +  bijFUTUREJJNF 

+  bi4lNVDAYS_CH%  +  hiiINV_CH%  +  hi(,SALESGR  +  haTA_CH% 

+  hi^ROA  +  bigROAjCH  +  CiYear Indicators  +  Ciii  =1,2,3)-  (4) 

Regressions  (3)  and  (4)  are  similar  to  Regressions  (1)  ana  (2),  except  that  the  existence  of 
MD&A  inventory  disclosures  {INVDIS_EXS)  is  the  only  MD&A-inventory-disclosure  variable 
included.  Evidence  of  a  significant  coefficient  bn  on  INVDIS_EXS  would  reject  the  null  of  H2 
(i.e.,  bn  =  0).  A  significantly  positive  coefficient  would  suggest  that,  on  average,  firms  that  explain 
inventory  increases  in  MD&A  have  better  future  performance  than  those  that  do  not,  consistent 
with  the  conjecture  that  non-disclosure  firms  are  more  likely  to  be  earnings  manipulators  or  firms 
with  negative  business  development. 

The  estimated  coefficients  for  disclosure  variables  in  Regressions  (3)  and  (4)  are  presented  in 
Table  6,  Panel  A  and  Panel  B,  respectively.  The  results  for  other  variables  are  similar  to  those  in 
Table  5  and  are  omitted  from  Table  6.  For  H2,  the  coefficient  on  the  existence  of  MD&A  inventory 
disclosures  ilNVDIS_EXS)  is  not  statistically  significant  in  any  column  of  either  panel,  indicating 
that  future  profitability  or  sales  growth  of  firms  that  explain  disproportionate  inventory  increases 
in  MD&A  is  not  significantly  different  from  that  of  firms  that  do  not  explain  the  changes.  Thus,  a 
firm’s  decision  not  to  explain  disproportionate  inventory  increases  in  MD&A  does  not  necessarily 
indicate  bad  news.  This  result  suggests  that  a  significant  number  of  firms  with  favorable  private 
information  may  choose  to  pool  with  firms  with  low-quality  earnings  or  bad  news  and  choose  not 
to  explain  inventory  increases. 

Although  non-disclosure  firms,  as  a  group,  do  not  have  significantly  different  future  perfor¬ 
mance  from  disclosure  firms,  they  may  have  different  prospects  from  firms  with  favorable  or 
unfavorable  explanations.  The  coefficients  on  MD&A  disclosure  variables  in  Regressions  (1)  and 
(2)  provide  a  way  to  test  this  conjecture  statistically.  The  sum  of  the  coefficients  on  the  indicator 
of  favorable  (unfavorable)  inventory  disclosures,  INVDIS_FAV  {INVDIS_UNF),  and  on  the  exis¬ 
tence  of  inventory  disclosures,  INVDIS_EXS,  represents  the  performance  difference  between  firms 
with  favorable  (unfavorable)  explanations  and  non-disclosure  firms.  In  Table  5,  Panel  A,  the  sum 
of  the  coefficients  on  INVDIS_FAV  and  INVDIS^EXS  is  significantly  positive  in  the  regression  for 
average  ROA  in  the  subsequent  three  years,  ROAleadS  (Test  2),  and  the  sum  of  the  coefficients  on 
INVDISJJNF  and  INVDIS_EXS  is  significantly  negative  for  ROAleadl  and  ROAlead2  (Test  3). 
For  example,  INVDIS_FAV  +  INVDIS_EXS  is  0.0190  in  the  regression  for  ROAleadl,  suggesting 
that  average  ROA  in  year  t+\  for  firms  with  favorable  inventory  disclosures  is  higher  than  that  for 
non-disclosure  firms  by  0.0190.  In  Table  5,  Panel  B,  the  sum  of  the  coefficients  on  INVDIS_FAV 
and  INVDIS_EXS  is  significantly  positive  in  the  regressions  for  average  sales  growth  in  the 
following  one  or  two  years,  SALESGRleadl  and  SALESGRleadl  (Test  2),  and  the  sum  of  the 
coefficients  on  INVDISJJNF  and  INVDIS_EXS  is  significantly  negative  for  SALESGRleadl  (Test 
3).  These  results  suggest  that,  although  a  firm’s  decision  not  to  explain  the  disproportionate 
inventory  increase  in  MD&A  does  not  necessarily  indicate  bad  news,  future  financial  performance 
of  non-disclosure  firms  falls  between  that  of  firms  with  favorable  explanations  and  firms  with 
unfavorable  explanations. 


The  insignificant  coefficient  on  the  existence  of  MD&A  inventory  disclosures  {INVD1S_EXS)  can  arise  because  the 
group  of  non-disclosure  firms  contain  both  firms  with  bad  news  and  firms  with  good  news,  as  discussed  in  H2.  However, 
such  an  insignificant  result  is  also  consistent  with  the  possibility  that  non-disclosure  firms  have  little  news.  Under  this 
conjecture,  non-disclosure  firms  are  expected  to  experience  smaller  changes  in  future  performance  than  disclosure  firms. 
However,  the  data  suggest  that  non-disclosure  firms  and  disclosure  firms  have  similar  means  or  standard  deviations  for 
absolute  change  in  ROA  or  sales  growth  in  the  subsequent  years. 


The  Accounting  Review 
American  Accounting  Association 


July  2010 


Do  MD&A  Disclosures  Help  Users  Interpret  Disproportionate  Inventory  Increases? 


1435 


V 

u 

fi 

a 

S 

Ui 

1-1 

u 

Cm 

s 

>. 

b 

Si 

s 

S 

-o 

c 

(Q 

OJ 

V 
V! 

C3 

<U 

u 

c 

s»> 

Mm 

O  fS 

fl  .2 

>■  a> 

^  a -B 

g  a 

o 

</!  S 

^  “t! 

a  o 

Vi 

O 

— "  Vi 
O  aj 
_v  ^ 


U 

h.) 

ca 


Q 

aa 

Q 


Cm 

O 

a> 

u 

c 

V 


U 

c 

u 

"S 

.c 

s 

o 

•  PN 

.2 

*3 

o 

c/3 


C/l 

0/ 

S 

2 

‘C 

•M 

C 

V 

■a 

B 

V 

a 

a> 

Q 


ftS 


* 

*  * 

*  * 

O  m 

ro  , — V  , — ,10,^ — , 

O  >0  ID  0^  >o 

p  CNj  p  ^  O  m 

d  o  O  O  (N 

^  ^11 


CD 

CD 

o 

o 


* 

* 

* 

a\ 


* 

* 

* 


I  I 


00  ID  CD  IT)  CD 

cN  p  p  p 

d  d  d  CD 

I  I 


* 
* 
* 
■o 
, — ,  oo 
>D 

^  o 
d  d 

I 


* 

* 

* 

>D 

- — . 

o  o 
^  d 
^  I 


I 


as. 


.Q 

5 

CB 

O 

u 

Clm 


V 

B 

B 

Ph 


A 

.2 

'u 

C 

4> 

c 

0/ 

a 

a> 

'O 

e 


P 

1 

d 

1 

d 

d' 

22 

d 

d 

Q 

d 

d 

d 

d 

d 

d 

d 

cn 

'O 


VO 


cs 

in 


On 

VO 

in 


00 

VO 

in 


<u 


c  <; 


CQ 

‘3 

■M 

B 

0) 

■O 

B 

a> 

a 

u 

a 


o 

cn 


o 

Ml 

o 

</] 

05 

B 

Rt 

CM 


§ 


</) 

3 

.2 

*3 

c 

0/ 

-O 

c 

a 

'O 

C 


o 


* 

* 

* 

CN 

o 


in 

r- 


00  00  ov 

o  r--  o  r-  o  VO 


VO 


O  O  O  fO 

I  I 


o 

I 


cs 

VO 


* 

* 

* 

* 

00 

1-H 

c— V  r- 

^  in 

00 

o 

r-  ^ 

Os 

1—1 

o 

o  ^ 

P  p 

OO 

d 

d  d 

in  d 

d 

1 

1 

1 

) 

CN 

in 

* 

* 

* 

^  VO 

^  o 
O  in  in 

O  -H  ^ 

d  d  CD 


* 

* 

* 

o 

00 

CS  00 

^  vq 
d  d 
I  I 


o 

o 

CN 

00  d 

VO 

in 


ti) 

cd' 


lil' 

§ 

t-M 

d 

tM 


I 

d' 

§ 

d 

d 


d 

TD 

O 

'W 

c/5 

C  < 


_o 

"o 


& 

o 

c 

<L> 

> 

•S 

< 

Q 

s 


0) 

u 

c 

<u 


T3 

<U 


O 


c3 

e 

o 

c 

C3 


0) 

o. 


<1^ 

-S2  ^ 
"S  <u 
>  h 
w  2 

3 

m 

s 

o  c 
■o  R 
^  o 
=  - 
o 

U  Cm 


c 

o 

o 

Mh 

<u 

Q. 


^  21 
§  O 


c 

o 

O 

Mh 

OJ 

Oc 


0) 

.C 


c  V3 

o  ^ 

<D  2 

a  c 
o  > 

<D 


(U 

x: 


•u 

CJ 

c 

cd 

o 

CO 


"o 


c  c 

W)  qj 


O 

o 

(U 

•s 

c/5 

c 

•U 

c/) 

(U 

t-< 

Om 

mD 

c« 


CiJ 

550 

C3 

s; 

c 

o 

Ci; 

a 

•S 

'•N.a 

o 

cj 


The  Accounting  Review 


July  2010 

American  Accounting  Association 


1436 


Sun 


o 


u  c 

-S  '3 

t2  a. 
geo. 
> 


*0 

c 

(O 

CL 

V 

T3 

<D 

x: 

H 


[3 


D 

c/5 

0 

c 

0 

■5 

td 

0 

t 

<u 

c/5 

X) 

0 

1 

CN 

00 

c/5 

ra 

0^ 

c3 

c/5 

OJ 

c/5 

53 

<D 

>s 

3 

cd 

tG 

> 

00 

c  u 

o  ^ 

o 

c 

o 


'O 

ID 

c/5 

(U 


§  §5 


•o 

c  . 

c3  0) 
».  c/5 

^  *> 

c/5  1> 

^  J 

U<  -S 

^  o 

o  <=> 

'O 

C  ^ 
cd  ■‘^ 

■f-T  ^ 

B  >> 

^,s 

S  0) 


(O 


-O 
cd 

x> 

cd  5 
1-1  ^ 
O  ^ 

>  ^ 
cd  5 
C4-1  3 

<0  <u 
c9  ^ 

o  o 


V 

-G 


o  o 

pH  PP 

G  G 


000 

^  w  w 
c  3  3 
O  O 


cd 


•^00 

H  O  CJ 

^  G  G 
(U  cd  cd 

•S  i  J 

<r‘  ^  ^ 
^  a>  0) 
ex  CU 


Q 

s 


"*H 

(U 

bi) 

c 

.s 

0 

0 

0 

(D 

> 

c/5 

c/5 

C/) 

cd 

•G 

•S 

<D 

C 

c 

(D 

> 

§ 

0) 

c/5 

Cd 

<D 

D 

a 

0) 

a 

cd 

G 

Vh 

<u 

(U 

ry) 

0 

3  cd  cd 


C*  oi)  w) 

o  G  C 

« '-S  '-S 

1)0  0 

>00 


■"^  TO  t3 

c  i  § 

3  g  t 

g-c2  C4-. 
D  C/5  C/5 


cG  U=  X2 


I  vvv 

C!  CO  >  fe 
^  ^  § 
^  P  S  ^ 

Q  I  r  I  r  I 
U  £2  ttl  IJJ 
OS  ct: 


■§ 

I 


hi  hi 

h,  h. 


The  Accounting  Review  My  2010 

American  Accounting  Association 


Do  MD&A  Disclosures  Help  Users  Interpret  Disproportionate  Inventory  Increases? 


1437 


Determinants  of  the  Decision  to  Explain  Disproportionate  Inventory  Increases  in  MD&A 

This  study’s  findings  imply  that  MD&A  inventory  disclosures  help  users  understand  the 
implications  of  disproportionate  inventory  increases  for  future  firm  performance.  However,  not  all 
firms  with  disproportionate  inventory  increases  choose  to  explain  such  inventory  changes  in 
MD&A.  A  comprehensive  investigation  of  the  determinants  of  a  firm’s  disclosure  decision  is 
beyond  the  scope  of  this  study.  Preliminary  tests  suggest  that  a  firm’s  disclosure  decision  is 
positively  associated  with  the  percentage  change  in  inventory  and  product  differentiation,  while 
negatively  associated  with  performance  variability  and  sales  growth.  I  also  find  that  the  propensity 
to  explain  inventory  changes  in  MD&A  is  greater  in  2002,  probably  because  of  the  Sarbanes- 
Oxley  Act  passed  in  2002  and  the  SEC’s  effort  in  2001  and  2002  to  discipline  MD&A  disclosures. 
These  results  suggest  that  a  firm’s  disclosure  decision  is  made  after  considering  both  disclosure 
benefits  and  disclosure  costs  and  can  potentially  explain  why  financial  performance  of  non- 
disclosers  is  no  worse  than  that  of  disclosers  and  why  the  existence  of  MD&A  inventory  disclo¬ 
sures  is  not  necessarily  an  indicator  of  better  future  firm  performance. 

V.  CONCLUSION 

This  study  examines  the  association  between  the  existence,  and  the  favorability,  of  MD&A 
disclosures  on  disproportionate  inventory  increases  and  a  firm’s  future  financial  performance. 
Disproportionate  inventory  increases  can  arise  for  a  variety  of  reasons.  To  understand  the  inven¬ 
tory  changes  and  to  evaluate  firm  performance,  investors  can  utilize  additional  information  from 
the  firm.  This  study  investigates  whether  MD&A  inventory  disclosures  can  help  users  interpret 
disproportionate  inventory  increases. 

I  find  that  the  favorability  of  explanations  for  disproportionate  inventory  increases  in  MD&A 
is  positively  associated  with  a  firm’s  profitability  and  sales  growth  in  the  subsequent  three  years. 
However,  286  (50.35  percent)  of  the  568  manufacturing  firm-years  with  disproportionate  inven¬ 
tory  increases  choose  not  to  explain  the  changes  in  MD&A.  I  find  that,  although  non-disclosure 
firms  do  not  have  significantly  poorer  future  financial  performance  than  disclosure  firms,  non¬ 
disclosure  firms’  future  profitability  and  sales  growth  fall  between  those  of  favorable-disclosure 
firms  and  unfavorable-disclosure  firms.  These  results  suggest  that  the  existence,  and  the  favorabil¬ 
ity,  of  MD&A  inventory  disclosures  have  predictive  ability  for  future  financial  performance  for 
firms  with  disproportionate  inventory  increases. 

By  examining  the  association  between  MD&A  disclosures  on  changes  in  a  line  item  directly 
reported  on  the  balance  sheet  and  a  firm’s  future  financial  performance,  this  study  contributes  to 
the  literature  on  disclosures  and  shows  that  MD&A  disclosures  on  inventory  increases  help  users 
interpret  the  inventory  changes  and  have  predictive  ability  for  future  firm  performance.  It  adds  to 
the  literature  on  inventory  by  showing  that  disproportionate  inventory  increases  can  have  different 
implications  for  a  firm’s  prospects.  This  study  also  contributes  to  the  literature  on  financial  state¬ 
ment  analysis  by  showing  that  after  disproportionate  inventory  increases  are  identified,  financial 
information  users  can  use  MD&A  disclosures  to  further  identify  the  reasons  and  the  implications. 

The  Sarbanes-Oxley  Act  was  passed  in  2002  to  improve  the  quality  of  financial  disclosures. 
As  this  study  examines  MD&A  disclosures  on  inventory  increases  during  1998-2002,  future  study 
could  examine  MD&A  inventory  disclosures  in  recent  years  to  provide  further  evidence  on 
whether  firms’  propensity  to  explain  disproportionate  inventory  increases  in  MD&A  is  greater 
after  the  passage  of  the  Sarbanes-Oxley  Act  and  whether  these  disclosures  do  a  better  job  in 
helping  users  interpret  inventory  changes.  Future  studies  could  also  examine  MD&A  disclosures 
on  other  financial  statement  accounts  (e.g.,  accounts  receivable,  property,  plant,  and  equipment 
[PPE])  to  gain  more  insights  into  the  incentives  behind  MD&A  disclosures  and  the  usefulness  of 
these  disclosures. 
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I.  The  Existence  of  MD&A  Disclosures  on  Inventory  Increases 

Description 

Existent  There  are  narrative  explanations  for  inventory  increases  in  MD&A. 

Absent  There  is  no  narrative  explanation  for  inventory  increases  in  MD&A. 


II.  The  Favorability  of  MD&A  Disclosures  on  Inventory  Increases  for  the  Subsample  of  Firms  that 


Explain  the  Inventory  Changes 


Description 


Favorable 


Neutral 


Unfavorable 


Anticipation  of  high  future  demand,  expectation  of  sales  growth,  new  product 
introduction  in  the  current  or  the  subsequent  year,  receipt  of  new  contract(s), 
addition  of  new  retail  store(s),  sales  growth  in  the  current  period.  Combination  of 
favorable  and  neutral  statements. 

Improvement  in  customer  service,  placement  of  more  trial  and  demonstration 
machines  at  customer  locations,  addition  of  new  showrooms,  an  increase  in  the 
amount  of  safety  stock  of  inventory,  a  shift  in  the  timing  of  inventory  purchase  or 
production,  an  increase  in  inventory  to  support  a  higher  level  of  production,  a 
volume  purchase  of  inventory  items  at  a  favorable  price,  a  planned  buildup  in 
inventory  in  anticipation  of  price  changes,  lower  than  normal  inventory  level  at  the 
end  of  the  previous  year. 

Lower  than  expected  sales,  a  decrease  in  orders,  a  decrease  in  sales,  cancellation  of 
orders,  products  returned  from  certain  customer(s)  or  distributor(s).  Combination  of 
unfavorable  and  neutral  statements. 


APPENDIX  B 

The  Classification  of  Forward-Looking  Statements  in  Form  10-K  on  a  Firm’s 

Performance  Outlook 


Favorable 


Neutral 

Unfavorable 


Description 

An  expected  increase  in  demand/product  price/sales/margins,  an  expected  decrease 
in  cost/expense,  new  product  introduction,  an  expected  increase  in  profitability, 
expected  positive  consequences  of  litigation  resolution  or  regulation  change,  etc. 

No  specific  prediction  or  a  combination  of  favorable  and  unfavorable  statements. 
An  expected  decrease  in  demand/product  price/sales/  margins,  an  expected  increase 
in  cost/expense,  an  expected  decrease  in  profitability,  expected  negative 
consequences  of  litigation  resolution  or  regulation  change,  etc. 
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ABSTRACT:  This  study  examines  how  firms’  asymmetric  cost  behavior  influences  ana¬ 
lysts’  earnings  forecasts,  primarily  the  accuracy  of  analysts’  consensus  earnings  fore¬ 
casts.  Results  indicate  that  firms  with  stickier  cost  behavior  have  less  accurate  analysts’ 
earnings  forecasts  than  firms  with  less  sticky  cost  behavior.  Furthermore,  findings  show 
that  cost  stickiness  influences  analysts’  coverage  priorities  and  investors  appear  to 
consider  sticky  cost  behavior  in  forming  their  beliefs  about  the  value  of  firms.  This  study 
integrates  a  typical  management  accounting  research  topic,  cost  behavior,  with  three 
standard  financial  accounting  topics  (namely,  accuracy  of  analysts’  earnings  forecasts, 
analysts’  coverage,  and  market  response  to  earnings  surprises). 

Keywords:  analysts;  earnings  forecasts;  sticky  costs;  asymmetric  cost  behavior. 

JEL  Classifications:  M41;  G12. 

I.  INTRODUCTION 

Management  accountants  have  traditionally  focused  on  cost  behavior  as  an  important 
aspect  of  profit  analysis  for  managers.  Financial  analysts,  however,  estimate  firms’ 
future  costs  in  the  process  of  forecasting  future  earnings.  Predicting  cost  behavior  is, 
therefore,  an  essential  part  of  earnings  prediction.  Yet,  a  potential  relationship  between  firms’  cost 
behavior  and  properties  of  analysts’  earnings  forecasts  has  not  yet  been  explored.  This  study 
integrates  the  management  and  financial  accounting  disciplines  by  showing  effects  of  cost  behav¬ 
ior  on:  (1)  the  accuracy  of  analysts’  consensus  earnings  forecasts,  (2)  the  extent  of  analyst  cover¬ 
age,  and  (3)  the  market  response  to  earnings  announcements. 

Focusing  on  cost  behavior,  I  build  on  the  concept  of  sticky  costs  (Anderson  et  al.  2003).  Costs 
are  termed  sticky  if  they  increase  more  when  activity  rises  than  they  decrease  when  activity  falls 
by  an  equivalent  amount.  A  firm  with  stickier  costs  shows  a  greater  decline  in  earnings  when  the 
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activity  level  falls  than  a  firm  with  less  sticky  costs.  The  reason  is  that  stickier  costs  result  in  a 
smaller  cost  adjustment  when  activity  level  declines  and,  therefore,  lower  cost  savings.  Lower  cost 
savings  result  in  a  greater  decrease  in  earnings.  This  greater  decrease  in  earnings  when  the  activity 
levels  fall  increases  the  variability  of  the  earnings  distribution,  resulting  in  less  accurate  earnings 
predictions. 

Results,  based  on  a  sample  of  44,931  industrial  firm  quarters  for  2,520  firms  from  1986 
through  2005,  indieate  that  sticky  cost  behavior  reduces  the  accuracy  of  analysts’  consensus 
earnings  forecasts  after  controlling  for  environmental  uncertainty,  the  amount  of  available  firm- 
specific  information,  the  forecast  horizon,  and  industry  effects. 

Classifying  costs  into  sticky  and  anti-sticky  costs,  ^  findings  show  that  analysts  absolute 
consensus  earnings  forecasts  for  firms  with  sticky  cost  behavior  are,  on  average,  25  percent  less 
accurate  than  those  for  firms  with  anti-sticky  cost  behavior.  Evidently,  cost  behavior  is  an  influ¬ 
ential  deternfinant  of  analysts’  forecast  accuracy.  The  results  are  robust  to  potential  managerial 
discretion  that  might  bias  the  cost  stickiness  measure  and  to  estimating  cost  stickiness  over  a  long 
time  window.  The  findings  extend  Banker  and  Chen  (2006),  who  show  that  cost  behavior  explains 
a  considerable  part  of  analysts’  advantage  over  time-series  models.  These  findings  are  also  useful 
for  investors  who  use  consensus  earnings  forecasts  to  value  firms,  as  they  suggest  that  stickier 
eosts  indicate  more  volatile  future  earnings. 

Addressing  the  extent  of  analyst  coverage,  I  examine  the  relationship  between  the  accuracy  of 
earnings  forecasts  and  the  extent  of  analyst  coverage.  While  Alford  and  Berger  (1999)  and  Weiss 
et  al.  (2008)  document  a  positive  relationship,  Barth  et  al.  (2001)  report  that  analysts  tend  to  prefer 
covering  firms  with  intangible  assets  characterized  by  volatile  performance.  Thus,  prior  evidence 
is  mixed  and  this  relationship  is  an  open  empirieal  issue.  I  find  that  firms  with  stickier  costs  (and 
less  accurate  earnings  forecasts)  have  lower  analyst  coverage  after  controlling  for  the  amount  of 
available  information,  environmental  uncertainty,  intensity  of  R&D  expenditures,  and  additional 
determinants  of  supply  and  demand  for  analysts’  forecasts  reported  in  the  literature  (e.g.,  Bhushan 
1989;  Lang  and  Lundholm  1996).  Findings  indicate  that  firms’  cost  behavior  affects  analysts’ 
coverage  priorities. 

Finally,  I  examine  whether  investors  behave  as  if  they  understand  cost  stickiness  in  respond¬ 
ing  to  earnings  announcements.  As  earnings  predictability  decreases,  reported  earnings  provide 
less  useful  information  for  the  prediction  of  future  earnings,  such  that  the  earnings  response 
coefficient  decreases  (e.g.,  Lipe  1990).  If  investors  recognize  cost  stickiness  to  some  extent,  being 
aware  that  cost  stickiness  diminishes  the  accuracy  of  the  analysts’  earnings  forecasts,  then  stickier 
cost  behavior  causes  investors  to  rely  less  on  realized  earnings  information  because  of  its  lower 
predictive  power.  Similarly,  I  find  a  weaker  market  response  to  earnings  surprises  for  firms  with 
stickier  cost  behavior.  Overall,  findings  indicate  that  cost  behavior  matters  in  forming  investors’ 
beliefs  regarding  the  value  of  the  firm. 

This  empirical  examination  is  facilitated  by  a  new  measure  of  cost  stickiness  at  the  firm  level 
developed  in  this  study.  I  estimate  the  difference  in  cost  function  slopes  between  upward  and 
downward  activity  adjustments.  While  Anderson  et  al.  (2003)  and  subsequent  studies  use  cross- 
sectional  and  time-series  regressions  to  estimate  cost  stickiness,^  the  new  measure  developed  here 
puts  less  demand  on  the  data  and  allows  for  testing  the  sensitivity  of  results  to  key  cost  model 
assumptions.  The  new  measure  eorroborates  prior  evidence  on  variation  among  firms’  cost  sticki- 


'  Costs  are  termed  anti-sticky  if  they  increase  less  when  activity  rises  than  they  decrease  when  activity  falls  by  an 
equivalent  amount.  See  examples  in  Balakrishnan  et  al.  (2004)  and  the  discussion  in  Section  II. 

^  See,  for  instance,  Banker  et  al.  (2008)  and  Anderson  and  Lanen  (2007). 
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ness  and  provides  room  for  estimating  cost  stickiness  of  firms  operating  in  industries  with  a  small 
number  of  firms,  which  limits  a  meaningful  estimation  of  regression  models.^ 

This  study  expands  the  scope  of  cost  behavior  research.  Traditionally,  cost  behavior  has 
attracted  the  attention  of  management  accountants  interested  in  decision-making  and  control.  The 
current  results  show  that  financial  analysts  benefit  from  understanding  cost  behavior  as  well. 
Further,  these  findings  contribute  to  our  understanding  of  how  analysts  use  public  information 
reported  in  financial  statements  to  recognize  cost  behavior  (e.g.,  Abarbanell  and  Bushee  1997; 
Brown  et  al.  1987). 

In  sum,  this  study  integrates  a  typical  management  accounting  research  topic,  cost  behavior, 
with  three  standard  financial  accounting  topics.  The  importance  of  integrating  the  two  streams  of 
research  has  long  been  recognized  (Hemmer  and  Labro  2008). 

Hypotheses  are  developed  in  Section  II,  the  research  design  is  described  in  Section  III,  and  the 
empirical  results  are  in  Section  IV.  Section  V  offers  a  concluding  remark  on  the  prospects  of 
integrating  management  and  financial  accounting  research. 

II.  DEVELOPMENT  OF  HYPOTHESES 

Despite  the  wide  interest  in  analysts’  earnings  forecasts,  prior  research  has  not  yet  investigated 
the  relationship  between  firms’  cost  behavior  and  properties  of  analysts’  earnings  forecasts,  not¬ 
withstanding  the  essential  part  that  cost  prediction  plays  in  the  process  of  earnings  prediction. 
Prior  empirical  studies  provide  evidence  that  the  accuracy  of  analysts’  earnings  forecasts  increases 
in  the  amount  of  information  available  regarding  the  firm  (Atiase  1985;  Lang  and  Lundholm 
1996),  increases  in  firm  size  but  not  in  firm  complexity  (Brown  et  al.  1987),  and  decreases  in  the 
level  of  uncertainty  in  the  firm’s  production  environment  (Parkash  et  al.  1995).  Later  work  by 
Banker  and  Chen  (2006)  reports  that  cost  behavior  explains  a  considerable  portion  of  the  analysts’ 
advantage  in  earnings  prediction  over  various  time-series  models. 

The  recently  developed  concept  of  sticky  costs  provides  a  compelling  setting  for  exploring 
why  and  how  cost  behavior  affects  the  accuracy  of  analysts’  earnings  forecasts.  Sticky  costs 
indicate  that  costs  tend  to  “stick”  and  hence  do  not  go  away  when  activity  levels  decline.  Bal- 
akrishnan  and  Gruca  (2008)  report  that  hospital  administrators  find  it  hard  and  expensive  to  adjust 
capacity  level  of  core  activities  downward  resulting  in  sticky  costs.  Banker  and  Chen  (2006) 
improve  earnings  predictions  by  estimating  the  excessive  costs  incurred  due  to  costs  being  sticky 
when  sales  decrease.  Anderson  et  al.  (2007)  report  that  cost  stickiness  causes  the  ratio  of  SG&A 
costs  to  sales  to  increase,  rather  than  decrease  proportionally  with  sales,  when  revenue  declines. 
Overall,  prior  studies  perceive  costs  as  sticky  if  firms  incur  disproportionate  costs  when  activity 
levels  decline. 

I  build  on  Balakrishnan  et  al.  (2004)  to  illustrate  the  intuition  underlying  the  relationship 
between  the  extent  of  cost  stickiness  and  the  accuracy  of  analysts’  earnings  forecasts.  Balakrishnan 
et  al.  (2004)  argue  that  the  level  of  capacity  utilization  affects  managers’  response  to  a  change  in 
activity  level.  Thus,  if  capacity  utilization  is  high,  the  firm’s  managers  are  not  likely  to  immedi¬ 
ately  cut  resources  in  response  to  a  decrease  in  activity  level  because  the  decrease  may  be  tem¬ 
porary.  However,  an  increase  in  activity  level  under  high-capacity  utilization  is  likely  to  cross  the 
available  resource  threshold  and  trigger  an  increase  in  resources  supplied.  Assuming  high-capacity 
utilization,  the  response  to  a  decrease  in  activity  level  will  be  smaller  than  the  response  to  a  similar 
increase  in  activity  level,  resulting  in  sticky  costs — depicted  by  the  bold  line  in  Figure  1 . 

By  contrast,  suppose  the  same  firm  experiences  excess  capacity.  Its  managers  are  likely  to  use 
the  slack  to  absorb  the  demand  from  an  increase  in  activity  level.  However,  an  additional  decrease 


^  For  instance,  Banker  and  Chen  (2006)  exclude  from  their  sample  four-digit  SIC  code  industries  with  less  than  20  firms. 
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FIGURE  1 
Cost  Asymmetry 


The  figure  depicts  sticky  and  anti-sticky  cost  functions  based  on  Balakrishnan  et  al.’s  (2004)  example.  The  bold 
cost  function  illustrates  sticky  costs  assuming  that  activity  level  Yq  is  high-capacity  utilization.  The  dashed  cost 
function  illustrates  anti-sticky  costs  assuming  excess  capacity  for  activity  level  Yq. 


in  activity  level  is  interpreted  as  confirming  a  permanent  reduction  in  demand  and  triggers  a 
response.  Assuming  excess  capacity,  the  cost  response  to  an  activity  level  decrease  exceeds  the 
cost  response  to  a  similar  increase  in  activity  level,  resulting  in  anti-sticky  costs— depicted  by  the 
dashed  line  in  Figure  1. 

Next,  I  build  on  Balakrishnan  et  al.  (2004)  to  illustrate  that  stickier  costs  result  in  greater 
earnings  variability.  Higher  capacity  utilization  yields  stickier  costs,  resulting  in  lower  cost  savings 
and  a  greater  decrease  in  profits  when  sales  decrease.'^  Assuming  that  the  rest  of  the  distribution  of 
profits  is  unchanged,  this  greater  decrease  in  profits  increases  the  variability  of  the  ex  ante  profit 
distribution. 

Now,  suppose  an  analyst  predicts  future  profits.  For  simplicity,  I  assume  that  future  activity 
level  will  either  increase  or  decrease  by  an  equivalent  amount  with  equal  probability.  I  further 
suppose  that  the  analyst  recognizes  cost  behavior  to  a  reasonable  extent  and  assume  that  the 
analyst  forecasts  expected  profit  (e.g.,  Ottaviani  and  Sorensen  2006).  Other  things  equal,  sticky 
costs  result  in  lower  profits  when  the  activity  level  declines  than  anti-sticky  costs  do.  Assuming 
equal  profits  when  activity  levels  rise,  the  analyst’s  profit  forecast  is  lower  under  sticky  costs  than 


The  terms  profits  and  earnings  are  used  interchangeably  in  this  study. 
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under  anti-sticky  costs.  For  that  reason,  the  absolute  forecast  error  when  activity  levels  decline  as 
well  as  when  activity  levels  rise  is  greater  under  sticky  costs  than  under  anti-sticky  costs.  Figure 
2  depicts  lower  profits  under  sticky  costs  (assuming  high-capacity  utilization)  than  under  anti- 
sticky  costs  (assuming  excess  capacity)  when  activity  levels  decline:  point  G  is  below  point  E. 
Lower  profits  are  expected  and  forecasted  under  sticky  costs  than  under  anti-sticky  costs;  see  profit 
levels  FC  and  DB,  respectively.  The  absolute  forecast  errors  under  sticky  costs  are  larger  than 
under  anti-sticky  costs:  AC  >  AB  when  the  activity  level  increases  and  FG  >  DE  when  the 
activity  level  decreases.  The  example  demonstrates  that  the  absolute  forecast  errors  increase  with 
the  extent  of  cost  stickiness. 


FIGURE  2 

Absolute  Forecast  Errors  are  Greater  in  the  Presence  of  Sticky  Costs  Than  in  the  Presence  of 

Anti-Sticky  Costs 


Profit 


The  figure  depicts  two  profit  functions  following  Balakrishnan  et  al.  (2004).  The  bold  profit  function  assumes 
sticky  costs  and  the  dashed  profit  function  assumes  anti-sticky  costs.  When  the  activity  level  declines  from  Yq  to 
Yl,  profits  are  lower  under  sticky  costs  (assuming  high-capacity  utilization)  than  under  anti-sticky  costs  (as¬ 
suming  excess  capacity):  point  G  is  below  point  E.  Expected  profits  are  lower  under  sticky  costs  than  under 
anti-sticky  costs:  see  profit  levels  FC  and  DB,  respectively.  The  absolute  forecast  errors  under  sticky  costs  are 
larger  than  under  anti-sticky  costs:  AC  >  AB  on  activity  level  increases  and  FG  >  DE  on  activity  level  decreases. 
Overall,  the  absolute  forecast  errors  when  activity  levels  decline,  as  well  as  when  activity  levels  increase,  are 
greater  under  sticky  costs  than  under  anti-sticky  costs. 
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The  above  example  is  used  to  illustrate  the  intuition  of  the  relationship  between  the  extent  of 
cost  stickiness  and  the  accuracy  of  earnings  forecasts.  This  relationship  between  the  extent  of  cost 
stickiness  and  the  absolute  forecast  errors  is  modeled  in  the  Appendix  (consistent  with  Equation 
(5)  in  Banker  and  Chen  [2006]).  The  Proposition  presented  in  the  Appendix  indicates  a  positive 
relationship  between  the  extent  of  cost  stickiness  and  the  absolute  forecast  errors.  Accordingly,  my 
first  hypothesis  is:^ 

HI:  Increased  cost  stickiness  reduces  the  accuracy  of  analysts’  consensus  earnings  forecasts. 

Prior  literature  documents  a  relationship  between  the  accuracy  of  analysts’  earnings  forecasts 
and  the  extent  of  analyst  coverage  (e.g.,  Alford  and  Berger  1999).  Recently,  Weiss  et  al.  (2008, 
Table  7)  report  that  firms  with  high  analyst  coverage  have  more  accurate  earnings  forecasts  than 
firms  with  low  analyst  coverage.  Stickel  (1992)  reports  that  members  of  the  Investor  All-American 
Research  Team  have  more  accurate  forecasts  than  non-members.  Analysts  who  find  this  compe¬ 
tition  to  be  of  major  importance  are  likely  to  prefer  covering  firms  with  less  sticky  cost  behavior 
to  achieve  greater  expected  accuracy.^ 

However,  Barth  et  al.  (2001)  report  high  coverage  of  firms  with  intangible  assets,  character¬ 
ized  by  low  earnings  predictability  and  high  earnings  forecasts  errors.  While  analysts  are  moti¬ 
vated  to  provide  investors  with  more  accurate  earnings  forecasts,  they  may  not  shy  away  from 
following  a  firm  with  low  earnings  predictability  if  they  have  an  information  advantage  with 
respect  to  that  firm  or  if  the  demand  for  forecasts  is  higher  for  that  firm.  In  sum,  the  empirical 
evidence  on  the  relationship  between  the  accuracy  of  analysts’  earnings  forecasts  and  the  extent  of 
analyst  coverage  is  mixed,  and  it  remains  an  open  empirical  issue. 

I  examine  whether  sticky  cost  behavior  influences  the  extent  of  analyst  coverage.  Sticky  cost 
behavior  will  influence  analysts’  coverage  priorities  if  they  recognize  the  relationship  between  cost 
stickiness  and  accuracy  of  earnings  forecasts  hypothesized  above.  I  test  a  potential  relationship 
between  sticky  cost  behavior  and  the  extent  of  analyst  coverage  after  controlling  for  the  intensity 
of  research  and  development,  the  amount  of  available  information,  firm  size,  environmental  un¬ 
certainty,  and  for  additional  determinants  of  supply  and  demand  for  analysts’  forecasts  reported  in 
the  literature.  Because  there  is  no  ex  ante  basis  for  a  prediction,  the  corresponding  hypothesis  is 
stated  in  the  null  form: 

H2:  Sticky  cost  behavior  does  not  affect  analyst  coverage. 

Finally,  I  examine  whether  investors  recognize  cost  behavior.  If  investors  have  some  under¬ 
standing  that  firms  with  stickier  costs  tend  to  have  less  accurate  earnings  forecasts,  then  cost 
behavior  is  likely  to  influence  their  response  to  surprises  in  earnings  announcements.  As  earnings 
predictability  decreases,  reported  earnings  provide  less  useful  information  for  valuation  and  pre¬ 
diction  of  future  earnings,  resulting  in  a  lower  earnings  response  coefficient  (e.g.,  Lipe  1990). 
Abarbanell  et  al.  (1995)  show  that  the  eamings-price  response  coefficient  increases  in  the  forecast 
precision.  If  investors  recognize  that  cost  stickiness  diminishes  the  accuracy  of  the  analyst’s 


^  To  clarify,  several  studies  report  a  variation  in  the  level  of  cost  stickiness — see  Balakrishnan  and  Gruca  (2008),  Banker 
et  al.  (2008),  Anderson  and  Lanen  (2007),  and  Balakiishnan  and  Soderstrom  (2009).  These  studies  offer  explanations  for 
both  sticky  and  anti-sticky  cost  behavior  (e.g.,  ownership,  pessimism/optimism  with  respect  to  future  demand,  core 
activity).  In  this  study,  I  build  on  this  variation  and  show  a  relationship  between  the  level  of  cost  stickiness  and 
properties  of  analysts’  earnings  forecasts. 

®  Examining  factors  associated  with  the  extent  of  analyst  coverage,  Bhushan  (1989)  finds  that  the  number  of  analysts 
covering  a  firm  increases  in  firm  size,  O’Brien  and  Bhushan  (1990)  report  greater  coverage  in  industries  with  stringent 
disclosure  requirements,  and  Lang  and  Lundholm  (1996)  claim  greater  coverage  of  firms  with  more  informative  dis¬ 
closure  policies.  Frankel  et  al.  (2006),  however,  find  no  relation  between  the  informativeness  of  the  analysts’  forecasts 
and  the  size  of  the  analyst  following. 
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earnings  forecasts,  then  stickier  cost  behavior  causes  investors  to  rely  less  on  realized  earnings 
information  because  of  its  low  predictive  power.  The  third  hypothesis  summarizes  the  argument: 

H3:  The  market  response  to  earnings  surprises  is  weaker  for  firms  with  stickier  cost  behavior. 

If  H3  holds,  then  it  suggests  that  investors  have  some  appreciation  of  the  role  of  cost  behavior 
in  determining  earnings  surprises.  In  other  words,  this  hypothesis  predicts  that  cost  behavior 
matters  in  forming  investors’  beliefs  regarding  the  value  of  firms. 

III.  RESEARCH  DESIGN 

Focusing  on  asymmetric  cost  behavior,  this  study  proposes  a  new  measure  of  cost  stickiness 
at  the  firm  level.  Prior  studies  use  a  cross-sectional  regression  model  to  estimate  cost  stickiness  at 
the  industry  level  or  a  time-series  regression  model  to  estimate  it  at  the  firm  level  (e.g.,  Anderson 
et  al.  2003).  Taking  a  different  path,  this  study  introduces  a  direct  measure  of  cost  stickiness  at  the 
firm  level.  I  estimate  the  difference  between  the  rate  of  cost  decrease  for  recent  quarters  with 
decreasing  sales  and  the  corresponding  rate  of  cost  increase  for  recent  quarters  with  increasing 
sales: 


STICKY  =  log 


^COST\ 


^COST\ 

^SALEj^-^ 


r,re{t,  .  .  ,t-3}, 


where  r  is  the  most  recent  of  the  last  four  quarters  with  a  decrease  in  sales  and  r  is  the  most  recent 
of  the  last  four  quarters  with  an  increase  in  sales,  iSSALEn^  SALEu-  SALEu_i,  (Compustat  #2), 
ACOSTi,  =  (SALEit  -  EARNINGStt)  -  (SALE;  ,_i  -  LAEMAGS,- ,_i),  and  EARNINGS  is  income  be¬ 
fore  extraordinary  items  (Compustat  #8). 

STICKY  is  defined  as  the  difference  in  the  cost  function  slope  between  the  two  most  recent 
quarters  from  quarter  t-3  through  quarter  t,  such  that  sales  decrease  in  one  quarter  and  increase  in 
the  other.  If  costs  are  sticky,  meaning  that  they  increase  more  when  activity  rises  than  they 
decrease  when  activity  falls  by  an  equivalent  amount,  then  the  proposed  measure  has  a  negative 
value.  A  lower  value  of  STICKY  expresses  more  sticky  cost  behavior.^  That  is,  a  negative  (posi¬ 
tive)  value  of  STICKY  indicates  that  managers  are  less  (more)  inclined  to  respond  to  sales  drops  by 
reducing  costs  than  they  are  to  increase  costs  when  sales  rise. 

Following  prior  sticky  costs  studies,  STICKY  uses  a  change  in  sales  as  an  imperfect  proxy  for 
the  actual  activity  change  because  changes  in  activity  level  are  not  observable.  Employing  sales  as 
a  fundamental  stochastic  variable  is  in  line  with  Dechow  et  al.  (1998),  who  suggest  a  model  of 
earnings,  cash  flow,  and  accruals,  assuming  a  random  walk  sales  process.  Banker  and  Chen  (2006) 
also  use  sales  as  a  fundamental  stochastic  variable  for  predicting  future  earnings. 

Since  analysts  estimate  total  costs  in  the  process  of  earnings  prediction,  the  stickiness  measure 
concentrates  on  total  costs  to  gain  insights  into  a  potential  relationship  between  stickiness  of  total 
costs  and  the  accuracy  of  analysts’  earnings  predictions.  Investigating  how  cost  stickiness  affects 
analysts’  earnings  forecasts,  I  use  sales  minus  earnings.  Employing  total  costs  for  the  analysis  also 
eliminates  managerial  discretion  in  cost  classifications  (Anderson  and  Lanen  2007).  I  also  assume 
that  costs  increase  in  activity  level  (as  in  the  adjustment  costs  model  presented  in  the  Appendix). 
This  assumption  means  that  costs  move  in  the  same  direction  as  activity  and  precludes  cost 
increases  when  activity  falls  and  cost  decreases  when  activity  increases  (Anderson  and  Lanen 
2007).  Eor  this  reason,  I  do  not  use  observations  with  costs  that  move  in  opposite  directions  in 
estimating  STICKY.  The  ratio  form  and  logarithmic  specification  make  it  easier  to  compare  vari- 


’  The  estimate  of  STICKY  is  consistent  with  the  sign  of  the  parameter  a,  as  defined  in  the  model  presented  in  the 
Appendix.  The  sign  of  STICKY  is  also  consistent  with  the  stickiness  measure,  I32<  in  Anderson  et  al.  (2003,  Model  I). 
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ables  across  firms,  as  well  as  alleviating  potential  heteroscedasticity  (Anderson  et  al.  2003). 

The  proposed  measure  has  several  advantages.  First,  and  most  important  for  this  study, 
STICKY  estimates  cost  asymmetry  at  the  firm  level.  Thus,  it  provides  means  of  investigating  how 
cost  behavior  influences  analysts’  earnings  forecasts.  Moreover,  it  allows  for  a  large-scale  study 
without  restricting  the  analysis  to  firms  with  at  least  ten  valid  observations  and  at  least  three  sales 
reductions  during  the  sample  period  (see  Anderson  et  al.  2003,  56). 

Second,  by  design,  the  stickiness  of  a  linear  cost  function  is  zero,  i.e.,  STICKY  =  0,  for  a 
traditional  fixed-variable  cost  model  with  a  constant  slope  for  all  activity  levels  within  a  relevant 
range.  Thus,  a  zero  value  indicates  that  managers  change  costs  symmetrically  in  response  to  sales 
increases  and  declines. 

Third,  the  proposed  cost  stickiness  measure  has  a  wider  scope  than  that  used  by  Anderson  et 
al.  (2003)  because  it  accounts  for  a  difference  between  proportions  of  cost  changes  and  allows  for 
cost  friction  with  respect  to  sales  increases.  For  instance,  Chen  et  al.  (2008,  2)  argue  that  empire¬ 
building  incentives  are  “likely  to  lead  managers  to  increase  SG&A  costs  too  rapidly  when  demand 
increases.”  They  report  a  positive  association  between  managerial  empire  building  incentives  and 
the  degree  of  cost  asymmetry.  STICKY  affords  an  examination  of  how  cost  asymmetry  affects  the 
forecast  accuracy  in  the  presence  of  decreases  in  sales  (i.e.,  as  presented  by  Anderson  et  al. 
[2003])  and  in  the  presence  of  increases  in  sales. 

Nonetheless,  there  are  potential  measurement  errors  in  the  suggested  cost  stickiness  metric. 
First,  the  model  assumes  a  piecewise  linear  specification  of  the  cost  function  within  the  relevant 
range  of  activity,  which  simplifies  the  analysis  and  allows  for  measuring  cost  stickiness  when  the 
upward  and  downward  activity  changes  do  not  have  the  same  magnitude.  This  approximation  is 
consistent  with  prior  studies  on  sticky  costs  and  is  reasonable  in  the  context  of  investigating  a 
relationship  between  attributes  of  cost  behavior  and  properties  of  analysts’  forecasts. 

Second,  the  model  assumes  a  realization  of  an  exogenous  state  of  the  world  that  determines 
activity  level.  However,  growth  or  reduction  in  activity  can  occur  not  only  because  of  changes  in 
activity  level,  but  also  because  of  changes  in  prices  of  products  or  resources  or  other  managerial 
choices  (Anderson  and  Lanen  2007).  I  restrict  the  sample  to  competitive  industrial  firms  to  mini¬ 
mize  this  problem,  and  later  test  the  sensitivity  of  results  to  potential  managerial  discretion. 

To  check  consistency  with  prior  literature,  I  compute  the  suggested  measure  for  two  major 
cost  categories  investigated  in  prior  literature.  Specifically,  COGS-STICKY  and  SGA-STICKY  sub¬ 
stitute  changes  in  total  costs  with  changes  in  cost  of  goods  sold,  hereafter  COGS  (Compustat  #30) 
and  SGA  costs  (Compustat  #1),  respectively.  The  median  proportion  of  COGS  and  SGA  to  sales  in 
my  sample  is  64.7  percent  and  23.1  percent,  respectively.  However,  the  accounting  classification 
of  COGS  and  SGA  is  open  to  managerial  judgment,  which  may  introduce  bias  into  the  cost 
stickiness  estimate  of  specific  cost  components.  The  results  should  be  interpreted  in  light  of  this 
limitation.  Taken  as  a  whole,  the  stickiness  measure  is  expected  to  provide  broad  insights  into  the 
relationship  between  cost  behavior  and  properties  of  analysts’  earnings  forecasts. 

Measuring  the  accuracy  of  the  analyst  consensus  forecast,  I  employ  the  mean  absolute  earn¬ 
ings  forecast  errors  as  an  inverse  accuracy  measure.  This  accuracy  gauge  has  been  extensively 
used  in  the  accounting  literature  (e.g.,  Lang  and  Lundholm  1996).  Thus,  the  forecast  error  is 
defined  as: 

actual  EPSii  —  analyst  consensus  forecastii 


and  the  absolute  forecast  error  is  ABS-FEii=  where  the  analyst  consensus  forecast  is  the 


In  measuring  skewness  of  firm-specific  earnings  distributions,  Gu  and  Wu  (2003)  require  each  firm  to  have  at  least  16 
quarterly  observations. 
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mean  of  analyst  forecasts  for  firm  i  and  quarter  t  announced  in  the  month  immediately  preceding 
that  of  the  earnings  announcement.  The  relatively  narrow  time  window  and  the  short  forecast 
horizon  control  for  the  timeliness  of  the  forecasts  and  mitigate  a  potential  trade-off  between  timing 
and  accuracy  (Clement  and  Tse  2003). 

Testing  HI 

In  testing  whether  stickier  cost  behavior  results  in  greater  mean  absolute  analyst  consensus 
earnings  forecast  error,  I  control  for  the  amount  of  available  firm-specific  information,  the  inherent 
uncertainty  in  the  operations  environment,  and  the  forecast  horizon.  The  literature  reports  that  an 
increased  amount  of  available  firm-specific  information  reduces  the  forecast  error.  The  amount  of 
information  acquired  by  analysts  is  positively  related  to  firm  size  (Atiase  1985;  Collins  et  al.  1987; 
Bhushan  1989).  Accordingly,  I  use  firm  size  as  a  control  variable  and  expect  a  negative  coefficient. 
Brown  (2001)  reports  a  disparity  between  the  magnitude  of  earnings  surprises  of  profits  and 
losses.  I  use  an  indicator  variable  to  control  for  losses  because  they  reflect  more  timely  informa¬ 
tion  and  are  associated  with  larger  absolute  forecast  errors  than  are  profits.  I  also  follow  Matsu- 
moto  (2002)  and  control  for  potential  earnings  guidance,  which  is  likely  to  reduce  the  forecast 
error  if  it  results  in  meeting  or  slightly  beating  the  consensus  earnings  forecast. 

Environmental  uncertainty  is  likely  to  influence  the  forecast  accuracy.  If  the  business  envi¬ 
ronment  is  highly  volatile,  then  one  will  expect  larger  forecast  errors.  I  use  two  proxies  for  the 
level  of  environmental  uncertainty.  The  first  is  the  coefficient  of  variation  in  sales,  which  directly 
captures  sales  volatility.  The  second  is  analyst  forecast  dispersion,  which  measures  complementary 
uncertainty  aspects  of  firms’  earnings  (Barron  et  al.  1998).  Brown  et  al.  (1987)  and  Wiedman 
(1996)  report  that  the  accuracy  of  analysts’  forecasts  decreases  in  the  dispersion  of  analysts’ 
forecasts,  which  is  used  to  proxy  for  the  variance  of  information  observations. 

In  addition,  management  accounting  textbooks  (e.g.,  Maher  et  al.  2006)  present  cost-volume- 
profit  analysis  and  suggest  that  firms  with  a  high  operating  leverage  are  likely  to  exhibit  high 
earnings  volatility.  Lev  and  Thiagarajan  (1993)  employ  profit  margin  as  a  proxy  for  operating 
leverage.  In  an  early  study,  Adar  et  al.  (1977)  present  a  positive  relationship  between  profit  margin 
and  forecast  error  in  a  cost-volume-profit  under  uncertainty  setting.  Operating  leverage  varies 
across  firms  and  is  likely  to  depend  on  the  firm-specific  business  environment,  as  well  as  current 
macroeconomic  conditions.  The  higher  the  operating  leverage  of  the  firm,  the  higher  is  the  ex¬ 
pected  error  in  the  analysts’  earnings  forecast.  Therefore,  I  predict  a  positive  relationship  between 
operating  leverage  and  the  magnitude  of  analysts’  earnings  forecast  errors. 

I  also  control  for  unexpected  contemporaneous  seasonal  shocks  to  earnings.  An  indicator 
variable,  SEASON,  indicates  firm  quarters  with  a  positive  change  in  earnings  from  the  same 
quarter  in  the  prior  year.  This  variable  controls  for  the  relation  between  the  change  in  earnings  and 
the  forecast  error  (Matsumoto  2002).  A  positive  coefficient  estimate  is  predicted. 

I  estimate  the  following  three  cross-sectional  regression  models  with  two-digit  SIC  code 
industry  effects; 

Model  1(a) 

ABS-FEit  =  (3o  +  p^STICKYit  +  +  fS^LOSSi,  +  jS^DOWNi,  +  jlsVSALEi,  +  ^^DISPu 

+  jSjOPLEVit  -I-  /S^SEASONii  -t- 


Model  1(b) 


ABS-FEi,  =  ySo  +  P^COGS-STICKYij  +  +  IB^LOSSu  +  +  PsVSALEu 


+  IS(,DISPit  +  /SjOPLEVi,  +  /SsSEASONi,  +  s^, 
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Model  1(c) 


ABS-FEi,  =  /3q  +  /3iSGA-STICKYn  +  +  p^WSSit  +  (i^DOWNi^  +  p^VSALE^,  +  /S^DISPn 


+  PjOPLEVi,  +  ^^SEASONi,  +  e,-,; 


where: 

log  of  market  value  of  equity  (Compustat  #61  X  #14)  at  quarter  end; 
indicator  variable  that  equals  1  if  the  reported  earnings  (Compustat  #8)  are 
negative,  and  0  otherwise; 

as  defined  in  Matsumoto  (2002)  and  equals  1  if  unexpected  earnings  forecasts 
are  negative,  and  0  otherwise; 

coefficient  of  variation  of  sales  measured  over  four  quarters  from  t-3  through  t; 
standard  deviation  of  the  analysts’  forecasts  announced  for  firm  i  and  quarter  t  in 
the  month  immediately  preceding  that  of  the  earnings  announcement,  deflated  by 
stock  price  at  the  end  of  quarter  t- 1 ; 

ratio  between  SALE,,,  minus  COGS  (Compustat  #30)  and  SALEi,;  values  below  0 
or  above  1  are  winsorized;  and 

indicator  variable  that  equals  1  if  the  change  in  earnings  from  the  same  quarter 
in  the  prior  year  (Compustat  #8)  is  positive,  and  0  otherwise. 

If  the  above  metric  captures  cost  stickiness,  then  HI  predicts  /3i  <  0  in  all  three  models, 
where  lower  values  of  STICKY  (COGS-STICKY,  SGA-STICKY)  indicate  stickier  cost  behavior. 

I  further  test  the  sensitivity  of  the  results  to  the  model’s  assumptions  and  potential  measure¬ 
ment  errors  in  three  ways.  First,  I  test  the  sensitivity  of  the  cost  stickiness  measure  to  a  longer  time 
window.  I  compute  the  ratio  of  change  in  total  costs  to  change  in  sales  using  data  from  the  most 
recent  eight  quarters,  t-1  through  1. 1  then  estimate  M-STICKY,  that  is,  the  difference  between  the 
mean  slope  under  downward  adjustments  and  the  mean  slope  under  upward  adjustments.  Thus, 
M-STICKY  accounts  for  downward  adjustments  and  upward  adjustments  made  over  eight  quarters. 
Comparing  M-STICKY  with  STICKY  provides  insights  into  the  perseverance  of  firms’  cost  behav¬ 
ior  over  a  longer  window.  To  check  the  robustness  of  the  coefficient  estimates,  1  estimate  the 
following  regression  model: 

Model  1(d) 


MV,,  = 
LOSS,,  = 

DOWN,,  = 

VSALE,,  = 
DISP„  = 

OPLEV,,  = 
SEASON,,  = 


ABS-FE,,  =  l3o  +  p, M-STICKY,,  +  ^82^^, 7  +  P^LOSS,,  +  (S^DOWN,,  -h  (S^VSALE,,  +  IS(,DISP„ 

+  fijOPLEV,,  +  P^SEASON,,  -l-  s„. 

Again,  if  the  above  metric  captures  the  cost  stickiness,  then  HI  predicts  ySj  <  0  in  model  1(d). 

Second,  I  conduct  a  limited  examination  of  the  effects  of  cost  stickiness  generated  by  past 
decisions,  such  as  technology  choice  and  labor  compensation  contracts,  on  absolute  forecast  er¬ 
rors.  Specifically,  I  consider  two  forms  of  managerial  discretion:  current  decisions  made  in  re¬ 
sponse  to  realized  market  conditions  in  the  current  quarter  t,  and  past  decisions  made  over  quarters 
prior  to  quarter  t.  I  view  adjustments  of  activity  levels  as  responses  in  reaction  to  realized  market 
conditions,  in  contrast  to  prior  decisions.  Substituting  STICKY,  , _i  for  STICKY,  ,  allows  for  esti¬ 
mating  the  impact  of  past  decisions  only.  In  other  words,  STICKY,  , proxies  for  the  extent  of  cost 
stickiness  in  an  earlier  quarter,  excluding  all  managerial  discretionary  choices  made  in  quarter  t, 
such  as  price  discounts  or  accrual  manipulations. 
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Model  1(e) 


ABS-FEii  =  /3o  +  (3iSTICKYi,_i  +  p2MVit  +  P^LOSSi^  +  jA^DOWNi,  +  td^VSALEi,  +  /3eDISPi, 
+  (djOPLEVi,  +  /3^SEAS0Ni,  +  e,,. 


As  before,  the  hypothesis  predicts  ySj  <  0  in  model  1(e). 

Third,  I  collect  evidence  concerning  the  assumption  that  analysts  understand  cost  behavior  to 
some  extent.  If  analysts  ignore  cost  stickiness  when  it  exists,  then  their  earnings  forecasts  will  be 
upward  biased.  Similarly,  if  analysts  ignore  anti-sticky  costs  when  they  exist,  then  their  earnings 
forecasts  will  be  downward  biased.  However,  if  analysts  understand  cost  behavior,  then  the  mean 
forecast  error  (not  absolute  error)  for  firms  with  sticky  costs  as  well  as  for  firms  with  anti-sticky 
costs  should  not  be  affected  by  the  level  of  cost  stickiness.^  In  other  words,  if  analysts  recognize 
cost  behavior,  then  the  extent  of  cost  stickiness  will  not  influence  the  mean  signed  forecast  error. 

Testing  H2 

To  test  the  association  between  cost  stickiness  and  analyst  coverage,  I  regress  the  number  of 
analysts  following  a  firm  on  its  cost  stickiness  and  control  variables.  The  analyses  include  inde¬ 
pendent  variables  to  control  for  the  amount  of  available  information,  environmental  uncertainty, 
the  intensity  of  research  and  development  expenditures,  additional  determinants  of  supply  and 
demand  for  analysts’  forecasts  reported  in  the  literature,  and  year  effects  and  two-digit  SIC  code 
industry  effects. 

Prior  literature  reports  that  firm  size  is  a  primary  determinant  of  analyst  coverage  (Bhushan 
1989;  Hong  et  al.  2000;  Das  et  al.  2006),  perhaps  because  large  firms  have  more  available 
firm-specific  information  than  small  firms  (Collins  et  al.  1987).  The  extent  of  information  asym¬ 
metry  between  managers  and  investors  is  likely  to  enhance  the  demand  for  earnings  forecasts,  but 
analysts  are  required  to  invest  more  resources  in  acquiring  information.  I  use  research  and  devel¬ 
opment  expenditures  as  a  proxy  for  information  asymmetry  because  firms  with  more  intangible 
assets  exhibit  greater  information  asymmetry  (Barth  et  al.  2001;  Barron  et  al.  2002). 

Controlling  for  uncertainty  in  the  forecasting  environment,  I  employ  the  coefficient  of  varia¬ 
tion  in  sales  as  a  direct  measure  for  shocks  in  demand.  In  addition,  analyst  forecast  dispersion  and 
the  absolute  forecast  error  in  the  prior  quarter  are  included  to  measure  other  aspects  of  the 
uncertainty  in  firms’  earnings  (Brown  et  al.  [1987]  and  Matsumoto  [2002],  respectively).  Das  et  al. 
(1998)  argue  that  analysts  extract  higher  rents  by  following  less  predictable  firms,  because  demand 
for  private  information  is  the  highest  for  these  firms,  but  the  accuracy  of  the  forecasts  is  expected 
to  be  lower.  Thus,  the  net  effect  of  uncertainty  in  the  forecasting  environment  on  an  increase  in  the 
extent  of  analyst  following  is  ambiguous. 

I  also  control  for  growth  and  trading  volume  (I.ang  and  Lundholm  1996),  which  provide 
analysts  with  greater  incentives  to  cover  firms.  Finally,  Baik  (2006)  argues  that  firms  experiencing 
financial  distress  appear  to  suffer  from  self-selection  by  analysts.  Accordingly,  I  also  control  for 
losses. 

Using  count-data  in  the  dependent  variable,  I  follow  Rock  et  al.  (2000)  and  use  the  standard 
negative  binomial  distribution  to  estimate  regression  models  2(a)  through  2(c): 


®  This  argument  recognizes  that  analysts  announce  expected  earnings  as  their  forecast.  Even  if  analysts’  forecasts  are 
biased  (say,  optimistically),  there  is  no  reason  to  believe  that  their  bias  depends  on  the  level  of  cost  stickiness. 
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Model  2(a) 


FLLWii  =  /3o  +  /S^STlCKYi,  +  +  l34VSALEi,  +  fi^DISPi,  +  pf^BS-FEi, 


+  13-jGROWTHi,  +  /B^TVi,  +  /3gL0SSi,  +  s,-,; 


Model  2(b) 


FLLWij  =  /3o  +  IB^M-STlCKYi,  +  +  P^RDh  +  P^VSALEi,  +  Ps^lSPu  +  PeABS-FEi, 


+  fijGROWTHii  +  ySgTV/,  +  ft^EOSSu  +  e,,; 


Morfe/  2(c) 

EELWit  =  /3o  +  l5^STICKYi,_^  +  /32MV,,  +  p^RDi,  +  P^YSALEn  +  PsDISPit  +  Pf^BS-EEi, 

+  /3jGROWTHi,  +  (3^TVit  +  /3gLOSSi,  +  e,,; 

where  year  effects  and  two-digit  SIC  code  industry  effects  are  added  to  all  models,  and; 

FLLWii  =  number  of  analysts’  earnings  forecasts  announced  for  firm  i  and  quarter  t  in  the 
month  immediately  preceding  that  of  the  earnings  announcement; 

GROWTH^,  =  (SALEiil  SALEi^,_4)^-^^  -  1; 

RDj,  =  Compustat  #4  for  firm  i  in  quarter  t  divided  by  SALEn]  observations  with  no 
values  are  set  equal  to  0  and  values  are  winsorized  at  1;  and 
TVi,  =  quarterly  trading  volume  in  millions  of  shares. 

Model  2(b)  measures  cost  stickiness  based  on  data  from  eight  preceding  quarters  and  model  2(c) 
uses  a  lagged  measure  of  cost  stickiness  estimated  on  a  prior  quarter  to  strengthen  the  causality 
argument. 

Market  Tests  of  H3 

The  third  hypothesis  predicts  that  the  market  response  to  earnings  surprises  is  weaker  for 
firms  with  stickier  costs  than  for  firms  with  less  sticky  costs.  To  test  this  hypothesis,  I  estimate  a 
valuation  model  that  regresses  the  cumulative  abnormal  return  on  the  magnitude  of  earnings 
surprise  and  the  interaction  between  earnings  surprise  and  cost  stickiness,  while  controlling  for 
environmental  uncertainty.  I  use  contemporaneous  estimates  of  cost  stickiness  and  an  earlier 
quarter  estimate  in  pooled  cross-sectional  regression  models.  Additionally: 

Model  3(a) 

CARit  =  ySo  +  P^EE^t  +  ^2FEi, STICKY i,  +  P^DlSPi,  -t-  /3^VSALEi,  -l-  e,,; 


Model  3(b) 


CARit  =  I3q  +  /3iFEi,  +  /32FEi,COGS-STICKYi,  +  fi^DlSPi,  +  (S^VSALEi,  +  e,,; 


Model  3(c) 


CAR,,  =  ySo  +  P,FE„  +  ^2FE„SGA-STICKY„  +  (3,DISP„  +  P,VSALE„  +  e„- 


Model  3(d) 


CAR,,  =  (3q  +  (3\FEi,  +  j32FEi,M -STICKY  ii  +  /S^DISP,,  +  /S^VSALE,,  +  e,,; 
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Model  3(e) 


CAR,,  =  /3o  +  (3,FEu  +  fi^FE^D-STICKY,,  +  (3^018?  „  +  fi^VSALE,,  +  8,,; 

Model  3(f) 


CAR,,  =  ^0  +  PiFE„  +  f2FE„STICKY„_i  +  f,DISP„  +  /3,VSALE„  +  s„- 

where  CAR,,  (cumulative  abnormal  return)  is  the  three-trading-day  cumulative  value-weighted 
market-adjusted  abnormal  return  surrounding  the  earnings  announcement  for  firm  i  in  quarter  t, 
and,  D-STICKY , ,  equals  —  1  if  STICKY,  ,  <  0,  and  0  otherwise. 

To  control  for  environmental  uncertainty,  Imhoff  and  Lobo  (1992)  use  dispersion  in  analysts’ 
earnings  forecasts  and  show  that  firms  with  higher  ex  ante  earnings  uncertainty  exhibit  smaller 
price  changes  in  response  to  earnings  announcements  than  firms  with  lower  ex  ante  earnings 
uncertainty.  Dispersion  in  analysts’  earnings  forecasts  is  likely  to  capture  additional  aspects  of 
earnings  predictability  other  than  those  related  to  cost  behavior.  To  control  for  broad  aspects  of 
earnings  predictability,  I  employ  both  the  dispersion  in  analysts’  forecasts,  DISP,  and  VSALE  as 
control  variables  in  the  above  models.  If  the  coefficient  estimate  for  the  interaction  variable 
becomes  statistically  significant  after  controlling  by  DISP  and  VSALE,  then  this  suggests  that  cost 
behavior  matters  in  forming  investors’  beliefs  regarding  the  value  of  the  firm.  The  earnings  re¬ 
sponse  coefficient  is  predicted  to  be  weaker  for  firms  with  stickier  cost  behavior  (i.e.,  /32  >  0). 

Sample  Selection 

The  sample  includes  all  industrial  firms  (SIC  codes  2000-3999)  from  1986  to  2005.  The  study 
is  limited  to  industrial  firms  for  two  reasons.  First,  it  allows  examination  of  the  effects  of  a 
potential  variation  in  cost  stickiness  of  the  COGS  and  SGA  cost  components  on  the  accuracy  of  the 
earnings  forecasts.  The  homogenous  structure  of  the  profit  and  loss  statement  among  industrial 
firms  allows  insights  into  the  effects  of  sticky  cost  behavior  among  major  cost  components  on  the 
accuracy  of  analysts’  earnings  forecasts.  Second,  industrial  firms  (in  contrast  to  utilities  and  other 
regulated  industries)  generally  operate  in  competitive  markets,  which  partially  mitigates  the  mea¬ 
surement  error  due  to  a  potential  pricing  effect,  rather  than  to  a  volume  effect. 

The  data  are  obtained  from  Compustat,  I/B/E/S,  and  CRSP.  For  each  firm  quarter,  I  use  the 
consensus  forecast  calculated  as  the  average  of  all  forecasts  announced  in  the  month  preceding 
that  of  the  earnings  announcement.  Actual  earnings  are  taken  from  I/B/E/S,  as  they  are  more  likely 
to  be  consistent  with  the  forecast  in  treating  extraordinary  items  and  some  special  items  (Philbrick 
and  Ricks  1991).  Following  Gu  and  Wu  (2003),  I  require  stock  prices  to  be  at  least  $3  to  avoid  the 
small  deflator  problem.  Announcement  dates  are  taken  from  Compustat  rather  than  I/B/E/S,  which 
has  more  firm  quarters  with  missing  announcement  dates.  In  line  with  the  model  assumption,  I 
limit  the  sample  to  firm-year  observations,  in  which  costs  and  sales  change  in  the  same  direction. 
This  reduces  the  sample  size  by  14.1  percent,  resulting  in  a  final  sample  that  consists  of  44,931 
firm  quarters  for  2,520  firms. 

Descriptive  Statistics  and  Consistency  with  an  Earlier  Cost  Stickiness  Measure 

Table  1  presents  summary  statistics  for  the  relevant  variables.  The  mean  (median)  value  of 
STICKY  is  -0.0174  (-0.0111).  Consistent  with  prior  literature,  the  mean  (median)  value  of 
SGA-STICKY  is  -0.0306  (-0.0326).  Both  means  are  negative  and  significant  (p  <  0.01).  On 
average,  total  costs  and  SGA  costs  exhibit  cost  stickiness.  The  mean  (median)  value  of  COGS  is 
positive,  0.0187  (0.0063).  Thus,  on  average,  SGA  costs  exhibit  sticky  cost  behavior,  while  COGS 
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TABLE  1 

Descriptive  Statistics,  Pooled  over  Time 


Variables 

n 

Mean 

1986-2005 

Std. 

Dev. 

Ql 

Median 

Q3 

% 

Negative 

STICKY 

44,931 

-0.0174 

0.4897 

■0.1551 

-0.0111 

0.1205 

53.2 

COGS-STICKY 

37,521 

0.0187 

0.4707 

0.1564 

0.0063 

0.1823 

48.7 

SGA-STICKY 

23,809 

-0.0306 

0.6944 

■0.3870 

-0.0326 

0.3304 

55.1 

M-STICKY 

44,931 

-0.0117 

0.2398 

■0.0633 

-0.0094 

0.0501 

54.9 

FE 

44,931 

-0.0014 

0.0382 

0.0110 

0 

0.0011 

38.3 

ABS-FE 

44,931 

0.0071 

0.0118 

0.0003 

0.0011 

0.0034 

NA 

MV 

44,926 

4.7037 

2.1314 

3.1508 

4.6328 

6.1997 

NA 

LOSS 

44,931 

0.1628 

0.3692 

0 

0 

0 

NA 

FLEW 

44,931 

5.5356 

5.1759 

2 

4 

8 

NA 

DOWN 

39,415 

0.5702 

0.4828 

0 

1 

1 

NA 

VSALE 

44,926 

0.1480 

0.1478 

0.0611 

0.1026 

0.1752 

NA 

DISP 

44,931 

0.0018 

0.0328 

0.0001 

0.0004 

0.0012 

NA 

OPLEV 

44,559 

0.3670 

0.1819 

0.2408 

0.3530 

0.4874 

NA 

SEASON 

44,626 

0.6111 

0.4875 

0 

1 

1 

NA 

GROWTH 

35,864 

0.0418 

0.1310 

0.0142 

0.0247 

0.0422 

NA 

RD 

44,931 

0.0482 

0.0987 

0 

0 

0.0620 

NA 

TV 

42,029 

5.3360 

8.2114 

1.8441 

2.2254 

6.1813 

NA 

Variable  definitions  for  each  firm  i 

on  quarter  f. 

FE, 

=  difference  between  reported  earnings  and  the  mean  (consensus)  forecasts  announced  in  the  month 

immediately  preceding  that  of  the  earnings  announcement,  deflated  by  the  price  at  the  end  of  the 

prior  quarter.  ABS-FE,  =  \  FEi\\ 

ACOST] 

ACOST] 

STICKY, 

~  i  ASALf/j 

,  r-3},  where  r  is  the  most  recent  quarter  with  sales 

decrease  and  ' 

T  is  the  most  recent  quarter  with  sales  increase; 

COST, 

=  sales  (Compustat  #2)  minus 

net  earnings  (Compustat  #8)  for  firm  i  in  quarter  t; 

SALE, 

=  Compustat  #2  for  firm  i  in  quarter  t; 

ACOGS] 

ACOGS] 

COGS-STICKY, 

~  \  ASALE  j 

,  -“’M  ASALE 

.  ,  t-3},  where  x  is  the  most  recent  quarter  with  sales 

decrease  and  t  is  the  most  recent  quarter  with  sales  increase; 

COGS, 

=  Compustat  #30  for  firm  i  in 

quarter  t; 

/  ASGA  \ 

/  ASGA  \ 

SGA-STICKY, 

.-'“MasaleL’ - 

,  t-3},  where  x  is  the  most  recent  quarter  with  sales 

decrease  and  r  is  the  most  recent  quarter  with  sales  increase; 

SGA, 

=  Compustat  #1  for  firm  i  in  quarter  t\ 

M-STICKY, 

=  difference  between  the  mean  cost  function  slope  under  upward  adjustments  made  on  quarters  from 

t-1  through  t  and  the  mean 

cost  function  slope  under  downward  adjustments  made  on  quarters 

from  t-1  through  r; 

MV, 

=  log  of  market  value  of  equity  (Compustat  #61 

X  #14)  on 

quarter  end; 

LOSSii  =  indicator  variable  that  equals  1  if  the  reported  earnings  (Compustat  #8)  are  negative,  and  0 
otherwise; 

FLLWj,  =  number  of  analysts’  earnings  forecasts  announced  for  firm  i  and  quarter  t  in  the  month  immediately 
preceding  that  of  the  earnings  announcement; 

DOWNj,  =  defined  in  Matsumoto  (2002)  and  equals  1  if  unexpected  earnings  forecasts  are  negative,  and  0 
otherwise; 

VSALEi,  =  coefficient  of  variation  of  sales  measured  over  four  quarters  from  t-3  through  f; 

DISPi,  =  standard  deviation  of  the  analysts’  forecasts  announced  for  firm  i  and  quarter  t  during  the  30  days 
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TABLE  1  (continued) 


prior  to  the  earnings  announcement,  deflated  by  the  stock  price  at  the  end  of  quarter  t- 1 ; 

SALE;,  -  COGSi,(Compustat  #  30) 

OPLEVii  = -  and  values  below  0  or  above  1  are  winsorized; 

SALEj( 

SEASON^,  =  indicator  variable  that  equals  1  if  the  change  in  earnings  from  the  same  quarter  in  the  prior  year 
(Compustat  #8)  is  positive,  and  0  otherwise; 

GROWTH,,  =  [sALEu  /  SALE,^,_^f  '^^  -  1 ; 

RD„  =  Compustat  #4  for  firm  i  in  quarter  t  divided  by  SALE,,.  Observations  with  no  values  are  taken  at  0 
and  values  are  winsorized  at  1 ;  and 
7y„  =  trading  volume  in  millions  of  shares. 


exhibit  anti-sticky  cost  behavior.'®  The  linear  nature  of  raw  materials  consumption  may  partially 
explain  this  disparity  in  cost  behavior.  Another  potential  explanation  for  this  finding  is  that  salaries 
and  advertising  expenses  are  likely  to  be  classified  as  SGA.  The  cost  stickiness  of  total  costs  is  also 
in  line  with  the  negative  skewness  of  the  earnings  distribution  reported  by  Givoly  and  Hayn 
(2000)  and  Gu  and  Wu  (2003).  The  standard  deviation  of  STICKY,  SGA-STICKY,  and  COGS- 
STICKY  is  0.4897,  0.6944,  and  0.4707,  respectively,  indicating  considerable  variation  among 
firms’  cost  behavior. 

Exanuning  whether  the  classification  of  per  firm  cost  stickiness  tends  to  remain  persistent 
over  time,  the  likelihood  of  keeping  the  same  cost  classification  (either  sticky  or  anti-sticky)  over 
two  consecutive  quarters  is  72.5  percent  (not  tabulated).  The  Spearman  (Pearson)  correlation 
between  STICKY  and  M-STICKY  reported  in  Table  2  is  0.48  (0.45),  indicating  reasonable  perse¬ 
verance  over  eight  quarters.  Additionally,  the  Pearson  (Spearman)  coefficient  between  the 
STICKYi  ,_i  and  STICKY^  estimates  is  0.43  (0.44),  both  significant  at  a;  =  1  percent  (not  tabu¬ 
lated),  indicating  that  firms’  cost  behavior  is  reasonably  stable  over  quarters. 

As  expected,  STICKY  is  significantly  and  positively  correlated  with  both  COGS-STICKY  and 
SGA-STICKY.  The  correlation  between  COGS-STICKY  and  SGA-STICKY  is  also  positive  and 
significant,  indicating  a  pattern  in  firms’  cost  behavior  with  respect  to  total  costs  and  to  the  two 
cost  constituents.  The  correlation  coefficient  between  STICKY  and  ABS-FE  is  negative  and  sig¬ 
nificant,  suggesting  a  negative  relation  between  the  cost  stickiness  and  the  absolute  analysts’ 
earnings  forecast  errors. 

I  concentrate  on  SGA  costs  to  check  the  consistency  of  the  proposed  measure  with  the 
stickiness  measure  and  results  reported  by  Anderson  et  al.  (2003).  I  estimate  the  stickiness  of  SGA 
costs  using  the  following  cross-sectional  regression  model  for  two-digit  SIC  code  industries  with 
at  least  25  observations: 


Model  4 


log 


SGA,, 

SGAi,_, 


=  \o  +  ^i  log 


SALE,, 
_SALEi^,_ 


+  X2SALEDECi,  log 


SALE;, 


SALE, 


.t-i 


+  E; 


where  SALEDEC,,  equals  1  if  SALE,,  <  SALE,j_u  and  0  otherwise. 

Anderson  et  al.  (2003)  suggest  the  regression  coefficient  estimate  ^2  as  a  cost  stickiness 
measure.  I  compute  mean  SGA-STICKY  for  two-digit  SIC  code  industries  and  examine  the  corre- 


Similarly,  Anderson  and  Lanen  (2007,  Table  7)  report  that,  on  average,  the  number  of  employees,  labor  costs,  and  PPE 
costs  are  anti-sticky. 
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lation  with  the  estimated  X2.  In  addition,  I  also  estimate  the  correlation  with  industry-level  coef¬ 
ficient  estimates  reported  by  Anderson  and  Lanen  (2007,  Table  6,  Panel  B).  I  note  that  Anderson 
et  al.  (2003)  and  Anderson  and  Lanen  (2007)  use  a  larger  sample  comprised  of  firms  with  and 
without  analyst  coverage  and  employ  annual  data.  All  correlation  coefficients  reported  in  Table  3 
are  positive  and  significant,  indicating  consistency  between  the  proposed  cost  stickiness  measure 
and  the  earlier  evidence  on  the  stickiness  of  SGA  costs. 

IV.  RESULTS 

HI  Results 

To  test  whether  stickier  cost  behavior  results  in  less  accurate  analysts’  earnings  forecasts. 
Table  4  presents  the  mean  and  median  absolute  analysts’  earnings  forecast  errors  contingent  on 
sticky  {STICKY  <  0)  versus  anti-sticky  {STICKY  >  0)  cost  classification.  The  mean  absolute  error 
for  firms  with  sticky  cost  behavior  is  0.0080,  whereas  that  for  firms  with  anti-sticky  cost  behavior 
is  0.0060.  Thus,  forecasts  for  firms  with  anti-sticky  cost  behavior  are,  on  average,  more  accurate 
by  25  percent  =  (0.0080  —  0.0060)70.0080  than  forecasts  for  firms  with  sticky  cost  behavior.  The 
difference  is  statistically  significant  (p  <  0.05).  If  accurate  earnings  forecasts  are  valuable  for 
capital  market  participants,  then  the  difference  is  economically  meaningful. 

Table  5  presents  coefficient  estimates  for  the  regression  models.  The  coefficient  on  STICKY  in 
model  1(a)  is  —0.0108,  and  is  statistically  significant  (p  <  0.001).'^  The  coefficient  on  COGS- 


TABLE  3 


Correlation  Coefficients  between  Industry  Estimates  of  Cost  Stickiness 


Variables 

Mean 

SGA-STICKY  J 

^2j 

Anderson-Lanen 

Coefficient 

Mean  SGA-STICKY  j 

0.562** 

0.345** 

^2  i 

0.485** 

0.467** 

Anderson-Lanen  Coefficient 

0.463** 

0.365** 

**  Significant  at  the  5  percent  level. 

The  table  presents  Spearman  (Pearson)  coefficients  above  (below)  the  diagonal  line  between  three  estimates  of  cost 
stickiness  measured  at  the  two-digit  SIC  code  level:  SGA-STICKY,  an  estimate  based  on  a  measure  suggested  by  Anderson 
et  al.  (2003)  and  estimates  reported  by  Anderson  and  Lanen  (2007). 

Mean  SGA-STICKY j  is  the  mean  value  of  SGA-STICKY  across  all  sample  observations  at  the  two-digit  SIC  code  level,  j  = 
20  to  39. 

k2j  is  the  coefficient  estimate  from  estimating  the  regression  of  the  following  model  using  all  sample  observations  at  the 
two-digit  SIC  code  level,  j  =  20  to  39. 

Model  4: 


log 


SGAjj 

SGAi,_i 


^Xo  +  Xi  log 


SALEij 
SALEi  ,_i  _ 


■+■  X2  SALEDECj ,  log 


SALEjj 

SALE; 


where  SALEDECj  ,  equals  1  if  SALE,  ,  <  SALE;  ,_i,  and  0  otherwise.  SGA,  ,  is  Compustat  #1  and  SALE  is  Compustat  #2. 
Anderson-Lanen  Coefficients  are  taken  for  the  respective  two-digit  SIC  code  industries  from  Anderson  and  Lanen  (2007, 
Table  6,  Panel  B). 


”  Consistent  with  the  perception  of  costs  as  sticky  if  firms  incur  disproportionate  costs  when  activity  levels  decrease, 
results  from  an  additional  regression  analysis  (untabulated)  indicate  that  cost  stickiness  boosts  absolute  earnings  forecast 
errors  more  when  activity  levels  decrease  than  when  they  increase. 
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TABLE  4 

Absolute  Forecast  Errors  (ABS-FE)  for  Firms  with  Sticky 

Behavior 

versus  Anti-Sticky  Cost 

Cost  Behavior 

Mean 

Median 

n 

Sticky  Costs:  STICKY i,  <  0 

Anti-Sticky  Costs:  STICKY >  0 
Difference 

0.0080 

0.0060 

0.0020** 

0.0012 

0.0010 

0.0002^ 

23,915 

21,016 

**  Significant  at  the  5  percent  level. 

“  Mann- Whitney  test  indicates  a  significant  difference  between  the  medians  at  the  5  percent  level. 

STICKY  in  model  1(b)  is  —0.0100,  and  is  statistically  significant  (p  <  0.001).  The  coefficient  on 
SGA-STICKY  in  model  1(c)  is  —0.0055,  statistically  significant  at  p  <  0.002.  Adjusted  values 
for  the  regressions  vary  from  7.5  percent  to  17.6  percent.  The  results  support  HI,  indicating  that 
stickier  cost  behavior  is  associated  with  lower  accuracy  of  analysts’  earnings  forecasts. 

As  for  the  control  variables,  results  for  MV  and  LOSS  are  generally  consistent  with  expecta¬ 
tions,  indicating  a  positive  and  significant  relationship  between  the  amount  of  available  firm- 
specific  information  and  forecast  error.  The  coefficient  estimate  on  DOWN  is  insignificant  across 
the  regression  models,  possibly  due  to  differences  among  analysts  in  the  underlying  costs,  earn¬ 
ings  models,  and  access  to  management  information:  a  large  number  of  analysts  covering  a  firm 
can  proxy  variation  in  the  underlying  costs  and  profits  models,  resulting  in  considerable  noise.  As 
expected,  the  findings  for  DISP  and  to  a  limited  extent  for  VSALE  indicate  a  positive  and  signifi¬ 
cant  association  between  the  absolute  magnitude  of  the  forecast  errors  and  the  uncertainty  in  the 
firm’s  environment  of  operations  and  earnings  predictability. 

OPT.F.V  is  positively  associated  with  ABS-FE,  indicating  that  operating  leverage  increases  the 
analysts’  earnings  forecast  errors.  The  seasonal  effect,  SEASON,  is  insignificant  across  the  regres¬ 
sion  models,  indicating  that  analysts  recognize  the  seasonal  effect  and  adjust  their  forecasts  ac¬ 
cordingly. 

Results  for  two  sensitivity  models  1(d)  and  1(e)  are  also  reported  in  Table  5  and  provide 
further  insights  into  additional  aspects  of  the  relationship  between  cost  behavior  and  the  accuracy 
of  analyst  earnings  forecasts.  First,  I  examine  the  sensitivity  of  the  results  to  estimating  cost 
stickiness  over  a  longer  time  period.  Accordingly,  M-STICKY  measures  cost  stickiness  based  on 
cost  responses  over  eight  quarters.  Regression  results  for  model  1(d)  indicate  a  statistically  sig¬ 
nificant  negative  coefficient  on  M-STICKY,  -0.0073  (p  =  0.019).  The  result  supports  HI. 

Second,  I  examine  whether  past  (rather  than  current)  managerial  discretion  affects  the  hypoth¬ 
esized  relationship.  I  check  whether  the  regression  coefficient  estimates  are  sensitive  to  discre¬ 
tionary  choices  made  by  managers  in  quarter  t-l  or  earlier  by  replacing  STICKY in  model  1(a) 
with  the  cost  stickiness  measure  estimated  on  quarter  t-l,  STICKY i  which  excludes  all  mana¬ 
gerial  choices  made  in  quarter  t. 

Estimating  regression  model  1(e),  the  coefficient  estimate  on  STICKY i  f_i  is  —0.0040  (p  = 
0.030).  The  negative  and  significant  coefficient  estimate  indicates  that  stickier  cost  behavior  ob¬ 
served  in  a  preceding  quarter  is  associated  with  higher  absolute  analysts  forecast  errors.  I  con¬ 
clude  that  cost  stickiness  estimated  by  analysts  on  a  preceding  quarter  affects  the  accuracy  of  the 
earnings  prediction. 

Additionally,  1  examine  the  incremental  effect  of  STICKY  over  earnings  volatility,  which  is 
likely  to  be  an  all-inclusive  noisy  variable  that  incorporates  many  types  of  uncertainties  (e.g.. 
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demand  uncertainty  and  operating  leverage).  Results  (not  tabulated)  indicate  that  STICKY  has  an 
incremental  effect  on  the  accuracy  of  analysts’  earnings  forecasts  above  and  beyond  earnings 
volatility  and  the  control  variables.  Overall,  the  evidence  supports  the  hypothesis. 

Furthermore,  evidence  on  the  mean  forecast  error  (as  opposed  to  the  absolute  forecast  error) 
reported  in  Table  6  offers  insights  into  the  validity  of  the  assumption  that  analysts  recognize  cost 
behavior.  Results  show  that  the  mean  forecast  error  of  firms  with  sticky  costs  is  insignificantly 
different  from  that  of  firms  with  anti-sticky  costs.  Thus,  the  evidence  supports  the  assumption 
that  analysts  have  at  least  some  understanding  of  firms’  cost  behavior. 

A  final  important  consideration  is  that  an  analyst  does  not  have  the  ability  to  reduce  forecast 
errors  caused  by  a  dispersion  of  a  firm’s  ex  ante  earnings  distribution.  In  other  words,  an  analyst 
cannot  influence  accuracy  implied  by  cost  stickiness  because  it  is  a  firm-specific  feature.  There¬ 
fore,  an  analyst  cannot  reduce  the  dispersion  of  the  ex  ante  earnings  distribution  implied  by  cost 
stickiness  even  if  she  is  aware  of  it  in  advance. 

H2  Results 

Results  showing  that  firms  with  stickier  cost  behavior  have  lower  analyst  coverage  are  pre¬ 
sented  in  Table  7.  Findings  in  Panel  A  indicate  that,  on  average,  5.459  analysts  follow  firms  with 
sticky  cost  behavior  while  5.622  analysts  follow  firms  with  anti-sticky  cost  behavior.  The  differ¬ 
ence  of  about  3  percent  is  statistically  significant  (p  <  0.05).  Panel  B  reports  the  results  of  three 
regression  models,  2(a),  2(b),  and  2(c).  The  coefficients  on  STICKYi^,  M-STICKY^  and 


TABLE  6 

Forecast  Errors  (FE)  for  Firms  with  Sticky  versus  Anti-Sticky  Cost  Behavior 

Cost  Behavior  Mean  Median  n 

Sticky  Costs:  STICKY i,  <  0 
Anti-Sticky  Costs:  STICKY i,  >  0 
Difference 


-0.0016  0  23,915 

-0.0012  0  21,016 

-0.0004^  0 


Insignificant  at  the  10  percent  level. 


Results  of  further  analyses  also  support  HI.  First,  findings  from  estimating  model  1  with  a  differential  slope  coefficient 
on  negative  stickiness  (i.e.,  sticky  costs)  indicate  a  minor  and  marginally  significant  difference  between  the  coefficients 
of  negative  and  positive  values  of  STICKY  on  ABS-FE.  Second,  checking  for  a  potential  seasonality  effect,  I  also 
computed  the  stickiness  measure  using  cost  responses  relative  to  the  same  quarter  of  the  preceding  year.  These  findings 
support  H 1 . 

To  see  the  intuition,  suppose,  on  the  contrary,  that  an  analyst  ignores  cost  stickiness.  Consequently,  her  forecast  will  be 
upward  biased  in  case  of  sticky  costs  (forecast  error  =  reported  earnings  -  forecast  <  0)  because  she  under-estimates 
costs  on  demand  falls.  In  a  similar  vein,  her  forecast  will  be  downward  biased  in  case  of  anti-sticky  costs  (forecast  error 
=  reported  earnings  -  forecast  >  0)  because  she  over-estimates  costs  on  demand  falls.  Thus,  sticky  costs  trigger  a 
negative  mean  forecast  error  and  anti-sticky  costs  trigger  a  positive  mean  forecast  error  (i.e  ,  bias,  not  absolute  forecast 
error). 

However,  results  reported  in  Table  6  indicate  that  the  mean  forecast  error  is  not  significantly  different  for  observations 
with  sticky  versus  anti-sticky  costs.  Therefore,  the  data  support  the  assumption  that  analysts  recognize  cost  stickiness  to 
some  extent. 

Lys  and  Soo  (1995)  demonstrate  that  the  inherent  difficulty  in  predicting  earnings  is  associated  with  large  forecast  errors 
(see  also  Kross  et  al.  1990).  Alford  and  Berger  (1999,  219)  suggest  a  proxy  for  “analysts’  ability  to  predict  company’s 
earnings”  (emphasis  added).  In  contrast,  firm-specific  sticky  costs  increase  the  ex  ante  dispersion  of  the  firm’s  earnings 
distribution.  The  correlation  between  STICKY  {M-STICKY)  and  this  proxy  (using  Equation  (1)  in  Alford  and  Berger 
1999,  223)  is  —0.07  (0.04),  suggesting  that  the  two  variables  do  not  pick  up  the  same  phenomena. 
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TABLE  7 

Association  of  Cost  Behavior  with  Analyst  Coverage 

Panel  A:  Mean  Number  of  Analysts  Following  Firms  with  Sticky  versus  Anti-Sticky  Cost  Behavior. 

Mean  Number  of  Analyst 

Cost  Behavior  Coverage  n 

Sticky  Costs:  Sr/CA:y,;  <  0  5.459  23,915 

Anti-Sticky  Costs:  >  0  5.622  21,016 

Difference  0.163** 


Panel  B:  Regression  of  the  Number  of  Analysts  Following  Firms  on  Cost  Stickiness,  Control 
Variables,  Year  Effects  and  Two-Digit  SIC-Code  Industry  Effects 

Model  2(a) 

FLLWit  =  ySo  +  PiSTICKYu  +  p2MVu  +  /S3RD,,  +  p^VSALEu  +  PsDISPu  +  p^BS-FE^ 

+  PjGROWTHu  +  PsTVi,  +  P^OSSu  + 


Model  2(b) 

FLEW  it  =  )8o  +  PiM-STICKYit  +  p2MVit  +  P^RDu  +  P^VSALEit  +  PsOISPu  +  p^BS-FEn 

+  pjGROWTHit  +  p^TVit  +  p^LOSSit  +  £it 


Model  2(c) 

FLLWu  =  Po  +  PiSTICKYit^i  +  p^MV^  +  p^RDi,  +  p^YSALEu  +  p^DISPu  +  P^BS-FEi, 

+  PjGROWTHit  +  p^TVit  +  p^OSSit  + 


Variables 

Model 

2(a) 

Intercept 

0.0500 

STICKYit 

(<0.001) 

0.0211 

M-STICKYit 

(0.031) 

STICKY  it-i 

MVit 

0.3174 

RDit 

(<0.001) 

0.2196 

VSALEit 

(<0.001) 

-0.4133 

DISPit 

(<0.001) 

0.6735 

ABS-FEit 

(<0.001) 

-0.0854 

GROWTHit 

(0.023) 

-0.0067 

TVit 

(0.786) 

0.1551 

(<0.001) 

Model 

Model 

2(b) 

2(c) 

0.0448 

0.0586 

(<0.001) 

(<0.001) 

0.0144 

(0.042) 

0.0216 

(0.033) 

0.3188 

0.3171 

(<0.001) 

(<0.001) 

0.2977 

0.2633 

(<0.001) 

(<0.001) 

-0.5001 

-0.4702 

(<0.001) 

(<0.001) 

0.6448 

0.7116 

(<0.001) 

(<0.001) 

-0.0998 

-0.0796 

(0.046) 

(0.045) 

-0.0444 

0.0360 

(0.171) 

(0.219) 

0.1881 

0.1776 

(<0.001) 

(<0.001) 

(continued  on  next  page) 
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Panel  B:  Regression  of  the  Number  of  Analysts  Following  Firms  on  Cost  Stickiness,  Control 
Variables,  Year  Effects  and  Two-Digit  SIC-Code  Industry  Effects 

Model  2(a) 

FLLWu  =  /3q  +  p^STICKYit  +  +  PiRDi,  +  p^VSALEi,  +  PsDISPu  +  P^BS-FEu 

+  PjGROWTHi,  +  PsTVit  +  P^LOSSu  +  £,« 


Model  2(b) 

FLLWu  =  /So  +  PiM-STICKYu  +  PiMV^  +  P^D^  +  p^VSALEu  +  PsDISPu  +  p^BS-^FEu 
+  pyGROWTHit  +  PsTVii  +  P^LOSSu  +  £„ 

Model  2(c) 

FLLWi,  =  pQ  +  PiSTICKYit_i  +  PiMV^  +  p^n  +  p^VSALEu  +  PsDISPu  +  P^BS-EEu 
+  p-,GROWTHu  +  PiiTVit  +  p^LOSSu  + 


Model 

Model 

Model 

Variables 

2(a) 

2(b) 

2(c) 

LOSSi, 

-0.0059 

-0.0055 

-0.0023 

(0.094) 

(0.111) 

(0.188) 

n 

35,857 

31,532 

31,662 

Pseudo-R^ 

43.18% 

45.81% 

44.07% 

**  Significant  at  the  5  percent  level. 

The  regression  model  was  estimated  using  a  standard  negative  binomial  distribution  because  the  dependent  variable 
(FLLW)  is  count-data.  The  dispersion  parameter  was  estimated  by  maximum  likelihood,  p-values  are  reported  in  paren¬ 
theses.  The  pseudo-R^  also  named  McFadden’s  R^,  is  the  log-likelihood  value  on  a  scale  from  0  to  1,  where  0  corresponds 
to  the  constant-only  model  and  1  corresponds  to  perfect  prediction  (a  log-likelihood  of  0). 

Variable  definitions  are  in  Table  1 . 


STICKY are  positive  and  highly  significant.  Keeping  in  mind  that  lower  values  of  STICKY 
indicate  stickier  cost  behavior,  the  findings  reject  the  null  H2.^^ 

As  for  the  control  variables,  the  coefficient  estimates  of  MV  and  TV  are  positive  and  signifi¬ 
cant,  in  line  with  prior  research.  The  coefficient  estimates  of  the  proxies  for  environmental  uncer¬ 
tainty  show  mixed  results.  The  coefficients  of  VSALE  and  ABS-FE  are  negatively  and  significantly 
associated  with  the  analyst  following,  while  the  coefficient  of  DISP  is  positive  and  significant.  The 
coefficients  of  GROWTH  and  LOSS  are  insignificant. 

The  coefficient  estimate  of  RD  is  also  positive  and  highly  significant,  consistent  with  Barth  et 
al.  (2001).  To  further  check  the  robustness  of  the  cost  behavior  effect,  I  separately  examine  the 
cost  stickiness  effect  on  analyst  coverage  for  firms  with  and  without  R&D  expenditures.  Results 
(not  tabulated)  indicate  that  cost  stickiness  is  significantly  associated  with  analyst  coverage  for 
firms  with  and  without  R&D  expenditures.  In  sum,  the  evidence  indicates  that  firms  with  stickier 
cost  behavior  have  lower  analyst  coverage. 

Lower  coverage  for  firms  with  stickier  costs  and  more  volatile  earnings  may  seem  counter¬ 
intuitive  if  analysts  strive  to  meet  a  high  demand  for  earnings  forecasts  for  firms  that  have  less 
predictable  earnings.  However,  the  analysts’  attitude  toward  large  negative  forecast  errors  can 


The  analysis  implicitly  assumes  that  an  equivalent  effort  is  expended  for  estimating  sticky  and  anti-sticky  costs.  This 
assumption  is  sensible  in  this  context  because  cost  stickiness  is  estimated  from  public  information  reported  in  financial 
statements. 
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partially  explain  their  coverage  preferences.  Ample  evidence  shows  substantial  declines  in  share 
price  following  a  negative  forecast  error  (e.g.,  Bartov  et  al.  2002).'^  To  some  extent,  analysts’ 
short-  and  long-term  benefits  are  affected  by  their  relationships  with  managers  of  covered  firms 
(Lim  2001).  Therefore,  all  else  being  equal,  analysts  are  likely  to  prefer  covering  firms  with  low 
ex  ante  probability  of  large  negative  forecast  errors.  Risk  aversion  reflected  in  a  conventional 
concave  loss-utility  function  captures  these  preferences.  This  interpretation  implicitly  assumes 
some  disparity  in  risk  attitude  to  large  negative  forecast  errors  between  investors  and  analysts  or, 
alternatively,  that  investors  recognize  cost  stickiness  to  a  limited  extent. 

H3  Results 

Table  8  presents  results  from  testing  whether  the  market  response  to  earnings  surprises  is 
weaker  for  firms  with  stickier  cost  behavior.  In  line  with  the  prior  literature,  coefficient  estimates 
Px  in  all  regression  models  are  positive  and  highly  significant,  indicating  a  positive  market  re¬ 
sponse  to  earnings  surprises.  The  estimated  coefficients  for  the  interaction  variable  are  positive 
and  significant  when  cost  stickiness  relates  to  total  costs  (models  3(a)  and  3(d)),  but  only  margin¬ 
ally  significant  when  cost  stickiness  relates  to  SGA  costs  (model  3(c)),  and  insignificant  with 
respect  to  stickiness  of  COGS  (model  3(b)).  Additionally,  results  from  estimating  model  3(e), 
which  uses  an  indicator  variable  for  the  classification  of  costs  as  sticky  versus  anti-sticky,  support 
H3. 

The  findings  suggest  that  investors  recognize  and  consider  cost  stickiness  with  respect  to  total 
costs,  but  not  the  stickiness  of  cost  components.  The  explanatory  power  in  the  models  ranges 
between  1.8  percent  and  2.9  percent,  which  is  in  line  with  prior  literature  (e.g.,  Gu  and  Wu  2003). 
To  strengthen  the  evidence,  I  take  a  predictive  rather  than  contemporaneous  approach  in  estimat¬ 
ing  cost  stickiness.  Model  3(f)  shows  a  lower  market  reaction  to  earnings  surprises  for  firms  with 
less  sticky  costs  estimated  on  the  preceding  quarter  (note  that  STICKY  <  0  indicates  sticky  costs). 
Taken  as  a  whole,  the  findings  corroborate  Banker  and  Chen  (2006)  and  indicate  a  weaker  market 
response  to  earnings  surprises  for  firms  with  stickier  cost  behavior,  supporting  H3. 

These  results  contribute  to  the  ongoing  debate  on  investor  rationality  by  documenting  that 
investors  are  able  to  process  accounting  information  and  partially  infer  cost  behavior  in  a  rational 
manner.  With  respect  to  the  control  variables,  coefficient  estimates  for  DISP  are  generally  insig¬ 
nificant  and  coefficient  estimates  for  VSALE  are  only  marginally  signihcant.  Thus,  dispersion  of 
analysts’  forecasts  and  variation  of  sales  may  not  serve  as  appropriate  proxies  for  ex  ante  earnings 
predictability  as  perceived  by  investors.  This  argument  is  supported  by  Diether  et  al.  (2002),  who 
interpret  dispersion  in  analysts’  earnings  forecasts  as  a  proxy  for  differences  in  opinion  about  the 
stock  (e.g.,  due  to  the  employment  of  different  valuation  models).  While  forecast  dispersion  may 
indicate  different  opinions  or  the  use  of  different  forecasting  models,  cost  stickiness  serves  as  a 
proxy  for  more  volatile  earnings  due  to  firm-specific  cost  structures.  Thus,  the  two  proxies  capture 
different  aspects  of  earnings  predictability.^^  Overall,  findings  indicate  that  investors  have  at  least 
some  understanding  of  firms’  cost  behavior  in  responding  to  earnings  surprises. 

V.  A  CONCLUDING  REMARK 

The  study  utilizes  a  managerial  accounting  concept,  sticky  costs,  to  gain  insights  into  how 
firms’  cost  behavior  affects  (1)  the  accuracy  of  analysts’  earnings  forecasts,  (2)  analysts’  selection 
of  covered  firms,  and  (3)  the  market  response  to  earnings  announcements.  While  implications  of 
cost  behavior  are  of  primary  interest  to  management  accountants,  this  study  employs  a  manage- 


Kinney  et  al.  (2002)  provide  a  different  view,  which  finds  considerable  variation  in  returns  for  firms  reporting  positive 
or  negative  surprises. 

See  Dichev  and  Tang  (2009)  and  Frankel  and  Litov  (2009)  for  additional  aspects  of  earnings  predictability. 
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TABLE  8 

Effect  of  Sticky  Cost  Behavior  on  Stock  Market’s  Reaction  to  Earnings  Surprises 


Model  3(a) 

CARi,  =  ySo  +  +  ^2FEi^TICKYi,  +  p^DISPu  +  +  e„ 

Model  3(b) 

CARi,  =  /3o  +  +  ^2EEuCOGS-STlCKYi,  +  +  fi^VSALE^,  +  Si, 

Model  3(c) 

CARi,  =  /3o  +  +  ^2FEi,SGA-STICKYi,  +  P^DlSPu  +  AVSALEi,  +  8„ 

Model  3(d) 

CARit  =  ^o  +  PiFEit  +  (SjEEi^M-STICKYi,  +  p^DISPi,  +  P^VSALEu  + 
Model  3(e) 

CARu  =  /So  +  PiFEu  +  PjFEiP-STlCKY,,  +  P^DISP^  +  P^VSALEi,  +  e„ 
Model  3(f) 

CAR,,  =  ySo  +  I3,PE„  +  /32FE,,STICKYi,_,  +  fi^DISP,,  +  (S^VSALE,,  +  s„ 


Variable 

Model  3(a) 

Model  3(b) 

Model  3(c) 

Model  3(d) 

Model  3(e) 

Model  3(f) 

Intercept 

0.0208 

0.0197 

0.0256 

0.0355 

0.0011 

0.0244 

(<0.001) 

(<0.001) 

(<0.001) 

(<0.001) 

(0.066) 

(<0.001) 

FEu 

0.2929 

0.3326 

0.3636 

0.3467 

0.4377 

0.3745 

(<0.001) 

(<0.001) 

(<0.001) 

(<0.001) 

(<0.001) 

(<0.001) 

FEi^TlCKYi, 

EEuCOGS-STICKYi, 

0.0166 

(0.033) 

0.0238 

(0.141) 

0.0089 

(0.077) 

EEi,SGA-STICKY„ 

FEi,M -STICKY,, 

0.0202 

FE„D-STlCKYi, 

(0.038) 

0.0366 

(0.040) 

FEi,STICKYi,_x 

0.0244 

(0.048) 


(continued  on  next  page) 
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TABLE  8  (continued) 


Variable 

Model  3(a) 

Model  3(b) 

Model  3(c) 

Model  3(d) 

Model  3(e) 

Model  3(f) 

DlSPii 

0.9202 

0.8333 

1.0284 

1.0196 

1.1280 

1.1017 

(0.151) 

(0.095) 

(0.156) 

(0.640) 

(0.251) 

(0.233) 

VSALEit 

0.0120 

0.0377 

0.0095 

0.0111 

0.0267 

0.0212 

(0.048) 

(0.089) 

(0.090) 

(0.050) 

(0.055) 

(0.075) 

Adj.  R2 

0.029 

0.018 

0.019 

0.027 

0.028 

0.019 

n 

43, m 

37,389 

23,666 

38,121 

43,777 

37,119 

p-values  based  on  two-tailed  tests  are  in  parentheses. 

Variable  Definitions: 

CARi,  =  cumulative  market-adjusted  returns  (raw  return  minus  value-weighted  CRSP  return)  measured  over 
three  trading  days  surrounding  earnings  announcement,  from  the  day  before  to  the  day  after;  and 
D-STICKYj,  =  —  1  if  STICKY i,  <  0,  and  0  otherwise. 


Definitions  of  the  other  variables  are  in  Table  1. 


ment  accounting  concept  for  addressing  research  questions  usually  raised  by  financial  accountants. 
Although  such  a  multi-disciplinary  approach  has  not  been  common  in  the  prior  literature,  the 
findings  of  this  study  suggest  that  combining  the  perspectives  of  management  and  financial  ac¬ 
counting  can  be  fruitful.  Further  research  is  expected  to  build  on  this  approach  in  exploring 
multi-disciplinary  accountings  topics.  Integrating  management  and  financial  accounting  research  is 
likely  to  benefit  both  disciplines. 


APPENDIX 

Employing  cost  stickiness  as  a  yardstick,  I  develop  a  simple  two-period  model  to  predict  a 
relationship  between  the  level  of  cost  stickiness  and  the  accuracy  of  analysts’  earnings  forecasts. 
The  primitive  model  input  in  the  first  period  is  a  set  of  prior  beliefs  on  the  state  of  the  world,  say 
demand  y,  which  is  a  realization  of  a  random  variable  y,  drawn  from  a  distribution  function,  <F(y), 
with  a  strictly  positive  and  symmetric  density,  </>(y),  over  the  support  [y  -  X,,  y  -t-  \],  y  >  X  >  0.  The 
second-period  revenue  function,  R(y),  is  assumed  to  be  differentiable,  increasing  and  concave.'* 
Costs  in  the  second  period  are  modeled  by: 


C(y,y,a:)  = ' 


/-f  (u  -f-  a;)y 
f+vy  +  ay 


ify  <y 
if  y  >y 


/  =  a  fixed  cost  of  production,  /  >  0; 

V  =  a  variable  cost  per  product  unit,  v  >  0;  and 


The  revenue  function  depends  on  previously  made  managerial  choices,  like  product  price.  See  also  Balakrishnan  and 
Sivaramakrishnan  (2002). 
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a  =  level  of  cost  stickiness  per  product  unit,  -u  <  a<  R'iy)  -  V. 

The  parameter  a  captures  the  level  of  cost  stickiness  in  adjusting  resources  to  changes  in  the 
activity  level.  If  cr  <  0,  then  costs  increase  more  when  activity  rises  than  they  decrease  when 
activity  falls  by  an  equivalent  amount;  that  is,  a  negative  value  of  a  indicates  sticky  costs.  If  a  > 
0,  then  costs  increase  less  when  activity  rises  than  they  decrease  when  activity  falls  by  an  equiva¬ 
lent  amount;  that  is,  a  positive  value  of  a  indicates  anti-sticky  costs.  The  difference  between  the 
cost  of  an  upward  activity  adjustment  and  the  cost  of  an  equivalent  downward  activity  adjustment 
depends  only  on  cr:'® 


[C(y -i-  1)  -  C(y)]  -  [C(y)  -  C(y  -  1)]  =  [t;(y  +  1)  +  cry  -  uy  -  ay]  -  [ny  4-  ay  -  (n  +  a)(y  -  1)] 

=  -  a. 

I  use  cost  of  adjustments  to  expand  the  conventional  fixed-variable  cost  model  and  estimate 
stickiness  of  firms’  cost  functions.  My  approach  follows  Wernerfelt  (1997),  who  shows  that  the 
magnitude  of  an  adjustment  cost  drives  the  form  of  the  organization,  and  Balakrishnan  and  Gruca 
(2008),  who  show  that  cost  stickiness  is  greater  for  cost  functions  that  relate  to  an  organization’s 
core  competency.  Rothschild  (1971)  models  properties  of  convex  (concave)  adjustment  cost  struc¬ 
tures  that  result  in  asymmetric  cost  functions  due  to  the  cost  of  producing  marginal  unit  increases 
(decreases)  in  the  activity  level. 

In  my  model,  the  earnings  function,  n(y,a)  =  R{y)  -  C(y,a),  is  strictly  increasing  in  y  and 
transforms  demand  y  realized  in  the  second  period  into  earnings.  The  ex  ante  earnings  expecta¬ 
tions  in  the  first  period  are  denoted  n(y  ,  a).  In  the  second  period,  the  firm  truly  reports  its  realized 
earnings,  11  (y,  a). 

An  analyst  is  delegated  the  task  of  producing  accurate  estimates  of  a  firm’s  earnings  expec¬ 
tations  and  the  forecast  is  honest,  as  in,  for  example,  Ottaviani  and  Sorensen  (2006).  In  the  first 
period,  the  analyst  announces  n(y ,  a)  as  her  most  accurate  forecast,  if  her  error  loss  function  is 
symmetric  and  concave  (e.g.,  a  quadratic  loss  function).^®  Focusing  on  the  absolute  earnings 
forecast  error  as  an  accuracy  gauge,  the  proposition  below  proves  a  negative  relationship  between 
the  level  of  cost  stickiness  and  the  mean  absolute  eaniings  forecast  error.  That  is,  higher  values  of 
a,  i.e.,  less  sticky  cost  behavior,  result  in  lower  mean  absolute  analyst  forecast  errors. 

Proposition 

£’[|n(y  ,  O')  -  fl(y  ,  a)|]  decreases  in  a. 

Proof 

The  proof  is  based  on  Jensen’s  inequality: 


Let  FE(y,a)  =  n(y,a)  -  n(y,a)  = 


Riy)  -/-  +  o')y  - 

^  R  (y)  -  /  -  vy  -  ay  -  n  (y ,  a) 


if  y  <y, 
otherwise. 


Define  y^(a):  n(y^(a) ,  a)  =  n(y  ,  a).  Thus,  FE{y  ,  a)  <  0  for  all  y  e[y  -  X. ,  y^(a)]. 


I  note  that  cost  stickiness  does  not  depend  on  the  operating  leverage  of  the  firm  because  the  fixed  cost  component,  /, 
does  not  influence  the  level  of  cost  stickiness. 

A  discussion  on  the  properties  of  a  symmetric  error  loss  function  appears  in  Beja  and  Weiss  (2006). 
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/(«)  y+\  /(o') 

£[|F£'(y,a)l]  =  j  -FE(y,a)^(y)dy+  j  FE(y,a)^(y)dy  =  -2  j  FE{y, 


a)(f>{y)dy 


y-'^  /(a) 

>+\  /(«) 

+  j  FE{y,  a)  (f>iy)dy  =  -  2  j  FE{y,  a)  (f){y)dy  >  0 


y-X 


y-X 


y-X 


Jensen’s  inequality  implies  a^>  a2^y^{ai)>y*{a^,  and,  FE{y ,  aj)  <FE{y ,  a^)  for  all  y 
Suppose  >  0^2.  Hence: 


y^(«2)  /(«i) 

E[\EEiy,a2)\]-E[\EE{y,ax)\]  =  -2  j  FE(y,a2)(f>(y)dy  +  2  J  FE(y,ai)(f>(y)dy 

y-X  y-X 


=  2 


y*(a7) 

j  (FE(y,  tti)  -  EE(y,  0:2))  </>(y)dy 

y-X 


/(«l) 

2  j  FE(y,ai)(^(y)dy>0. 

y*(o‘i) 


Based  on  the  proposition,  I  argue  that  cost  stickiness  is  a  determinant  of  the  accuracy  of 
analysts’  earnings  forecasts.  Specifically,  more  sticky  costs  increase  the  spread  of  the  ex  ante 
distribution  of  earnings,  which  increases  the  ex  ante  volatility  of  reported  earnings.  For  that  reason 
the  proposition  motivates  HI:  Increased  cost  stickiness  reduces  the  accuracy  of  analysts’  consen¬ 
sus  earnings  forecasts. 
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ROBERT  BLOOMFIELD  and  KRISTINA  RENNEKAMP,  Experimental  Research  in  Fi¬ 
nancial  Reporting:  From  the  Faboratory  to  the  Virtual  World,  Foundations  and  Trends® 
in  Accounting  (Boston,  MA:  now  Publishers,  Inc.,  2009,  ISBN  978-1-60198-214-8,  vol. 
3,  no.  1,  pp.  ix,  89). 


In  this  monograph,  Robert  Bloomfield  and  Kristina  Rennekamp  make  the  case  that  accounting  researchers 
can  benefit  from  using  virtual  worlds  as  a  laboratory  to  study  the  impacts  of  complex  institutions  on  aspects  of  the 
financial  reporting  process.  After  a  brief  introductory  chapter,  the  monograph  proceeds  with  a  description  of 
research  in  financial  accounting  and  a  discussion  of  the  benefits  of  experiments  for  learning  about  behavior  in 
financial  reporting  contexts,  and  then  moves  in  Chapter  2  to  a  discussion  of  costs  and  benefits  of  including 
institutional  complexity  in  accounting  experiments.  Chapters  3  and  4  describe  two  important  virtual  worlds  and 
how  their  economies  work.  Chapters  5  through  7  discuss  the  various  ways  of  using  virtual  worlds  to  conduct 
experiments,  methods  for  implementing  some  key  institutions,  and  potential  problems  with  conducting  experi¬ 
ments  embedded  in  virtual  worlds  and  ways  of  mitigating  these  problems.  Chapter  8  contains  a  discussion  of  the 
costs  and  benefits  of  using  virtual  worlds  to  conduct  research  across  research  methodologies.  Chapter  9  contains 
concluding  remarks. 

A  variety  of  audiences  will  find  aspects  of  the  monograph  to  be  interesting  and  informative.  As  a  nongamer, 
I  was  surprised  to  learn  that  Second  Life  has  dance  parties  and  concerts.  I  was  even  more  surprised  to  leam  that 
it  has  an  active  financial  sector,  including  several  stock  exchanges,  that  there  are  external  markets  for  its  currency, 
and  that  its  owners  had  to  ban  banking  activities  after  allegations  of  fraud.  These  facts  confirmed  for  me  the 
“reality”  of  virtual  worlds  and  piqued  my  interest  in  using  therp  as  a  research  tool. 

As  a  researcher  interested  in  financial  reporting  issues,  I  found  the  authors’  discussion  of  the  benefits  of 
adding  complexity  to  experiments  (Chapter  2)  compelling.  The  authors  point  out  that  adding  complex  institutions 
(defined  broadly  to  include  laws  and  regulations,  common  types  of  commercial  and  social  contracts,  and  interac¬ 
tions,  etc.)  to  experiments  allows  examination  of  a  broader  set  of  questions.  They  provide  several  examples  of 
research  questions  that  can  be  studied  in  this  sort  of  experiment.  I  found  the  list,  which  includes  examining  the 
effects  of  institutional  changes  on  the  reliability  of  financial  reports,  on  how  information  intermediaries  do  their 
work,  on  commercial  arrangements,  on  real  investment  and  production  decisions,  along  with  the  authors’  ex¬ 
amples,  to  be  interesting  and  thought  provoking.  It  prompted  me  to  consider  how  I  could  use  virtual  worlds  to 
address  issues  of  particular  interest  to  me. 

Specifically,  while  the  authors  do  not  explicitly  make  this  point,  their  list  makes  clear  that  the  set  of 
researchable  questions  is  expanded  when  complex  institutions  are  incorporated  into  experiments,  because  inter¬ 
actions  among  individuals  and  between  individuals  and  institutions  are  allowed.  Thus,  the  set  of  possible  depen¬ 
dent  measures  is  expanded  to  include  joint  outcomes,  and  multiple  levels  of  “play”  allow  examination  of  aggregate 
(e.g.,  economy-level)  outcomes. 

For  example,  consider  controversial  policy  issues  in  accounting,  such  as  the  likely  impact  of  adopting  less 
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precise  accounting  standards  on  auditor  liability  (e.g.,  Schipper  2003;  Benston  et  al.  2006).  While  careful  thinking 
can  identify  several  of  the  critical  causal  factors  (e.g.,  Hail  et  al.  2009)  and  laboratory  experimentation  can  identify 
additional  factors  and  test  theoretical  relationships  (Kadous  and  Mercer  2010),  it  is  probably  impossible  to 
anticipate  all  the  effects  of  introducing  a  change  in  institution  into  a  complex  environment,  such  as  the  U.S. 
economy,  and  it  is  unlikely  that  researchers  working  in  this  fashion  can  fully  anticipate  how  various  factors  will 
play  against  each  other.  This  leads  to  what  I  see  as  the  biggest  advantage  of  adding  complexity  to  experiments  in 
a  virtual  world  environment — the  opportunity  for  relatively  low-cost,  relatively  efficient  theory  development.  That 
is,  in  addition  to  allowing  testing  of  hypotheses  about  complex  institutions,  virtual  world  experimentation  can 
serve  accounting  research  in  the  scientific  role  that  is  traditionally  reserved  for  case  studies  and  field 
experiments — that  of  identifying  new  variables  and  relationships  for  development  of  theory,  which  can  then  be 
further  tested  in  controlled  experimental  settings  or  in  archival  data  if  the  policy  change  is  implemented.  And  they 
can  do  so  without  the  cost  and  time  lags  that  would  occur  if  the  institutional  changes  were  actually  implemented 
in  the  real  (as  opposed  to  virtual)  world. 

While  the  early  chapters  inspire  the  reader  to  rush  out  to  virtual  worlds  to  conduct  experiments,  the  later 
chapters  counteract  this  effect,  to  some  extent.  Specifically,  the  authors’  discussion  of  the  various  ways  of  con¬ 
ducting  studies  in  virtual  worlds  (Chapters  5  and  8)  leads  one  to  quickly  conclude  that  the  broadest  research 
questions  are  out  of  reach  unless  one  is  able  to  corroborate  with  developers  of  virtual  worlds.  Also,  there  is  a 
disconnect  in  the  monograph  in  that  the  discussion  of  how  some  key  institutional  details  are  implemented  in 
virtual  worlds  (Chapter  6)  focuses  on  how  reporting  and  enforcement  are  automatic  and  can  be  perfectly  enforced 
in  virtual  worlds,  which  seems  to  rule  out  investigating  more  interesting  questions  that  rely  on  discretion  in 
reporting  and  enforcement.  On  the  other  hand,  the  discussion  in  Chapter  7  about  difficulties  of  conducting 
experiments  in  virtual  worlds  and  potentially  mitigating  factors  is  quite  practical  and  is  of  immediate  use  to  new 
researchers  or  those  just  beginning  to  conduct  experiments  online,  whether  in  virtual  worlds  or  not. 

Another  positive  aspect  of  the  monograph  is  that  it  discusses  of  the  types  of  studies  that  are  well-suited  and 
ill-suited  for  virtual  worlds.  The  reader  gets  a  concentrated  dose  of  this  material  in  Chapter  8,  but  it  is  also 
sprinkled  throughout  the  rest  of  monograph.  Thus,  the  monograph  is  not  an  indiscriminate  booster  for  the  virtual 
worlds  technology,  but  instead  it  provides  a  balanced  discussion  of  the  types  of  research  questions  that  would  and 
would  not  benefit  from  the  virtual  worlds  technology. 

In  sum,  Bloomfield  and  Rennekamp’s  monograph  provides  an  introduction  to  the  use  of  virtual  worlds  in 
accounting  research.  In  doing  so,  it  provides  new  researchers  with  good  coverage  of  some  basic  research  design 
issues,  “old”  researchers  with  some  fresh  new  inspiration,  and  perhaps  all  of  us  with  a  glimpse  into  the  future  of 
accounting  research. 
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ROBERT  S.  KAPLAN  and  DAVID  P.  NORTON  ,  The  Execution  Premium:  Linking  Strategy 
to  Operations  for  Competitive  Advantage  (Boston,  MA:  Harvard  Business  Press,  2008, 
ISBN  13:  978-I-422I-2II6-0,  pp.  xiii,  320). 


If  you  are  an  admirer  of  Kaplan  and  Norton’s  earlier  work  on  the  balanced  scorecard,  then  you  will  find  much 
to  be  pleased  with  in  their  most  recent  book,  The  Execution  Premium.  If  you  found  the  balanced  scorecard 
framework  too  narrow,  you  will  also  be  pleased  with  their  new  book,  because  it  contextualizes  the  balanced 
scorecard  (BSC)  within  the  larger  framework  of  strategy  development  and  execution.  If  you  are  a  practicing 
manager  looking  for  sound  advice  about  strategy  execution,  you  will  find  much  that  is  helpful.  If  you  are  a  pure 
academician  looking  for  a  book  driven  by  empirical  data  that  has  been  subjected  to  rigorous  statistical  analysis, 
then  you  will  be  disappointed  by  the  nonscientifically  selected  sample  of  cases  which  form  the  body  of  the  data  for 
this  book.  However,  I  would  still  advise  the  purists  to  read  the  book,  because  they  can  use  the  practical  examples 
and  case  studies  to  enrich  their  classes  and  discover  rich  research  opportunities  to  extend  our  understanding  of 
strategy  and  strategy  execution  in  organizations. 

For  admirers  of  Kaplan  and  Norton’s  prior  works,  this  book  continues  the  tradition  of  providing  easy  to 
understand  and  execute  tools  and  techniques  for  strategy  development  and  execution.  There  is  a  simple  framework 
that  ties  together  work  on  strategy  development,  strategy  mapping,  the  balanced  scorecard,  and  activity-based 
costing. 

The  book  begins  by  providing  an  overview  of  strategy  development.  This  is  a  review  of  the  traditional 
frameworks  for  strategy  formulation,  such  as  SWOT  and  PESTEL  analysis.  Chapter  3  provides  a  useful  discussion 
about  planning  the  strategy  execution.  It  is  a  useful  reminder  that  “planning  the  plan”  is  quite  separate  from  the 
planning  effort  itself.  Chapter  4  continues  the  theme  of  planning  the  strategy  by  discussing  the  strategic  initiatives 
that  need  to  be  launched  to  execute  the  strategy.  This  is  a  crucial  chapter  which  links  strategy,  operations,  and 
budgets.  It  forces  a  manager  to  ask  what  changes  and  operations  are  necessary  to  execute  the  strategy  and  what 
resources  he  or  she  will  need  to  drive  the  operational  changes.  Chapter  5  discusses  the  all-important  structural  and 
behavioral  issues  involved  in  aligning  organizational  units  with  the  strategy.  This  is  probably  one  of  the  weakest 
chapters  in  the  book,  because  it  does  not  deal  with  the  messy  problems  of  change  management,  leadership,  and 
motivation.  In  my  opinion,  the  authors  rely  too  heavily  on  structural  solutions  such  as  cascading  the  BSC  or 
creating  a  communications  plan.  In  reality,  alignment  requires  a  shared  frame  of  reference  that  cannot  be  achieved 
through  structural  mechanisms  alone.  Chapters  6  and  7  cover  operational  improvements,  using  tools  such  as 
traditional  budgeting,  activity-based  costing,  and  capacity  analysis.  Most  of  the  material  in  these  chapters  would 
not  be  new  for  accountants.  However,  they  are  extremely  useful  for  managers  who  may  not  appreciate  the  strategic 
use  of  activity-based  costing.  The  next  two  chapters  deal  with  operational  and  strategic  review  meetings  to  adapt 
and  change  the  strategy  in  the  light  of  changed  circumstances.  The  last  chapter  concludes  by  arguing  effectively 
for  the  creation  of  an  office  of  strategy  management  so  that  there  is  a  clear  owner  of  strategy  and  strategy 
execution. 

For  those  of  the  belief  that  the  balanced  scorecard  framework  was  too  narrow,  Kaplan  and  Norton  have 
admirably  redressed  this  imbalance  by  framing  the  BSC  as  part  of  a  larger  framework  of  strategy  development  and 
execution.  I  happen  to  be  one  of  those  who  always  felt  that  measurement  without  a  sense  of  strategy  would 
probably  do  more  harm  than  good  in  an  organization.  My  belief  is  based  on  a  half  century  of  research  in  the 
behavioral  sciences  that  shows  that  when  people  do  not  understand  or  subscribe  to  the  purpose  of  the  measure¬ 
ment,  they  are  likely  to  undertake  dysfunctional  behaviors.  I  am  therefore  delighted  to  see  that  the  current  book 
recognizes  the  importance  of  communicating  with,  and  aligning,  people  and  strategy  before  putting  in  a  scorecard. 

For  practicing  managers,  the  book  provides  a  lot  of  advice  that  will  be  helpful  as  they  embark  on  their  own 
exercise  of  strategy  execution.  In  particular,  the  concept  of  an  office  of  strategy  execution  is  an  important  and 
underappreciated  concept.  This  is  probably  because  of  the  bad  experience  most  corporations  had  with  the  strategic 
planning  office,  which  often  became  the  place  for  unsuccessful  line  managers. 

If  you  are  an  academician  who  believes  in  empirical  data  and  rigorous  statistical  analysis,  you  will  find  very 
little  of  it  in  this  book.  Most  of  the  data  in  this  book  comes  from  Harvard  Business  School  teaching  cases  or  from 
the  consulting  practice  of  Kaplan  and  Norton.  From  an  empirical  perspective,  the  flaws  in  the  data  are  obvious. 
The  sample  is  nonscientiflc;  it  comes  mostly  from  opportunistic  interventions.  It  is  a  bit  paradoxical  that  a  book 
which  is  selling  a  rational-scientific  methodology  for  strategy  development  and  execution  uses  cases  as  opposed  to 
a  matched  or  paired  sample  methodology  to  show  that  the  group  with  tight  linkage  between  strategy  execution  and 
operational  improvement  has  better  results  than  one  that  does  not.  Even  the  data  for  firms  that  have  performed  well 
with  a  balanced  scorecard  and  other  mechanisms  for  sound  strategy  execution  must  be  taken  with  a  grain  of  salt. 
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This  is  because  the  study  covers  a  period  from  2000  to  2006,  one  in  which  the  stock  market  rose  because  of 
reasons  that  have  come  back  to  bite  us  in  the  year  2008.  For  example,  consider  the  case  of  KeyCorp  one  of  the 
success  stories  cited  in  the  book.  All  of  its  financial  gains  have  been  nearly  wiped  out,  and  I  presume  that  the 
company  still  uses  the  balanced  scorecard. 

Paradoxically,  for  future  doctoral  students  and  researchers  looking  for  topics  in  managerial  accounting,  this 
book’s  weaknesses  are  a  treasure  trove  of  research  opportunities.  While  Kaplan  and  Norton  have  started  to  enlarge 
the  context  in  which  strategy  is  developed  and  executed,  a  large  part  of  the  domain  of  strategy  execution  remains 
unmapped.  The  world  of  Kaplan  and  Norton  is  a  simple,  rational,  ordered  place.  They  see  the  problem  primarily 
in  terms  of  mapping  relationships  between  organizational  structures  and  processes.  Messy  things  such  as  organi¬ 
zational  power  and  politics,  behavioral  and  motivational  issues,  and  organizational  culture  are  neatly  sidestepped. 
They  are  simply  lumped  into  the  “learning  and  growth”  category  which  turns  into  the  black  box  that  holds  all  the 
misbehaving  children  that  bedevil  strategy  executors  in  the  real  world. 

Most  practicing  managers,  however,  will  tell  you  that  strategy  execution  is  a  very  messy  process,  because  it 
has  to  negotiate  the  tricky  waters  of  organizational  power  and  politics,  organizational  culture,  and  employee 
motivation.  These  are  not  only  significant  challenges,  but  they  require  very  different  tools.  While  the  balanced 
scorecard  and  activity-based  costing  can  help  to  understand  weakness  in  performance,  aligning  employees  requires 
engagement  and  dialog.  Leadership  in  these  situations  is  not  just  about  communications  that  inform  or  tell 
employees,  but  requires  engaging  employees.  Leadership  becomes  a  matter  of  teaching  and  learning  and  not 
simply  communicating. 

A  great  opportunity  for  future  research  comes  from  critically  examining  the  process  of  strategy  formulation 
and  execution  from  a  different  perspective.  Kaplan  and  Norton’s  work  is  rooted  in  the  rational-predictive  model  of 
managerial  behavior.  Their  work  assumes  that  there  is  a  knowable  but  uncertain  environment,  and  a  manager’s 
task  is  to  identify  opportunities  through  SWOT  analysis  and  then  exploit  them  through  strategy  execution.  This 
may  work  for  organizations  that  operate  under  conditions  of  uncertainty  or  which  can  control  the  uncertainty 
through  predictive  methods.  But  what  happens  when  organizations  face  not  just  uncertainty  but  an  unknowable 
environment?  How  does  strategy  formulation  and  execution  proceed  in  such  situations? 

One  answer  to  this  question  comes  from  the  work  of  Professor  Saras  D.  Sarasvathy  of  the  University  of 
Virginia,  who  has  studied  the  behavior  of  entrepreneurs.  Future  scholars  interested  in  the  study  of  strategy  should 
read  Sarasvathy’s  book.  Effectuation  (2008).  She  argues  that  entrepreneurs  do  not  use  managerial  logic  because 
they  operate  in  unknowable  environments.  Rather  than  trying  to  predict  the  future,  entrepreneurs  work  to  control 
the  future  by  co-creating  it  with  self-selected  stakeholders.  They  recognize  the  futility  of  predicting  the  future 
when  the  future  is  unknowable.  For  entrepreneurs,  clarity  comes  through  action  and  not  analysis.  They  do  not 
discover  opportunities;  they  create  opportunities.  Saras  calls  this  “effectual  logic.” 

The  notion  that  entrepreneurs  get  clarity  through  action  turns  the  logic  of  strategy  on  its  head.  In  Saras’ 
world,  execution  would  come  before  strategy.  This  is  because  an  explicit  recognition  that  the  world  is  unknowable 
and  that  opportunities  are  made,  not  found,  means  that  strategy  does  not  follow  vision;  rather,  action  follows  a 
vision  that  serves  as  a  “soft  goal”  that  will  be  shaped  and  molded  into  a  strategy.  Saras  uses  the  analogy  of  cooking 
to  make  her  point.  One  way  to  cook  is  to  start  with  a  clear  goal  of  what  one  wants  to  eat.  With  a  clear  goal, 
execution  means  finding  the  right  recipe  for  what  we  want  to  eat  and  then  following  it  by  making  sure  all  the 
ingredients  are  at  hand.  The  other  way  to  cook  is  to  start  with  a  soft  goal,  “taking  care  of  being  hungry.”  There  is, 
however,  no  particular  dish  or  recipe  in  mind.  We  simply  throw  open  the  refrigerator  and  the  spice  cabinet,  see 
what  is  at  hand,  and  use  these  ingredients  to  throw  together  a  dish.  Note  that  both  methods  ultimately  lead  to  the 
larger  goal  of  nourishing  one’s  body.  The  former  begins  with  a  goal  and  arrays  the  means  to  achieve  the  goal;  the 
latter  begins  with  the  means  at  hand  to  achieve,  and  in  the  process  shape  and  mold  a  larger  vision. 

The  question  of  whether  to  use  means-ends  or  ends-means  logic  to  study  strategy  is  a  significant  question  for 
scholars  of  strategy  and  strategy  execution.  It  raises  the  question  of  whether  an  organization  that  strictly  follows 
the  traditional  causal  logic  for  formulating  and  executing  strategy  is  missing  out  on  the  possibility  of  innovation 
and  creativity  that  is  embedded  in  the  logic  of  co-creation.  With  the  massive  failure  of  many  of  our  major  financial 
institutions,  the  validity  of  our  traditional  business  models  has  acquired  an  added  sense  of  urgency.  It  would  be 
great  for  scholars  of  strategy  to  consider  whether  organizations  that  have  successful  strategy  execution  mecha¬ 
nisms  of  the  type  which  Kaplan  and  Norton  present  in  their  book  are  also  characterized  by  higher  levels  of 
creativity  and  innovation.  This  is  an  empirical  question,  and  I  do  not  have  an  answer,  but  1  am  convinced  that  it 
is  a  question  worth  studying.  To  the  extent  that  this  book  brings  the  issue  of  organizational  performance,  creativity, 
opportunity  creation,  and  its  relationship  to  internal  organizational  structures  and  processes  to  the  fore,  it  has 
accomplished  an  important  purpose.  For  this  we  can  thank  Kaplan  and  Norton  even  if  we  do  not  find  their  entire 
message  persuasive. 
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STEPHEN  ZEEF  (Editor),  Principles  before  Standards:  The  ICAEW’s  ‘N  Series’  of  Recom¬ 
mendations  on  Accounting  Principles  1942-1969  (London,  U.K.:  ICAEW  Financial  Report¬ 
ing  Faculty,  2009,  pp.  189).*  No  ISBN.  Available  in  hard  copy.  For  an  electronic  copy,  go  to 
www.icaew.com  and  search  for  Principles  before  standards. 

The  Institute  of  Chartered  Accountants  in  England  and  Wales  (ICAEW)  was  second  only  to  the  American 
Institute  of  Accountants  in  introducing  a  series  of  statements  providing  authoritative  guidance  on  best  accounting 
practice.  Studying  to  become  a  member  of  the  ICAEW  in  the  late  1970s,  I  had  to  familiarize  myself  not  only  with 
the  relatively  new  Statements  of  Standard  Accounting  Practice  (SSAPs)  being  issued  by  the  Accounting  Standards 
Committee,  but  also  with  the  “N  Series,”  so-called  after  the  section  of  the  ICAEW’s  Members’  Handbook  in  which 
the  Recommendations  on  Accounting  Principles  were  reproduced.  The  Recommendations  were  precisely  as  their 
name  implied,  guidance  that  the  ICAEW,  as  it  stated  in  preambles  to  various  Recommendations,  “hoped  will  be 
helpful  to  members  when  advising  as  to  best  practice.”  They  were  not  mandatory,  and  hence  preparers  and  auditors 
of  financial  statements  could  not  be  penalized  if  any  Recommendations  were  not  followed.  Despite  this,  they 
provided  an  authoritative  basis  for  financial  reporting  in  the  U.K.,  were  used  by  the  writers  of  textbooks  as  the 
basis  for  teaching  accounting,  and  influenced  both  the  SSAPs,  which  began  to  appear  in  1971,  and  the  require¬ 
ments  of  U.K.  company  law  relating  to  corporate  financial  statements. 

The  Recommendations  form  a  central  element  of  the  history  of  standard  setting  for  financial  reporting,  but 
because  they  are  now  superseded,  it  has  been  difficult  for  modern  researchers  and  regulators  to  compare  their  early 
messages  with  more  recent  pronouncements.  This  collection  has  been  published  in  the  belief  that  the  Recommen¬ 
dations  are  of  interest  in  their  own  right  as  evidence  of  how  professional  accountants  thought  about  financial 
reporting  issues  at  a  time  when  formal  regulation  was  not  a  significant  constraint  on  accounting  policy  choice. 
Readers  are  able  to  study  not  only  the  substantive  recommendations  on  specific  accounting  practices,  but  also  the 
quality  of  reasoning  applied  in  developing  the  Recommendations.  Over  a  period  of  some  three  decades,  from  1942 
to  1969,  29  Recommendations  were  published.  The  early  Recommendations  dealt  with  accounting  for  taxation,  a 
matter  made  particularly  complicated  by  almost  confiscatory  levels  of  business  tax  in  the  U.K.  during  World  War 
II.  To  make  matters  worse,  businesses  paid  income  tax  not  on  the  profits  of  their  current  financial  period,  but  rather 
on  the  profits  of  the  period  that  finished  in  the  preceding  tax  year  (which  for  historical  reasons  ended  on  April  5). 
However,  against  the  background  of  impending  reform  of  U.K.  company  law  in  the  mid- 1940s,  the  Recommen¬ 
dations  began  to  address  more  general  issues,  such  as  the  presentation  of  the  financial  statements,  reserves  and 
provisions,  accounting  for  business  combinations,  depreciation,  and  accounting  for  stock-in-trade  (inventories). 
Many  of  the  early  Recommendations  were  incorporated  into  the  Companies  Act  1948. 

The  Recommendations  went  beyond  annual  financial  statements  to  deal  with  prospectuses,  estates  and  trasts, 
and  retirement  benefits.  Several  Recommendations  were  superseded  by  new  pronouncements.  For  example,  there 
were  three  substantially  revised  Recommendations  on  accounting  for  tax,  stimulated  by  major  reforms  to  the  U.K. 
tax  system.  The  Recommendations  also  touched  on  fundamental  issues,  with  N12  (1949)  and  N15  (1952)  both 
dealing  with  price  change  accounting.  They  recommended  the  continued  use  of  historical  cost,  with  any  revalu¬ 
ations  to  take  account  of  increased  replacement  costs  being  accounted  for  as  reserves  (equity)  rather  than  as 
provisions  (liabilities).  Recommendations  would  respond  to  external  events,  with  the  ICAEW  publishing  a  Rec¬ 
ommendation  on  foreign  currency  translation  (N25)  in  February  1968,  just  three  months  after  the  devaluation  of 
Sterling  in  November  1967. 

Stephen  Zeff,  who  edited  the  collection,  has  supplemented  the  Recommendations  themselves  by  providing  a 
general  introduction  and  a  lengthy  extract  from  his  book,  Forging  Accounting  Principles  in  Five  Countries  (1972). 
This  sets  out  the  complex  and  secretive  process  by  which  the  ICAEW  developed  and  approved  the  Recommen¬ 
dations.  An  important  element  of  this  was  the  involvement  of  ICAEW  members  who  were  not  in  public  practice. 


*The  arrangements  for  this  review  were  made  by  Christopher  W.  Nobes. 
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The  proportion  of  members  who  were  neither  principals  of  nor  employed  by  accountancy  firms  had  been  growing 
in  the  1930s,  and  by  the  early  1940s  over  20  percent  of  ICAEW  members  worked  in  business  rather  than  practice. 
The  Taxation  and  Financial  Relations  Committee,  responsible  for  developing  the  Recommendations,  was  the  first 
ICAEW  committee  to  include  members  outside  practice,  and  provided  an  impetus  to  greater  involvement  of  such 
members  in  the  management  of  the  ICAEW,  including  eventually  membership  of  Council.  Zeff  has  also  included 
an  article,  originally  published  in  the  weekly  magazine  The  Accountant  in  1953,  in  which  Professor  W.  T.  Baxter 
of  the  London  School  of  Economics  questioned  whether  professional  accountancy  bodies  such  as  the  ICAEW 
should  make  official  pronouncements  on  “accounting  theory”  (Baxter  objected  to  the  support  for  historical  cost 
given  in  N12  and  N15,  even  though  the  documents  themselves  provided  arguments  for  alternative  accounting 
systems).  Baxter  was  not  a  member  of  ICAEW,  but  rather  a  Scottish  chartered  accountant,  and  his  own  institute 
rejected  the  publication  of  similar  recommendations.  Baxter’s  opinion  was  that  technical  guidance  could  be  useful, 
but  that  it  should  not  replace  professional  judgment.  He  feared  that  the  existence  of  formal  statements  of  “ap¬ 
proved”  accounting  theory  would  deter  accounting  students  from  exploring  alternatives,  and  would  ultimately  lead 
to  stagnation,  as  new  ideas  would  have  difficulty  in  challenging  the  “conventional  wisdom.” 

Zeff  also  reproduces  a  hitherto  unpublished  interview  with  Michael  Renshall,  who  had  worked  for  the 
ICAEW  as  an  administrator  involved  in  the  development  of  Recommendations  since  1960,  became  the  ICAEW’s 
first  Technical  Director  in  1969,  and  was  active  in  the  transition  to  SSAPs.  Renshall  provides  insights  into  why  the 
ICAEW  considered  it  necessary  to  move  beyond  nonmandatory  Recommendations,  and  notes  the  impact  of 
financial  scandals  in  the  late  1960s.  In  his  insider’s  account,  Renshall  celebrates  the  influence  of  a  small  number 
of  key  individuals  (Ronald  Leach  of  Peat,  Marwick,  Henry  Benson  of  Coopers,  and  Professor  Edward  Stamp  of 
Edinburgh  University),  suggesting  that  the  creation  of  the  Accounting  Standards  Steering  Committee  in  1969  owed 
a  great  deal  to  the  personal  charm  and  persuasiveness  of  Leach. 

The  re-publication  of  the  Recommendations  not  only  makes  these  important  documents  more  accessible  to 
students  and  researchers  of  accounting  regulation  but  also  provides  examples  of  different  styles  of  presenting 
guidance  on  financial  reporting.  The  Recommendations  range  from  a  few  paragraphs,  presenting  conclusions  with 
little  discussion,  to  lengthy  explanations  revealing  the  thought  processes  of  their  authors.  Some  of  the  Recom¬ 
mendations  (for  example,  the  two  documents  on  accounting  in  conditions  of  changing  prices)  could  be  utilized  in 
developing  students’  powers  of  reflection  and  critique,  as  they  are  at  the  same  time  concise  and  contentious.  The 
supplementary  items  provide  different  angles  on  the  standard-setting  process.  Does  the  emphasis  on  “due  process,” 
with  standards  being  developed  in  full  sight  of  the  public,  necessarily  lead  to  higher  quality  standards  than  the 
secret  processes  of  the  1950s  and  1960s?  Do  standard  setters  have  any  legitimate  authority  to  make  pronounce¬ 
ments  on  accounting  theory,  or  is  their  role  at  best  one  of  applying  principles  developed  by  others  to  give  guidance 
on  technical  accounting  problems?  Although  the  publication  of  Principles  before  Standards  gives  the  ICAEW  the 
opportunity  of  celebrating  its  contribution  to  the  development  of  accounting  standards,  it  also  provides  a  valuable 
resource  that  will  ensure  that  the  “N  Series”  is  appreciated  by  a  wider  readership. 
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In  the  March  2010  issue  of  The  Accounting  Review  article  titled  “Reporting  Bias  with  an 
Audit  Committee”  by  Judson  Caskey,  Venky  Nagar,  and  Paolo  Petacchi,  the  authors  published  an 
incomplete  proof  of  Proposition  1  in  the  Appendix  on  page  469.'  The  proof  of  Proposition  1  of 
Caskey  et  al.  (2010)  was  incomplete  in  its  demonstration  of  the  uniqueness  of  the  linear  equilib¬ 
rium.  The  authors  regret  the  oversight. 

We  first  restate  the  equations  that  identify  the  equilibrium,  which  are  listed  as  Equations  (A8) 
and  (A9)  in  the  Appendix: 

gd  =  pj  +  clTy\ariE[v\sa,rJ)pf-c]Ty\ar{E[u\s^,rJ)  =  0  (1) 

gr„  =  rl+  T^yaiiE[v\sJ\sa)r^  -  ^T^cov{v,E[v\sJ\sa)  =  0.  (2) 


The  Appendix  shows  that  a  unique  pf  &  (0 , 1)  solves  (1)  given  r^.  When  G>  0  the  Pf  that  solves 
(1)  is  decreasing  in  r/. 


7 - I  11  "n - -PfHx 

dpf  _dgjdr^  _  (var(E[u|^J [5^)  +  ) _ 

dr^  dgjdpf  3pj  +  c^TyVar(E[v  [5^,  r J) 


Our  assumption  that  corr(e^  ,  e^)  <  implies  that  cov(f; ,  E[i;  |  s^]  \  >  0.  As  a  result,  the 

unique  Pf  that  solves  (2)  given  is  increasing  in  r^,  crosses  zero  when  =  0,  and  approaches 
infinity  as  approaches  infinity.  Thus,  the  pf  that  solves  (2)  crosses  the  pf  that  solves  (1)  at  a 
unique  point,  giving  the  solution  to  Equations  (1)  and  (2). 


'  Caskey,  J.,  V.  Nagar,  and  P.  Petracchi.  2010.  Reporting  bias  with  an  audit  committee.  The  Accounting  Review  85  (2): 
447-481. 
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EDITORIAL  POLICY  AND  STYLE  INFORMATION 

EDITORIAL  POLICY 

According  to  the  policies  set  by  the  Publications  Committee  (which  were  endorsed  by  the  Executive 
Committee  and  were  published  in  the  Accounting  Education  News,  June  1987),  The  Accounting  Review 
“should  be  viewed  as  the  premier  journal  for  publishing  articles  reporting  the  results  of  accounting  research 
and  explaining  and  illustrating  related  research  methodology.  The  scope  of  acceptable  articles  should  embrace 
any  research  methodology  and  any  accounting-related  subject,  as  long  as  the  articles  meet  the  standards 
established  for  publication  in  the  journal  ...  No  special  sections  should  be  necessary.  The  primary,  but  not 
exclusive,  audience  should  be — as  it  is  now — academicians,  graduate  students,  and  others  interested  in 
accounting  research.” 

The  primary  criterion  for  publication  in  The  Accounting  Review  is  the  significance  of  the  contribution  an 
article  makes  to  the  literature.  Topical  areas  of  interest  to  the  journal  include  accounting  information  systems, 
auditing  and  assurance  services,  financial  accounting,  management  accounting,  taxation,  and  all  other  areas  of 
accounting,  broadly  defined.  The  journal  is  also  open  to  all  rigorous  research  methods. 

The  efficiency  and  effectiveness  of  the  editorial  review  process  is  critically  dependent  upon  the  actions 
of  both  authors  submitting  papers  and  the  reviewers.  Authors  accept  the  responsibility  of  preparing  research 
papers  at  a  level  suitable  for  evaluation  by  independent  reviewers.  Such  preparation,  therefore,  should  include 
subjecting  the  manuscript  to  critique  by  colleagues  and  others  and  revising  it  accordingly  prior  to  submission. 
The  review  process  is  not  to  be  used  as  a  means  of  obtaining  feedback  at  early  stages  of  developing  the 
research. 

Reviewers  and  editors  are  responsible  for  providing  constructive  and  prompt  evaluations  of  submitted 
research  papers  based  on  the  significance  of  their  contribution  and  on  the  rigor  of  analysis  and  presentation. 

MANUSCRIPT  PREPARATION  AND  STYLE 

The  Accounting  Review’s  manuscript  preparation  guidelines  follow  The  Chicago  Manual  of  Style  (15th  ed.; 
University  of  Chicago  Press).  Another  helpful  guide  to  usage  and  style  is  The  Elements  of  Style,  by  William 
Strunk,  Jr.,  and  E.  B.  White  (Macmillan).  Spelling  follows  Webster’s  Collegiate  Dictionary. 

FORMAT 

1.  All  manuscripts  should  be  formatted  in  12-point  font  on  8  1/2  x  11"  paper  and  should  be  double-spaced, 
except  for  indented  quotations. 

2.  Manuscripts  should  be  as  concise  as  the  subject  and  research  method  permit,  generally  not  to  exceed  7,000 
words. 

3.  Margins  should  be  at  least  one  inch  from  top,  bottom,  and  sides. 

4.  To  promote  anonymous  review,  authors  should  not  identify  themselves  directly  or  indirectly  in  their 
papers  or  in  experimental  test  instruments  included  with  the  submission.  Single  authors  should  not  use  the 
editorial  “we”. 

5.  A  cover  page  should  show  the  title  of  the  paper,  all  authors’  names,  titles  and  affiliations,  email  addresses, 
and  any  acknowledgments. 

6.  The  American  Accounting  Association  encourages  use  of  gender-neutral  language  in  its  publications. 

Pagination:  All  pages,  including  tables,  appendices  and  references,  should  be  serially  numbered.  Major 
sections  should  be  numbered  in  Roman  numerals.  Subsections  should  not  be  numbered. 

Numbers:  Spell  out  numbers  from  one  to  ten,  except  when  used  in  tables  and  lists,  and  when  used  with 
mathematical,  statistical,  scientific,  or  technical  units  and  quantities,  such  as  distances,  weights  and  measures. 
For  example:  three  days;  3  kilometers;  30  years.  All  other  numbers  are  expressed  numerically. 

Percentages  and  Decimal  Fractions:  In  nontechnical  copy  use  the  word  percent  in  the  text. 

Hyphens:  Use  a  hyphen  to  join  unit  modifiers  or  to  clarify  usage.  For  example:  a  cross-sectional  equation; 
re-form.  See  Webster’s  for  correct  usage. 

Keywords:  The  abstract  must  be  followed  by  at  least  three  keywords  to  assist  in  indexing  the  paper  and 
identifying  qualified  reviewers. 
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ABSTRACT/INTRODUCTION 

An  Abstract  of  about  100  words  (150  maximum)  should  be  presented  on  a  separate  page  immediately 
preceeding  the  text.  The  Abstract  should  concisely  inform  the  reader  of  the  manuscript’s  topic,  its  methods, 
and  its  findings.  Keywords  and  the  Data  Availability  statements  should  follow  the  Abstract.  The  text  of  the 
paper  should  start  with  a  section  labeled  “L  Introduction,”  which  provides  more  details  about  the  paper’s 
purpose,  motivation,  methodology,  and  findings.  Both  the  Abstract  and  the  Introduction  should  be  relatively 
nontechnical,  yet  clear  enough  for  an  informed  reader  to  understand  the  manuscript’s  contribution.  The 
manuscript’s  title,  but  neither  the  author’s  name  nor  other  identification  designations,  should  appear  on  the 
Abstract  page. 

TABLES  AND  FIGURES 

The  author  should  note  the  following  general  requirements: 

1.  Each  table  and  figure  (graphic)  should  appear  on  a  separate  page  and  should  be  placed  at  the  end  of  the 
text.  Each  should  bear  an  Arabic  number  and  a  complete  title  indicating  the  exact  contents  of  the  table  or 
figure.  Tables  and  figures  should  define  each  variable.  The  titles  and  definitions  should  be  sufficiently 
detailed  to  enable  the  reader  to  interpret  the  tables  and  figures  without  reference  to  the  text. 

2.  A  reference  to  each  graphic  should  be  made  in  the  text. 

3.  The  author  should  indicate  where  each  graphic  should  be  inserted  in  the  text. 

4.  Graphics  should  be  reasonably  interpreted  without  reference  to  the  text. 

5.  Source  lines  and  notes  should  be  included  as  necessary. 

Equations:  Equations  should  be  numbered  in  parentheses  flush  with  the  right-hand  margin. 

DOCUMENTATION 

Citations:  Within-text  citations  are  made  using  an  author-year  format.  Cited  works  must  correspond  to  the 
list  of  works  listed  in  the  “References”  section.  Authors  should  make  an  effort  to  include  the  relevant  page 
numbers  in  the  within-text  citations. 

1.  In  the  text,  works  are  cited  as  follows:  author’s  last  name  and  year,  without  comma,  in  parentheses.  For 
example:  one  author,  (Berry  2003);  two  authors,  (Fehr  and  Schmidt  2003);  three  or  more  authors, 
(Dechow  et  al.  1998);  more  than  one  work  cited,  (Cole  and  Yakushiji  1984;  Dechow  et  al.  1995;  Levitt 
1998);  with  two  works  by  the  same  author(s),  (Nelson  2003,  2005). 

2.  Unless  confusion  would  result,  do  not  use  “p.”  or  “pp.”  before  page  numbers.  For  example,  (Dechow  and 
Dichev  2002,  41-42). 

3.  When  the  reference  list  contains  two  or  more  works  by  the  same  author  (as  the  only  author  or  first  of  two 
or  more  authors)  published  in  the  same  year,  the  suffix  a,  b,  etc.,  is  appended  to  the  date  in  the  within-text 
eitations  and  in  the  “References”  section.  For  example,  (Johansson  2004a,  2004b,  2004c;  Baiman  and 
Rajan  2002a,  2002b;  Dhaliwal  et  al.  2005a,  2005b). 

4.  When  the  author’s  name  is  mentioned  in  the  text,  it  need  not  be  repeated  in  the  citation.  For  example: 
“Cohen  et  al.  (2005)  provide  ...” 

5.  Citations  to  institutional  works  should  use  acronyms  or  short  titles  where  practicable.  For  example: 
(NCFFR,  The  Treadway  Commission  1987). 

6.  If  the  paper  refers  to  statutes,  legal  treatises,  or  court  cases,  citations  acceptable  in  law  reviews  should  be 
used. 

Reference  List:  Every  manuseript  must  include  a  “References”  section  that  contains  only  those  works  cited 
within  the  text.  Each  entry  should  contain  all  information  necessary  for  unambiguous  identification  of  the 
published  work.  Use  the  following  formats  (which  follow  The  Chicago  Manual  of  Style): 

1.  Arrange  citations  in  alphabetical  order  according  to  the  surname  of  the  first  author  or  the  name  of  the 
institution  or  body  responsible  for  the  published  work. 

2.  Use  authors’  initials  instead  of  proper  names. 

3.  For  two  or  more  authors,  separate  authors  with  a  comma,  including  a  comma  before  “and”  (Dechow,  R 
M.,  R.  Sloan,  and  A.  Sweeney). 

4.  Date  of  publication  follows  the  author’s  (authors’)  name(s). 

5.  Titles  of  journals  or  newspapers  should  not  be  abbreviated. 
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Sample  entries  are  as  follows: 
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relationships.  Journal  of  Accounting  Research  40  (2):  247-278. 
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Footnotes:  Footnotes  are  not  used  for  documentation.  Textual  footnotes  should  be  used  only  for  extensions 
and  useful  excursions  of  information  that  if  included  in  the  body  of  the  text  might  disrupt  its  continuity. 
Footnotes  should  be  inserted  using  the  “footnote”  or  “endnote”  feature  of  the  word  processing  software  which 
will  automatically  number  the  footnotes  throughout  the  manuscript  with  superscript  Arabic  numerals. 

SUBMISSION  OF  MANUSCRIPTS 

Authors  should  note  the  following  guidelines  for  submitting  manuscripts: 

1 .  Manuscripts  currently  under  consideration  by  another  journal  or  publisher  should  not  be  submitted.  The 
author  must  state  upon  submission  that  the  work  is  not  submitted  or  published  elsewhere. 

2.  For  manuscripts  reporting  on  field  surveys  or  experiments:  If  the  additional  documentation  (e.g.,  ques¬ 
tionnaire,  case,  interview  schedule)  is  sent  as  a  separate  file,  then  all  information  that  might  identify  the 
author(s)  must  be  deleted  from  the  instrument. 

3.  Manuscripts  should  be  submitted  via  email  as  Microsoft  Word  or  PDF  file  to  the  Senior  Editor,  Steven 
Kachelmeier,  at  email  address  tar@mccombs.utexas.edu.  Please  submit  separate  files  for  (1)  the  manu¬ 
script’s  title  page  with  identifying  information  (not  forwarded  to  reviewers),  (2)  the  manuscript  with  title 
page  and  all  other  identifying  information  removed,  and  (3)  any  necessary  supplemental  files,  such  as 
experimental  instructions  and/or  response  memoranda  on  invited  revisions.  The  body  or  subject  line  of  the 
email  should  also  indicate  the  submission  fee  receipt  code  from  AAA  (see  step  4  below). 

4.  A  nonrefundable  submission  fee  is  required  unless  explicitly  waived  by  the  editor  for  invited  revisions  of 
previous  submissions.  The  submission  fee  in  U.S.  funds  is  $200.00  for  members  and  $400.00  for  non¬ 
members  of  the  AAA  payable  by  credit  card  (VISA  or  MasterCard  only).  The  payment  form  is  available 
online  at:  https://aaahq.org/AAAforms/journals/tarsubmit.cfm.  If  you  are  unable  to  pay  by  credit  card  or 
have  any  questions,  please  contact  the  AAA  Member  Services  Team  at  (941)  921-7747  or  info@aaahq.org. 

5.  Revisions  must  be  submitted  within  12  months  from  the  decision  letter  inviting  a  revision. 

COMMENTS 

The  Accounting  Review  welcomes  submissions  of  comments  on  previous  TAR  articles.  Comments  on  articles 
previously  published  in  The  Accounting  Review  will  generally  be  reviewed  by  two  reviewers,  usually  includ¬ 
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ABSTRACT:  This  study  investigates  whether  managers  use  classification  shifting  to 
manage  earnings  when  reporting  discontinued  operations.  Using  a  methodology  similar 
to  McVay  (2006),  we  find  evidence  consistent  with  the  hypothesis  that  firms  shift  oper¬ 
ating  expenses  to  income-decreasing  discontinued  operations  to  increase  core  earn¬ 
ings.  Our  findings  also  indicate  that  managers  use  classification  shifting  to  meet  or  beat 
analysts’  forecasts.  Finally,  we  find  that,  since  the  introduction  of  SFAS  No.  144,  the 
reporting  frequency  of  discontinued  operations  has  increased;  however,  the  magnitude 
of  classification  shifting  has  decreased.  We  provide  potential  explanations  for  this  find¬ 
ing. 

Keywords:  earnings  management;  classification  shifting;  earnings  components;  dis¬ 
continued  operations. 

Data  Availability:  Data  are  available  from  public  sources  identified  in  this  study. 

1.  INTRODUCTION 

Classification  shifting  involves  reporting  revenues,  expenses,  gains,  and  losses  on  different 
lines  on  the  income  statement  other  than  those  on  which  they  should  properly  appear  under 
GAAP.  In  particular,  this  study  focuses  on  whether  managers  achieve  classification  shift¬ 
ing  by  allocating  operating  expenses  to  discontinued  operations.  Managers  could  engage  in  this 
form  of  earnings  management  if  they  perceive  that  investors  value  recurring  earnings  more  than 
nonrecurring  earnings.  It  is  difficult  for  investors  to  detect  this  type  of  earnings  management 
because  discontinued  operations  are  not  usually  disclosed  in  detail  in  the  financial  statements. 
Unlike  other  forms  of  earnings  management,  such  as  accrual  management  or  real  activity  manipu¬ 
lation,  classification  shifting  does  not  change  net  income  and  may  be  a  less  costly  form  of  earnings 
management. 
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We  believe  this  is  an  interesting  question  for  the  following  reasons.  First,  previous  studies 
have  provided  evidence  of  classification  shifting  using  extraordinary  items  (Ronen  and  Sadan 
1975;  Barnea  et  al.  1976)  and  special  items  (McVay  2006;  Fan  et  al.  2010),  while  discontinued 
operiions  have  not  been  studied.  It  could  be  more  beneficial  for  managers  to  use  discontinued 
operations  for  classification  shifting  than  special  items,  because  shifting  operating  expenses  to 
discontinued  operations  also  increases  operating  income,  income  from  continuing  operations,  and 
core  earnings.^  Increasing  all  three  subtotals  could  lead  to  higher  stock  valuation  than  increasing 
core  earnings  alone.^  Second,  there  is  no  detailed  disclosure  of  discontinued  operations  in  the 
financial  statements.  This  lack  of  detail  creates  information  asymmetry  between  managers  and 
investors,  as  investors  do  not  know  the  nature  of  the  expenses  allocated  to  discontinued  operations. 
Third,  the  adoption  of  SFAS  No.  144,  Accounting  for  the  Impairment  or  Disposal  of  Long-Lived 
Assets  (FASB  2001a),  has  increased  the  reporting  frequency  of  discontinued  operations  by  reduc¬ 
ing  the  recognition  threshold  for  asset  disposals  to  be  classified  as  discontinued  operations.  Con¬ 
sequently,  the  FASB  may  have  inadvertently  allowed  managers  more  discretion  to  manage  earn¬ 
ings,  potentially  reducing  the  quality  of  reported  earnings.  Finally,  in  recent  years,  special  items 
have  been  scrutinized  by  researchers,  regulators,  and  auditors,  while  discontinued  operations  have 
been  largely  ignored.  This  increased  scrutiny  may  have  increased  the  costs  and  reduced  the 
benefits  of  classification  shifting  using  special  items.  It  could  now  be  more  advantageous  for 
managers  to  use  discontinued  operations,  especially  since  the  recognition  threshold  has  been 
reduced  since  SFAS  No.  144.  To  our  knowledge,  ours  is  the  first  study  to  investigate  the  impli¬ 
cation  of  increasing  recognition  of  discontinued  operations  for  earnings  management  after  the 
introduction  of  SFAS  No.  144.  This  study  benefits  investors,  analysts,  auditors,  and  regulators  by 
alerting  them  to  the  potential  earnings  management  using  classification  shifting  when  a  firm 
reports  discontinued  operations. 

Using  a  research  methodology  similar  to  McVay  (2006),  we  decompose  core  earnings  into  its 
expected  and  unexpected  components.  We  find  a  positive  association  between  income-decreasing 
discontinued  operations  and  unexpected  core  earnings  in  the  year  a  firm  reports  discontinued 
operations,  a  result  consistent  with  managers  shifting  operating  expenses  to  discontinued 
operations.^  We  also  find  a  negative  association  between  discontinued  operations  and  the  change 
in  unexpected  core  earnings  in  the  year  after  a  firm  reports  discontinued  operations,  a  result 
consistent  with  operating  expenses  returning  to  core  earnings.  Together,  these  results  provide 
evidence  that  managers  shift  core  expenses  to  discontinued  operations  to  inflate  core  earnings.  We 
also  investigate  the  motivations  for  this  form  of  classification  shifting,  finding  evidence  consistent 
with  managers  using  classification  shifting  to  meet  or  beat  analysts  forecasts. 

Our  contribution  to  the  literature  is  threefold.  First,  we  extend  the  classification  shifting 
literature  by  providing  evidence  that  discontinued  operations  are  used  to  manage  earnings.  Second, 
we  extend  the  benchmark-beating  literature  by  providing  evidence  that  managers  engage  in  clas¬ 
sification  shifting  using  discontinued  operations  to  meet  or  beat  analyst  forecasts.  Finally,  we 


‘  For  ease  of  exposition,  we  characterize  classification  shifting  as  the  shifting  of  core  expenses  to  discontinued  operations. 
It  is  also  possible  that  managers  shift  revenues  from  discontinued  operations  to  core  earnings;  both  of  these  shifting 
mechanisms  are  captured  by  our  examination  of  classification  shifting. 

^  While  using  extraordinary  items  for  classification  shifting  yields  the  same  benefits  as  using  discontinued  operations,  they 
are  difficult  to  recognize.  Over  time,  the  FASB  has  increasingly  tightened  the  standards  for  reporting  extraordinary 
items.  Now,  even  catastrophic  events  such  as  the  attack  on  September  11,  2001  and  Hurricane  Katrina  are  not  considered 
extraordinary  events  for  financial  reporting  purposes  (EITF  01-10,  FASB  2001c).  In  2003,  only  2  percent  of  firms 
included  in  Accounting  Trends  and  Techniques,  60th  edition,  reported  extraordinary  items. 

^  The  classification  shifting  increases  unexpected  core  earnings  and  reduces  the  value  of  the  discontinued  operations 
creating  a  negative  relationship.  We  multiply  the  value  of  discontinued  operations  by  negative  one  and  predict  a  positive 
relationship. 
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document  an  increase  in  the  reporting  frequency  of  discontinued  operations  and  a  decrease  in  the 
magnitude  of  earnings  management  since  the  introduction  of  SFAS  No.  144.  There  are  two 
potential  reasons  for  the  last  finding.  First,  SFAS  No.  144  broadened  the  definition  of  discontinued 
operations  by  replacing  the  “business  segment”  requirement  under  APB  No.  30  with  the  “compo¬ 
nent  of  an  entity”  concept.  This  allowed  for  more  asset  dispositions  to  be  accounted  for  as 
discontinued  operations,  increasing  the  reporting  frequency.  In  addition,  the  broadened  definition 
also  allowed  smaller  asset  dispositions  to  qualify  as  discontinued  operations,  potentially  reducing 
the  amount  of  operating  expenses  that  could  be  allocated  to  the  disposition.  Second,  the  Sarbanes- 
Oxley  Act  was  enacted  around  the  same  time  as  SFAS  No.  144,  and  could  have  led  to  a  decrease 
in  classification  shifting  due  to  increased  regulation  and  oversight.'^ 

Section  II  discusses  discontinued  operations  and  classification  shifting.  Section  III  reviews 
prior  literature  and  develops  testable  hypotheses.  Section  IV  describes  the  sample  selection  crite¬ 
ria,  data  collection  method,  and  variable  measurement.  Section  V  reports  the  results,  and  Section 
VI  concludes. 

11.  DISCONTINUED  OPERATIONS  AND  CLASSIFICATION  SHIFTING 

When  making  firm-valuation  decisions,  investors  place  a  higher  value  on  items  of  income 
expected  to  be  persistent  in  the  future.  Presumably  to  aid  users’  valuation  decisions,  GAAP 
generally  requires  that  material  nonrecurring  items  be  separately  disclosed  in  the  financial  state¬ 
ments.  However,  the  separation  of  net  income  into  recurring  and  nonrecurring  components  also 
gives  managers  the  opportunity  to  mislead  investors,  by  misclassifying  income  and  expense  items. 
For  example,  a  manager  wanting  to  increase  firm  valuation  can  misclassify  recurring  expenses  as 
nonrecurring,  misleading  investors  as  to  the  persistence  of  the  income  increase.  In  the  accounting 
literature,  this  type  of  earnings  management  is  called  classification  shifting. 

McVay  (2006)  discusses  two  reasons  why  classification  shifting  can  be  a  relatively  low-cost 
method  for  managing  earnings.  First,  unlike  accrual  management  or  real  activity  manipulation, 
there  is  no  “settling-up”  in  the  future  for  past  earnings  management.  If  a  manager  decides  to 
increase  earnings  using  income-increasing  accruals,  then,  at  some  point,  these  accruals  must 
reverse.  The  reversal  of  these  accruals  reduces  future  reported  earnings.  If  a  manager  decides  to 
increase  earnings  by  managing  real  activities,  such  as  reducing  research  and  development  expen¬ 
ditures,  then  this  likely  leads  to  fewer  income-producing  projects  and  reduced  earnings  in  the 
future.  In  contrast,  classification  shifting  involves  simply  reporting  recurring  expenses  in  a  non¬ 
recurring  classification  on  the  income  statement,  having  no  implications  for  future  earnings.  Sec¬ 
ond,  because  classification  shifting  does  not  change  net  income,  it  is  potentially  subject  to  less 
scrutiny  by  auditors  and  regulators  than  forms  of  earnings  management  that  change  net  income. 

Discontinued  operations  represent  the  income  and  cash  flows  of  a  portion  of  a  company  that 
has  been  (or  will  be)  discontinued  from  the  continuing  operations  of  the  company.  Under  SFAS 
No.  144,  the  gain  or  loss  for  discontinued  operations  is  comprised  of  three  amounts.  The  first 
amount  is  the  operating  income  or  loss  from  the  operations  of  the  component  being  discontinued 
for  the  entire  year  in  which  the  decision  to  discontinue  is  made.  The  second  amount  is  the  gain  or 
loss  from  disposal,  which  is  the  net  amount  realized  over  the  carrying  value  of  net  assets  of  the 
component.  The  third  amount  is  an  impairment  loss  on  the  “assets  held  for  sale”  if  the  component 
is  not  disposed  in  the  same  year  as  the  decision  to  discontinue.^  The  results  of  discontinued 


Lobo  and  Zhou  (2006)  find  that  accounting  conservatism  has  increased  in  the  post-Sarbanes-Oxley  period.  In  addition, 
Cohen  et  al.  (2008)  find  that  accrual-based  earnings  management  has  decreased  following  SOX,  while  real  activity 
management  has  increased. 

^  The  aggregate  gain  or  loss  from  discontinued  operations  was  computed  differently  under  APB  No.  30.  The  income  or 
loss  from  operations  of  the  segment  being  disposed  was  measured  as  the  income  or  loss  from  the  operation  from  the 
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operations  are  reported  on  the  income  statement  as  a  separate  item  (net  of  tax  effects)  after  income 
from  continuing  operations.  The  “below  the  line”  reporting  of  disposals  as  discontinued  operations 
is  likely  perceived  to  be  desirable  for  investors,  because  it  communicates  to  them  the  results  of  a 
firm’s  operations  on  a  “with  and  without”  basis.  However,  it  could  be  detrimental  to  investors  if 
managers  use  their  reporting  discretion  to  manage  earnings  when  reporting  discontinued  opera¬ 
tions.  For  example,  managers  could  allocate  normal  operating  expenses  to  discontinued  operations 
in  order  to  report  increased  income  from  continuing  operations.  Managers  have  discretion  over  the 
allocation  of  joint  costs  between  continuing  and  discontinued  operations,  and  this  information  is 
not  normally  available  to  external  investors.  It  is  doubtful  that  auditors  would  detect  this  manipu¬ 
lation  during  their  analytical  review  because  the  company’s  financial  ratios  would  be  similar  to 
what  they  were  previously.^ 

The  ability  of  asset  disposals  to  be  classified  as  discontinued  operations  has  varied  throughout 
time.  Under  APB  Opinion  No.  30  (APB  1973),  only  dispositions  of  business  “segments”  were  able 
to  qualify  for  reporting  as  discontinued  operations.  In  general,  business  segments  were  defined  as 
a  major  line  of  business  or  a  customer  class.  The  FASB  believed  that  investors  would  benefit  from 
expanded  application  of  the  accounting  and  disclosure  rules  of  discontinued  operations  and  issued 
SFAS  No.  144,  Accounting  for  the  Impairment  or  Disposal  of  Long-Lived  Assets  (FASB  2001a). 
SFAS  No.  144  reduced  the  threshold  for  recognition  of  discontinued  operations  treatment  by 
introducing  the  “component  of  an  entity”  concept.  A  component  of  an  entity  is  distinguishable 
from  the  rest  of  the  entity  because  it  has  its  own  clearly  defined  operations  and  cash  flows.  A 
component  of  an  entity  can  be  a  reportable  segment,  an  operating  segment  (as  defined  by  SFAS 
No.  131,  FASB  1997),  a  reporting  unit  (as  defined  by  SFAS  No.  142,  FASB  2001b),  a  subsidiary, 
or  an  asset  group.  This  reduction  in  the  threshold  for  discontinued  operations  has  allowed  for  more 
asset  disposals  to  be  classified  as  discontinued  operations.  Consequently,  the  FASB  may  have 
inadvertently  given  managers  more  discretion  to  manage  earnings,  potentially  reducing  the  quality 
of  reported  earnings. 

In  addition  to  the  “component  of  an  entity”  requirement,  SFAS  No.  144  mandated  two  other 
requirements  for  an  asset  disposal  to  qualify  for  treatment  as  a  discontinued  operation.  The  first 
requirement  is  that  the  operations  and  cash  flows  of  the  component  being  disposed  are  completely 
separable  from  the  seller’s  continuing  operations.  The  second  requirement  is  that  the  seller  has  no 
significant  involvement  with  the  component  after  the  sale.  In  theory,  the  application  of  these 
requirements  appears  straightforward;  however,  SFAS  No.  144  did  not  clearly  define  the  meaning 
of  significant  involvement.  In  practice,  many  asset  disposals  involve  contractual  terms:  seller 
financing,  retained  equity  by  the  seller,  royalties  paid  to  the  seller  by  the  buyer,  service  agree¬ 
ments,  etc.  The  inclusion  of  these  terms  in  the  asset-disposal  contract  could  be  interpreted  as 
significant  involvement  by  the  seller.  In  order  to  provide  better  guidance  on  the  meaning  of  the 
term  “significant  involvement,”  the  FASB  issued  EITF  03-13  (FASB  2004)  as  a  guide  for  deter¬ 
mining  whether  an  entity  retains  significant  involvement  or  receives  cash  flows.  In  short,  if  a  firm 
receives  material  direct  cash  flows  from  the  discontinued  operations  or  maintains  significant  con- 


beginning  of  the  year  until  the  “measurement  date.”  The  measurement  date  was  the  date  the  plan  to  dispose  of  the 
segment  was  approved  by  the  company.  If  the  disposal  took  place  in  the  same  year  as  the  measurement  date,  then  the 
gain  or  loss  on  disposal  was  comprised  of  two  items.  The  first  was  the  gain  or  loss  from  operations  from  the  measure¬ 
ment  date  to  the  end  of  the  year  and  the  second  was  the  difference  between  the  selling  price  of  the  segment  and  its  book 
value.  If  the  segment  was  not  disposed  in  the  same  year,  then  the  gain  or  loss  on  disposal  also  included  an  estimate  of 
future  losses. 

®  For  example,  assume  a  retailer  is  closing  a  number  of  individual  stores  and  accounts  for  this  as  a  discontinued  operation. 
Further  assume  the  retailer  always  allocated  a  percentage  of  advertising  expenses  to  these  stores  and  plans  no  reduction 
in  advertising  expense  after  the  stores  are  closed.  In  this  case,  the  advertising  expense  for  the  continuing  operations  of 
the  retailer  will  be  the  same  on  a  per-store  basis  as  in  the  past. 
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tinuing  involvement,  then  the  disposal  does  not  qualify  as  a  discontinued  operation.  EITF  03-13 
took  effect  in  2005  but  could  be  adopted  as  early  as  2004.  It  should  be  noted  that,  while  the 
recognition  criteria  for  discontinued  operations  are  well  defined  in  GAAP,  the  costs  to  be  allocated 
to  the  discontinued  operations  are  not. 

Accounting  treatment  for  discontinued  operations  is  also  a  global  accounting  issue.  Similar  to 
APB  No.  30,  IFRS  5,  Non-Current  Assets  Held-for-Sale  and  Discontinued  Operations  (lASB 
2004)  defines  a  discontinued  operation  as  a  separate  major  line  of  business  or  geographical  area  of 
operations.  IFRS  5  also  requires  detailed  disclosure  of  revenue,  expenses,  pre-tax  profit  or  loss, 
and  the  related  income  tax  expense,  either  in  the  notes  or  on  the  face  of  the  income  statement. 
Currently,  both  the  FASB  and  the  lASB  are  working  toward  a  converged  accounting  definition  and 
treatment  for  discontinued  operations,  with  both  boards  issuing  proposals  amending  SFAS  No.  144 
and  IFRS  5,  respectively. 

In  closing,  material  asset  disposals  are  reported  on  the  income  statement  as  discontinued 
operations  if  they  meet  the  requirements  of  SFAS  No.  144  (or  APB  No.  30  for  disposals  prior  to 
2002)  and  likely  reported  as  special  items  if  they  do  not.  Either  income  statement  classification 
gives  managers  the  opportunity  to  engage  in  classification  shifting.  Prior  research  provides  evi¬ 
dence  that  special  items  are  used  for  classification  shifting.  Since  the  recognition  criteria  for 
discontinued  operations  are  better  defined  in  the  accounting  standards  than  for  special  items,  this 
may  give  managers  less  flexibility  to  manage  earnings  using  classification  shifting.  For  example, 
it  may  not  be  possible  for  managers  to  time  the  recognition  of  discontinued  operations  to  coincide 
with  a  period  during  which  earnings  management  is  desired.  Consequently,  whether  managers  use 
discontinued  operations  for  earnings  management  purposes  is  an  empirical  question  that  our  re¬ 
search  addresses 

III.  LITERATURE  REVIEW  AND  HYPOTHESES  DEVELOPMENT 

Earnings  management  is  one  of  the  most  studied  areas  in  financial  accounting  research.^ 
Previous  studies  document  that  earnings  management  can  be  carried  out  through  accrual  manage¬ 
ment  (e.g..  Dechow  et  al.l995;  Payne  and  Robb  2000),  real  activity  management  (e.g..  Dechow 
and  Sloan  1991;  Bushee  1998;  Roychowdhury  2006),* *  or  classification  shifting  (Ronen  and  Sadan 
1975;  Bamea  et  al.  1976;  McVay  2006;  Fan  et  al.  2010).  However,  increasing  current  earnings 
using  the  former  two  methods  has  the  potential  of  reducing  future  earnings.  Since  classification 
shifting  simply  moves  certain  revenues,  expenses,  gains,  and  losses  to  different  line  items  on  the 
income  statement,  it  does  not  actually  change  net  income.  As  a  result,  classification  shifting  is 
likely  to  be  less  costly  and  less  scrutinized  by  auditors  and  regulators  (Nelson  et  al.  2002).^ 
Previous  studies  have  largely  focused  on  earnings  rnanagernent  using  accruals  and  real  activity 
management.  Relatively  few  studies  have  examined  earnings  management  through  classification 
shifting. 


’  A  review  of  the  accrual-based  earnings  management  literature  can  be  found  in  Healy  and  Wahlen  (1999),  Dechow  and 
Skinner  (2000),  and  Kothari  (2001).  Xu  et  al.  (2007)  review  the  real  activity-based  earnings  management  literature. 

*  Discretionary  expenses  may  include  research  and  development  (Dechow  and  Sloan  1991;  Bushee  1998),  and  advertis¬ 
ing,  maintenance,  and  training  expenses  (Roychowdhury  2006). 

®  Nelson  et  al.  (2002)  investigate  auditors’  responses  to  earnings  management  attempts.  Their  survey  evidence  suggests 
that  auditors  are  less  likely  to  require  audit  adjustments  under  the  following  circumstances:  first,  when  GAAP  is  specific 
and  the  firm  structures  the  transaction  to  comply  with  GAAP;  second,  when  GAAP  is  not  specific  and  the  firm  engages 
in  an  unstructured  transaction;  third,  when  the  transaction  does  not  increase  current  income;  fourth,  when  the  client  is 
large;  and  fifth,  when  the  amount  of  the  earnings  management  is  immaterial.  For  a  transaction  to  be  categorized  as 
discontinued  operation,  the  firm  needs  to  structure  the  transaction  to  comply  with  specific  accounting  standards,  which 
would  presumably  reduce  the  chance  of  an  audit  adjustment.  The  calculation  of  the  income  from  the  discontinued 
operations  is  rather  unstructured,  which  would  reduce  the  probability  of  an  audit  adjustment.  Because  classification 
shifting  does  not  increase  net  income,  their  results  would  again  suggest  the  reduced  probability  of  an  audit  adjustment. 
Finally,  the  large  size  of  our  sample  firms  would  also  reduce  the  likelihood  of  an  audit  adjustment. 
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Ronen  and  Sadan  (1975)  contend  that  the  dimensions  and  objects  of  income  smoothing  are 
closely  associated.  They  also  reason  that,  if  the  smoothing  object  is  net  income,  then  classification 
shifting  is  irrelevant.  However,  if  the  smoothing  object  is  any  income  subtotal  other  than  “bottom- 
line”  net  income,  then  managers  have  an  incentive  to  engage  in  classification  shifting.  This  begs 
the  question  of  which  income  subtotals  interest  investors.  Lipe  (1986)  documents  that  investors 
understand  the  future  earnings  implications  between  the  different  earnings  components  reported 
on  the  income  statement.  Bradshaw  and  Sloan  (2002)  provide  evidence  of  “street  earnings”  (i.e., 
modified-GAAP  earnings  with  noncash  and  nonrecurring  items  excluded)  replacing  GAAP  earn¬ 
ings  as  one  of  the  primary  determinants  of  stock  price.  Together,  these  studies  provide  evidence  of 
investors’  interest  in  recurring  income  subtotals  rather  than  “bottom  line”  net  income  that  includes 
nonrecurring  items.  If  investors  value  recurring  earnings  higher  than  nonrecurring  earnings,  then 
managers  have  an  incentive  to  misclassify  operating  expenses  as  nonrecurring  expenses,  to  in¬ 
crease  recurring  income  subtotals.  Although  one  cannot  observe  which  income  statement  subtotals 
investors  use,  we  follow  McVay  (2006)  in  assuming  that  core  earnings  is  the  managed  subtotal. 
However,  using  discontinued  operations  to  increase  core  earnings  also  increases  income  from 
continuing  operations  and  other  “above-the-line”  earnings  subtotals. 

One  of  the  earliest  studies  in  the  area  of  classification  shifting  is  Ronen  and  Sadan  (1975). 
They  find  evidence  consistent  with  managers  using  extraordinary  items  to  smooth  earnings  before 
extraordinary  items.  Bamea  et  al.  (1976)  extend  this  line  of  research  by  providing  evidence  that 
managers  also  use  extraordinary  items  to  smooth  operating  income.  Taken  together,  these  studies 
provide  evidence  that  managers  are  interested  in  managing  multiple  income  statement  subtotals, 
presumably  because  investors  value  various  income  statement  subtotals  differently.  While  the 
above  studies  provide  evidence  of  earnings  management  using  “below  the  line”  income  statement 
items,  they  do  not  examine  the  motivations  behind  classification  shifting. 

Kinney  and  Trezevant  (1997)  investigate  whether  managers  use  special  items  opportunisti¬ 
cally  to  manage  both  earnings  and  investors’  perceptions.  They  find  that  income-decreasing  special 
items  are  more  likely  to  be  reported  as  line  items  on  the  income  statement,  presumably  to  call 
attention  to  the  transitory  nature  of  the  earnings  decrease  caused  by  these  items.  They  also  find 
that  income-increasing  special  items  are  more  likely  to  be  reported  in  the  footnotes  to  the  financial 
statements,  presumably  to  shift  attention  away  from  the  transitory  nature  of  the  earnings  increase 
caused  by  these  items.  Their  findings  suggest  that  managers  behave  opportunistically  when  dis¬ 
closing  special  items.  However,  Riedl  and  Srinivasan  (2010)  further  examine  managers’  reporting 
behavior  regarding  special  items.  They  find  that  special  items  separately  disclosed  on  the  income 
statement  have  a  lower  persistence  than  special  items  disclosed  in  the  financial  statement  foot¬ 
notes.  Their  results  are  consistent  with  managers  using  financial  statement  presentation  to  assist 
investors  in  evaluating  the  persistence  of  special  items.  This  evidence  offers  an  alternative  expla¬ 
nation  for  the  managerial  opportunism  detected  by  Kinney  and  Trezevant  (1997). 

McVay  (2006)  investigates  whether  managers  engage  in  classification  shifting,  by  reporting 
core  expenses  in  income-decreasing  special  items.  She  uses  an  expectation  model  to  separate  core 
earnings,  defined  as  operating  income  before  depreciation  and  amortization,  into  expected  and 
unexpected  components.  She  finds  special  items  are  positively  associated  with  contemporaneous 
unexpected  core  earnings  and  negatively  associated  with  the  unexpected  change  in  future  core 
earnings.  These  results  are  consistent  with  her  hypothesis  that  managers  shift  operating  expenses 
to  special  items.  She  also  provides  evidence  suggesting  that  managers’  motivation  for  classifica¬ 
tion  shifting  is  to  meet  or  beat  analysts’  forecasts.  In  addition,  she  finds  a  negative  stock  price 
reaction  to  the  reversal  of  unexpected  core  earnings,  indicating  that  investors  do  not  understand  (or 
are  not  able  to  fully  undo)  the  earnings  management.  One  drawback  to  the  core  earnings  expec¬ 
tation  model  used  by  McVay  (2006)  is  the  use  of  contemporaneous  accruals  (including  accruals 
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related  to  special  items)  as  a  control  for  firm  performance.  The  inclusion  of  special  item  accruals 
in  the  expectation  model  creates  a  potential  bias  in  favor  of  her  hypotheses.  She  acknowledges  the 
reliance  on  an  imperfect  model  is  a  limitation  of  her  study. 

Fan  et  al.  (2010)  extends  McVay’s  (2006)  core  earnings  expectation  models  by  controlling  for 
performance  using  returns  and  lagged  returns  rather  than  contemporaneous  accruals,  thus  elimi¬ 
nating  any  potential  bias.  They  confirm  McVay’s  (2006)  findings  that  managers  shift  core  expenses 
to  income-decreasing  special  items.  Using  quarterly  data,  they  find  that  classification  shifting 
using  income-decreasing  special  items  is  more  prevalent  in  the  fourth  quarter  than  in  other  quar¬ 
ters.  They  also  find  that  classification  shifting  is  more  prevalent  when  managers  are  constrained 
from  using  income-increasing  accruals. 

Classification  Shifting  Using  Discontinued  Operations 

This  study  investigates  whether  discontinued  operations  are  used  for  classification  shifting. 
Our  study  is  motivated  by  prior  research  suggesting  that  managers  are  opportunistic  when  report¬ 
ing  “below  the  line”  income  statement  items.  Results  from  discontinued  operations  are  reported  at 
a  lower  point  on  the  income  statement  than  special  items,  but  are  arguably  subject  to  less  man¬ 
agement  discretion.  Thus,  while  managers  have  a  greater  incentive  to  shift  expenses  to  this  clas¬ 
sification,  it  is  an  empirical  question  as  to  whether  they  have  the  opportunity  to  do  so.  In  addition, 
shifting  operating  expenses  to  discontinued  operations  also  increases  core  earnings,  operating 
income,  and  income  from  continuing  operations.  These  subtotals  are  widely  believed  to  be  good 
indicators  of  firm  performance  and  to  be  of  primary  interest  to  investors.  Discontinued  operations 
could  presumably  attract  less  attention  from  investors,  auditors,  and  regulators  than  special  items. 

Discontinued  operations  are  reported  on  the  income  statement  in  an  aggregated  form,  creating 
an  information  asymmetry  between  managers  and  investors.  It  is  unclear  to  investors  exactly  what 
revenues,  expenses,  gains,  and  losses  are  allocated  to  the  discontinued  operations.  Because  of  this, 
we  predict  a  shifting  of  operating  expenses  to  discontinued  operations.  This  leads  to  our  first 
hypothesis,  stated  in  alternative  form; 

HI:  Managers  engage  in  classification  shifting  using  discontinued  operations  to  increase  core 
earnings. 

Classification  Shifting  to  Meet  or  Beat  Earnings  Benchmarks 

The  literature  provides  evidence  of  firms  managing  earnings  to  meet  or  beat  three  earnings 
benchmarks:  zero  earnings,  prior  year’s  earnings,  and  analysts’  forecasts.  Prior  research  also 
shows  that  firms  have  several  methods  of  managing  earnings  to  meet  these  benchmarks.  For 
example,  prior  research  documents  a  discontinuity  in  the  earnings  distribution,  with  more  firms 
reporting  income  just  above  these  benchmarks  than  firms  reporting  income  below  these  bench¬ 
marks  (Hayn  1995;  Burgstahler  and  Dichev  1997;  Degeorge  et  al.  1999).  Degeorge  et  al.  (1999) 
reason  this  is  due  to  certain  investors  relying  on  simple  heuristics  when  evaluating  firm  perfor¬ 
mance.  They  further  reason  that  because  investors  are  risk-averse,  they  have  a  tendency  to  be  more 
sensitive  toward  losses  than  toward  gains.  The  literature  has  also  documented  market  rewards  for 
firms  exceeding  financial  benchmarks.  Barth  et  al.  (1999)  find  firms  that  consistently  exceed  prior 
years’  earnings  have  higher  price-earnings  multiples  than  firms  that  do  not.  Brown  and  Cay  lor 
(2005)  find  that,  even  after  controlling  for  the  earnings  surprise,  firms  experience  positive  abnor¬ 
mal  returns  when  reporting  positive  earnings,  earnings  increases,  or  exceeding  analysts’  forecasts. 
Together,  these  studies  suggest  that  firms  exceeding  benchmarks  have  a  lower  cost  of  equity.  Jiang 
(2008)  documents  that  firms  exceeding  benchmarks  also  have  a  lower  cost  of  debt,  especially  for 
firms  exceeding  the  zero  earnings  benchmark.  Prior  literature  has  also  provided  evidence  that 
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managers  receive  rewards  for  exceeding  benchmarks.  For  example,  Matsunaga  and  Park  (2001) 
find  evidence  consistent  with  CEOs  receiving  a  lower  cash  bonus  when  their  firms  report  an 
earnings  decline  or  miss  analysts’  forecasts. 

We  have  no  prediction  as  to  which  benchmark  is  the  most  important  to  firms  reporting 
discontinued  operations.  Following  Degeorge  et  al.  (1999),  Brown  and  Caylor  (2005),  and  Jiang 
(2008),  we  conjecture  that  managers  use  classification  shifting  to  meet  or  beat  the  following 
earnings  benchmarks:  zero  earnings  (avoiding  a  loss),  prior  year’s  earnings  (avoiding  an  earnings 
decline),  and  exceeding  the  consensus  analyst  forecast  (avoiding  a  negative  earnings  surprise). 
More  formally,  we  predict  the  following: 

H2a:  Managers  engage  in  classification  shifting  using  discontinued  operations  to  increase 
core  earnings  in  order  to  avoid  reporting  negative  core  earnings. 

H2b:  Managers  engage  in  classification  shifting  using  discontinued  operations  to  increase 
core  earnings  in  order  to  meet  or  beat  prior  year’s  core  earnings. 

H2c:  Managers  engage  in  classification  shifting  using  discontinued  operations  to  increase 
core  earnings  in  order  to  meet  or  beat  analyst  forecasts. 

Classification  shifting  using  discontinued  operations  involves  two  managerial  decisions.  The 
first  is  the  decision  to  discontinue  the  operations,  while  the  second  is  to  shift  operating  expenses 
to  this  classification.  We  argue  that  managers  have  already  made  the  decision  to  discontinue 
certain  operations  before  deciding  to  engage  in  classification  shifting.  In  other  words,  we  believe 
the  two  decisions  are  independent  of  each  other.  We  base  our  belief  on  the  following  reasoning. 
First,  discontinued  operations  are  an  important  real  activity  decision  requiring  board  of  director 
approval.  Consequently,  the  manager  needs  compelling  economic  reasons  to  make  a  case  to  the 
board.  Second,  because  discontinued  operations  are  a  real  activity  decision,  the  company  will 
incur  real  expenses  disposing  of  the  operation,  paying  expenses  such  as  severance  pay,  appraisal 
fees,  legal  fees,  etc.  from  the  cash  flows  of  continuing  operations.  Third,  there  is  likely  to  be  a 
possibility  of  improvement  in  the  operating  results  of  the  discontinued  operation.  If  so,  the  man¬ 
ager  will  be  trading  future  earnings  to  increase  current  period  earnings.  Finally,  we  believe  that  the 
decision  to  discontinue  an  operation  is  a  lengthy  process,  requiring  board  approval,  asset  apprais¬ 
als,  identification  of  potential  purchasers,  etc. 

In  sum  the  decision  to  discontinue  an  operation  is  both  an  expensive  and  time-consuming 
decision.  If  made  solely  for  the  purpose  of  earnings  management,  then  it  would  reduce  the  main 
advantages  of  classification  shifting,  namely  being  relatively  simple  and  inexpensive.  This  study 
focuses  on  the  financial  reporting  decision  rather  than  on  the  decision  to  discontinue. 

IV.  DATA,  SAMPLE,  AND  VARIABLE  MEASUREMENT 

The  sample  used  in  this  study  is  comprised  of  data  from  the  2007  Annual  Compustat  and 
I/B/E/S  Summary  files.  The  initial  sample  period  covers  the  years  1988  to  2006.  We  perform  the 
empirical  analyses  using  observations  from  1989  to  2005  because  we  need  one  year  of  lead  and 
lag  data.  For  inclusion  in  the  final  sample,  each  firm-year  needs  to  have  all  variables  required  to 
estimate  unexpected  core  earnings  and  at  least  1 5  observations  per  industry-year.  Because  we  use 
sales  as  a  scalar,  we  also  eliminate  observations  with  sales  less  than  $1  million  to  avoid  outliers. 

Table  1  presents  a  comparison  between  all  Compustat  firms  and  the  firms  included  in  our 
sample.  Overall,  9  percent  of  the  firms  included  in  Compustat  report  discontinued  operations  in  the 
test  period.  The  percentage  of  firms  reporting  discontinued  operations  varies  by  year,  with  a  low  of 
5  percent  in  1994  to  a  high  of  19  percent  in  2006.  The  final  sample  used  for  our  empirical  analyses 
contains  79,643  firm-year  observations,  with  6,262  observations  (8  percent)  reporting  discontinued 
operations.  The  pattern  of  annual  percentages  of  firms  reporting  discontinued  operations  in  the 
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final  sample  is  similar  to  that  of  the  entire  population.  There  is  a  significant  increase  in  the 
percentage  of  firms  reporting  discontinued  operations  after  SPAS  No.  144  became  effective,  in¬ 
creasing  from  8  percent  in  2001  to  13  percent  in  2002.  Untabulated  results  show  a  significant 
difference  between  the  mean  percentage  of  firms  reporting  discontinued  operations  in  the  pre- 
SFAS  No.  144  and  post-SFAS  No.  144  periods. 

We  note  a  gradual  decrease  in  the  magnitude  of  discontinued  operations  in  the  post-SFAS  No. 
144  period,  with  the  absolute  value  of  discontinued  operations  decreasing  from  7.5  percent  in  2001 
to  2.8  percent  in  2006,  consistent  with  the  broader  scope  of  SFAS  No.  144.  Untabulated  results 
also  show  that  the  mean  unexpected  core  earnings  of  firms  reporting  discontinued  operations  in  the 
post-SFAS  No.  144  period  are  lower  than  those  in  the  pre-SFAS  No.  144  period,'^  as  we  inves¬ 
tigate  in  the  following  section. 

The  empirical  analyses  test  whether  firms  increase  core  earnings  by  shifting  operating  ex¬ 
penses  to  discontinued  operations.  We  follow  McVay  (2006)  to  measure  core  earnings,  expected 
core  earnings,  unexpected  core  earnings,  and  the  unexpected  change  in  core  earnings.  Core 
earnings  is  defined  as  operating  earnings  before  depreciation  and  special  items,  scaled  by  revenue. 
To  estimate  unexpected  core  earnings  and  the  unexpected  change  in  core  earnings,  we  use  the 
following  expectation  models: 


CE,  =  fio  +  +  I32ATO,  +  p^ACCRUAESt.^  +  P^ACCRUAES,  +  /B^ASALESi 

-t-  /3(,NEG_^SALESl  +  s,  ( 1 ) 


/S,CEi  =(po  +  (piCE^-i  +  <P2^CEt_i  +  ip^AATO,  +  (p4ACCRUALSi_i  -l-  (p^ACCRUALSj 

+  cp(,ASALES,  +  (pjNEG_ASALESt  +  ,  (2) 

where: 

CE,  =  core  earnings,  calculated  as  [Sales  —  Cost  of  Goods  Sold  —  Selling,  General  and 
Administrative  Expenses  (data  item  #13)]/Sales  (data  item  #12)  for  firms  not  reporting 
discontinued  operations;  for  firms  reporting  discontinued  operations  we  use  restated 
data:  Data  (#117  -  #131  -  #132)/#!  17; 

ACE;  =  change  in  core  earnings,  calculated  as  CE^- 

ATOi  =  asset  turnover  ratio,  defined  as  Sales^  (  #  12)  /  {NOA,  +  NOA,_i)  /  2; 

NOA  =  Net  Operating  Assets 

=  Operating  Assets  —  Operating  Liabilities 

=  [Total  Assets  (#6)  —  Cash  (#1)  and  Short-Term  Investments  (#32)]  —  [Total 
Assets  (#6)  —  Total  Debt  (#9  +  #34)  —  Book  Value  of  Common  (#60)  and 
Preferred  Equity  (#130)  —  Minority  Interest  (#38)]; 


For  Compustat  firms,  the  mean  reporting  percentage  increases  from  7.03  percent  in  the  pre-SFAS  No.  144  period  to 
15.70  percent  in  the  post-SFAS  No.  144  period  (t-stat  =  25.12,  p  <  0.01).  For  our  sample  firms,  the  mean  reporting 
percentage  increased  from  7.70  percent  in  the  pre-SFAS  No.  144  period  to  16.05  percent  in  the  post-SFAS  No.  144 
period  (t-stat  =  21.21,  p  <  0.01). 

"  For  firms  reporting  discontinued  operations,  the  mean  unexpected  core  earnings  in  the  pre-  and  post-SFAS  No.  144 
period  are  3.74  percent  and  0.92  percent,  respectively  (t-stat  =  3.73,  p  <  0,01). 

Fan  et  al.  (2010)  observe  that  the  association  between  unexpected  core  earnings  and  the  magnitude  of  negative  special 
items  may  be  biased  because  noncash  special  items  are  included  in  accruals  that  are  used  as  an  independent  variable  in 
the  expectation  model.  Since  this  study  focuses  on  discontinued  operations,  the  association  between  core  earnings  and 
discontinued  operations  is  not  affected  by  the  design-driven  bias.  We  also  re-estimate  the  expectation  models  without 
using  contemporaneous  accruals  and  re-run  our  main  analysis.  The  results  are  reported  in  the  robustness  section. 
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iMKTOt  =  change  in  asset  turnover,  calculated  as  ATOf-ATOj_i 

ACCRUALSi  ~  operating  accruals,  calculated  as  [Net  Income  before  Extraordinary  Items 
(#123)  -  Cash  from  Operations  (#308  -  #124)]/Sales  (#12); 

ASALESf  =  percentage  change  in  sales,  calculated  as  [SALES X  #12) 

-  SALES, /  SALES, and 

NEG_ASALES,  =  percentage  change  in  sales  (ASALES,),  if  ASALES,  is  less  than  zero,  and  0 
otherwise. 

To  provide  evidence  of  classification  shifting,  we  test  for  a  positive  association  between 
unexpected  core  earnings  and  discontinued  operations  in  year  t.  However,  a  positive  association 
can  arise  mechanically  due  to  the  removal  of  a  loss-making  operation,  because  the  expectation 
model  for  CE,  includes  prior  year  core  earnings  {CE,_])  as  an  independent  variable.  To  address 
this  possibility,  we  use  restated  data  to  calculate  the  variables  included  in  the  expectation  models 
for  firms  with  discontinued  operations.  The  restated  data  exclude  the  operating  income  (or  loss)  of 
the  discontinued  operations  from  the  reported  operating  income  for  year  t-\.  The  Compustat 
database  includes  restated  data  for  the  previous  ten  years  on  the  basis  comparable  to  the  current 
year  for  firms  with  discontinued  operations,  acquisitions  or  accounting  changes. We  delete  firm- 
year  observations  reporting  acquisitions  or  accounting  changes.  We  report  the  empirical  results 
using  restated  data. 

We  measure  the  expected  value  of  core  earnings  and  the  change  in  core  earnings  for  firm  i 
using  the  predicted  values  from  Equations  (1)  and  (2),  respectively.  We  estimate  each  equation  by 
industry-year,  excluding  firm  i  from  the  estimation.  Unexpected  core  earnings  and  the  unexpected 
change  in  core  earnings  are  the  difference  between  the  actual  and  predicted  values  of  Equations 
(1)  and  (2),  respectively.  The  data  on  discontinued  operations  are  collected  from  the  Compustat 
annual  database  (item  #66). 


V.  RESULTS 

Descriptive  Statistics 

We  multiply  %DO,  %DO_NEG,  and  %DO_POS  by  —  1  to  capture  the  positive  association 
between  discontinued  operations  and  unexpected  core  earnings.  Table  2  provides  descriptive  sta¬ 
tistics  for  the  variables  used  in  the  empirical  analyses.  Panel  A  presents  the  descriptive  statistics 
for  the  entire  sample.  The  mean  (median)  core  earnings  for  all  firms  are  0.024  (0.103).  The  mean 
(median)  annual  sales  of  all  firms  in  the  full  sample  qre  $1,660M  ($142M). 

Table  2,  Panel  B  presents  the  descriptive  statistics  for  the  subsample  of  firms  reporting 
discontinued  operations.  Firms  with  discontinued  operations  are  relatively  larger  in  size,  have 
higher  average  sales  volume,  lower  sales  growth,  and  lower  core  earnings  than  other  firms.  The 
mean  (median)  unexpected  core  earnings  is  0.028  (0.011)  for  firms  reporting  discontinued  opera¬ 
tions,  which  is  much  higher  than  0.002  (0.004)  for  the  entire  sample.  This  finding  is  consistent 
with  our  prediction  that  firms  reporting  discontinued  operations  shift  recurring  operating  expenses 
to  discontinued  operations  in  order  to  increase  core  earnings. 

In  order  to  control  for  performance,  we  compare  firm  characteristics,  such  as  firm  size  {SIZE), 
book-to-market  ratio  {BM),  total  accruals  {ACCRUALS),  operating  cash  flow  {OCE),  and  return  on 
assets  {ROA),  between  firm-year  observations  with  and  without  discontinued  operations.  We  find 


To  ensure  the  quality  of  the  restated  data,  we  randomly  selected  50  observations  reporting  discontinued  operations  and 
checked  for  data  consistency  between  the  SEC  filings  and  Compustat  database.  We  find  that  the  restated  data  in  the 
Compustat  database  are  consistent  with  the  corresponding  data  in  Form  10-K  for  the  selected  observations. 
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significant  differences  in  all  these  variables;  consequently,  we  control  for  these  variables  in  our 
empirical  tests 

Classification  Shifting  using  Discontinued  Operations 

We  follow  McVay  (2006)  in  designing  our  regression  models  to  test  whether  firms  increase 
core  earnings  by  using  classification  shifting  when  reporting  discontinued  operations.  We  modify 
her  equations  by  substituting  discontinued  operations  {%DO)  for  special  items  (%5/)  and  adding 
control  variables,  as  shown  in  the  following  equations: 

UE_CEf  =  ao  +  +  a2SIZE,  +  a^BM^  +  a^ACCRUALS,  +  a^OCFf  +  a^^ROA^  +  s,  (3a) 

UE_^CE,+x  =  Vo^V\%DO,+  ?72%D6>,+i  +  T]'iSIZEj+  7)^BMt+  p^ACCRUALSt+  v^OCEf 

+  ViROA,  +  .  (3b) 

UE_CEi  is  unexpected  core  earnings  in  year  t.  UE_iisCEf^i  is  the  unexpected  change  in  core 
earnings  in  year  t+l.  The  variable  of  interest,  %DOj,  is  the  value  of  the  aggregate  results,  losses, 
and  gains  from  discontinued  operations  (scaled  by  sales).  To  test  HI,  we  predict  a  positive  asso¬ 
ciation  between  unexpected  core  earnings  and  discontinued  operations  (i.e.,  ai  >  0).  However,  a 
positive  association  could  also  arise  from  an  improvement  in  operations,  due  to  the  discontinuation 
of  a  loss-making  operation.  If  this  is  the  case,  then  the  improvement  in  core  earnings  should  persist 
in  future  years.  However,  if  the  improvement  is  due  to  classification  shifting,  we  predict  a  negative 
association  between  the  unexpected  change  in  core  earnings  in  year  t+\  and  the  discontinued 
operations  in  year  t  (i.e.,  771  <  0).  This  reversal  is  due  to  operating  expenses  reappearing  again  as 
a  part  of  core  earnings.  To  control  for  discontinued  operations  in  year  t+l,  we  add  %DOt+i  to 
Equation  (3b).  We  predict  a  positive  coefficient  on  %DOt^i  (i.e.,  772  >  0).  Finally,  we  add  the 
following  control  variables  to  Equations  (3a)  and  (3b):  firm  size  {SIZE^,  book-to-market  ratio 
(5Mj),  accruals  {ACCRUALS^,  operating  cash  flow  {OCF^  and  return  on  assets  {ROA^.  We  make 
no  prediction  as  to  the  sign  of  these  control  variables. 

A  firm’s  classification  shifting  behavior  could  vary  depending  on  whether  firms  report  overall 
gains  or  losses  from  discontinued  operations.  For  example,  managers  might  have  more  incentive 
or  greater  discretion  to  shift  more  operating  expenses  when  discontinued  operations  already  have 
losses. To  test  this  conjecture,  we  replace  the  discontinued  operations  variable  %DO  with  two 
variables,  %DO_NEG  for  discontinued  operations  with  overall  losses  and  %DO_POS  for  discon¬ 
tinued  operations  with  overall  gains,  in  the  following  equations: 

UE_CEt  =770+77,  %DO_NEGt  +  tt2%DO_POSi  +  'Tt^SIZE^  +  tt^BM,  +  TT^ACCRUALSt 

+  tT(,OCF j  +  ttjROAi  +  ipj,  (4a) 

f/E_ACE,+i  =  <Po  +  (Pi%DO_NEGi+  (p2%DO_POS,+  (p^%DO_NEG,+^  +  ip^%DO_POSt^^ 

+  (p^SIZEj  +  (p(yBMf  +  (pqACCRU AES  f  +  tp^OCF ^  +  (pgROAj  +  .  (4b) 

If  firms  engage  in  classification  shifting  using  discontinued  operations  irrespective  of  overall  gains 
or  losses,  we  predict  both  tt,  and  772  (^1  and  ^2)  ^  be  positive  (negative). 


The  differences  between  the  means  of  the  variables  SIZE,  BM,  ACCRUALS,  and  ROA  are  significant  at  better  than  the 
1  percent  level.  The  difference  between  the  means  of  the  variable  OCF  is  significant  at  the  10  percent  level. 

For  example,  a  manager  may  have  more  of  an  incentive  to  blame  poor  past  results  of  the  firm  as  a  whole,  on 
discontinued  operations  sold  at  a  loss.  One  way  of  doing  this  is  to  increase  the  operating  loss  of  the  discontinued 
operations.  In  addition,  the  big  bath  hypothesis  predicts  that  companies  may  deliberately  take  large  losses  in  a  single 
year  through  write-offs  and  write-downs  in  order  to  increase  accounting  flexibility  in  future  years. 
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TABLE  3 

Regressions  of  Unexpected  Core  Earnings  and  Unexpected  Change  in  Core  Earnings  on 

Discontinued  Operations 

Panel  A:  Dependent  Variable 

=  UEJCE, 

Independent  Variables 

Coefficient 

t-stat 

Coefficient 

t-stat 

Intercept 

-0.022 

-8.70*** 

-0.023 

_9  J3*** 

%DOt 

0.451 

%DO  NEG, 

0.705 

10.68*** 

%DO_POSt 

-0.480 

-3.76*** 

SIZE, 

0.000 

-0.32 

0.000 

-0.43 

BM, 

0.000 

-0.24 

0.000 

-0.14 

ACCRUALS, 

-0.036 

-3.16*** 

-0.037 

—3  24*** 

OCF, 

0.373 

34  72*** 

0.373 

34.75*** 

ROA, 

-0.015 

-1.21 

-0.012 

-0.99 

Adjusted 

7.0% 

7.1% 

Panel  B:  Dependent  Variable 

=  UE_^CE,^^ 

Independent  Variables 

Coefficient 

t-stat 

Coefficient 

t-stat 

Intercept 

-0.027 

-11.65*** 

-0.027 

—  11  92*** 

%DO, 

-0.124 

-2.31** 

%DO,+i 

0.419 

8.28*** 

%DO_NEG, 

-0.164 

-2.67*** 

%DO_POS, 

-0.090 

-0.76 

%DOJlEG,^x 

0.650 

10  97*** 

%DO_POS,+i 

-0.245 

—2  39*** 

SIZE, 

0.004 

10.83*** 

0.004 

10.66*** 

BM, 

0.004 

5.46*** 

0.004 

5.50*** 

ACCRUALS, 

0.104 

9  93*** 

0.105 

9  9-7*** 

OCE, 

0.164 

16.71*** 

0.164 

16.79*** 

ROA, 

-0.132 

-12.06*** 

-0.131 

-11.93*** 

Adjusted 

0.75% 

0.82% 

No.  of  Observations 

72,372 

72,372 

*,  **,  ***  Indicate  significance  at  the  10,  5,  and  1  percent  levels,  respectively.  All  the  test  results  use  a  two-tailed  t-test 

except  %DO„  %DO_NEG„  %DO_ 

POS„  %DO,^u  %DO_NEG,^x,  and  %DO_ 

_P05,+i  (use  a  one-tailed  t-test) 

Discontinued  operations  {%DO)  are 

scaled  by  sales  multiplied  by  (—1);  [Discontinued  Operations  (#66)  X  (- 

-1)]  /SALES 

(#12).  Discontinued  operations  (Compustat  data  item  #66)  scaled  by  sales  and  multiplied  by  (—1),  when  reported  discon- 

tinned  operations  are  income-decreasing,  and  0  otherwise.  Discontinued  operations  (Compustat  data  item  #66)  are  scaled 

by  sales  and  multiplied  by  (—1)  when  reported  discontinued  operations  are  income-increasing,  and  0  otherwise.  All  other 

variables  are  as  defined  in  Table  2. 

Table  3,  Panel  A  reports  regression  results  for  Equations  (3a)  and  (4a).  For  Equation  (3a),  the 
coefficient  on  is  both  positive  and  significant  (a,  =0.451,  p  <  0.01),  a  result  consistent 

with  Hl.’^  For  Equation  (4a),  the  coefficient  on  %DO_NEGt  is  both  positive  and  significant 
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All  the  test  results  reported  in  this  section  are  based  on  a  one-tailed  test  of  significance,  except  where  otherwise  noted. 
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(771  =  0.705,  p  <  0.01);  however  the  coefficient  on  %DO_POSf  is  negative  and  significant  (772 
=  -0.480,  p  <  0.01),  opposite  of  the  predicted  sign.  These  findings  suggest  that  classification 
shifting  takes  place  when  firms  report  losses  from  discontinued  operations.  Economically,  one 
standard  deviation  increase  in  the  loss  from  discontinued  operations  is  expected  to  increase  unex¬ 
pected  core  earnings  (scaled  by  sales)  by  approximately  92  basis  points  (0.705  [the  coefficient  on 
%DO_NEGi  from  Table  3]  multiplied  by  0.013  [the  standard  deviation  of  income-decreasing 
discontinued  operations  for  this  sample  from  Table  2]). 

Table  3,  Panel  B  reports  regression  results  of  Equation  (3a)  and  (4b).  For  Equation  (3b),  the 
coefficient  on  %DOf  is  both  negative  and  significant  (?7i  =  -0.124,  p  <  0.01),  a  result  consistent 
with  operating  expenses  reappearing  as  a  component  of  core  earnings.  For  Equation  (4b),  only  the 
coefficient  on  %DO_NEGt  is  both  negative  and  significant  (^i  =  -0.164,  p  <  0.01).  The  coeffi¬ 
cient  on  %DO_POSj  is  negative  and  insignificant  {(p2  =  -0.090,  p  =  0.45).  These  findings  suggest 
the  reversals  of  the  improvements  in  core  earnings  are  associated  with  the  reporting  of  losses  from 
discontinued  operations  in  year  t.  Consistent  with  Panel  A,  there  is  a  contemporaneous  positive 
association  between  UE_/^CEi^i  and  each  of  the  following  variables  %DOi+i  (772  =  0.419,  p  < 
0.01)  and  %DO_NEGt+[  (<^3  -  0.650,  p  <  0.01).  Consistent  with  Panel  A,  but  inconsistent  with  our 
prediction  %DO_POSj+i  is  significantly  negative  ((^4  = -0.245,  p  <  0.01). 

Overall,  findings  in  Table  3  provide  evidence  consistent  with  classification  shifting  using 
discontinued  operations.  However,  our  predictions  only  hold  for  firms  reporting  income- 
decreasing  discontinued  operations.  We  do  not  find  consistent  evidence  for  firms  reporting 
income-increasing  discontinued  operations.^’  Hence,  we  use  %DO_NEGi  as  our  variable  of  inter¬ 
est  in  all  subsequent  analyses. 

Meeting  or  Beating  Earnings  Benchmarks 

We  use  the  following  equations  to  examine  whether  classification  shifting  using  discontinued 
operations  is  used  to  meet  or  beat  certain  earnings  benchmarks: 

UE_CE,  =  7o  +  yi%DO_NEG,  +  y2MEET&BEATt  +  y^MEET&BEATt  *  %DO_NEGt 

+  y^SIZEt  +  y^BMt  +  yf^CCRUALS,  +  yjOCF^  +  ygROA^  +  (5a) 

UE_A,CEt+i  =  -I-  8^%DO_NEGj  +  S2MEET&BEAT,  +  S^MEET&BEATt  *  %DO_NEGt 

+  S^%DO_NEG,+i  +  S5SIZE,  +  S(,BM,  +  SjACCRUALSt  +  S^OCE,  +  S^ROA^ 

+  '  (5b) 

MEET&BEATf  is  an  indicator  variable  taking  on  a  value  of  1  if  the  firm-year  observation 
meets  or  beats  one  of  the  following  three  earnings  benchmarks,  and  0  otherwise.  The  first  earnings 
benchmark  is  zero  earnings.  This  variable  proxies  for  earnings  management  to  avoid  losses,  where 
MEET&BEATi=  1  if  the  observation  has  earnings  per  share  greater  than  or  equal  to  zero,  and  0 
otherwise.  The  second  earnings  benchmark  is  prior  year’s  earnings.  This  variable  proxies  for 
earnings  management  to  avoid  a  decline  in  earnings  from  the  previous  year,  where 
MEET&BEATj  =  1  if  the  observation  has  earnings  per  share  changes  greater  than  or  equal  to  zero. 
The  earnings  subtotal  used  for  benchmarks  1  and  2  are  earnings  before  discontinued  operations 
and  extraordinary  items  because  they  have  been  widely  used  as  performance  measures  in  account¬ 
ing  literature.  The  third  benchmark  is  analysts’  forecasts.  Following  Matsumoto  (2002), 


’’  We  believe  that  classification  shifting  can  take  place  when  firms  report  positive  discontinued  operations,  although  we  do 
not  find  the  predicted  results.  We  leave  this  issue  to  future  research. 
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MEET&BEATf  =  1  if  firm-years  have  positive  analysts’  forecast  errors/*  and  0  otherwise.  We  use 
the  median  consensus  forecast  available  before  the  earnings  announcement  as  obtained  from  the 
adjusted  I/B/E/S  summary  database. 

Table  4,  Panel  A  reports  the  regression  results  for  Equation  (5  a).  The  coefficients  on 
%DO_NEGj  are  consistently  positive  and  significant  for  benchmarks  1  and  2.  The  coefficient  on 
benchmark  3  is  positive  but  insignificant.  The  coefficients  on  the  variable  of  interest, 
MEET&BEATt  interacted  with  %DO_NEGt  are  positive  and  significant.  These  findings  are  con¬ 
sistent  with  H2a,  H2b,  and  H2c.  Table  4,  Panel  B  reports  the  regression  results  for  Equation  (5b). 
The  coefficient  on  %DO_NEGi  is  negative  and  significant  only  for  benchmark  1 .  The  coefficient 
on  benchmark  2  is  negative  but  insignificant.  The  coefficient  on  benchmark  3  is  positive  and 
significant.  The  coefficient  on  the  variable  of  interest,  MEET&BEATf  interacted  with  %DO_NEGt, 
is  only  significant  for  benchmark  3.  These  findings  are  consistent  with  H2c. 

Overall,  the  results  suggest  that  companies  reporting  income-decreasing  discontinued  opera¬ 
tions  use  classification  shifting  to  increase  core  earnings  in  order  to  meet  or  beat  the  analysts’ 
forecast  benchmark. 

Effect  of  SEAS  No.  144 

As  shown  in  Table  1,  the  recognition  of  discontinued  operations  significantly  increased  after 
SEAS  No.  144  became  effective  in  2002.  This  section  further  examines  the  impact  of  the  change 
in  accounting  standards  on  the  association  between  discontinued  operations  and  unexpected  core 
earnings  presented  in  Table  3.  We  modify  Equations  (5a)  and  (5b)  by  replacing  MEET&BEAT,, 
with  POST_2002,  which  takes  a  value  of  1  if  the  discontinued  operations  occur  in  fiscal  year  2002 
or  later,  and  0  otherwise.  Table  5  presents  the  results  for  both  of  these  models. 

The  second  and  third  columns  of  Table  5  report  the  results  of  the  modified  Equation  (5a). 
Consistent  with  the  findings  reported  in  previous  tables,  the  discontinued  operations  variable, 
%DO_NEG,  (0.774,  p  <  0.01),  is  positively  associated  with  unexpected  core  earnings.  The  coef¬ 
ficient  on  the  interaction  term  is  both  negative  and  significant  (—0.411,  p  <  0.01).  The  fourth  and 
fifth  columns  of  Table  5  report  the  results  of  the  modified  Equation  (5b).  Consistent  with  the 
findings  reported  in  previous  tables,  the  discontinued  operations  variable,  %DO_NEG,  (—0.211,  p 
<  0.01),  is  negatively  associated  with  the  t+l  change  in  unexpected  core  earnings.  The  coefficient 
on  the  interaction  term  is  both  positive  and  significant  (0.329,  p  =  0.02).  Taken  together,  these 
results  suggest  a  reduction  in  the  magnitude  of  classification  shifting  using  discontinued  operations 
in  the  post-SEAS  No.  144  period. 

There  are  two  potential  reasons  for  this  finding.  Eirst,  SEAS  No.  144  broadened  the  definition 
of  discontinued  operations  by  replacing  the  “business  segment”  requirement  under  APB  No.  30 
with  the  “component  of  an  entity  concept.”  This  change  allowed  smaller  asset  dispositions  to  be 
reported  as  discontinued  operations,  increasing  the  reporting  frequency.  In  addition,  allowing 
smaller  asset  dispositions  to  qualify  as  discontinued  operations  potentially  reduces  the  amount  of 
operating  expenses  that  can  be  shifted  to  the  disposition.  Second,  the  Sarbanes-Oxley  Act  was 
enacted  around  the  same  time  as  SEAS  No.  144,  and  may  have  led  to  a  decrease  in  classification 
shifting  due  to  increased  regulation  and  oversight. 

Robustness  Tests 

To  investigate  whether  classification  shifting  using  income-decreasing  discontinued  opera¬ 
tions  is  concentrated  among  firms  frequently  reporting  discontinued  operations,  we  perform  the 
following  analysis.  Table  6,  Panel  A  reports  that  approximately  26  percent  (3,105/11,895)  of  our 
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Forecast  errors  are  defined  as  actual  earnings  per  share  less  the  last-issued  median  consensus  analyst  forecast. 
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TABLE  5 

Regressions  of  Unexpected  Core  Earnings  and  Unexpected  Change  in  Core  Earnings 

Pre  and  Post  SEAS  No.  144  Period 

Dependent  Variables 


VE_CEt _  UE_ACEt^i 


Independent  Variables 

Coefficient 

t-stat 

Coefficient 

t-stat 

Intercept 

-0.022 

—  8.91*** 

-0.026 

-11.50*** 

%DO_NEGi 

0.774 

10.63*** 

-0.211 

—3  14*** 

POST_2002 

-0.005 

-2.18** 

-0.008 

—3  92*** 

%DO  NEC,  * 

-0.411 

-2.38*** 

0.329 

2.09** 

POST_2002 

%DO_NEGt^i 

0.646 

10  91*** 

SIZE, 

0.000 

0.03 

0.005 

BM, 

0.000 

-0.23 

0.004 

5.30*** 

ACCRUALS, 

-0.036 

-3.17*** 

0.105 

1002*** 

OCF, 

0.373 

34.76*** 

0.166 

16.89*** 

ROA, 

-0.013 

-1.12 

-0.132 

-12  08*** 

Adjusted 

7.1% 

0.84% 

No.  of  Observations 

72,372 

72,372 

*,  ***  Indicate  significance  at  the  10,  5,  and  1  percent  levels,  respectively.  All  the  test  results  use  a  two-tailed  t-test 

except  %DO_NEGi,  %DO_NEGi  *  POSTJ2002,  and  %DO_NEG,^i  (use  a  one-tailed  t-test). 

POST_  2002  is  an  indicator  variable  that  takes  a  value  of  1  if  an  observation  is  from  fiscal  year  2002  and  afterwards,  and 
0  otherwise. 

All  other  variables  are  as  defined  in  Table  2. 


sample  firms  have  reported  discontinued  operations  at  least  once  during  the  sample  period.  How¬ 
ever,  only  1.6  percent  (191/11,895)  of  our  sample  companies  recognized  discontinued  operations 
more  than  four  times  during  the  sample  period.  Consequently,  we  do  not  believe  that  classification 
shifting  using  income-decreasing  discontinued  operations  is  concentrated  among  serial  recognizers 
of  discontinued  operations.  In  order  to  confirm  this  belief,  we  re-estimate  Equations  (3a)  and  (3b) 
(replacing  %DO(  with  %DO_NEG^  on  a  sample  of  firms  that  recognized  discontinued  operations 
only  once  during  the  sample  period.  Table  6,  Panel  B  reports  the  results  of  the  re-estimation. 
Overall,  the  results  are  consistent  with  the  results  reported  in  Table  3. 

As  a  further  robustness  check,  we  re-estimated  Equations  (3a)  and  (3b),  (replacing  %DOf  with 
%DO_NEGi)  after  deleting  those  companies  recognizing  more  than  four  discontinued  operations 
during  the  sample  period.  These  results  (untabulated)  are  also  consistent  with  the  main  results. 
Overall,  our  results  are  consistent  with  classification  shifting  taking  place  when  a  firm  reports 
income-decreasing  discontinued  operations. 

To  control  for  the  effect  of  special  items,  we  repeat  all  our  analyses  adding  a  continuous 
variable  for  income-decreasing  special  items  (scaled  by  sales)  as  an  independent  variable  in 
Equations  (3a),  (3b),  (4a),  and  (4b).  The  results  (not  tabulated)  are  similar  both  qualitatively  and 
quantitatively  to  what  is  reported  in  our  tables. 

Ean  et  al.  (2010)  observe  that  the  classification  shifting  results  documented  in  McVay  (2006) 
could  be  subject  to  a  research  design  problem.  They  assert  that  the  associations  between  income- 
decreasing  special  items  and  unexpected  core  earnings  and  unexpected  change  in  future  core 
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earnings  may  be  biased  because  noncash  special  items  are  included  in  total  accruals  that  are  used 
as  an  independent  variable  in  the  expectation  model.  We  do  not  believe  our  study  suffers  from  this 
potential  bias  because  the  results  from  discontinued  operations  are  reported  separately  from  the 
results  of  continuing  operations,  which  are  used  to  estimate  unexpected  core  earnings  and  accru¬ 
als.  Nevertheless,  as  a  robustness  check,  we  remove  contemporaneous  accruals  from  the  expecta¬ 
tion  models  and  re-estimate  unexpected  core  earnings  and  unexpected  changes  in  core  earnings. 
We  then  re-estimate  Equations  (3a),  (3b),  (4a),  and  (4b)  reported  in  Table  3.  The  results  (untabu¬ 
lated)  are  qualitatively  similar  to  those  in  Table  3.  For  Equations  (3a)  and  (4a),  the  coefficients  on 
%DOi  (q;i  =  0.436,  p  <  0.01)  and  %DO_NEGi  (ttj  =  0.752,  p  <  0.01)  are  both  positive  and 
significant.  For  Equations  (3b)  and  (4b),  the  coefficients  on  %DOt  {pi  =-0.080,  p  =  0.06)  and 
%DO_NEGt  ((pi  = -0.103,  p  =  0.05)  are  both  negative  and  significant.  Overall,  these  results 
suggest  that  our  findings  are  robust  to  the  potential  model  specification  error  observed  by  Fan  et  al. 
(2010). 


VI.  CONCLUSIONS 

This  study  examines  whether  managers  use  discontinued  operations  to  increase  core  earnings 
by  engaging  in  classification  shifting.  Following  a  methodology  similar  to  that  employed  by 
McVay  (2006),  we  find  a  positive  association  between  unexpected  core  earnings  and  discontinued 
operations.  Our  analysis  shows  that  the  positive  and  significant  association  between  unexpected 
core  earnings  and  discontinued  operations  is  driven  by  firms  with  losses  from  discontinued  op¬ 
erations.  We  interpret  these  results  as  evidence  that  firms  engage  in  classification  shifting  using 
discontinued  operations.  We  do  not  find  consistent  evidence  for  firms  reporting  income-increasing 
discontinued  operations,  although  classification  shifting  can  also  take  place  in  those  situations.  We 
leave  further  investigation  of  this  issue  to  future  research.  We  also  investigate  the  motivations  to 
manage  earnings  using  discontinued  operations,  finding  that  firms  reporting  income-decreasing 
discontinued  operations  use  classification  shifting  to  meet  or  beat  analyst  forecasts. 

Our  results  also  show  that  the  magnitude  of  classification  shifting  using  discontinued  opera¬ 
tions  has  declined  since  the  introduction  of  SFAS  No.  144.  Two  explanations  potentially  account 
for  this  finding.  First,  SFAS  No.  144  broadened  the  definition  of  discontinued  operations  by 
replacing  the  business  segment  requirement  under  APB  No.  30  with  the  component  of  an  entity 
concept.  This  change  allowed  firms  to  report  smaller  asset  dispositions  as  discontinued  operations, 
increasing  the  reporting  frequency.  In  addition,  allowing  smaller  asset  dispositions  to  qualify  as 
discontinued  operations  potentially  reduced  the  amount  of  operating  expenses  that  could  be  shifted 
to  the  disposition.  Second,  the  Sarbanes-Oxley  Act  was  enacted  around  the  same  time  as  SFAS 
No.  144,  and  could  have  led  to  a  decrease  in  classification  shifting  due  to  increased  regulation  and 
oversight. 

Our  results  are  robust  to  examining  a  sample  of  firms  reporting  discontinued  operations  only 
once  during  our  sample  period  and  controlling  for  income-decreasing  special  items.  We  also 
re-estimate  the  expectation  models  without  using  contemporaneous  accruals  and  find  consistent 
results. 

We  believe  the  findings  of  this  study  could  have  implications  for  the  convergence  project  on 
discontinued  operations  between  the  FASB  and  the  lASB.  We  also  believe  the  study  would  benefit 
investors  by  alerting  them  to  the  potential  use  of  income-decreasing  discontinued  operations  for 
earnings  management  purposes. 
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ABSTRACT:  We  document  that  firms  included  in  the  ExecuComp  database  tend  to  be 
larger,  more  complex,  followed  by  more  analysts,  have  greater  stock  liquidity  levels, 
and  have  higher  total,  but  less  concentrated,  institutional  ownership  than  other  firms. 
Based  on  these  differences,  we  test  and  find  support  for  three  predictions.  First,  Ex¬ 
ecuComp  firms  rely  more  heavily  on  earnings  and  stock  returns  in  determining  CEO 
cash  compensation.  Second,  the  weight  on  earnings  is  more  sensitive  to  differences  in 
the  extent  of  growth  opportunities  for  ExecuComp  firms.  Third,  the  positive  relation 
between  institutional  ownership  concentration  and  the  value  of  stock  option  grants  is 
stronger  for  ExecuComp  firms.  Overall,  our  results  suggest  that  ExecuComp  and  non- 
ExecuComp  firms  operate  in  different  contracting  environments  that  lead  to  differences 
in  the  design  of  their  executive  compensation  contracts.  As  a  result,  care  should  be 
taken  in  extending  results  based  on  ExecuComp  samples  to  non-ExecuComp  firms. 

Keywords:  ExecuComp  database;  executive  compensation;  stock  option  grants;  con¬ 
tracting  environment. 
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I.  INTRODUCTION 

In  recent  years,  researchers  have  relied  extensively  on  Standard  &  Poor’s  (S&P)  ExecuComp 
database,  which  covers  firms  in  the  S&P  500,  S&P  400  mid-cap,  and  S&P  600  small-cap 
indices,  to  investigate  a  wide  variety  of  corporate  governance  and  executive  compensation 
issues/  The  database  includes  most  of  the  large  publicly  traded  firms  in  the  United  States,  as  well 
as  medium  and  smaller  size  firms  that  provide  a  degree  of  heterogeneity.  Although  the  firms  in 
ExecuComp  comprise  approximately  88  percent  of  the  market  capitalization  of  publicly  traded 
firms  in  the  United  States,  they  only  represent  25  percent  of  the  firms  in  the  Compustat  database. 
Thus,  firms  excluded  from  the  ExecuComp  database  comprise  a  substantial  portion  of  samples  for 
empirical  studies  that  do  not  require  compensation  data.  Yet,  little  is  known  about  their  compen¬ 
sation  structures  or  how  the  determinants  of  their  CEO  incentive  contracts  differ  from  those  of 
ExecuComp  firms.  In  this  study,  we  document  differences  between  ExecuComp  and  non- 
ExecuComp  firms  across  multiple  dimensions  that  are  likely  to  affect  a  firm’s  contracting  envi¬ 
ronment  and  provide  evidence  regarding  how  compensation  functions  (determinants  of  CEO  cash 
compensation  and  option  grants)  differ  across  the  two  sets  of  firms. 

Because  the  ExecuComp  database  only  includes  firms  from  the  three  major  S&P  indices,  the 
firms  in  the  database  have  a  common  and  distinct  set  of  characteristics  that  are  likely  to  impact  the 
firm’s  contracting  environment.  Eor  example,  ExecuComp  firms  tend  to  be  more  complex,  with 
operations  in  more  business  and  geographic  segments,  and  higher  values  for  Herfindahl- 
Hirschman  concentration  indices  measuring  business  and  geographic  diversification.  On  average, 
ExecuComp  firms  also  have  a  greater  degree  of  analyst  following  and  stock  liquidity.  In  addition, 
ExecuComp  firms  have  a  larger  proportion  of  institutional  ownership  and  less  concentrated  insti¬ 
tutional  ownership  (i.e.,  ownership  is  distributed  across  a  greater  number  of  institutions).  Collec¬ 
tively,  we  expect  these  differences  to  influence  the  contracting  environment  and,  as  a  result,  the 
use  and  type  of  performance  metrics  in  contract  design.  To  provide  evidence  on  how  the  differ¬ 
ences  in  the  contracting  environment  can  lead  to  systematic  differences  in  the  determinants  of 
CEO  compensation,  we  test  three  empirical  predictions. 

Our  first  prediction  is  that  firms  in  the  ExecuComp  database  rely  more  heavily  on  earnings 
and  stock  returns  as  performance  measures  in  determining  CEO  cash  compensation.  As  shown  by 
Holmstrom  (1979)  and  Banker  and  Datar  (1989),  the  weight  placed  on  a  given  performance 
measure  in  an  optimal  incentive  contract  is  proportional  to  the  signal-to-noise  ratio  of  the  measure 
(i.e.,  the  extent  to  which  it  reflects  the  agent’s  actions,  as  opposed  to  external  factors  beyond  the 
agent’s  control).  In  our  setting,  we  expect  the  relatively  high  level  of  analyst  following  associated 
with  ExecuComp  firms  to  reflect  greater  public  scrutiny  over  reported  earnings,  thereby  leading  to 
a  higher  degree  of  earnings  quality  and  lower  noise  in  earnings  as  a  performance  measure.  In 
addition,  we  expect  the  higher  degree  of  stock  liquidity  for  ExecuComp  firms,  combined  with  the 
higher  degree  of  analyst  following,  to  increase  the  efficiency  with  which  market  prices  impound 
the  manager’s  actions  into  firm  value,  thereby  reducing  the  noise  in  stock  returns  as  a  measure  of 
performance.  Einally,  we  expect  the  costs  of  identifying,  defining,  and  weighting  individual  per¬ 
formance  measures  to  be  greater  for  the  more  complex,  diverse  firms  that  are  included  in  the 
ExecuComp  database,  thereby  leading  ExecuComp  firms  to  rely  more  heavily  on  aggregate  finan¬ 
cial  performance  measures.  The  combination  of  having  less  noise  in  earnings  and  stock  returns. 


From  1999-2006,  we  found  28  articles  published  in  The  Accounting  Review,  Journal  of  Accounting  and  Economics,  and 
the  Journal  of  Accounting  Research,  and  34  articles  published  in  the  Journal  of  Financial  Economics,  Journal  of 
Finance,  and  the  Review  of  Financial  Studies,  that  used  the  ExecuComp  database.  Although  we  restricted  our  search  to 
accounting  and  finance  journals,  papers  that  investigate  corporate  governance  and  executive  compensation  issues  pub¬ 
lished  in  journals  in  other  fields,  such  as  economics  and  management,  are  also  likely  to  rely  on  ExecuComp  data. 
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along  with  a  higher  cost  in  defining  individual  measures,  should  lead  ExecuComp  firms  to  place  a 
greater  weight  on  both  earnings  and  stock  returns  in  determining  CEO  compensation. 

We  test  this  prediction  by  using  the  standard  empirical  model  that  relates  cash  compensation 
to  accounting  and  stock  return  measures  of  performance  (i.e.,  Lambert  and  Larcker  1987;  Sloan 
1993;  Baber  et  al.  1996;  Baber  et  al.  1999;  Matsunaga  and  Park  2001).  Consistent  with  our 
prediction,  we  find  the  coefficients  on  both  accounting  and  stock  returns  are  significantly  larger  for 
ExecuComp  firms.  These  results  are  robust  to  the  exclusion  of  loss  firms,  controlling  for  differ¬ 
ences  in  CEO  ownership,  and  using  a  predicted  classification  based  on  characteristics  associated 
with  the  firm’s  contracting  environment.  The  latter  result  provides  evidence  that  our  findings 
reflect  differences  in  the  firm’s  contracting  environment,  as  opposed  to  the  classification  as  an 
ExecuComp  firm. 

While  our  first  prediction  relates  to  differences  in  the  average  weight  on  earnings  and  stock 
returns  between  the  ExecuComp  and  non-ExecuComp  samples,  our  second  prediction  relates  to 
differences  in  the  weight  on  earnings  across  firms  within  each  sample.  Baber  et  al.  (1996)  and 
Ittner  et  al.  (1997)  note  that  the  conservatism  in  the  accounting  system  reduces  the  ability  of 
accounting  earnings  to  properly  reflect  CEO  performance  for  growth  firms,  positing  that  the 
relative  weight  placed  on  earnings  is  negatively  related  to  the  firm’s  investment  opportunity  set 
(lOS).  Their  theory  implicitly  assumes  that,  in  the  absence  of  growth  opportunities,  firms  use 
earnings  as  a  primary  performance  measure.  While  this  assumption  may  be  appropriate  for  Ex¬ 
ecuComp  firms,  it  is  less  likely  to  be  valid  for  non-ExecuComp  firms.  As  noted  above,  indepen¬ 
dent  of  growth  opportunities,  we  expect  non-ExecuComp  firms  to  substitute  individual,  nonfinan- 
cial  measures  for  earnings  in  their  CEO  bonus  plans.  Thus,  while  high-growth  and  low-growth 
non-ExecuComp  firms  are  likely  to  choose  different  individual  performance  measures,  we  would 
expect  both  high-  and  low-growth  non-ExecuComp  firms  to  place  similarly  low  weights  on  earn¬ 
ings.  This  leads  to  our  prediction  that  the  weight  on  earnings  should  be  less  sensitive  to  the  firm’s 
lOS  for  non-ExecuComp  firms  than  for  ExecuComp  firms.^ 

Supporting  this  prediction,  we  find  the  predicted  negative  association  between  the  extent  of  a 
firm’s  investment  opportunity  set  and  the  weight  placed  on  earnings  in  our  sample  of  ExecuComp 
firms,  but  not  in  our  sample  of  non-ExecuComp  firms.  In  addition,  we  document  that  the  shift  in 
weights  from  earnings  toward  stock  returns  as  the  lOS  increases  is  significantly  larger  for  Execu¬ 
Comp  firms.  This  result  is  robust  to  adjustments  for  loss  firms,  CEO  ownership,  and  using  pre¬ 
dicted  classifications. 

Although  our  first  two  predictions  relate  to  cash  compensation,  the  differences  in  the  con¬ 
tracting  environment  also  have  implications  for  the  determinants  of  CEO  stock  option  grant 
incentives.  Specifically,  we  predict  that  the  positive  association  between  institutional  ownership 
concentration  and  the  extent  of  incentives  in  the  CEO’s  annual  stock  option  grant  documented  by 
Hartzell  and  Starks  (2003)  for  ExecuComp  firms  will  be  less  likely  to  hold  for  non-ExecuComp 
firms.  Hartzell  and  Starks  (2003)  argue  that  concentrated  ownership  reflects  the  incentives  of 
institutional  owners  to  monitor  executive  compensation  and  that,  as  institutional  ownership  be¬ 
comes  more  concentrated,  institutions  use  their  influence  to  increase  the  level  of  CEO  incentives. 


^  We  consider  a  firm  to  be  predicted  to  be  an  ExecuComp  firm  if  the  probability  generated  by  the  model  is  greater  than 
50  percent. 

^  Note  that  a  lower  average  weight  on  earnings  in  the  non-ExecuComp  sample  does  not  mechanically  lead  to  a  smaller 
shift  in  the  weight  for  the  non-ExecuComp  sample.  To  see  this,  consider  the  following  example:  Sample  A  contains  a 
high-growth  firm  that  places  a  10  percent  weight  on  earnings  and  a  low-growth  firm  that  places  a  90  percent  weight  on 
earnings.  Sample  B  contains  a  high-growth  firm  that  places  a  60  percent  weight  on  earnings  and  a  low-growth  firm  that 
places  an  80  percent  weight  on  earnings.  In  this  example,  the  difference  in  the  weights  between  high-  and  low-growth 
firms  for  Sample  A  is  considerably  larger  than  the  difference  in  weights  for  Sample  B  even  though  the  average  weight 
on  earnings  for  Sample  A  (50  percent)  is  lower  than  the  average  weight  on  earnings  for  Sample  B  (70  percent). 
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For  non-ExecuComp  firms,  the  lower  degree  of  firm  complexity  and  the  greater  noise  in  stock 
prices  leads  institutional  shareholders  to  use  alternative  performance  measures  to  evaluate  man¬ 
agement.  We  therefore  predict  that  the  positive  relation  between  concentrated  institutional  owner¬ 
ship  and  CEO  stock  option  grant  incentives  is  stronger  for  ExecuComp  firms  than  for  non- 
ExecuComp  firms. 

Our  results  support  this  prediction.  We  find  that  the  relation  between  institutional  ownership 
concentration  and  the  extent  of  CEO  stock  option  grant  incentives  is  significantly  more  positive 
for  ExecuComp  firms  than  it  is  for  non-ExecuComp  firms.  We  also  find  that  the  positive  associa¬ 
tion  between  institutional  ownership  concentration  and  the  extent  of  CEO  stock  option  grant 
incentives  is  only  present  in  the  ExecuComp  sample.  Our  findings  are  robust  to  adjustments  for 
differences  in  CEO  ownership  and  to  using  predicted  classifications. 

Finally,  we  examine  whether  the  systematic  differences  in  the  functions  that  determine  CEO 
pay  that  we  document  between  ExecuComp  and  non-ExecuComp  firms  are  also  present  within  the 
ExecuComp  sample  by  comparing  S&P  500  firms  to  non-S&P  500  firms  included  in  the  Execu¬ 
Comp  database.  We  choose  this  separation  because  many  of  the  differences  in  the  contractual 
environment  between  ExecuComp  and  non-ExecuComp  firms  are  also  present  between  S&P  500 
firms  and  non-S&P  500  ExecuComp  firms.  Given  the  cost  of  acquiring  compensation  data  from 
other  sources,  these  tests  provide  guidance  to  researchers  regarding  whether  they  can  use  Execu¬ 
Comp  data  to  assess  the  sensitivity  of  their  results  to  differences  in  the  contracting  environment. 

We  find  limited  evidence  that  the  differences  we  document  across  ExecuComp  and  non- 
ExecuComp  firms  can  be  found  within  the  ExecuComp  database.  Specifically,  with  respect  to  our 
first  prediction,  we  find  that  S&P  500  firms  place  a  larger  weight  on  stock  returns,  but  we  do  not 
find  a  significant  difference  in  the  weight  placed  on  accounting  earnings  by  S&P  500  and  non-S&P 
500  firms  within  the  ExecuComp  sample.  Also,  we  do  not  find  significant  differences  between  the 
two  groups  of  firms  with  respect  to  the  predicted  negative  association  between  the  firm’s  invest¬ 
ment  opportunity  set  and  the  weight  placed  on  earnings  or  the  relation  between  institutional 
ownership  concentration  and  CEO  stock  option  grant  incentives.  In  sum,  these  findings  suggest 
that  tests  that  compare  groups  of  firms  within  the  ExecuComp  database  should  not  be  used  as 
substitutes  for  tests  that  compare  ExecuComp  and  non-ExecuComp  firms  to  assess  the  impact  of 
contracting  environments  on  the  design  of  CEO  incentive  contracts. 

Overall,  we  document  that  ExecuComp  firms  are  more  complex,  attract  a  greater  degree  of 
analyst  following,  have  higher,  but  less  concentrated  institutional  ownership,  and  have  a  greater 
degree  of  stock  liquidity  than  non-ExecuComp  firms.  We  refer  to  these  factors  collectively  as  the 
firm’s  contracting  environment  and  examine  whether  the  differences  in  contracting  environments 
between  ExecuComp  and  non-ExecuComp  firms  lead  to  predictable  differences  in  the  design  of 
CEO  incentive  contracts.  We  find  that  ExecuComp  firms  place  a  greater  weight  on  earnings  and 
stock  returns  in  setting  CEO  cash  compensation,  and  that  the  weight  placed  on  earnings  is  more 
sensitive  to  the  extent  of  the  firm’s  growth  opportunities  for  ExecuComp  firms  than  non- 
ExecuComp  firms.  Finally,  the  relation  between  institutional  ownership  concentration  and  CEO 
stock  option  grant  incentives  is  significantly  positive  only  for  ExecuComp  firms. 

Our  study  contributes  to  the  literature  on  the  determinants  of  executive  compensation  con¬ 
tracts  by  providing  insight  into  the  design  of  CEO  incentive  contracts  for  a  set  of  firms  that  have 
different  contracting  environment  characteristics  from  those  of  firms  typically  examined  in  em¬ 
pirical  studies  on  executive  compensation.  Our  findings  indicate  that  expanding  samples  to  include 


The  Accounting  Review 
American  Accounting  Association 


September  2010 


Determinants  of  CEO  Pay:  A  Comparison  of  ExecuComp  and  Non-ExecuComp  Firms 


1515 


non-ExecuComp  firms  can  substantially  influence  empirical  results  and  provide  researchers  with 
additional  insights  into  executive  compensation  and  corporate  governance  issues."^ 

Finally,  we  find  evidence  to  support  the  Baber  et  al.  (1996)  prediction  regarding  the  impact  of 
growth  opportunities  on  the  relative  weight  on  earnings  and  the  Hartzell  and  Starks  (2003)  pre¬ 
diction  regarding  the  positive  relation  between  institutional  ownership  concentration  and  CEO 
stock  option  grant  incentives  in  the  ExecuComp  sample,  but  not  in  our  non-ExecuComp  sample. 
As  a  result,  researchers  should  exercise  caution  when  generalizing  results  based  on  an  ExecuComp 
sample  to  broader  samples. 

Section  II  discusses  our  sample.  Section  III  documents  differences  between  ExecuComp  and 
non-ExecuComp  firms  with  regard  to  firm,  CEO,  and  monitoring  environment  characteristics. 
Section  IV  develops  empirical  predictions  based  on  these  differences.  Section  V  reports  evidence 
on  differences  between  ExecuComp  and  non-ExecuComp  firm  CEO  compensation  functions. 
Section  VI  provides  the  results  of  tests  that  compare  S&P  500  firms  with  non-S&P  500  firms 
included  in  the  ExecuComp  database.  Section  VII  concludes. 

11.  DESCRIPTION  OF  EXECUCOMP  AND  NON-EXECUCOMP  FIRM  SAMPLES 

The  Standard  &  Poor’s  ExecuComp  database  currently  provides  data  from  1992  onward  for 
the  firms  in  the  S&P  1500  index,  which  includes  the  S&P  500,  S&P  400  mid-cap,  and  S&P  600 
small-cap  indices.  From  2000  to  2007,  ExecuComp  firms  comprise  approximately  25  percent  of 
the  domestic  firms  in  the  Compustat  database.  Over  this  same  period,  the  market  capitalization  of 
ExecuComp  firms  is  approximately  88  percent  of  the  market  capitalization  of  all  domestic  Com¬ 
pustat  firms. ^ 

According  to  Standard  &  Poor’s,  the  goal  of  the  committees  that  determine  the  composition  of 
the  S&P  1500  indices  is  to  ensure  that  the  firms  act  as  leading  indicators  of  large-cap,  medium- 
cap,  and  small-cap  U.S.  equities  and  have  sufficient  liquidity  to  ensure  efficient  portfolio  trading. 
As  a  result,  firms  that  are  included  must  meet  a  number  of  criteria.  One  of  these  criteria  is  financial 
viability,  which  is  usually  determined  by  four  consecutive  quarters  of  positive  GAAP  earnings 
before  discontinued  operations  and  extraordinary  items.  S&P  also  requires  each  firm  to  have 
adequate  stock  liquidity.  As  a  result  of  these  criteria,  the  ExecuComp  database  is  likely  to  include 
a  relatively  high  proportion  of  large  and  profitable  firms  with  stable  cash  flows  and  liquid  shares. 

We  obtain  compensation  data  for  non-ExecuComp  firms  for  the  fiscal  years  2000-2007  from 
the  Momingstar  Historical  Governance  database.  This  database  can  be  purchased  from  Morning- 
star,  and  provides  machine-readable  compensation  data  for  firms  included  in  the  Russell  3000  and 
S&P  1500  indices.  Hence,  the  non-ExecuComp  firms  we  study  are  included  in  the  Russell  3000 
index,  but  not  in  the  S&P  1500  indices.  The  Russell  3000  index  consists  of  firms  in  the  Russell 
lai-ge-cap  1000  (the  1000  largest  firms  in  the  U.S.  in  terms  of  market  capitalization  in  a  given 
year),  and  the  Russell  small-cap  2000  (the  next  2000  largest  firms  in  the  U.S.  in  terms  of  market 
capitalization  in  a  given  year).  We  note  that  according  to  Russell  Investments,  the  Russell  2000 
small-cap  index  makes  up  almost  10  percent  of  the  market  capitalization  of  U.S.  equities.  Thus, 


In  addition  to  using  data  collected  from  the  Momingstar  Historical  Governance  database,  our  source  for  compensation 
data  for  non-ExecuComp  firms,  other  data  sources  available  to  researchers  wishing  to  study  broader  samples  include 
Equilar  and  the  Corporate  Library.  Further,  in  some  instances  hand-collection  of  compensation  data  for  non-ExecuComp 
firms  could  be  warranted  and  necessary  to  gain  specific  insights  about  compensation  practices. 

^  Percentages  were  calculated  using  the  sample  of  firms  with  data  available  on  Compustat  excluding  firms  incorporated 
abroad  for  the  fiscal  years  2000  through  2007.  Market  values  are  measured  as  the  price  of  the  firm’s  stock  at  the  calendar 
year-end  multiplied  by  the  number  of  outstanding  shares  at  calendar  year-end. 
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our  sample  of  non-ExecuComp  firms  enables  us  to  study  the  compensation  practices  of  a  group  of 
firms  that  we  know  little  about,  even  though  this  group  of  firms  is  economically  important.^ 

As  shown  in  Panel  A  of  Table  1,  from  2000  through  2007,  there  are  a  total  of  13,828 
firm-years  available  on  ExecuComp  and  a  total  of  17,480  firm-years  available  for  Momingstar- 
non-ExecuComp  firms.  After  deleting  observations  with  insufficient  data  for  the  tests  that  follow, 
our  final  sample  consists  of  11,928  ExecuComp  and  11,168  non-ExecuComp  firm-years.  Panel  B 
provides  information  regarding  the  distribution  of  observations  across  the  two  samples  by  year. 
The  number  of  Momingstar-non-ExecuComp  firms  increases  over  time.  There  is  a  large  increase 
in  this  number  of  firms  between  2001  and  2002.  This  large  increase  occurs  because  the  years  2000 
and  2001  are  the  first  years  for  which  the  Momingstar  Historical  Governance  database  has  com¬ 
prehensive  CEO  compensation  data  and  during  these  two  years  CEO  compensation  data  was  not 
collected  for  all  the  firms  in  the  Russell  3000  and  S&P  1500  indices.  Subsequent  to  2002,  there  is 
a  gradual  annual  increase  in  the  number  of  Momingstar-non-ExecuComp  firms  because  once 
Momingstar  covers  a  firm,  they  continue  to  track  the  firm  as  long  as  it  is  traded  on  a  U.S.  stock 
exchange.  As  a  result,  the  sample  of  non-ExecuComp  firms  in  the  database  increases  over  time,  as 
firms  are  dropped  from  the  Russell  3000  and  new  firms  are  added  to  this  index.  In  contrast  to  the 
Momingstar-non-ExecuComp  firms,  the  number  of  ExecuComp  firms  decreases  over  our  sample 
period  because,  although  ExecuComp  likewise  retains  firms  that  are  dropped  from  one  of  the  S&P 
1500  indices,  it  also  backfills  the  data  for  new  firms  that  are  added  to  an  index. 

With  respect  to  industry  composition.  Table  1 ,  Panel  C  shows  that  the  two  sets  of  firms  appear 
to  be  relatively  similar,  with  the  notable  exceptions  of  a  lower  proportion  of  financial  institutions 
(12.2  percent  versus  25.8  percent)  and  a  higher  proportion  of  transportation  and  gas  companies 
(10.9  percent  versus  6.0  percent)  and  wholesale  and  retail  firms  (11.4  percent  versus  6.9  percent) 
in  the  ExecuComp  sample.  The  difference  in  the  proportion  of  financial  institutions  is  driven  by 
the  banking  industry  (two-digit  SIC  code  60),  and  the  difference  for  transportation  and  gas  com¬ 
panies  is  driven  by  the  utility  industry  (two-digit  SIC  code  49).’  The  difference  in  the  wholesale 
and  retail  sector  is  not  driven  by  a  specific  two-digit  SIC  industry  within  this  sector. 

III.  DIFFERENCES  BETWEEN  EXECUCOMP  AND  NON-EXECUCOMP  EIRM 

CHARACTERISTICS 

Table  2  compares  firm,  CEO,  institutional  ownership,  and  incentive  compensation  character¬ 
istics  for  the  two  samples,  both  for  the  full  samples  and  after  excluding  banking  and  utility  firms. 
In  Panel  A  of  Table  2  we  report  statistics  of  firm  characteristics  for  firms  in  the  ExecuComp, 
Momingstar-non-ExecuComp,  and  Compustat  samples.  Overall,  non-ExecuComp  firm  character¬ 
istics  significantly  differ  from  those  of  ExecuComp  firms.  In  addition,  their  characteristics  more 
closely  resemble  those  of  the  broad  sample  of  Compustat  firms  than  those  of  ExecuComp  firms. 

Given  the  selection  criteria  for  inclusion  in  the  three  major  S&P  indices,  it  is  not  surprising  to 
find,  in  Panel  A  of  Table  2,  that  ExecuComp  firms  are  substantially  larger  and  more  profitable  than 
are  Momingstar-non-ExecuComp  firms.  Median  market  capitalization  (return  on  assets)  for  Ex¬ 
ecuComp  and  non-ExecuComp  firms  is  $1.56  billion  dollars  (4.5  percent)  and  $165  million  dollars 
(1.1  percent),  respectively.  ExecuComp  firms  are  also  more  highly  leveraged  and  have  lower 


^  When  there  was  an  overlap  between  ExecuComp  and  Momingstar  data,  we  found  agreement  among  the  data  sources  for 
various  data  fields.  In  the  analyses  that  follow,  the  ExecuComp  sample  is  restricted  to  firms  with  data  available  from 
ExecuComp  and  the  Momingstar  non-ExecuComp  sample  is  restricted  to  observations  that  do  not  appear  in  the  Execu¬ 
Comp  srunple. 

’  Approximately  5.4  percent  of  ExecuComp  firms  are  in  the  banking  industry,  as  opposed  to  18.5  percent  of  firms  in  the 
Momingstar-non-ExecuComp  sample.  Also,  about  6.0  percent  of  ExecuComp  firms  are  utility  firms,  while  only  1.8 
percent  of  Morningstar-non-ExecuComp  firms  are  utility  firms. 
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idiosyncratic  risk.  Differences  in  growth  prospects  across  the  two  samples  are  less  clear.  Non- 
ExecuComp  firms  have  higher  mean  values  for  the  market-to-book  ratio  and  the  investment 
opportunity  set  factor  score  (lOS),  but  ExecuComp  firms  have  a  higher  median  value  for  the 
market-to-book  ratio. 

Panel  A  of  Table  2  also  reports  information  on  analyst  coverage,  firm  complexity,  and  stock 
liquidity.  Eight  analysts  follow  the  median  ExecuComp  firm,  while  only  one  analyst  tracks  the 
median  non-ExecuComp  firm.  This  suggests  that  there  is  likely  to  be  greater  public  scrutiny  of 
ExecuComp  than  non-ExecuComp  firms.  We  also  find  that  ExecuComp  firms  have  more  complex 
operations  than  non-ExecuComp  firms,  where  complexity  is  measured  as  in  Bushman  et  al. 
(2004),  using  a  firm’s  reported  number  of  business  or  geographic  segments  and  Herfindahl- 
Hirschman  indices  measuring  a  firm’s  business  or  geographic  diversification.  Further,  using  the 
Amihud  (2002)  measure  of  the  illiquidity  of  a  firm’s  stock  (average  daily  ratio  of  stock  return  to 
dollar  volume),  we  document  that  the  shares  of  ExecuComp  firms  are  significantly  more  liquid 
than  those  of  non-ExecuComp  firms.  These  differences  persist  after  excluding  banking  and  utility 
firms. 

Table  2,  Panel  B  presents  a  comparison  of  CEO  and  institutional  ownership  characteristics 
between  the  two  samples.  CEOs  of  ExecuComp  firms  are,  on  average,  older  and  have  shorter 
tenures.  Also,  CEOs  of  ExecuComp  firms  own  a  smaller  percentage  of  the  firm’s  outstanding 
shares.  The  median  CEO  ownership  for  ExecuComp  and  non-ExecuComp  firms  is  0.2  percent  and 
0.9  percent,  respectively. 

We  also  find  significant  differences  in  institutional  ownership  structure  between  the  two 
groups  of  firms.  Although  institutions  own  approximately  twice  the  proportion  of  outstanding 
shares  of  ExecuComp  firms,  this  ownership  is  distributed  across  more  institutions  and  is  thereby 
less  concentrated  than  the  institutional  ownership  of  non-ExecuComp  firms.  Because  institutional 
monitoring  should  be  more  intense  when  institutional  ownership  concentration  is  higher  (e.g., 
Shleifer  and  Vishny  1986;  Hartzell  and  Starks  2003),  institutions  may  monitor  the  management  of 
non-ExecuComp  firms  with  greater  intensity  than  they  would  the  management  of  ExecuComp 
firms. 

Finally,  Table  2,  Panel  C  compares  incentive  compensation  for  CEOs  across  the  ExecuComp 
and  non-ExecuComp  samples.  We  find  that  the  bonus  is  a  greater  percentage  of  total  cash  com¬ 
pensation  for  firms  in  the  ExecuComp  sample,  and  that  total  compensation  varies  more  from  year 
to  year.  This  panel  also  shows  that  ExecuComp  firms  grant  a  larger  percentage  of  total  compen¬ 
sation  in  the  form  of  stock  options  and  restricted  stock.  Further,  this  panel  documents  that  the 
sensitivity  of  the  value  of  the  CEO’s  annual  option  and  restricted  stock  grants  to  a  1  percent 
change  in  firm  value  is  markedly  higher  for  ExecuComp  firms  than  it  is  for  non-ExecuComp  firms. 

Table  3  provides  evidence  regarding  the  extent  to  which  contracting  environment  character¬ 
istics  explain  the  likelihood  that  a  firm  is  included  in  the  ExecuComp  database.  This  analysis  also 
addresses  the  concern  that  firm  size  fully  captures  the  differences  between  ExecuComp  and  non- 
ExecuComp  samples.  Using  the  combined  sample  of  ExecuComp  and  non-ExecuComp  firms,  we 
estimate  a  probit  regression  that  predicts  whether  a  firm  is  included  in  the  ExecuComp  database 
with  variables  that  proxy  for  firm  size,  growth  opportunities,  the  level  and  concentration  of  insti¬ 
tutional  ownership,  analyst  coverage,  the  number  of  reported  business  segments,  and  the  illiquidity 
of  a  firm’s  stock.  To  separately  evaluate  the  impact  of  firm  characteristics  correlated  with  firm  size 


*  Because  our  sample  period  starts  after  1998,  when  SPAS  No.  131  was  adopted,  the  number  of  business  segments 
reported  by  a  firm  is  unlikely  to  be  affected  by  discretionary  disclosure  issues.  Consequently,  this  number  should  be  a 
valid  measure  of  firm  complexity. 
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TABLE  3 


ExecuComp  Prediction  Model 


Panel  A:  Probit  Regression  Model^ 

(1) 

(2) 

(3) 

Intercept 

-0.091 

-3.631*** 

(-0.72) 

(-8.89) 

(-29.95) 

Market-to-book  equity 

0.033*** 

0.029*** 

(4.11) 

(3.51) 

lOS  factor  score 

-17.237*** 

-18.666*** 

(-10.74) 

(-11.54) 

Institutional  ownership 

1.193*** 

1.154*** 

(11.78) 

(11.19) 

Institutional  ownership 

-2.921*** 

-2.038*** 

concentration 

(-19.02) 

(-12.19) 

Analyst  coverage 

0.061*** 

0.024*** 

(12.60) 

(4.65) 

Number  of  segments 

0.061*** 

0.054*** 

(11.76) 

(10.02) 

Illiquidity  of  firm’s  shares 

0.000 

0.003*** 

(0.25) 

(2.90) 

Log  of  the  market  value  of 

0.268*** 

0.590*** 

equity 

(10.14) 

(30.59) 

Pseudo 

0.422 

0.440 

0.335 

n 

23,096 

23,096 

23,096 

Panel  B:  Prediction  Errors  Using  p  =  0.50  Cut-Off'’ 

Predicted  Classification 

ExecuComp  Non-ExecuComp 

Total 

Model  Excluding  Size 

True  Classification 

ExecuComp 

10,444 

1,484 

11,928 

(87.6%) 

(12.4%) 

Non-ExecuComp 

2,346 

8,822 

11,168 

(21.0%) 

(79.0%) 

Total 

12,790 

10,306 

23,096 

Model  Including  Size 

True  Classification 

ExecuComp 

10,510 

1,418 

11,928 

(88.1%) 

(11.9%) 

Non-ExecuComp 

2,202 

8,966 

11,168 

(19.7%) 

(80.3%) 

Total 

12,712 

10,384 

23,096 

Model  Including  Only  Size 

True  Classification 

ExecuComp 

9,737 

2,191 

11,928 

(81.6%) 

(18.4%) 

(continued  on  next  page) 
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Panel  B:  Prediction  Errors  Using  p 

=  0.50  Cut-Off** 

Predicted  Classification 

ExecuComp 

Non-ExecuComp 

Total 

N  on-ExecuComp 

2,375 

8,793 

11,168 

(11.0%) 

(89.0%) 

Total 

12,112 

10,984 

23,096 

***  Indicates  that  coefficients  are  different  from  zero  at  the  1  percent  significance  level. 

The  sample  period  is  2000-2007.  The  models  are  probit  regression  models  in  which  the  dependent  variable  equals  1  if 
a  firm  is  included  in  the  ExecuComp  database  and  equals  0  if  a  firm  is  a  non-ExecuComp  firm  included  in  the 
Momingstar  Historical  Governance  database.  Market-to-book  equity  is  the  market  value  of  the  firm’s  equity  at  the  end 
of  the  fiscal  year  scaled  by  the  book  value  of  the  firm’s  equity.  lOS  factor  score  represents  the  investment  opportunity 
set  based  on  investment  intensity,  market-to-book  assets,  geometric  mean  annual  growth  rate  of  market  value  of  assets, 
and  R&D  expenditures  scaled  by  assets  following  Baber  et  al.  (1996).  Institutional  ownership  equals  the  number  of 
shares  held  by  institutions  as  a  percentage  of  total  outstanding  shares.  Institutional  ownership  concentration  is  the 
fractional  ownership  held  by  the  top  five  institutions  scaled  by  the  fraction  of  outstanding  shares  held  by  all  institutions. 
Logarithm  of  market  value  of  equity  is  calculated  at  the  end  of  the  fiscal  year.  Illiquidity  of  firm’s  shares  is  measured  as 
in  Amihud  (2002)  and  represents  the  average  daily  ratio  of  stock  return  to  dollar  volume.  Year  indicator  variables  are 
included  in  all  the  models,  t-statistics  for  two-tailed  tests  are  in  parentheses  and  are  based  on  standard  errors  that  are 
adjusted  for  the  clustering  of  observations  at  the  firm  level. 

’’  A  firm  is  predicted  to  be  an  ExecuComp  firm  if  using  the  fitted  values  from  the  probit  model  for  whether  a  firm  is 
included  in  the  ExecuComp  database  we  find  that  the  probability  that  the  firm  is  an  ExecuComp  firm  is  at  least  50 
percent.  In  Panel  B  the  rows  provide  information  on  the  correct  classification  of  ExecuComp  and  non-ExecuComp  firms 
using  the  fitted  values  from  the  probit  model.  The  columns  provide  information  on  the  number  of  ExecuComp  or 
non-ExecuComp  firms  predicted  to  be  one  or  the  other  of  the  two  firm  types  using  the  fitted  values  from  the  probit 
model. 


on  the  classification  of  firms  across  the  two  samples,  we  first  estimate  the  regression  without  size 
as  an  explanatory  variable  and  then  add  size  to  the  regression  model.  Finally,  we  run  the  regression 
using  firm  size  as  the  only  explanatory  variable. 

The  results  in  the  first  column  in  Panel  A  of  Table  3,  in  which  size  is  excluded  from  the 
model,  show  that  all  of  the  independent  variables  except  for  the  illiquidity  of  a  firm’s  stock  have 
a  significant  incremental  effect  in  explaining  whether  a  firm  is  included  in  the  ExecuComp  data¬ 
base.  In  the  second  column  in  Panel  A  of  Table  3,  we  include  the  natural  logarithm  of  the  market 
value  of  equity  as  an  explanatory  variable.  All  of  the  significant  associations  documented  in  the 
first  column  are  robust  to  the  inclusion  of  firm  size,  indicating  that  differences  in  firm  size  between 
the  two  groups  of  firms  do  not  completely  drive  the  significant  associations  documented  in  the  first 
column.  Although  the  coefficient  on  firm  size  is  positive  and  significant,  as  expected,  the  resulting 
increase  in  the  explanatory  power  of  the  probit  regression  after  including  firm  size,  as  measured  by 
the  pseudo  R^,  is  relatively  small  (0.422  to  0.440). 

The  third  column  in  this  panel  shows  that  when  firm  size  is  the  only  explanatory  variable  in 
the  probit  model,  the  pseudo  is  0.335.  However,  the  increase  in  the  pseudo  R^  when  we  add  the 
other  variables  to  the  regression  (0.335  to  0.440)  further  supports  the  contention  that  firm  size  is 
an  important,  but  not  all-encompassing  variable  in  distinguishing  ExecuComp  and  non- 
ExecuComp  firms. 

To  further  evaluate  the  economic  determinants  of  the  ExecuComp  classification,  we  use  the 
fitted  values  from  the  three  models  in  Panel  A  of  Table  3  to  predict  whether  a  firm  is  included  in 
the  ExecuComp  database.  We  classify  a  firm  as  predicted  to  be  an  ExecuComp  firm  if  the  prob¬ 
ability  that  the  firm  is  an  ExecuComp  firm  is  greater  than  50  percent.  The  results,  shown  in  Panel 
B  of  Table  3,  indicate  that  the  first  model  in  Panel  A  of  Table  3  correctly  classifies  observations  as 
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being  ExecuComp  observations  87.6  percent  of  the  time.  While  this  is  substantially  higher  than  a 
naive  prediction  that  every  firm  is  an  ExecuComp  firm,  which  would  be  correct  51.6  percent 
(11,928/23,096)  of  the  time  given  the  distribution  of  our  sample,  some  error  still  remains.  Panel  B 
also  shows  that  the  second  model  in  Panel  A  of  Table  3  correctly  classifies  ExecuComp  firm 
observations  88.1  percent  of  the  time.  Thus,  adding  firm  size  to  the  prediction  model  slightly 
improves  its  predictive  accuracy.  Finally,  the  model  that  only  includes  firm  size  as  an  explanatory 
variable  correctly  classifies  ExecuComp  firm  observations  81.6  percent  of  the  time.  Consistent 
with  our  interpretation  of  the  results  in  Panel  A  of  Table  3,  the  results  presented  here  suggest  that, 
although  firm  size  is  an  important  determinant  of  a  firm’s  inclusion  in  the  ExecuComp  database, 
other  characteristics  are  also  important  for  predicting  this  likelihood.^ 

IV.  EMPIRICAL  PREDICTIONS 

A  compensation  contract  consists  of  a  set  of  performance  measures  and  weights  that  convert 
outcomes  to  pay,  through  mechanisms  such  as  a  bonus  schedule.  Theoretically,  assuming  costless 
contracting,  an  employment  contract  could  contain  an  infinite  number  of  performance  measures,  as 
long  as  each  performance  measure  provides  incremental  information  regarding  the  agent’s  actions 
(Holmstrom  1979).  However,  under  costly  contracting,  a  firm  faces  a  cost-benefit  trade-off  when 
deciding  whether  to  add  a  performance  measure  to  the  contract.  Because  the  costs  of  adding 
performance  measures  are  generally  increasing  in  the  number  of  existing  performance  measures, 
while  the  incremental  benefit  of  adding  performance  measures  decreases,  firms  typically  use  a 
relatively  small  number  of  performance  measures  in  contract  design. 

Firms  commonly  use  aggregate  financial  measures,  such  as  earnings  or  stock  returns,  because 
they  are  readily  available,  and  are  either  audited  (accounting  earnings),  or  determined  by  indepen¬ 
dent  parties  (market  returns).  However,  as  noted  by  Ittner  et  al.  (1997),  firms  can  increase  effi¬ 
ciency  by  supplementing  the  contract  with  nonfinancial  performance  measures,  such  as  product 
quality  measures  or  customer  satisfaction  (i.e.,  transferring  a  portion  of  the  weight  on  the  aggre¬ 
gate  financial  measures  to  the  nonfinancial  measures).  The  value  of  adding  nonfinancial  measures 
is  determined  by  the  firm’s  strategy,  the  incremental  information  provided  by  the  nonfinancial 
measures,  and  the  extent  of  exogenous  noise  in  the  financial  measures. 

In  a  relatively  simple  firm,  the  board  can  readily  identify  the  firm’s  strategy  and  define 
performance  measures  that  capture  the  key  aspects  to  generating  success,  such  as  customer  satis¬ 
faction.  However,  in  a  more  complex  firm,  optimal  strategies  and  keys  to  success  are  likely  to 
differ  across  business  units.  For  instance,  while  customer  satisfaction  may  be  the  key  to  profit¬ 
ability  in  one  business  unit,  increasing  market  share  or  minimizing  product  defect  rates  could  be 
of  greater  importance  in  other  business  units.  Thus,  as  the  complexity  of  the  firm  increases,  it 
becomes  increasingly  costly  for  a  firm  to  balance  conflicting  strategies  across  business  units  and 
determine  the  appropriate  weights  to  place  on  individual  performance  measures.  Consequently,  we 
expect  that  the  more  complex  ExecuComp  firms  are  more  likely  to  rely  on  aggregate  financial 
performance  measures  that  allow  their  CEOs  to  use  their  presumably  superior  knowledge  of  firm 
operations  to  determine  the  optimal  allocation  of  effort  across  activities  and  business  units. 


^  We  also  use  the  combined  sample  of  ExecuComp  and  non-ExecuComp  firms  to  estimate  the  basic  regressions  found  in 
Tables  4-6  on  each  size  quartile.  Overall,  we  do  not  find  results  across  the  size  quartiles  that  are  consistent  with  the 
results  in  these  tables  being  driven  by  firm  size. 

The  cost  of  adding  a  performance  measure  includes  the  cost  of  determining  what  to  measure,  how  to  measure  it,  and  the 
optimal  weight  to  place  on  the  measure.  Because  each  of  these  steps  requires  a  firm  to  assess  the  interaction  of  the 
proposed  measure  with  all  of  the  other  measures  in  the  contract,  the  cost  of  adding  a  performance  measure  increases 
with  the  number  of  measures  in  the  contract. 
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At  the  same  time,  the  higher  analyst  following  and  more  liquid  shares  of  ExecuComp  firms 
reduce  the  exogenous  noise  in  earnings  and  stock  returns  for  these  firms.  Specifically,  the  addi¬ 
tional  scrutiny  of  ExecuComp  firms  by  analysts  leads  these  firms  to  increase  their  investment  in 
information  systems  and  reduces  the  ability  of  managers  to  manipulate  earnings.  Likewise,  com¬ 
petition  among  analysts  generates  additional  information  that  leads  to  the  capital  market  having 
more  precise  information  to  evaluate  the  firm’s  earnings  (Piotroski  and  Roulstone  2004).  Further, 
the  higher  stock  liquidity  of  ExecuComp  firms  improves  the  efficiency  with  which  information  is 
impounded  into  stock  prices,  and  as  a  result,  the  stock  returns  of  ExecuComp  firms  are  likely  to 
better  reflect  the  value  implications  of  the  CEO’s  actions. 

Overall,  we  expect  the  combination  of  the  greater  complexity  of  firm  operations,  higher 
analyst  following,  and  greater  stock  liquidity  of  ExecuComp  firms  to  increase  the  cost  of  using 
individual  performance  measures  and  reduce  the  noise  associated  with  earnings  and  stock  returns 
as  measures  of  performance.  As  a  result,  we  predict  that  ExecuComp  firms  exhibit  stronger 
relations  between  CEO  pay  with  earnings  and  stock  returns  than  do  non-ExecuComp  firms. 

The  hypothesis  that  non-ExecuComp  firms  rely  more  heavily  on  individual  performance 
measures  and  less  on  aggregate  financial  performance  measures  also  has  implications  for  shifts  in 
the  relative  weights  between  earnings  and  stock  returns.  For  example,  Baber  et  al.  (1996)  and 
Ittner  et  al.  (1997)  argue  that,  due  to  the  historical  basis  of  earnings,  as  the  importance  of  invest¬ 
ment  opportunities  increases  relative  to  assets  in  place,  the  value  of  accounting-based  measures  of 
performance  declines.  However,  this  reasoning  assumes  that  firms  with  relatively  low  levels  of 
investment  opportunities  place  a  relatively  high  weight  on  earnings.  As  noted  above,  non- 
ExecuComp  firms  are  likely  to  substitute  individual  performance  measures  for  earnings,  even  in 
the  absence  of  growth  opportunities.  Thus,  for  non-ExecuComp  firms,  we  would  expect  growth 
firms  to  shift  the  weights  between  individual  performance  measures  rather  than  between  aggregate 
financial  performance  measures.  While  the  individual  performance  measures  in  CEO  bonus  con¬ 
tracts  are  generally  not  publicly  disclosed,  the  underlying  theory  leads  to  the  empirical  prediction 
that  the  sensitivity  of  the  weight  on  earnings  to  the  firm’s  investment  opportunity  set  should  be 
greater  for  ExecuComp  than  non-ExecuComp  firms. 

Finally,  we  expect  the  differences  in  the  contracting  environment  to  affect  the  monitoring  role 
played  by  institutional  investors.  Hartzell  and  Starks  (2003)  document  a  significantly  positive 
relation  between  institutional  ownership  concentration  and  the  extent  of  incentives  provided  to 
executives  through  annual  stock  option  grants.  Their  interpretation  is  that  institutional  investors 
monitor  the  contracting  process  and  use  their  influence  to  increase  the  degree  of  management 
incentives.  They  also  argue  that  because  institutions  that  monitor  bear  the  full  cost  of  the  moni¬ 
toring  activity  and  share  the  benefit  with  the  other  shareholders,  the  extent  of  monitoring  increases 
with  the  degree  of  concentration  of  institutional  ownership,  and  an  individual  institution’s  influ¬ 
ence  over  the  board  increases  as  its  ownership  becomes  more  concentrated.  We  expect  the  positive 
relation  documented  by  Hartzell  and  Starks  (2003)  is  less  likely  to  hold  for  non-ExecuComp  firms 
because  the  lower  degree  of  complexity  and  the  associated  increase  in  the  availability  of  individual 
performance  measures  for  non-ExecuComp  firms  reduces  the  cost  for  institutions  to  directly  evalu¬ 
ate  management’s  decisions  or  use  alternative  performance  measures  to  evaluate  management’s 
effectiveness.  In  addition,  the  expected  higher  degree  of  noise  in  stock  prices  for  non-ExecuComp 
firms  increases  the  noise  in  stock-based  incentives,  such  as  stock  options,  for  those  firms.  Conse¬ 
quently,  institutions  that  invest  in  non-ExecuComp  firms  are  more  likely  to  directly  monitor 
management,  or  use  individual  performance  measures  to  evaluate  management.  As  a  result,  a 
positive  relation  between  institutional  ownership  concentration  and  stock  option  grant  incentives  is 
less  likely  for  these  firms. 
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V.  PRIMARY  RESULTS 

Weights  on  Aggregate  Financial  Performance  Measures 

To  assess  whether  ExecuComp  firms  place  greater  weights  on  aggregate  financial  perfor¬ 
mance  measures,  we  estimate  the  following  regression  relating  changes  in  compensation  to  ac¬ 
counting  and  stock  return  measures  of  performance  (see  Lambert  and  Larcker  1987;  Sloan  1993; 
Baber  et  al.  1996;  Baber  et  al.  1999;  Matsunaga  and  Park  2001).'* 

Acomp;;  =0^0+  AROE,y  +  Q;2Annualized  stock  return, ,  -l-  a3Exec  +  Q;4(Exec  *  AROE„) 

+  a5(Exec  *  Annualized  stock  return,,)  -i-  Cj  (1) 

where: 

Acomp  =  the  annual  change  in  the  natural  logarithm  of  the  CEO’s  total  cash  compensation 
or  total  compensation; 

AROE  =  the  annual  change  in  income  before  extraordinary  items  deflated  by  average 
stockholders  equity; 

Annualized 

stock  return  =  annualized  stock  return  assuming  dividend  reinvestment;  and 
Exec  =  1  if  the  firm  is  in  the  ExecuComp  database,  and  0  otherwise. 

We  predict  that  ExecuComp  firms  place  greater  weights  on  aggregate  financial  performance 
measures  in  determining  CEO  compensation  (i.e.,  0:4  and  0:5  >  0).  Consistent  with  prior  research, 
we  exclude  banking  and  utility  firms  because  these  firms  are  regulated  and  likely  to  compensate 
their  CEOs  differently  from  firms  in  unregulated  industries.  We  first  restrict  the  Exec  indicator 
variable  and  interaction  terms  from  the  compensation  model  and  estimate  the  model  separately  for 
ExecuComp  and  non-ExecuComp  firms.  We  then  expand  the  model  and  estimate  Equation  (1)  on 
the  pooled  sample  to  test  whether  the  relations  between  compensation  and  the  performance  met¬ 
rics  are  significantly  different  between  the  samples. 

Table  4,  Panel  A  presents  the  results  from  estimating  Equation  (1),  using  CEO  cash  compen¬ 
sation  as  the  dependent  variable.  We  find  evidence  of  the  expected  positive  relation  between  CEO 
cash  compensation  and  both  accounting  earnings  and  stock  returns  for  both  the  ExecuComp 
sample  (Column  1)  and  the  non-ExecuComp  sample  (Column  2).  Eor  the  pooled  sample,  the 
results  reported  in  Column  (3)  indicate  that  the  coefficients  on  the  interactions  of  the  ExecuComp 
indicator  variable  with  the  change  in  ROE  and  stock  return  variables  are  significantly  positive. 
These  findings  suggest  that  ExecuComp  firms  place  a  greater  weight  on  aggregate  financial  per¬ 
formance  measures  in  determining  cash  compensation,  which  is  consistent  with  our  prediction  that 
the  greater  complexity  of  ExecuComp  firms  increases  the  cost  for  these  firms  of  using  individual 
performance  measures,  while  the  higher  analyst  following  and  stock  liquidity  levels  of  Execu¬ 
Comp  firms  reduce  the  noise  in  aggregate  performance  measures.  At  the  same  time,  the  relatively 
weaker  relations  between  cash  compensation  and  aggregate  performance  measures  for  the  non- 
ExecuComp  sample  is  consistent  with  firms  excluded  from  the  ExecuComp  sample  placing  a 
greater  weight  on  individual  nonfinancial  performance  measures. 

We  conduct  two  additional  tests  to  rule  out  alternative  explanations.  The  first  explanation  is 
that  there  is  a  weaker  relation  between  CEO  cash  compensation  and  aggregate  financial  perfor- 


"  See  also  the  regression  models  in  Dechow  et  al.  (1994)  and  Gaver  and  Gaver  (1998)  for  evidence  on  whether  restruc¬ 
turing  or  other  nonrecurring  accounting  charges  modify  the  relation  between  cash  compensation  and  earnings,  Natarajan 
(1996)  for  results  on  how  cash  flows  from  operations  modifies  this  relation,  and  Baber  et  al.  (1998)  for  an  examination 
of  how  earnings  persistence  affects  this  relation. 
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mance  measures  for  loss  firms  and  that  the  non-ExecuComp  sample  includes  a  higher  proportion 
of  loss  firms  (Gaver  and  Gaver  1998).  To  address  this  issue,  we  estimate  the  model  on  the 
combined  sample  after  excluding  loss  firm-year  observations.  The  results,  reported  in  Column  (4), 
show  that  excluding  loss  firms  does  not  qualitatively  affect  our  findings.  The  second  alternative 
explanation  is  that  the  weaker  relation  between  CEO  cash  compensation  and  aggregate  financial 
performance  measures  is  driven  by  differences  between  the  two  groups  of  firms  with  respect  to 
CEO  ownership.  Because,  as  shown  in  Table  2,  CEO  ownership  is  greater  for  non-ExecuComp 
firms,  the  agency  problem  resulting  from  the  separation  of  ownership  and  control  could  be  lower 
for  non-ExecuComp  firms,  thereby  reducing  the  importance  of  providing  the  CEOs  of  these  firms 
with  contractual  incentives.  To  address  this  concern,  we  estimate  the  model  on  the  combined 
sample  after  excluding  observations  in  the  highest  CEO  ownership  decile  of  the  non-ExecuComp 
sample.  This  yields  a  sample  for  which  there  is  not  a  significant  difference  in  mean  CEO  owner¬ 
ship  across  ExecuComp  and  non-ExecuComp  firms.  The  results,  reported  in  Column  (5),  show 
that  the  significant  and  positive  coefficients  on  the  interactions  of  the  ExecuComp  indicator  vari¬ 
able  with  the  change  in  ROE  and  stock  return  variables  are  robust  to  the  exclusion  of  non- 
ExecuComp  firms  with  high  CEO  ownership. 

Using  the  fitted  values  from  the  probit  model  in  Column  (2)  of  Table  3,  Panel  A,  we  inves¬ 
tigate  whether  the  larger  weights  on  accounting  earnings  and  stock  returns  for  ExecuComp  firms 
are  due  to  differences  in  contracting  environment  characteristics  between  the  two  samples.  Spe¬ 
cifically,  we  replace  the  ExecuComp  indicator  variable  with  an  indicator  variable  set  equal  to  1  if 
the  probability  that  a  firm  is  in  ExecuComp  based  on  its  contracting  environment  characteristics  is 
greater  than  0.50,  and  0  otherwise.  The  results,  reported  in  Column  (6)  of  Table  4,  Panel  A,  are 
consistent  with  our  other  findings.  We  find  that  cash  compensation  is  significantly  more  sensitive 
to  earnings  and  stock  returns  for  firms  predicted  to  be  in  ExecuComp.  These  results  suggest  that 
the  differences  in  the  weights  on  the  performance  measures  are  due  to  contracting  environments 
that  are  indicative  of  ExecuComp  firms. 

Table  4,  Panel  B  reports  the  results  using  total  compensation.  Although  adding  equity  grants 
provides  a  more  comprehensive  measure  of  CEO  compensation,  the  extent  to  which  equity  grants 
are  used  to  set  long-term  incentives  (Core  and  Guay  1999),  as  opposed  to  providing  a  reward  for 
short-term  performance,  will  likely  reduce  the  power  of  the  test.  As  before,  we  find  significant 
positive  relations  between  stock  returns  and  CEO  compensation  for  both  groups  of  firms,  and  also 
show  that  ExecuComp  firms  place  a  greater  weight  on  stock  returns  in  determining  total  compen¬ 
sation.  We  also  find  that,  although  the  relation  between  accounting  earnings  and  total  CEO  com¬ 
pensation  is  significantly  positive  for  ExecuComp  firms  and  not  significant  for  non-ExecuComp 
firms,  the  coefficients  on  the  change  in  ROE  variable  do  not  significantly  differ  between  the  two 
groups  of  firms.  Columns  (4)  and  (5)  of  Table  4,  Panel  B  show  that  the  results  are  not  sensitive  to 
the  exclusion  of  loss  firms  or  to  the  exclusion  of  high  CEO  ownership  firms.  Einally,  the  results 
reported  in  Column  (6)  show  that  the  greater  weight  on  stock  returns  for  ExecuComp  firms  holds 
using  the  predicted  classification. 

The  Impact  of  the  Investment  Opportunity  Set  on  the  Relative  Weight  on  Earnings  versus 
Returns 

To  test  our  prediction  that  the  relative  weight  on  earnings  will  be  more  sensitive  to  the 
investment  opportunity  set  for  ExecuComp  firms,  we  estimate  the  following  regression  from 
Baber  et  al.  (1996): 

Acomp^f  =  70  +  7iAROE,f  +  72AROEit  *  IOS,;+  7-3 Annualized  stock  return^-^ 

+  74Annualized  stock  return, *  lOS,-,  -f  yslOSi,  +  e2  (2) 

where  lOS  is  the  factor  score  representing  the  investment  opportunity  set  based  on  investment 
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intensity,  market-to-book  assets,  geometric  mean  annual  growth  rate  of  market  value  of  assets,  and 
R&D  expenditures  scaled  by  assets  following  Baber  et  al.  (1996). 

As  in  Baber  et  al.  (1996),  banking  and  utility  firms  are  excluded  from  the  estimation.  Baber  et 
al.  (1996)  predict  that  the  relative  weight  on  earnings  decreases  as  the  investment  opportunity  set 
increases  (i.e.,  72  <  0)-'^  We  report  the  results  for  cash  compensation  and  total  compensation  in 
Panels  A  and  B  of  Table  5,  respectively.  Focusing  on  cash  compensation,  we  find  the  predicted 
negative  coefficient  on  the  interaction  of  the  investment  opportunity  set  and  accounting  earnings 
for  the  ExecuComp  sample  in  Column  (1),  but  an  insignificant  coefficient  on  this  interaction  in  the 
non-ExecuComp  sample  in  Column  (2).  Eor  the  non-ExecuComp  sample,  we  surprisingly  find  a 
significant  negative  coefficient  on  the  interaction  of  the  investment  opportunity  set  and  stock 
returns,  which  suggests  that  non-ExecuComp  firms  shift  the  weight  away  from  stock  returns  as 
investment  opportunities  increase.  In  the  pooled  sample,  reported  in  Column  (3),  we  find  that  the 
coefficient  on  the  AROE  *  lOS  interaction  is  significantly  more  negative  for  the  ExecuComp 
sample.  However,  the  coefficient  on  the  Annualized  stock  return  *  lOS  interaction  does  not 
significantly  differ  between  the  two  samples.  Finally,  the  results  reported  in  Columns  (4)-(6) 
indicate  that  the  inferences  are  robust  to  eliminating  loss  firms  (Column  (4)),  controlling  for 
differences  in  CEO  ownership  across  samples  (Column  (5)),  and  using  the  predicted  ExecuComp 
classification  rather  than  the  actual  classification  (Column  (6)).*^ 

Eor  total  compensation,  reported  in  Table  5,  Panel  B,  we  find  that  for  ExecuComp  firms  the 
coefficient  on  the  AROE  *  lOS  interaction  is  significant  and  negative  and  that  the  coefficient  on 
the  Annualized  stock  return  *  lOS  interaction  is  significant  and  positive.  Both  of  these  findings  are 
consistent  with  ExecuComp  firms  shifting  away  the  weight  on  accounting  earnings  toward  stock 
returns  as  the  investment  opportunity  set  increases.  However,  for  non-ExecuComp  firms,  we  find 
that  the  coefficients  on  these  two  interactions  do  not  significantly  differ.  Einally,  in  the  pooled 
sample  (Column  (3)),  we  find  evidence  that  the  coefficient  on  the  Annualized  stock  return  *  lOS 
interaction  is  significantly  more  positive  for  ExecuComp  firms  than  it  is  for  non-ExecuComp 
firms.  This  difference  remains  significant  when  we  adjust  for  differences  in  CEO  ownership 
(Column  (5))  or  use  the  predicted  classifications  (Column  (6)),  although  the  difference  is  not 
significant  when  we  exclude  loss  firms  (Column  (4)). 

Overall,  the  Table  5  findings  support  the  prediction  that  the  sensitivity  of  the  relative  weight 
on  earnings  versus  stock  returns  to  the  firm’s  investment  opportunity  set  is  greater  for  ExecuComp 
firms.  Our  tests  also  provide  an  example  of  how  differences  in  the  contracting  environment  can 
lead  to  inferences  based  on  an  ExecuComp  sample  that  do  not  hold  for  a  sample  of  non- 
ExecuComp  firms. 

The  Determinants  of  Stock  Option  Grants 

To  test  our  prediction  that  the  positive  relation  between  institutional  ownership  concentration 
and  CEO  stock  option  grants  will  be  stronger  for  ExecuComp  firms  we  estimate  the  following 
model  from  Hartzell  and  Starks  (2003): 


Technically,  the  prediction  is  that  the  weight  on  earnings  relative  to  stock  returns  will  be  lower.  This  would  be  consistent 
with  a  significantly  negative  72.  and  a  positive,  or  insignificant  74,  or  a  significantly  positive  74  and  a  negative,  or 
insignificant  72. 

As  an  alternative  control  for  loss  observations,  in  the  Table  4  and  Table  5  models  we  also  tried  including  in  the  models 
run  on  only  the  ExecuComp  sample  or  non-ExecuComp  sample  an  indicator  variable  for  whether  earnings  during  the 
current  year  are  negative  and  in  the  models  run  on  the  combined  sample  of  ExecuComp  and  non-ExecuComp  firms  an 
indicator  variable  for  whether  earnings  during  the  current  year-  are  negative  and  the  interaction  of  this  variable  with  the 
ExecuComp  indicator  variable.  We  found  that  including  these  control  variables  in  these  models  did  not  affect  the  main 
findings  from  the  models. 
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Option;';  =  ^0  +  ^1  Institutional  ownership  concentration;;.  i  +  02lnstitutional  ownership,;.  i 
+  03Market-to-book  assets,;,  j  +  04Log  of  the  market  value  of  equity,;.  i 
+  ^sChange  in  shareholder  wealth,;  +  ^gChange  in  shareholder  wealth,;_i  +  e3  (3) 

where: 


Option  =  the  change  in  the  value  of  the  CEO’s  annual  option  grant  for  a 
$1,000  increase  in  firm  market  value; 

Institutional  ownership 

concentration  =  the  fractional  ownership  held  by  the  top  five  institutions  scaled  by 
the  fraction  of  outstanding  shares  held  by  all  institutions; 

Institutional  ownership  =  the  fraction  of  outstanding  shares  held  by  institutions; 

Market-to-book  assets  =  the  market  value  of  the  firm’s  assets  scaled  by  the  book  value  of  the 

firm; 

Log  of  the  market  value 

of  equity  =  the  natural  logarithm  of  the  market  value  of  the  firm;  and 
Change  in  shareholder 

wealth  =  change  in  the  value  of  the  shares  outstanding  times  stock  price  from 
period  t-\  to  period  t. 

Columns  (1)  and  (2)  of  Table  6  provide  the  results  when  estimating  Equation  (3)  on  the 
ExecuComp  and  non-ExecuComp  samples,  respectively.  In  Column  (1),  consistent  with  Hartzell 
and  Starks  (2003),  we  find  a  significantly  positive  coefficient  on  institutional  ownership  concen¬ 
tration.  However,  Column  (2)  shows  that  this  association  is  negative  and  significant  for  non- 
ExecuComp  firms,  which  suggests  that  for  these  firms  more  intense  institutional  monitoring  leads 
to  a  reduced  flow  of  equity  incentives  to  the  CEO.  The  results  for  the  pooled  sample  (Column  (3)), 
where  we  include  interactions  of  each  independent  variable  with  the  ExecuComp  firm  indicator 
variable,  show  that  the  coefficient  on  institutional  ownership  concentration  is  significantly  more 
positive  for  ExecuComp  firms.  This  finding  holds  after  controlling  for  differences  in  CEO  own¬ 
ership  across  samples  (Column  (4))  and  using  predicted  classifications  (Column  (5)).  These  results 
are  consistent  with  our  prediction  that  for  non-ExecuComp  firms  the  lower  complexity  of  opera¬ 
tions  combined  with  the  availability  of  alternative  performance  measures  reduces  the  positive 
association  between  institutional  ownership  concentration  and  CEO  stock  option  grant  incentives. 

VI.  TESTS  WITHIN  THE  EXECUCOMP  SAMPLE 

The  prior  series  of  tests  document  significant  differences  in  the  structure  of  CEO  compensa¬ 
tion  between  ExecuComp  and  non-ExecuComp  firms.  We  next  examine  whether  similar  differ¬ 
ences  can  be  found  between  broad  groups  of  firms  within  the  ExecuComp  data  set.  Specifically, 
we  compare  the  CEO  compensation  functions  of  S&P  500  firms  with  those  of  non-S&P  500  firms 
included  in  the  ExecuComp  sample.  These  tests  provide  guidance  to  researchers  regarding 
whether  ExecuComp  data  can  be  used  to  assess  the  sensitivity  of  results  to  different  contracting 
environments. 

Table  7  provides  descriptive  firm,  monitoring,  and  compensation  statistics  for  the  three  indi¬ 
ces  within  the  S&P  1500.  The  statistics  suggest  similar  systematic  differences  in  the  contracting 
environment  and  compensation  characteristics  across  firms  in  the  ExecuComp  sample.  Eirm  size, 
firm  complexity,  analyst  coverage,  stock  liquidity,  the  lOS  factor  score,  and  the  market-to-book 
ratio  all  decrease  as  one  moves  from  the  S&P  500  index  to  the  S&P  400  mid-cap  and  the  S&P  600 
small-cap  indices.  Further,  CEO  ownership  and  institutional  ownership  concentration  increases 
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TABLE  7 

Comparison  of  S&P  500  Large-Cap,  S&P  400  Mid-Cap,  and  S&P  600  Small-Cap  Firms 

Mean  (Median) 


Panel  A:  Firm  Characteristics'* 


Market  value  of  equity 

Book  value  of  assets 

Return  on  assets 

Return  on  equity 

Long-term  debt/book  value  of 
assets 

Annualized  stock  return 

Idiosyncratic  risk 

Market-to-book  equity 

lOS  Factor  Score 

Analyst  coverage 

Number  of  reported  business 
segments 

Number  of  reported  geographic 
segments 

Herfindahl  index  of 

within-industry  concentration 
Herfindahl  index  of  geographic 
concentration 
Illiquidity  of  firm’s  shares 


S&P  500  S&P  400  S&P  600 

Large-Cap  Mid-Cap  Small-Cap 


22,865 

2,153 

764*** 

(9,045) 

(1,889) 

(621)*** 

34,230 

3,180 

1,112*** 

(8,611) 

(1,779) 

(579)*** 

6.3% 

5.9% 

4.8%*** 

(5.9%) 

(5.4%) 

(49%)*** 

17.1% 

13.4% 

10.3%*** 

(16.3%) 

(13.3%) 

(11.1%)*** 

0.19 

0.18 

0.15*** 

(0.17) 

(0.17) 

(0  11)*** 

13.7% 

14.4% 

14.8%*** 

(8.9%) 

(9.4%) 

(7.0%)*** 

1.9% 

2.0% 

2.5%*** 

(1.6%) 

(1.8%) 

2.2%*** 

3.88 

2.97 

2  62 *** 

(2.99) 

(2.33) 

(2.01)*** 

-0.011 

-0.012 

-0.013*** 

(-0.013) 

(-0.014) 

(-0.016)*** 

15.6 

8.4 

6  4*** 

(15) 

(7) 

(5)*** 

6.3 

5.8 

4  q*** 

(5) 

(5) 

(3)*** 

7.8 

6.3 

63*** 

(6) 

(6) 

(4.5)*** 

0.42 

0.40 

0  42*** 

(0.33) 

(0.32) 

(0.33)*** 

0.36 

0.40 

0  42*** 

(0.25) 

(0.32) 

(0.31)*** 

0.001 

0.004 

0.019*** 

(0.000) 

(0.001) 

(0.005)*** 

Panel  B:  CEO  and  Institutional  Investor  Ownership 

S&P  500 
Large-Cap 


Characteristics** 


S&P  400 
Mid-Cap 


S&P  600 
Small-Cap 


CEO  age 

55.9 

56.0 

55.9*** 

(56) 

(56) 

(56)*** 

CEO  tenure 

6.1 

7.6 

g  Q*** 

(4) 

(5) 

(5)*** 

CEO  ownership  percentage 

0.8% 

2.7% 

2.6%*** 

(0.1%) 

(0.3%) 

(0.4%)*** 

Institutional  ownership 

69.4% 

72.5% 

72.3%*** 

percentage 

(70.0%) 

(75.7%) 

(76.4%)*** 

Institutional  ownership 
concentration 

0.35 

(0.34) 

0.38 

(0.37) 

/\  ^  ^ 

(040)*** 

(continued  on  next  page) 
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Panel  C:  CEO  Incentive  Compensation'’ 


S&P  500 

S&P  400 

S&P  600 

Large-Cap 

Mid-Cap 

Small-Cap 

Bonus  as  a  percentage  of  total 

29.0% 

28.0% 

25.6%*** 

cash  compensation 

(26.0%) 

(25.9%) 

(23.5%)*** 

Absolute  percentage  change  in 

36.5% 

34.6% 

30.3%*** 

total  cash  compensation 

(19.4%) 

(20.3%) 

(18.7%)*** 

Value  of  annual  option  and 

3,678.1 

1,371.9 

699.6*** 

restricted  stock  grants 
($000s) 

(2,112.1) 

(678.9) 

(252.3)*** 

Ratio  of  value  of  annual  option 

45.6% 

34.6% 

28.0%*** 

and  restricted  stock  grants  to 
total  cash  compensation 

(49.0%) 

(34.8%) 

(23.6%)*** 

Sensitivity  of  annual  option 

109.7 

42.4 

20.2*** 

and  restricted  stock  grants 

(65.8) 

(20.9) 

(7.3)*** 

for  a  1  %  change  in  firm 
value  ($000s) 


a  ***  Represents  significance  at  the  1  percent  level  for  two-tailed  t-tests  or  Wilcoxon  rank-sum  tests  to  determine 
whether  means  or  medians  differ  between  the  S&P  500  large-cap  and  S&P  600  small-cap  samples. 

Data  are  taken  from  a  sample  of  3,209  S&P  500  large-cap  firm-year  observations,  2,204  S&P  400  mid-cap  firm-year 
observations  and  2,738  S&P  600  small-cap  firm-year  observations  from  2000-2007.  All  firms  are  included  on  the  Execu- 
Comp  database.  Market  value  of  equity  is  the  number  of  outstanding  shares  multiplied  by  the  stock  price  at  the  end  of  the 
year.  Book  value  of  assets  is  the  average  book  value  of  the  firm’s  assets.  Return  on  assets  is  net  income  before  extraor¬ 
dinary  items  divided  by  average  book  assets.  Return  on  equity  is  net  income  before  extraordinary  items  divided  by  the 
average  book  value  of  equity.  Long-term  debt/book  value  of  assets  is  as  of  the  end  of  the  fiscal  year.  Annualized  stock 
return  is  the  firm  stock  remrn  based  on  the  annualized  monthly  stock  returns  for  the  fiscal  year,  assuming  dividend 
reinvestment.  Idiosyncratic  risk  is  the  standard  deviation  of  the  residual  from  a  market  model  regression  estimated  over  253 
days  using  daily  stock  returns  (subject  to  a  minimum  of  125  daily  stock  returns).  Market-to-book  equity  is  the  market  value 
of  the  firm’s  equity  scaled  by  the  book  value  of  the  firm’s  equity  at  the  end  of  the  fiscal  year.  lOS  factor  score  represents 
the  investment  opportunity  set  based  on  investment  intensity,  market-to-book  assets,  geometric  mean  annual  growth  rate  of 
market  value  of  assets,  and  R&D  expenditures  scaled  by  assets  following  Baber  et  al.  (1996).  Analyst  coverage  data  is 
collected  from  the  Institutional  Brokers’  Estimate  System  (I/B/E/S).  Herfindahl  index  of  within-industry  concentration  is 
calculated  by  first  determining  what  are  the  fractions  of  a  firm’s  total  sales  accounted  for  by  its  different  business  segments 
and  then  summing  the  squares  of  these  fractions.  Herfindahl  index  of  geographic  concentration  is  calculated  by  first 
determining  what  are  the  fractions  of  a  firm’s  total  sales  accounted  for  by  its  different  geographic  segments  and  then 
summing  the  squares  of  these  fractions.  Illiquidity  of  firm’s  shares  is  measured  as  in  Amihud  (2002)  and  represents  the 
average  daily  ratio  of  stock  return  to  dollar  volume. 

b  Represents  significance  at  the  1  percent  level  for  two-tailed  t-tests  or  Wilcoxon  rank-sum  tests  to  determine 

whether  means  or  medians  differ  between  the  ExecuComp  and  non-ExecuComp  samples.  Institutional  ownership  con¬ 
centration  is  the  fractional  ownership  held  by  the  top  five  institutions  scaled  by  the  fraction  of  outstanding  shares  held 
by  all  institutions.  Bonus  is  the  annual  cash  bonus  paid  to  the  executive.  Total  cash  compensation  is  salary  plus  cash 
bonus.  Absolute  percentage  change  in  cash  compensation  is  the  absolute  value  of  the  change  in  total  cash  compensation 
scaled  by  total  cash  compensation  in  the  prior  fiscal  year.  Value  of  annual  option  grants  is  the  Black-Scholes  value  of  the 
options  granted  to  the  CEO  throughout  the  fiscal  year  in  thousands  of  dollars.  Sensitivity  of  annual  option  and  restricted 
stock  grants  for  a  1  percent  change  in  firm  value  ($000s)  is  the  dollar  change  (in  thousands  of  dollars)  of  the  CEO’s 
annual  option  and  restricted  stock  grants  for  a  1  percent  change  in  the  value  of  the  firm’s  equity. 


across  the  three  groups  of  firms,  while  the  use  of  incentive-based  compensation  decreases.  How¬ 
ever,  although  there  appears  to  be  heterogeneity  within  the  ExecuComp  sample,  the  small-cap 
firms  in  the  ExecuComp  database  more  closely  resemble  ExecuComp  firms  than  non-ExecuComp 
firms. 

Table  8  compares  the  results  from  the  ExecuComp/non-ExecuComp  classification  to  the  S&P 
500/non-S&P  500  classification  within  the  ExecuComp  database.  These  tests  provide  evidence  as 
to  whether  the  differences  in  compensation  functions  between  ExecuComp  and  non-ExecuComp 
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TABLE  8 

S&P  500  Indicator  within  the  ExecuComp  Sample 


Panel  A:  Performance  Measure  Weights  for  Cash  Compensation  (Table  4  Panel  A) 

ExecuComp  Indicator  S&P  500  Indicator 


Indicator  variable  *  AROE 

0.157** 

0.129 

(4.48) 

(1.59) 

Indicator  variable  =»=  Annualized 

0.113*** 

0.098*** 

stock  return 

(6.99) 

(2.68) 

n 

13,277 

7,342 

Panel  B:  The  Impact  of  the  lOS  on  the  Weight  on  Earnings  in  Cash  Compensation  (Table  5 

Panel  A) 

ExecuComp  Indicator 

S&P  500  Indicator 

Indicator  variable  *  AROE 

0.683*** 

0.490 

(3.92) 

(1.25) 

Indicator  variable  *  AROE  * 

-0.287** 

0.186 

lOS 

(-2.53) 

(0.37) 

Indicator  variable  *  Annualized 

0.268*** 

0.392** 

stock  return 

(4.47) 

(2.98) 

Indicator  variable  *  Annualized 

0.147 

0.181 

stock  return  *  lOS 

(1.64) 

(0.69) 

Indicator  variable  *  lOS 

-0.057* 

-0.027 

(-1.73) 

(-0.38) 

n 

13,277 

7,342 

Panel  C:  The  Pay-For-Performance  Sensitivity  of  Option  Grants  (Table  6) 

ExecuComp  Indicator 

S&P  500  Indicator 

Indicator  variable  *  Log  of  the 

1.312** 

0.315*** 

market  value  of  value  of 

(2.26) 

(3.03) 

equity 

Indicator  variable  * 

-0.000 

-0.005 

Market-to-book  assets 

(-0.64) 

(-0.66) 

Indicator  variable  *  Change  in 

0.000 

-0.000 

shareholder  wealth 

(0.74) 

(-0.11) 

Indicator  variable  *  Prior  year 

-0.000** 

-0.000*** 

change  in  shareholder  wealth 

(-1.98) 

(-2.89) 

Indicator  variable  * 

7.608*** 

-2.131 

Institutional  ownership 

(2.71) 

(-1.59) 

concentration 

Indicator  variable  * 

0.989 

-0.840 

Institutional  Ownership 

(1.08) 

(-1.41) 

n 

23,096 

11,928 

**,***  Indicate  that  coefficients  are  different  from  zero  at  the  5  and  1  percent  significance  levels,  respectively. 

Results  in  the  ExecuComp  Indicator  column  provide  regression  coefficients  for  selected  variables  from  the  models  in  Table 
4,  Panel  A,  Table  5,  Panel  A,  and  Table  6.  Results  in  the  S&P  500  Indicator  column  provide  regression  coefficients  for 
selected  variables  from  the  models  in  Table  4,  Panel  A,  Table  5,  Panel  A,  and  Table  6  in  which  only  the  ExecuComp  sample 
is  examined  and  the  ExecuComp  indicator  variable  is  replaced  with  an  S&P  500  indicator  variable,  t-statistics  for  two- 
tailed  tests  are  in  parentheses  and  are  based  on  standard  errors  that  are  adjusted  for  the  clustering  of  observations  at  the  firm 
level. 
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firms  are  present  within  the  ExecuComp  database.  For  brevity,  we  only  tabulate  the  results  for  the 
key  interactions  of  interest.  To  facilitate  comparison  with  our  prior  findings  we  also  include  the 
comparable  results  from  Tables  4-6. 

In  Table  8,  Panel  A,  consistent  with  our  results  comparing  ExecuComp  and  non-ExecuComp 
firms,  we  find  that  cash  compensation  is  more  sensitive  to  stock  returns  for  S&P  500  than  for 
non-S&P  500  firms.  On  the  other  hand,  we  do  not  observe  the  significant  difference  in  the  weight 
on  earnings  that  we  found  between  ExecuComp  and  non-ExecuComp  firms.  Table  8,  Panel  B 
shows  that  we  do  not  find  significant  differences  between  S&P  500  and  non-S&P  500  firms  with 
respect  to  the  predicted  negative  association  between  the  firm’s  investment  opportunity  set  and  the 
weight  placed  on  earnings  versus  stock  returns.  Likewise,  Table  8,  Panel  C  documents  that  the 
relation  between  institutional  ownership  concentration  and  CEO  stock  option  grant  incentives  is 
not  significantly  different  across  the  two  groups  of  firms. Overall,  these  findings  indicate  that 
tests  conducted  between  groups  of  firms  within  the  ExecuComp  database  are  not  substitutes  for 
tests  conducted  between  samples  of  ExecuComp  and  non-ExecuComp  firms  in  assessing  the 
impact  of  contracting  environments  on  the  design  of  CEO  incentive  contracts. 

VIL  CONCLUSION 

The  ExecuComp  database  provides  an  easy-to-use  data  source  of  a  relatively  broad  range  of 
firms,  including  the  largest  and  arguably  most  important  firms  in  the  economy.  As  a  result,  the 
database  is  extensively  used  to  investigate  a  wide  range  of  governance  and  compensation  issues, 
although  little  is  known  with  regard  to  the  governance  structures  and  compensation  functions  of 
firms  not  included  in  the  database,  or  how  they  differ  from  those  of  firms  included  in  ExecuComp. 
In  this  study  we  document  systematic  differences  in  contracting  environment  characteristics  be¬ 
tween  ExecuComp  and  non-ExecuComp  firms  that  are  likely  to  impact  firms’  executive  compen¬ 
sation  contracts.  We  hypothesize  that  the  greater  firm  complexity,  analyst  following,  and  stock 
liquidity  associated  with  ExecuComp  firms  alters  the  contracting  environment  for  these  firms, 
which  leads  to  three  empirical  predictions.  First,  we  predict  that  ExecuComp  firms  place  a  greater 
weight  on  earnings  and  stock  returns  in  CEO  cash  compensation  contracts.  Second,  we  predict  that 
the  weight  on  earnings  will  be  more  sensitive  to  a  firm’s  growth  opportunities  for  ExecuComp 
firms.  Third,  we  predict  that  the  positive  association  between  institutional  ownership  concentration 
and  CEO  stock  option  incentives  documented  by  Hartzell  and  Starks  (2003)  will  be  stronger  for 
ExecuComp  firms  than  for  non-ExecuComp  firms. 

The  findings  support  our  hypotheses  and  suggest  that  differences  in  contracting  environments 
between  ExecuComp  and  non-ExecuComp  firms  lead  to  differences  in  the  structure  of  CEO 
incentive  contracts  and  the  role  played  by  external  monitors,  such  as  institutional  shareholders,  in 
setting  CEO  incentives.  Overall,  our  findings  highlight  the  importance  for  future  researchers  to 
consider  how  a  firm’s  contracting  environment  can  affect  results  of  empirical  tests  of  executive 
compensation.  Researchers  should  use  caution  when  generalizing  results  to  firms  that  are  not 
included  in  the  ExecuComp  data  set. 


REFERENCES 

Amihud,  Y.  2002.  Illiquidity  and  stock  returns:  Cross-section  and  time-series  effects.  Journal  of  Financial 
Markets  5:  31-56. 


For  the  Table  8  analyses  we  define  whether  a  firm  is  an  S&P  500  firm  by  considering  whether  it  is  included  in  the  S&P 
500  index  during  the  most  recent  year  for  which  ExecuComp  has  collected  executive  compensation  data.  If  instead,  for 
these  analyses,  we  use  an  indicator  variable  that  indicates  whether  in  a  particular  year  a  firm  is  included  in  this  index, 
we  find  similar  results. 


The  Accounting  Review 
American  Accounting  Association 


September  2010 


Determinants  of  CEO  Pay:  A  Comparison  of  ExecuComp  and  Non-ExecuComp  Firms 


1543 


Baber,  W.  R.,  S.  N.  Janakiraman,  and  S.  Kang.  1996.  Investment  opportunities  and  the  structure  of  executive 
compensation.  Journal  of  Accounting  and  Economics  21;  297-318. 

- ,  S.  Kang,  and  K.  R.  Kumar.  1998.  Accounting  earnings  and  executive  compensation:  The  role  of 

earnings  persistence.  Journal  of  Accounting  and  Economics  25:  169-193. 

- , - ,  and - .  1999.  The  explanatory  power  of  earnings  levels  vs.  earnings  changes  in  the  context 

of  executive  compensation.  The  Accounting  Review  74:  459-472. 

Banker,  R.,  and  S.  Datar.  1989.  Sensitivity,  precision,  and  linear  aggregation  of  accounting  signals.  Journal 
of  Accounting  Research  27:  21-39. 

Bushman,  R.,  Q.  Chen,  E.  Engel,  and  A.  Smith.  2004.  Financial  accounting  information,  organizational 
complexity  and  corporate  governance  systems.  Journal  of  Accounting  and  Economics  37:  167-201. 

Core,  J.  E.,  and  W.  R.  Guay.  1999.  The  use  of  equity  grants  to  manage  optimal  equity  incentive  levels. 
Journal  of  Accounting  and  Economics  28:  151-184. 

Dechow,  R  M.,  M.  R.  Huson,  and  R.  G.  Sloan.  1994.  The  effect  of  restructuring  charges  on  executives’  cash 
compensation.  The  Accounting  Review  69:  138-156. 

Gaver,  J.  J.,  and  K.  M.  Gaver.  1998.  The  relation  between  nonrecurring  accounting  transactions  and  CEO 
cash  compensation.  The  Accounting  Review  73:  235-253. 

Hartzell,  J.  C.,  and  L.  T.  Starks.  2003.  Institutional  investors  and  executive  compensation.  The  Journal  of 
Finance  58:  2351-2374. 

Holmstrom,  B.  1979.  Moral  hazard  and  observability.  The  Bell  Journal  of  Economics  10:  74-91. 

Ittner,  C.  D.,  D.  F.  Larcker,  and  M.  V.  Rajan.  1997.  The  choice  of  performance  measures  in  annual  bonus 
contracts.  The  Accounting  Review  72:  231-255. 

Lambert,  R.  A.,  and  D.  F.  Larcker.  1987.  An  analysis  of  the  use  of  accounting  and  market  measures  of 
performance  in  executive  compensation  contracts.  Journal  of  Accounting  Research  25:  201-227. 

Matsunaga,  S.  R.,  and  C.  W.  Park.  2001.  The  effect  of  missing  a  quarterly  earnings  benchmark  on  the  CEO’s 
annual  bonus.  The  Accounting  Review  26:  85-125. 

Natarajan,  R.  1996.  Stewardship  value  of  earnings  components:  Additional  evidence  on  the  determinants  of 
executive  compensation.  The  Accounting  Review  1 1 :  1-22. 

Piotroski,  J.,  and  D.  Roulstone.  2004.  The  influence  of  analysts,  institutional  investors,  and  insiders  on  the 
incorporation  of  market,  industry  and  firm-specific  information  into  stock  prices.  The  Accounting 
Review  79:  1119-1152. 

Shleifer,  A.,  and  R.  Vishny.  1986.  Large  shareholders  and  corporate  control.  The  Journal  of  Political 
Economy  94:  461—488. 

Sloan,  R.  G.  1993.  Accounting  earnings  and  top  executive  compensation.  Journal  of  Accounting  and  Eco¬ 
nomics  16:  55-100. 


The  Accounting  Review 


September  2010 
American  Accounting  Association 


t'iiW  jiiiA -v 


"6^ 


Bpi^llll  l»*tr-»  '■ij  'il  Tl*r*<’  ’>  '•  •  4  .•••‘''  ■•  ‘i'  V  t  'fwt  ^ 

?»  »>  IC'.  . 

^  .V--.  V  ■«.  .  ■  il|ii:f>^t  <r-  r.:)  .  J  - ' '•«- '  %<,  .  ‘  '- 

IB  -.•  »•  ■;  ’  i  -wv  wi.’Cij*:  ->  m  .A4q»iy  ,  4n  ■'t'STvTcwr 

Bfe|^taars«'V'''..?t  *,  ,  i«Xjaifi(SyB;ij*fe;  ^  h>  •  V/'  «  .>i|lUy.tf«fej^->i’;Wij: 

■.li'i'A  ;.-.  ^  **  w- "*■«';  •'■  •  r  '•'  - .- ..  ..a;--  -5 

^5  utrrr-v^^  ■'JK  r  '  -^w?!  :5''n'  '>■'*  v -r««y:;  Ji- ^.^it  y»«h-  .>'nf?' 

iii' u  ‘-'•*  ■“•*■*  ?•*  nv,.  <>*li,l- ff4f<Al^.\’y^\.i.>\>-.  ,,  ^ 

^-■'fe  -.‘(M'S,  ■•  .'•■  tWT  '  ’r,t'  '  j 

'/^  f  fl*;  ,ii  J  <1fy  *0’  ■, 

, ,  '■  ijiV  - .  -^Vdvj  <88^  X  J'  ,.'<  M 

<»»  Jr^Tv^  4'r't  •  .Vlf*‘«”  tj.  Ait'  V  Tl^'.ret. 

■...♦j(|(lfe"Cl^  . 

-/  ^  :^/ a- • 'vs2rA?' >*• 

•...•  <1  n-Ktxwnl-l^* rwiKi,;ltfi)idK fcr  jn.-jm  ,-iv^  flif,-,  ' 

i  /Wj  *■* 'r^  Y-ifchfi/ '  -  » 

<frii.;<./  7!(*<>.>i».n.  f-ih,  .  V  v<.T<l^tlhrtyiA^i44^*v  '* 

.  *--H-’'  •'  '•'■■^  '  .-1.  t  '  ^'rrf.Jli^' ’*1  ^^•  .- *■  ■'•' 

V  '  ■esit=^'’-’  ■  ’♦'■^  v-<i;  om%ji^p¥'ft  ^ 

'^^t'  i^.  v^iti'i'  •  *1  iitj*)-  ~  HP'  V' 


»v-r-  »'{,■  i» 


4f'*.  w- 

‘'yii  UlTi 


vr  ,  Fr4*#icri,  ^ 

-,  f*^-.  V, '  i'.>’'t>v'flV^!us|8lHK^ 


''v.vi.-»^Xnj|>i^'l  .ri'Vf_'V  ,-!^  b^-'|g'i'*<  ‘<*k»J«  lvvj|gwl| 

.jniti^tiv.  L' '*«r  '  ''  -•' V  '.'!  >  ■-.vt'^J?-'.^‘5 


!;.■<  u;»ffcKMj.  Voi'r  #» 

r--  ■  . ,  •  ,  ..■  ♦ -J 

3T^'»vH 

■  -J  ' "l-V  ■ 


W<'7N;Ai4ite. 


f, 


- 1  - 


'*  « 


..4 

/I 


.1 

•  - 


•>  Y' 


-  ■■  * •  ' 

:-.yft«v».jf  Jliiriijl '‘.  r  *•  ►!'  >  .  . . 

J^',4^  «s'4i/.-fy,  Vir*  i  •'  *?>.  '•'  ■  «M*  ‘J*  •  . .  ' 


if|  W 


*  .yr-C: 


•  A*'  A  .I* 

.r\-v»^'Ct  • ' 4 ;  tte 

-*t  »i<^j  '  ri<r 

■  ,  J>  <  iiviJ|a1»  '  'pVi 

.._  ‘  .  ■  ,  'fe. 


».  1» 

.w'-. 


>~ 


/*  .  .v.  .  iAe  '  /t 


,j 

^  Jr  4r 


l^M  ^  'r*  ^  _f 


American  Accounting  Association 
DOI:  10.2308/accr.2010.85.5.1545 


THE  ACCOUNTING  REVIEW 
Vol.  85,  No.  5 
2010 

pp. 1545-1575 

Ranking  Performance  Measures  in 
Multi-Task  Agencies 

Peter  O.  Christensen 

Aarhus  University 

Florin  §abac 
Jie  (Joyce)  Tian 

University  of  Alberta 

ABSTRACT:  We  derive  sufficient  conditions  for  ranking  performance  evaluation  sys¬ 
tems  in  multi-task  agency  models  (using  both  optimal  and  linear  contracts)  in  terms  of 
a  second-order  stochastic  dominance  (SSD)  condition  on  the  likelihood  ratios.  The  SSD 
condition  can  be  replaced  by  a  variance-covariance  matrix  of  likelihood  ratios  (VCM) 
condition  when  the  utility  function  is  square-root,  the  performance  measures  are  nor¬ 
mally  distributed,  and  for  LEN  models.  We  identify  existing  results  derived  under  the 
LEN  assumptions  that  rely  on  the  VCM  condition  and,  thus,  also  hold  for  optimal  con¬ 
tracts. 

Keywords:  likelihood  ratios;  multiple  tasks;  LEN. 

I.  INTRODUCTION 

The  choice  of  performance  evaluation  systems  is  at  the  heart  of  accounting  and  accounting 
research.  The  principal-agent  framework  has  provided  profound  insights  into  the  determi¬ 
nants  of  the  optimal  choice  among  alternative  systems  and  how  the  information  is  opti¬ 
mally  exploited  in  the  associated  reward  system,  such  as  the  identification  of  the  conditional 
controllability  principle  for  the  value  of  an  additional  performance  measure  (e.g.,  Antle  and 
Demski  1988).  However,  most  of  the  research  in  this  area  either  is  in  single-period  and  single-task 
settings,  or  has  extended  the  analysis  to  multiperiod  and  multi-task  settings  using  the  simplifying 
assumptions  of  the  LEN  framework  (Linear  contracts.  Exponential  utility.  Normally  distributed 
performance  measures).  The  multi-task  models  of  Eeltham  and  Xie  (1994)  and  Holmstrom  and 
Milgrom  (1991)  are  prominent  examples  of  LEN  models  that  have  had  a  significant  impact  on 
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accounting  research,  broadening  our  understanding  of  performance  measure  design  and  other 
organizational  issues/ 

The  advances  made  by  using  LEN  models  came  at  a  price:  (1)  basic  questions  on  the  choice 
of  performance  evaluation  systems  remained  unanswered  in  general;  and  (2)  the  validity  of  the 
results  obtained  with  linear  contracts  is  uncertain  because  performance  information  is  not  ex¬ 
ploited  optimally  in  the  reward  system.  Our  study  speaks  to  both  issues.  We  investigate  the  choice 
among  performance  evaluation  systems  in  a  single-period,  multi-task  setting  with  optimal  con¬ 
tracts.  The  likelihood  ratios  reveal  the  underlying  information  economics  not  only  for  optimal 
contracts,  but  also  for  the  LEN  framework.  We  do  not  claim  that  LEN  results  have  general 
validity,  but  we  identify  a  set  of  LEN  results  for  which  the  assumption  of  linear  contracts  is  not 
restrictive. 

We  determine  sufficient  conditions  for  a  partial  ranking  of  information  systems  in  multi-task 
agencies  in  terms  of  a  second-order  stochastic  dominance  (SSD)  condition  on  the  likelihood  ratios. 
In  general,  the  SSD  condition  is  implied  by  Blackwell  garbling.  For  normally  distributed  perfor¬ 
mance  measures,  the  SSD  condition  is  also  implied  by  a  dominance  condition  for  the  variance- 
covariance  matrices  (VCM)  of  the  likelihood  ratios.  The  same  VCM  condition  is  sufficient  to  rank 
information  systems  when  the  agent  has  square-root  utility  or  in  the  LEN  framework.  Using  the 
VCM  condition,  we  extend  Holmstrom’s  (1979)  sufficient  statistic  result  to  the  multi-task  setting 
of  normally  distributed  performance  measures  with  both  optimal  and  LEN  contracts. 

We  provide  the  first  results  in  the  literature  on  ranking  information  systems  for  multi-task 
agency  problems  using  optimal  contracts.  Our  SSD  ranking  condition  for  the  multi-task  setting 
generalizes  that  in  Kim  (1995)  for  single-task  settings.  The  sufficiency  of  the  VCM  dominance 
condition  for  multi-task  settings  and  either  square-root  agent  utility  or  normally  distributed  per¬ 
formance  measures  generalizes  the  single-task  results  in  Kim  and  Suh  (1991).  The  sufficiency  of 
the  VCM  condition  for  ranking  information  systems  in  the  LEN  model  is  a  new  result  based  on  a 
novel  derivation  of  the  optimal  linear  contracts  using  likelihood  ratios. 

In  the  single-task  setting  of  Kim  (1995),  the  SSD  condition  on  the  likelihood  ratio  reflects  that 
a  larger  variation  in  the  likelihood  ratio  provides  more  precise  inferences  about  the  agent’s  action 
and,  thus,  the  risk  premium  required  to  induce  a  particular  action  can  be  reduced.  In  the  extreme, 
if  there  is  no  variation  in  the  likelihood  ratio,  the  performance  measure  provides  no  information 
about  the  agent’s  action  and,  consequently,  the  performance  measure  cannot  be  used  to  provide 
incentives  for  the  agent. 

The  same  interpretation  applies  in  our  multi-task  setting,  except  that  we  require  a  SSD  con¬ 
dition  on  the  (multi-dimensional)  likelihood  ratio  weighted  by  the  optimal  multiplier  on  the  in¬ 
centive  compatibility  constraints.  Because  these  multipliers  are  endogenously  determined,  our 
general  SSD  condition  depends  on  the  specifics  of  the  incentive  problem,  such  as  the  agent’s 
preferences.  This  is  the  key  difference  relative  to  the  single-task  setting. 

In  the  single-task  setting  of  Kim  (1995),  the  SSD  condition  on  the  (one-dimensional)  likeli¬ 
hood  ratios  is  independent  of  the  multipliers  because  a  stochastic  dominance  relation  between 
distributions  of  two  random  variables  is  preserved  if  both  variables  are  scaled  by  the  same  factor. 
In  other  words,  the  multiplier  “cancels  out,’’  and  the  ranking  of  performance  measures  is  based 


'  Despite  the  fact  that  linear  contracts  are  not  optimal  in  the  many  cases  of  interest  to  which  the  LEN  model  has  been 
applied,  its  tractability  has  allowed  exploration  of  many  interesting  contracting  settings  that  would  have  been  otherwise 
intractable.  A  partial  list  includes:  Gibbons  and  Murphy  (1992)  on  cai'eer  concerns;  Meyer  and  Vickers  (1997)  on 
relative  performance  evaluation;  Bushman  and  Indjejikian  (1993),  Kim  and  Suh  (1993),  and  Feltham  and  Wu  (2000)  on 
earnings  and  stock  prices  as  performance  measures;  Datar  et  al.  (2001)  on  relative  incentive  rates;  Dutta  and  Reichel- 
stein  (1999,  2002,  2003)  on  dynamic  performance  measurement;  Indjejikian  and  Nanda  (1999),  Christensen  et  al.  (2003, 
2005),  and  §abac  (2007,  2008)  on  dynamic  models  with  renegotiation;  McLaughlin  (1994),  Huddart  and  Liang  (2005), 
Adams  (2006),  and  Liang  et  al.  (2008)  on  team  incentives. 
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only  on  the  informational  characteristics  of  the  performance  measures.  For  general  multi-task 
agencies,  the  optimal  multipliers  enter  the  ranking  condition  for  performance  evaluation  systems.^ 

We  provide  several  sufficient  conditions  such  that  the  ranking  of  performance  evaluation 
systems  depends  only  on  the  informational  characteristics  of  the  performance  measures.  First,  we 
show  that  in  a  setting  with  stochastically  independent  task-specific  (technologically  independent) 
performance  measures,  a  SSD  condition  for  each  of  the  task-specific  likelihood  ratios  implies  the 
general  SSD  condition  (independently  of  the  optimal  multipliers).  In  other  words,  if  there  is  more 
variation  in  the  likelihood  ratio  for  each  task,  then  the  risk  premium  required  for  inducing  a 
particular  multi-task  action  can  be  reduced. 

Second,  we  extend  Kim  (1995)  and  show  that  Blackwell  garbling  implies  the  general  SSD 
condition  for  the  weighted  likelihood  ratios  in  multi-task  settings  independently  of  the  optimal 
multipliers.  Including  more  “noise”  in  the  performance  measures  implies  less  precise  inferences 
about  the  agent’s  action  and,  thus,  a  higher  risk  premium  to  induce  a  particular  action. 

Third,  we  examine  settings  in  which  only  the  variance  of  the  weighted  likelihood  ratios 
matters  (their  mean  is  always  zero),  either  because  of  particular  agent  preferences  (square-root 
utility)  or  because  the  distribution  of  the  likelihood  ratios  is  fully  characterized  by  their  variance- 
covariance  matrix  (normal  distributions).  When  comparing  two  performance  evaluation  systems, 
the  variance  of  the  weighted  likelihood  ratios  is  larger  with  one  system  than  the  other  indepen¬ 
dently  of  the  multiplier  if,  and  only  if,  the  corresponding  variance-covariance  matrix  of  the  like¬ 
lihood  ratios  is  larger  (i.e.,  their  difference  is  positive  definite).  This  is  our  VCM  condition. 

When  the  performance  measures  are  both  stochastically  and  technologically  independent,  i.e., 
each  performance  measure  is  affected  by  only  one  task,  the  variance-covariance  matrix  of  the 
likelihood  ratios  is  a  diagonal  matrix:  the  VCM  condition  requires  that  the  variances  of  the 
likelihood  ratios  are  larger  with  one  information  system  than  the  other  system  for  all  tasks.  When 
the  performance  measures  are  either  technologically  and/or  stochastically  dependent,  the  covari¬ 
ances  of  the  likelihood  ratios  across  the  different  tasks  also  enter  the  VCM  condition;  in  these 
settings,  the  notion  of  more  variability  in  the  likelihood  ratio  and,  thus,  more  precise  inferences 
about  the  agent’s  multiple  tasks,  is  a  statement  about  the  size  of  the  full  variance-covariance 
matrix  of  the  likelihood  ratios  and  not  only  about  their  variances. 

We  use  the  VCM  dominance  condition  to  derive  results  that  hold  for  both  optimal  and  LEN 
contracts.  While  this  does  not  imply  that  LEN  results  have  general  validity,  those  that  hinge  on  the 
VCM  condition  do.  Eor  example,  our  sufficient  statistic  result  on  the  value  of  an  additional 
performance  measure  generalizes  Holmstrom’s  (1979)  informativeness  result  to  a  multi-task  set¬ 
ting  with  normally  distributed  performance  measures  and  Eeltham  and  Xie’s  (1994)  LEN  infor¬ 
mativeness  result  to  optimal  contracts.  The  VCM  condition  allows  a  generalization  of  the  condi¬ 
tional  controllability  principle  to  the  multi-task  setting  and  explains  why  the  LEN  results  of 
Eeltham  and  Xie  (1994)  point  in  the  right  direction.  Other  examples  include  the  value  of  more 
precise  information  and  linear  aggregation  of  performance  measures. 

We  find  the  concept  of  congruity  too  limited  for  ranking  information  systems  over  the  set  of 
implementable  actions.  Congruity  can  be  defined  only  in  LEN  models  on  the  equilibrium  path  in 
terms  of  the  relative  position  of  the  first-best  and  second-best  action  choice  vectors.  The  VCM 
condition  ranks  information  systems  for  all  implementable  actions,  not  only  on  the  equilibrium 
path.  Eor  both  the  LEN  model  and  square-root  utility  with  optimal  contracts,  the  VCM  condition 
also  determines  the  set  of  implementable  actions.  By  not  restricting  ourselves  to  the  equilibrium 


^  As  shown  in  the  proof  of  Proposition  1,  the  optimal  multiplier  on  the  incentive  compatibility  constraints  is  the  optimal 
multiplier  that  implements  a  given  action  when  the  principal  randomizes  over  the  two  information  systems.  In  the 
single-task  setting,  stochastic  dominance  of  the  distributions  of  the  likelihood  ratios  alone  is  sufficient  because  any 
(scalar)  multiplier  “cancels  out”  (see  Lemma  A.l  in  Appendix  A). 


The  Accounting  Review 


September  2010 
American  Accounting  Association 


1548 


Christensen,  §abac,  and  Tian 


path,  our  analysis  of  the  value  of  information  in  a  multi-task  setting  differs  significantly  from  the 
accounting  literature  that  followed  Feltham  and  Xie  (1994),  such  as  Feltham  and  Wu  (2000),  Datar 
et  al.  (2001),  or  Demski  et  al.  (2007). 

Our  analysis  demonstrates  that  some  LEN  results,  including  many  of  the  key  results  in 
Feltham  and  Xie  (1994),  are  robust  to  removing  the  linear  contract  restriction  and  shows  that  the 
key  assumption  driving  these  results  (via  the  VCM  condition)  is  the  linear  structure  of  the  perfor- 
mance  measures  with  normally  distributed  additive  noise  independent  of  action  choices.  We  do 
not  claim  that  results  from  LEN  models  are  likely  to  be  valid  in  most  performance  evaluation 
settings  of  interest."*  Rather,  we  provide  a  new  formulation  of  the  LEN  framework  in  terms  of 
likelihood  ratios  as  for  optimal  contracts,  which  can  provide  the  basis  for  a  deeper  understanding 
of  when  and  why  LEN  results  may  have  more  general  validity. 

II.  THE  GENERAL  MULTI-TASK  AGENCY  MODEL 

The  model  in  this  section  is  based  on  the  single-period  multi-task  model  of  Holmstrom  and 
Milgrom  (1991)  and  Feltham  and  Xie  (1994)  (without  the  LEN  restrictions  we  introduce  in  the 
next  section).  A  risk-neutral  principal  owns  a  technology  that  requires  productive  effort  on  m  tasks 
a  =  (ai ,  02  , ... ,  o^)  e  R'”  from  a  risk-  and  effort-averse  agent. ^  The  agent’s  preferences  over  com¬ 
pensation  and  effort  are  given  by: 

u{c,a)  =  u{c)k{a) -v{a),  (1) 

where  the  utility  of  compensation,  m(c),  is  increasing  and  concave,  and  the  effort  cost  is  either 
multiplicatively  separable  (v(a)  =  0)  or  additively  separable  (k{a)  =  1).  The  agent’s  effort  cost, 
v(o)  or  k{a),  is  increasing  in  all  directions  and  convex. 

The  principal  minimizes  the  expected  cost  of  compensation  c{y)  that  induces  the  agent  to 
choose  a  given  action  a.  The  contract  c(y)  the  principal  offers  the  agent  is  a  function  of  n 
performance  measures  y  e  R”  whose  distribution  has  joint  density  </)(y|a,77),  where  rj  is  the  infor¬ 
mation  system  that  produces  the  contractible  performance  measures  y.  The  contracts  are  bounded 
from  below,  c(y)  >  c,  to  avoid  the  Mirrlees  (1999)  problem  of  non-existence  of  an  optimal  solu¬ 
tion  to  the  principal’s  problem.  The  agent  accepts  the  contract  if  its  expected  utility  is  greater  than 
or  equal  to  his/her  reservation  utility  (the  agent’s  participation,  or  individual  rationality  constraint) 

W{c,a\r])  =  j  u(c(y))k(a)(f>(yla,y)dy  -  u(a)  >  [/.  (2) 

The  agent  chooses  action  a  to  maximize  his/her  expected  utility,  which  we  assume  is  equiva¬ 
lent  to  a  satisfying  the  first-order  condition  (the  agent’s  incentive  compatibility  constraint): 


^  Holmstrom  and  Milgrom  (1987)  introduced  the  LEN  framework  in  the  agency  literature  as  the  limiting  case  of  an 
optimal  contract  setting  and  the  study  has  been  much  cited  in  support  of  using  a  LEN  model  in  the  early  years  after  its 
publication.  However,  their  approach  has  been  shown  to  be  fragile  to  changing  any  of  the  assumptions,  see  Lambert 
(2001),  Christensen  and  Feltham  (2005),  or  Hellwig  and  Schmidt  (2002). 

If  the  agent’s  actions  affect  the  level  of  risk  in  the  performance  measures,  then  optimal  contracts  shield  the  agent  from 
low  outcomes,  which  is  not  possible  with  linear  contracts  (Christensen  and  Feltham  2005,  Ch.  21).  In  multiperiod 
settings,  optimal  full-commitment  contracts  exploit  intertemporal  correlation  to  a  greater  extent  than  can  be  achieved 
with  linear  contracts,  while  optimal  renegotiation-proof  contracts  yield  similar  results  to  those  obtained  with 
renegotiation-proof  linear  contracts  (Christensen  and  Feltham  2005,  Chs.  27  and  28). 

^  In  what  follows,  we  use  the  following  vector  and  matrix  notation.  Vectors  are  thought  of  as  column  vectors  in  all  cases, 
and  the  scalar  product  of  two  vectors  a  and  b  in  R™  is  denoted  by  a  b.  For  a  matrix  M  =  with  n  rows 

and  m  columns,  we  do  not  distinguish  between  the  matrix  and  the  associated  linear  operator  M :  R"  — >  R"  defined  by 
=  (S|sy<:m'”;7^;)is,s«-  We  denote  the  transpose  matrix  as  M^.  Throughout,  we  use  A^a-b  =  a-  Ab  and  {ABY 
=  B'^A'^. 
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V„t/“(c,a|  v)  =  j  u{c{y))[V^k{a)(f){y\a,  tj)  +  k{a)V Tj)]dy  -  V^v{a)  =  0,  (3) 

where  denotes  the  gradient  vector  of  partial  derivatives  {dl  dui  ,dl  da2  ,  ...,d  I  da^). 

The  optimal  contract  that  implements  a  minimizes  the  agent’s  expected  compensation  subject 
to  the  participation  and  incentive  compatibility  constraints  (2)  and  (3).  The  first-order  condition 
with  respect  to  c(y)  (when  c(y)  >  c)  characterizes  the  contract  as  follows: 


a),  (4) 

where  \{a)  and  p(a)  are  the  multipliers  for  the  participation  and  incentive  compatibility  con¬ 
straints,  respectively,  and  L{ip(y),7^a)  is  the  likelihood  ratio: 


1 


u'iciy)) 


=  k{a)X{a)  +  ^  J<.{a)  -f  ^(a)/z(a)  •  L{4>{y),  rj\ 


L{(f){y),7]\a)^ 


Va(f){y\a,Tj) 

4>{y\a,v) 


(5) 


Kim  (1995)  shows  that  the  efficiency  of  information  systems  for  implementing  a  single-task 
action  a  can  be  compared  using  the  second-order  stochastic  dominance  (SSD)  criterion  for  the 
distributions  of  the  likelihood  ratios  (or,  equivalently,  by  mean-preserving  spreads  using  the  fact 
that  likelihood  ratios  have  zero  mean).  In  a  single -task  setting,  the  likelihood  ratio  is  one¬ 
dimensional  (even  with  multiple  performance  measures),  while  in  a  multi-task  setting  it  is  multi¬ 
dimensional  and  cannot  be  compared  across  information  systems  using  the  SSD  criterion  (which 
is  a  single-dimensional  concept).  However,  the  likelihood  ratio  enters  the  characterization  of 
optimal  contracts,  see  (4),  as  the  single-dimensional  weighted  likelihood  ratio  \i{a)-L{(f),r]\a).  Our 
main  result  shows  that  the  SSD  condition  on  the  weighted  likelihood  ratio  is  sufficient  to  rank 
information  systems  and,  thus,  generalizes  Kim  (1995,  Proposition  1)  to  a  multi-task  setting. 

Proposition  1:  Assume  the  agent’s  utility  is  either  additively  or  multiplicatively  separable  in 
compensation  and  effort,  the  agent’s  incentive  compatibility  constraints  can 
be  represented  by  their  first-order  conditions,  and  that  a  is  an  implementable 
action  under  two  information  systems  and  rj^.  There  exists  a  vector  of 
multipliers  p(a)  such  that,  if  the  distribution  function  of  iJL{a)  ■  L{4> ,  \  a) 

second-order  stochastically  dominates  the  distribution  of  fx^a)  •  L{(f),  rj^\a), 
then  the  principal  prefers  rf-  to  if  for  implementing  action  a. 

All  proofs  are  in  Appendix  A. 

As  shown  in  the  proof,  p(a)  is  the  optimal  multiplier  that  implements  a  when  the  principal 
randomizes  over  the  two  information  systems.  In  the  single-task  setting,  stochastic  dominance  of 
the  distributions  of  the  likelihood  ratios  alone  is  sufficient  because  any  (scalar)  multiplier  \i{a) 
“cancels  out”  (see  Lemma  A.l  in  Appendix  A). 

We  do  not  restrict  the  type  of  utility  functions  (other  than  effort  cost  separability  in  (1))  or  the 
distributions  of  the  performance  measures.*’  Relying  on  the  first-order  approach  restricts  the  set¬ 
tings  for  which  Proposition  1  is  applicable.^  We  show  in  Appendix  B  that  the  same  SSD  condition 
on  the  weighted  likelihood  ratios  is  sufficient  to  rank  information  systems  in  settings  with  finite 


^  Kim  (1995)  and  Kim  and  Suh  (1991)  assume  additively  separable  effort  cost;  we  also  allow  multiplicatively  separable 
effort  cost  because  this  is  standard  in  the  LEN  models  we  examine  later.  We  use  integral  notation  throughout  for 
simplicity;  the  proofs  also  hold  for  discrete  distributions  of  the  performance  measures. 

^  There  are  no  sufficient  conditions  reported  in  the  literature  for  the  applicability  of  the  first-order  approach  in  multi-task 
settings.  Mirrlees  (1999),  Rogerson  (1985),  Jewitt  (1988),  Sinclair-Desgagne  (1994,  2009),  and  Conlon  (2009)  derive 
sufficient  conditions  for  the  applicability  of  the  first-order  approach  in  single-task  agencies. 
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action  sets:  Proposition  B.l  is  a  discrete  action  version  of  Proposition  1  that  does  not  rely  on  the 
first-order  approach.  With  a  finite  action  set,  the  ranking  involves  likelihood  ratios  for  all  the 
binding  incentive  compatibility  constraints,  while  the  first-order  approach  is  akin  to  assuming  that 
only  the  “adjacent”  incentive  constraints  are  binding. 

We  assume  that  the  action  a  is  implementable  with  either  information  system.  In  general,  we 
cannot  characterize  the  implementable  actions,  although  with  either  square-root  utility  or  under  the 
LEN  assumptions  the  problem  becomes  tractable  through  the  explicit  calculation  of  the  multipliers 
on  the  first-order  incentive  compatibility  constraints — see  Proposition  4  and  Lemma  1  below. 

The  SSD  condition  in  Proposition  1  depends  on  the  endogenously  determined  optimal  mul¬ 
tiplier  on  the  incentive  compatibility  constraints  and,  thus,  on  the  specific  characteristics  of  the 
incentive  problem,  such  as  agent  preferences.  In  the  following  subsections,  we  identify  conditions 
for  ranking  information  systems  independently  of  these  characteristics. 

Technological  and  Stochastic  Independence 

We  begin  with  a  setting  in  which  the  performance  measures  are  technologically  and  stochas¬ 
tically  independent;  that  is,  the  performance  measures  can  be  grouped  into  a  set  of  (possibly 
multi-dimensional)  task-specific  performance  measures  y  =  (yi ,  ...  ,y^)  e  S'",  such  that: 

</>(y|a,77)  =  0(yi|a,77) ...  (}){yja,7])  =  (j){y^\a^,r]) ...  (6) 

where  the  first  factorization  is  stochastic  independence  and  the  second  factorization  is  technologi¬ 
cal  independence.  Then,  the  likelihood  ratios,  L;(y,- 1  a, ,  rj)  =  I  >  v) !  4>{yi  I  >  ''?)>  ^re  sto¬ 
chastically  independent,  and  the  SSD  condition  in  Proposition  1  is  implied  by  SSD  conditions  for 
each  task  separately,  independently  of  the  multipliers;  the  strong  separability  of  performance 
measures  across  tasks  allows  “cancellation”  of  the  multipliers  as  in  the  single-task  case. 

Proposition  2:  Assume  the  performance  measures  are  technologically  and  stochastically  in¬ 
dependent  and  that  a  is  an  implementable  action  under  two  information  sys¬ 
tems  77'  and  rf-.  If  the  distribution  function  of  L^y/la,-,  iff)  second-order 
stochastically  dominates  the  distribution  of  L,(y,- 1  a,' ,  rf)  for  all  /  =  1,  ...  ,m, 
then  the  principal  prefers  rf  to  rf  for  implementing  action  a. 

This  result  is  parallel  to  the  single-task  result  in  Kim  (1995,  Proposition  1):  if  the  performance 
measures  are  technologically  and  stochastically  independent,  then  more  variation  in  the  likelihood 
ratio  for  each  task  implies  that  the  risk  premium  required  for  inducing  a  particular  multi-task 
action  can  be  reduced.^ 

Blackwell  Garbling 

We  next  show  that  Blackwell  garbling  implies  the  SSD  condition  in  Proposition  1  for  any 
multiplier.  For  performance  measures  yj  and  y2  associated  with  performance  evaluation  systems 
rf  and  rf,  we  say  that  yj  is  a  Blackwell  garbling  of  y2  if  there  exists  a  function  h{y\  ,yf)>Q  such 
that  the  following  three  conditions  hold  (Kim  1995;  Marschak  and  Miyasawa  1968,  Condition  B): 

0(yi|a)=  h{y^,y2)4>{y2\a)dy2  (7a) 


Dewatripont  et  al.  (1999)  discuss  the  action  implemented  in  equilibrium  in  a  context  with  multiple  tasks  and  implicit 
contracts  based  on  career  concerns.  Note,  however,  the  differences:  we  are  interested  in  endogenous  optimal  explicit 
contracts,  while  they  work  with  an  implicit  contract  that  is  the  expectation  of  an  unobservable  parameter  conditional  on 
observing  performance  measures. 

^  Proposition  2  also  holds  in  the  discrete-action  setting  of  Appendix  B  when  only  the  “adjacent”  incentive  constraints  are 
binding. 
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I 


h{y\,y:ddyi  =  1 


for  all  y2 


(7b) 


0<J  hiyi,y'2)dy2<^  for  all  Ji-  (7c) 

Proposition  3:  If  3^1  is  a  Blackwell  garbling  of  y2,  then  jx  ■  L{(f) ,  rj^  \  a)  second-order  stochas¬ 
tically  dominates  fx  ■  L{(f) ,  rf-  \  a)  for  all  g  and  all  a.  Thus,  for  any  aetion  a 
implementable  under  77^  and  rf-,  the  principal  prefers  rf-  to  77^  for  implement¬ 
ing  a. 

Proposition  3  generalizes  Kim  (1995,  Proposition  4)  to  our  multi-task  setting:  including  more 
“noise”  in  the  performance  measures  implies  less  precise  inferences  about  the  agent’s  action  and, 
thus,  a  higher  risk  premium  to  induce  a  particular  action.  In  addition,  the  proof  of  Proposition  3 
carries  through  without  modifications  to  the  setting  with  a  finite  action  set  in  Proposition  B.l, 
Appendix  B.^° 

VCM  Dominance 

As  demonstrated  by  Kim  and  Suh  (1991)  for  the  single-task  setting,  if  either  the  agent’s  utility 
function  is  restricted  or  the  performanee  measures  have  particular  distributions,  the  ranking  of 
information  systems  reduces  to  ranking  the  varianees  of  the  respective  likelihood  ratios.  In  a 
multi-task  setting,  under  similar  assumptions,  the  ranking  is  based  on  the  variances  of  the 
weighted  likelihood  ratios,  i.e.,  SL((/>,77|a)p-p,  where  is  the  variance-covariance  matrix 

of  the  likelihood  ratios.  To  obtain  a  ranking  independent  of  the  optimal  multipliers  on  the  incentive 
compatibility  constraints,  we  rank  the  varianee-covarianee  matrices  of  the  likelihood  ratios  and 
define  the  VCM  dominance  condition  as  follows. 

Definition  1  (VCM  Dominance):  Information  system  77^  strictly  dominates  information  sys¬ 
tem  77'  at  a  in  the  VCM  (Variance-Covariance  Matrix) 
sense  if: 

SL((/>,  77^|a)  >  SL((/>,  77^1a).  (8) 

The  strict  inequality  in  the  VCM  condition  (8)  means  that  the  difference  between  the  two 
matrices  is  positive  definite.  A  weak  dominance  relation  is  defined  by  a  weak  inequality  in  (8),  i.e., 
the  difference  between  the  matrices  is  semi-positive  definite.  The  VCM  condition  gives  only  a 
partial  ordering  in  the  multi-task  setting,  because  we  are  ranking  multi-dimensional  matrices. 
When  the  performance  measures  are  technologically  and  stochastically  independent,  their 
variance-covariance  matrix  is  diagonal  and  the  VCM  condition  reduces  to  comparing  the  variances 
of  the  likelihood  ratios  for  each  task  separately;  otherwise,  the  VCM  dominance  condition  must 
also  include  the  covariances  among  the  likelihood  ratios. 


Kim  (1995)  points  out  that  the  Blackwell  garbling  condition  is  a  strictly  stronger  condition  than  both  ranking  two 
information  systems  for  all  agency  contexts  (his  Proposition  3)  and  for  the  SSD  condition  (his  Proposition  5).  His  proofs 
rely  on  discrete  action  (and  discrete  performance  measure)  counter-examples;  because  our  Proposition  3  also  holds  in 
these  settings,  Kim’s  (1995)  analysis  carries  through  in  our  multi-task  setting. 

“  A  square  matrix  A  is  positive  (semi-positive)  definite  if  Ax-x  >  0  {Ax-x  >  0),  for  all  nonzero  vectors  x.  In  a  sense,  the 
VCM  condition  is  too  strong,  since  it  ensures  that  2L(  <^ ,  97^  |  a)  A  •  /r,  >  XC(  f f  for  all  /x  e  S'".  For  ex¬ 

ample,  it  is  easy  to  show  (using  a  similar  argument  as  in  Jewitt  [1988,  Lemma  1])  that  if  the  agent  has  additively 
separable  utility,  then  /a  •  V„u(fl)  &  0  (contrary  to  the  single-task  setting,  this  does  not  imply  that  all  the  multipliers  are 
positive).  Thus,  the  difference  between  the  variance  matrices  needs  to  be  positive-definite  only  on  this  half-space. 


The  Accounting  Review 


September  2010 
American  Accounting  Association 


1552 


Christensen,  §abac,  and  Tian 


The  next  two  results  demonstrate  the  sufficiency  of  the  VCM  condition  in  ranking  information 
systems  if  either  (1)  the  agent’s  utility  is  additively  separable  with  square-root  utility  of  compen¬ 
sation  or  (2)  the  performance  measures  are  normally  distributed  with  a  mean  linear  in  agent  effort 
and  a  variance  independent  of  agent  effort. 

Proposition  4:  Assume  that  the  agent’s  utility  is  additively  separable  with  square-root  utility 
of  compensation,  i.e.,  u{c  ,  a)  -2\lc  -  v{a),  and  that  the  optimal  compensa¬ 
tion  is  everywhere  interior.  For  any  implementable  action  a,  if  the  variance- 
covariance  matrices  of  the  likelihood  ratios  satisfy  4>,  rj^\  a) 
>  XL{<f> ,  77*  \a),  then  the  principal  strictly  prefers  information  system  77^  to 
77'  for  implementing  a.  The  set  of  implementable  actions  is  A{7])  =  {a 
e  R'”  I  V^u(a)  e  Im(SL(<^ ,  77 1  a))}. 

Proposition  4  generalizes  Kim  and  Suh  (1991,  Proposition  1)  to  the  multi-task  setting.'^  The 
implementable  actions  are  characterized  by  the  row  space  of  the  variance-covariance  matrix  of  the 
likelihood  ratios,  while  in  the  single-task  setting  any  action  can  be  implemented  as  long  as  a 
nontrivial  effort  level  can  be  induced. 

The  following  example  illustrates  how  the  implementable  actions  are  constrained  by  the 
performance  information  available.  Suppose  the  agent  performs  two  tasks  a  =  (aj  ,  02)  and  there  is 
only  one  performance  measure  available  for  contracting,  y  =  m^Ui  +  777202  +  If  0(e)  is  the  prob¬ 
ability  density  function  of  e,  then  the  probability  density  function  of  y  is  0(y  |  a)  =  ^y-TTZiOj 
-  777202)  the  likelihood  ratio  vector  is: 


L(0,y|o)  =  -  (777 1,7772) (07 0)(y  -  777iOi  -  7772O2)  =  -  (777,,7772)L((/7,e). 

The  variance-covariance  matrix  of  the  likelihood  ratios  is  singular  because: 


2L(0|o) 


777  i  777  i  7772 
777  i7772  777^ 


var(L(^,s)) 


(9) 


and  both  rows  are  collinear  with  the  vector  (7771 , 7772);  the  row  space  of  the  variance-covariance 
matrix  (9)  is  the  single-dimensional  space  (line)  determined  by  the  vector  (777]  ,7772).  In  other 
words,  the  only  implementable  actions  are  those  for  which  the  marginal  effort  cost  V^uCo)  is 
collinear  with  (7771 ,  m2).  In  particular,  for  a  simple  quadratic  effort  cost  of  the  form  v{a)  =  ra  •  a, 

1 A  ^ 

the  implementable  actions  must  satisfy  Oi  /  02  =  /  ^2- 

Next,  assume  the  performance  measures  are  joint  normally  distributed,  with  y  =  Ma  4-  s, 
where  M  is  an  77  X  777  matrix  of  sensitivities  of  the  77  performance  measures  to  the  m  tasks,  s  ~ 
A^(0,X),  and  %  is  the  positive  definite  n  X  n  variance-covariance  matrix  of  the  77-dimensional 
performance  measure  noise  vector  s.  The  777-dimensional  likelihood  ratio  is  then  L(0(y)  ,7]\a) 
=  -  Ma),  and  its  777  X  777  variance-covariance  matrix  is  SL(0,  77)  =  M^“'M.^^  Thus, 


The  ranking  of  information  systems  also  holds  in  the  discrete  action  setting  of  Appendix  B,  if  we  replace  the  VCM 
condition  with  ranking  the  variances  of  the  likelihood  ratios  in  (B7).  Proposition  4  is  derived  explicitly  from  the 
first-order  condition  for  optimal  contracts  (4)  with  square-root  utility,  and  not  from  the  general  SSD  condition  in 
Proposition  1.  Even  if  %L{4>,  rf-\a)>  rf  |  a),  it  is  not  possible  to  establish  the  SSD  condition  in  Proposition  1 

for  general  distributions. 

The  row  (column)  space  of  a  matrix  is  the  linear  subspace  of  vectors  that  can  be  written  as  a  linear  combination  of  the 
row  (column)  vectors  of  the  matrix.  Similarly,  the  “span”  of  a  set  of  vectors  is  the  linear  subspace  of  vectors  that  can  be 
written  as  a  linear  combination  of  these  vectors. 

'''  Because  the  implementable  actions  have  the  same  characterization,  this  example  also  works  for  LEN  models,  see 
Lemma  1  and  Lemma  2  below,  and  the  discussion  following.  For  LEN  models,  this  example  is  familiar  from  the 
discussion  of  noncongruity  in  Feltham  and  Xie  (1994). 

The  likelihood  ratio  depends  on  the  action  a,  but  is  a  linear  function  of  the  signal  y  and,  thus,  its  variance-covariance 
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are  scalar  zero-mean  normally  distributed  random  variables  and  the  SSD  condition 
is  equivalent  to  comparing  their  variances,  SL(0,77)p  •  p.  If  the  VCM  condition  tLif ,  rf-) 
rj^)  is  satisfied,  then  [SL(0,  rf-) -%L{4> ,  rf)']iJL-  yu  >  0  for  all  p  and  the  variance  is 
higher  for  system  rf  than  for  system  if.  Hence,  in  this  setting,  the  VCM  condition  implies  the 
general  SSD  condition  in  Proposition  1  and  is  sufficient  for  ranking  the  two  information  systems. 

Proposition  5:  If  the  performance  measures  are  normally  distributed  with  y  ~  N{MaX)  and 
if  ,  rf)>XL{4>,  rf),  then  the  principal  strictly  prefers  information 

system  rf  to  information  system  rf  for  implementing  a  given  action  a.  For  a 
weak  preference  relation,  one  replaces  the  strict  inequality  by  a  weak  inequal¬ 
ity.  In  addition,  if  yj  is  a  Blackwell  garbling  of  y^,  then  rf  dominates  rf  in 
the  VCM  sense. 

Proposition  5  generalizes  Kim  and  Suh  (1991,  Proposition  2)  to  the  multi-task  setting.  We 
cannot  prove  or  disprove  that  Blackwell  garbling  is  strictly  stronger  than  the  VCM  condition. 
Blackwell  garbling  requires  a  kernel  h{yi  ,y2)  that  is  independent  of  actions.  Kim’s  (1995)  ex¬ 
amples,  showing  that  neither  the  ranking  of  information  systems  in  an  agency  context  nor  the  SSD 
condition  imply  Blackwell  garbling,  are  based  on  the  local  nature  of  the  likelihood  ratio.  However, 
under  the  assumptions  in  Proposition  5,  the  VCM  condition  is  independent  of  the  action  induced. 
This  is  because  the  agent’s  action  does  not  affect  the  variance  of  the  performance  measures,  the 
variance-covariance  matrix  of  the  likelihood  ratio  does  not  depend  on  a  and  the  VCM  condition  is 
“global”  with  respect  to  a.  As  a  result,  Kim’s  (1995)  counter-example  (see  his  Proposition  3  and 
Proposition  5)  that  the  SSD  condition  does  not  imply  Blackwell  garbling  cannot  be  extended  in 
our  context  to  the  VCM  condition. 

We  also  cannot  prove  or  disprove  that  the  variance  of  likelihood  ratio  condition  is  necessary 
for  ranking  two  information  systems  in  the  agency  model.  We  conjecture  that  the  ranking  of 
information  systems  in  all  agencies  does  not  imply  the  VCM  condition.  The  VCM  condition  at  a 
requires  that  for  all  /r  e  S'”: 

%L((f),rf\a)/JL-  /LL>l,L{c/),r/^\a)/ii-  fji.  (10) 

By  Proposition  1,  when  the  performance  measures  are  normally  distributed,  (10)  is  sufficient  to 
rank  the  information  systems  for  all  implementable  actions  a  if  it  satisfied  for  all  optimal  multi¬ 
pliers  p(a).  However,  if  the  optimal  multipliers  p(a)  for  all  implementable  actions  a  do  not  span 
R"*,  even  if  the  principal  prefers  rf  over  rf  for  all  implementable  a,  we  cannot  expect  to  prove 
that  (10)  holds  for  all  fx  e  R"*.  This  is  not  an  issue  in  the  single-task  case  because 
XL{4>  ,  rf)  yu  ■  ,  rf)ix-  IX  for  one  nonzero  multiplier  p  is  sufficient  to  prove  that  the 

scalar  variances  satisfy  '%L{4> ,  rf)  >  XL{(f),  97*) — see  Kim  and  Suh  (1991,  Proposition  2). 

III.  THE  MULTI-TASK  LEN  MODEL 

In  this  section,  we  introduce  the  LEN  assumptions  in  the  multi-task  agency  model.  The  agent 
has  exponential  utility  of  consumption  u{c)  =  -exp(-r[c  -  /<:(a)])  with  multiplicatively  separable 
effort  cost  /c(a),  which  is  a  convex  function  of  the  agent’s  action  and  is  increasing  in  each  task 
separately.  The  principal  offers  the  agent  a  linear  compensation  contract  c(y)  =  /  +  v-y,  where  y 
is  a  vector  of  n  performance  measures  reported  by  information  system  77.  As  in  the  setting  of 
Proposition  5,  the  performance  measures  are  joint  normally  distributed  with  y  ~  V(Ma,X). 


matrix  does  not  depend  on  the  agent’s  action.  This  is  reflected  in  the  notation  L{(f>,rj\a)  for  the  likelihood  ratio  and 
%L{(p,Tj)  for  its  variance-covariance  matrix. 

We  have  this  result  for  the  “standard”  LEN  model:  in  Proposition  7  below,  the  ranking  of  information  systems  in  all 
agencies  on  the  equilibrium  path  implies  the  VCM  condition. 
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The  LEN  assumptions  imply  the  agent’s  compensation  is  normally  distributed  and  the  agent’s 
preferences  have  a  mean-variance  representation  given  by  the  certainty  equivalent: 

C¥,{c{y)\a,r])  =  E[c{y)\a,rj\-Kydx{c{y)\r])  -  K{a) .  (11) 

The  principal  seeks  to  induce  an  action  a  with  a  linear  contract  c(y)  at  minimum  expected 
cost.  The  agent  accepts  the  contract  if  its  certainty  equivalent  is  greater  than  or  equal  to  his/her 
reservation  certainty  equivalent  CEq  (the  agent’s  participation,  or  individual  rationality  constraint): 

CE(c(y)|a,77)>CEo.  (12) 

The  agent’s  action  choice  maximizes  his/her  certainty  equivalent  of  compensation  net  of  effort 
costs,  which  in  this  case  is  equivalent  to  a  satisfying  the  first-order  (incentive  compatibility) 
constraint: 

V^CE(c(y)la,77)  =  V^E[c(y)|a,77]- V„/c(a)  =  0,  (13) 

where  the  agent’s  action  does  not  impact  the  variance  of  compensation  because  it  does  not  impact 
the  variance  of  y  and  the  contracts  are  linear.'^  The  principal’s  problem  is  to  minimize  expected 
compensation  cost  over  the  vector  space  of  linear  contracts  C,  minc(y)  ecE[c(y)  \ci  ,v\  subject  to 
the  individual  rationality  constraint  (12)  and  the  incentive  compatibility  constraint  (13)  for  a  given 
action  a. 

Lemma  1:  Under  the  assumptions  of  the  LEN  model: 

(i)  Eor  an  implementable  action  a,  an  optimal  multiplier  on  the  incentive  compatibility 

constraint  (13)  is  denoted  by  ix  =  r'l,L{(f) ,  and  is  characterized  by: 

=  V  aK{a).  (14) 

r 

(ii)  The  set  of  implementable  actions  A(77)  is  characterized  by  the  row  space  of  M  and  the 
cost  of  effort  K{a),  A{rj)  =  {a  e  R'”  1  VQ«:(a)  e  Im(M^}. 

(iii)  Eor  any  a  e  Airj),  the  optimal  linear  contract  that  implements  a  is  characterized  by: 

c(y)  =  CEq  +  k(a)  -f-  ^rSL(^,  77)“’ V^/c(a)  •  V„/c(a)  +  SL(</),  77)“' /<:(«)•  L(</),  r]\a). 

(15) 

The  contract  is  uniquely  determined,  even  when  the  multiplier  p  that  satisfies  (14)  is  not. 

Lemma  1  recasts  the  LEN  model  in  a  new  form.  Using  the  certainty  equivalent  representation 
of  the  agent’s  utility,  we  rewrite  the  principal’s  problem  to  include  the  mean  and  variance  of  the 
agent’s  compensation;  this  leads  to  optimal  linear  contracts  that  depend  on  the  likelihood  ratio  and 
on  multipliers  that  can  be  explicitly  characterized. 

Action  a  is  implementable  if,  and  only  if,  the  marginal  effort  cost  vector  VQ/c(a)  is  in  the 
image  of  SL((/>,77),  which  is  equivalent  to  ax{a)  being  in  the  row  space  of  the  sensitivity  matrix 
M.  Eor  example,  if  the  agent’s  effort  cost  is  of  the  simple  quadratic  form  Ka  a,  where  ^  is  a 
positive  definite  matrix,  and  XL{(f),r})  is  non-singular,  or  equivalently  M  is  of  rank  m,  then  any 


We  allow  for  the  possibility  that  the  first-order  condition  (13)  does  not  uniquely  determine  the  agent’s  action  choice.  For 
example,  if  K{a)  =  K{p-a),  then  V„/c(p  •  a)  =  pK'{p  ■  a),  and  the  agent  is  indifferent  between  vectors  of  tasks  for  which 
p  ■  a  is  the  same.  In  this  case,  we  assume  that  the  principal  can  specify  the  agent’s  action. 
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action  a  e  K"*  is  implementable.  For  an  effort  cost  function  of  the  form  K(p-a),  all  actions  are 
implementable  if  p  is  in  the  row  space  of  M,  and  the  principal  specifies  the  agent’s  action  among 
those  over  which  the  agent  is  indifferent.  If  p  is  not  in  the  row  space  of  M,  then  there  is  no  interior 
solution  for  the  agent’s  action  choice.^* 

An  optimal  LEN  contract  pays  the  agent  a  fixed  wage  consisting  of  his/her  reservation  cer¬ 
tainty  equivalent,  the  cost  of  effort,  the  risk  premium,  and  a  linear  mean  zero  variable  wage.  Both 
the  risk  premium  and  the  variable  wage  depend  on  the  multiplier  p  on  the  agent’s  incentive 
compatibility  constraint,  which  in  turn  depends  on  the  action  a  to  be  implemented.  The  LEN 
model  allows  an  explicit  characterization  of  this  multiplier,  although  it  may  not  be  uniquely 
determined  if  is  singular.  However,  as  we  show  in  the  proof  of  Lemma  1,  both  pi  ■  V  ^Kia) 

and  p-L((^,77  |  a)  are  uniquely  determined;  this  in  turn  allows  an  explicit  calculation  of  the  expected 
cost  of  implementing  action  a  with  an  optimal  linear  contract: 

E[c(y)|a,77]  =  CEo+  /c(a)  +  -rSL(0,77)“^V„/c(a)  •  Va/c(fl).  (16) 

Because  the  principal  prefers  the  information  system  that  allows  implementing  a  with  a  lower  risk 
premium,  and  this  is  determined  by  the  variance-covariance  matrix  of  the  likelihood  ratio,  we  are 
led  again  to  the  VCM  condition. 

To  compare  the  efficiency  of  two  information  systems  in  implementing  a  given  action,  we 
need  to  consider  actions  that  are  implementable  under  both.  The  VCM  condition  ranks  the  sets  of 
implementable  actions. 

Lemma  2:  If  ,  rf')  >  77’),  then  A(77^)  D  A(77^). 

In  other  words,  if  rf-  dominates  77^  in  the  VCM  sense,  the  principal  can  implement  at  least  as 
many  actions  with  if  as  with  77^ ;  in  particular,  the  actions  implementable  under  both  77'  and  rf  are 
the  same  as  the  actions  implementable  under  rf. 

The  next  result  demonstrates  the  sufficiency  of  the  VCM  condition  for  ranking  information 
systems  in  LEN  models  when  the  same  actions  can  be  implemented  with  both. 

Proposition  6:  Under  the  LEN  assumptions  for  two  information  systems  77^  and  rf,  if  the 
row  spaces  of  and  are  equal,  then  the  strict  VCM  condition 

SL(  (f) ,  if)  >  77^)  is  sufficient  for  the  principal  to  strictly  prefer  if  to 

77*  for  implementing  any  action  a  &  A{if)  =A{if).  The  weak  VCM  condi¬ 
tion  is  sufficient  for  a  weak  preference  of  if  over  if. 

The  VCM  condition  is  sufficient  for  ranking  information  systems  in  the  LEN  case  because  the 
optimal  linear  contracts  depend  on  the  same  linear  function  of  the  likelihood  ratio  as  the  general 
optimal  contracts.  The  VCM  condition  in  Proposition  6  does  not  depend  on  particular  functional 
forms  for  the  principal’s  expected  benefit  and  for  the  agent’s  action  cost  K{a).  Thus,  we  rank 
performance  measures  in  any  agency  for  all  implementable  actions  and  not  only  for  the  optimal 
action  in  a  particular  parametrization  of  the  model.  However,  the  set  of  implementable  actions 
A(  77^)  C  A(  77^)  may  depend  on  the  agent’s  effort  cost  function  /c(a):  if  either  or  M2  are  not  of 
full  rank  m,  the  set  of  implementable  actions  is  limited  and  enters  its  characterization  (see 

also  (A28)  in  Appendix  A  and  the  subsequent  discussion). 


**  As  a  simple  example,  consider  the  case  of  a  single  performance  measure  and  two  tasks  specified  as  y  =  WiOi  +  m2a2 
+  s,  K{a^ ,  02)  =  r:{ai  +  flj),  and  mj  >  m2.  For  any  contract  c  =  /  -f  vy,  if  v  >  0,  and  fl; ,  <22  are  not  exogenously  bounded, 
the  agent  can  choose  arbitrarily  large,  02  =  -Oi.  and  derive  an  unboundedly  large  benefit  i;(mi  -  while  keeping 
effort  costs  constant  at  /c(0).  This  leads  to  a  contract  without  incentives,  v  =  0  as  in  Holmstrom  and  Milgrom  (1991),  or 
to  boundary  solutions  determined  by  constraints  on  Oj ,  a2- 
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We  have  assumed  that  the  same  actions  are  implementable  under  both  information  systems. 
When  'liL{4> ,  77O  is  non-singular  and,  thus,  has  full  rank  m,  all  actions  are  implementable  under 
both  information  systems  because  both  variance-covariance  matrices  have  full  rank.  Otherwise, 
when  SL(  ^ is  singular  and  not  all  actions  are  implementable,  if  the  row  spaces  of  M  j  and  M2 
are  equal,  then  the  same  actions  are  implementable  with  both  systems.  If  rf-  allows  implementa¬ 
tion  of  a  larger  set  of  actions  than  rj^,  then  Proposition  6  is  still  applicable  with  some  minor 
modifications.  Specifically,  if  the  VCM  condition  is  satisfied  on  a  linear  subspace  S  in  the  inter¬ 
section  of  the  row  spaces  of  and  M2,  and  if  S  is  invariant  to  both  variance-covariance  matrices 
XL{(f>,  7/),  then  the  principal  prefers  rf  to  77'  for  implementing  any  action,  such  that  VaK{a) 
e  S. 

The  “Standard”  Multi-Task  LEN  Model 

Feltham  and  Xie  (1994)  use  a  particular  parametrization  of  the  principal’s  benefit  function  and 
the  agent’s  effort  cost  that  allows  an  explicit  characterization  of  the  optimal  action  to  be  induced 
in  equilibrium.  We  re-examine  their  “standard”  formulation  of  the  LEN  model  and  show  that  the 
VCM  condition  is  both  necessary  and  sufficient  to  rank  information  systems  on  the  equilibrium 
path.  As  in  Feltham  and  Xie  (1994),  we  assume  the  principal’s  expected  benefit  is  b{a)  =  b  ■  a, 
where  b  =  {bi, ... ,  b^)  are  the  marginal  benefits  from  each  of  the  agent’s  tasks,  and  K{a)  =  2^-  a. 
The  principal  maximizes  his/her  expected  benefit  net  of  compensation  costs,  U^{a  ,rj)  =  b-  a 
-E[c(y)  I  a,  77],  by  choosing  the  optimal  action  to  be  induced  in  equilibrium  with  an  optimal 
linear  contract  as  determined  in  Lemma  1.  The  next  result  characterizes  the  equilibrium. 

Lemma  3:  In  equilibrium,  the  optimal  action  induced  is: 
a  =  (2L(0,  77)  -I-  r/)“^XL(0,  rj)b, 

and  the  optimal  linear  contract  and  the  principal’s  optimal  expected  surplus  are: 
c(y)  =  CEo  +  ^(SL(0,  77)  -I-  r/)"‘2L(0,  rj)b-b  +  (XL(</»,  77)  -l-  rlY'^b  ■  L{(t>,  y\a) 

(17) 

UP  =  i(/-  r(XL(</.,  77)  -t  rir^)b  ■  b  -  CEq.  (18) 

In  Lemma  1  we  showed  that  the  optimal  linear  contract  is  linear  in  the  likelihood  ratio,  and 
the  LEN  assumptions  allow  us  to  explicitly  characterize  the  multipliers  X  and  p.  Lemma  3  deter¬ 
mines  all  the  equilibrium  parameters — action,  contract,  and  payoffs — in  terms  of  the  variance- 
covariance  matrix  of  the  likelihood  ratio  in  the  parametrization  of  the  “standard”  LEN 

model,  the  optimal  multiplier  p  and  the  optimal  action  in  equilibrium  depend  on  XL(</>,77).'^  Using 
the  explicit  expression  (18)  for  the  principal’s  surplus  in  equilibrium,  we  can  rank  information 
systems  for  inducing  the  optimal  action  for  an  arbitrary  marginal  benefit  vector  b. 

Proposition  7:  The  principal  strictly  prefers  if  to  77'  for  implementing  the  optimal  action  for 
all  benefit  vectors  b,  if,  and  only  if,  rf  strictly  dominates  77'  in  the  VCM 


To  reconcile  the  expressions  for  the  principal’s  utility  in  (18)  from  Lemma  3,  (A40)  in  Appendix  A,  and  those  in 
Feltham  and  Xie  (1994),  it  is  sufficient  and  straightforward  to  show  that: 

SL(</.,  V)  +  riy^  =  +  r/)-'  =  +  rtY^M . 
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sense,  XL{(f),  rf-')  >  XL{(f) ,  y^).  A  weak  preference  relation  in  all  agencies  is 
equivalent  to  the  weak  VCM  condition. 

The  sufficiency  of  the  VCM  condition  follows  directly  from  the  explicit  expression  of  the 
principal’s  expected  utility  (18).  To  see  that  the  VCM  condition  in  Proposition  7  is  also  necessary 
if  the  ranking  of  information  systems  holds  on  the  equilibrium  path  for  all  b  e  R"*,  note  that 
{XL{4>,  rf)  -  SL(0  ,  •  h  >  0  for  all  b  implies  %L{(f) ,  rf-)  -  Xhitf),  y^)  >  0.  Two  information 

systems  are  equivalent  if  they  have  the  same  variance-covariance  matrix  of  the  likelihood  ratio. 

For  the  stemdard  LEN  model  with  a  quadratic  effort  cost,  the  set  of  implementable  actions  is 
the  column  space  of  the  variance-covariance  matrix  of  the  likelihood  ratio,  which  is  the  same  as 
the  row  space  of  the  sensitivity  matrix  M  (this  follows  from  (14)  by  using  VaK{a)  =  a);  in  our 
notation,  A{y)  =  lm(%L{cf) ,  y))^lm{M'^.  By  Lemma  2,  if  the  VCM  dominance  condition  in 
Proposition  7  is  satisfied,  then  the  space  of  implementable  actions  under  y^  is  at  least  as  large  as 
that  under  yK  By  Proposition  6,  any  action  that  is  implementable  under  y^  is  implementable  under 
77^  at  the  same  or  at  a  lower  cost.  Thus,  it  is  not  surprising  that  the  VCM  condition  is  sufficient  to 
rank  information  systems  on  the  equilibrium  path. 

The  ranking  is  valid,  even  if  the  variance-covariance  matrices  of  the  likelihood  ratios  are 
singular  and  the  row  spaces  of  and  M2  are  different  (i.e.,  the  sets  of  implementable  actions 
differ  under  the  two  information  systems).  As  opposed  to  Proposition  6,  where  we  rank  the 
optimal  linear  contracts  for  the  same  actions  that  are  implementable  under  both  information 
systems,  in  Proposition  7,  we  rank  the  information  systems  on  the  equilibrium  path  for  the  en¬ 
dogenously  determined  optimal  actions  (which  generally  differ  for  different  information  systems). 

Although  the  VCM  condition  is  shown  to  be  sufficient  for  ranking  information  systems  in 
Proposition  6,  it  does  not  follow  that  this  condition  is  necessary,  even  if  the  principal  prefers  one 
information  system  over  the  other  for  inducing  any  implementable  action.  For  example,  under  the 
conditions  for  which  we  proved  Proposition  6,  if  XL(0,77)  is  non-singular,  but  K{a)  =  Kip-a),  any 
action  is  implementable  with  multipliers  that  are  all  proportional  to  the  single  vector  ,  77)  p 
(also  footnote  17).  It  follows  that  the  principal  prefers  rf-  to  77^  for  implementing  any  action  a 
e  S'”  if,  and  only  if,  XL(  (f ,  77^"^  ■p>XL{cf),  •  p.  Being  satisfied  for  a  single  vector,  this 

does  not  rank  the  variance-covariance  matrices  and,  thus,  does  not  imply  the  VCM  condition. 

There  is  no  notion  of  “congruity”  here.  In  Feltham  and  Xie  (1994)  and  Datar  et  al.  (2001), 
congmity  is  defined  relative  to  h  e  R'”.  As  Demski  et  al.  (2007)  show,  congruity  can  be  interpreted 
in  terms  of  the  relative  positions  of  the  benefit  vector  b  and  the  row  space  of  M  (which  is  the  set 
of  implementable  actions).  The  VCM  condition  holds  uniformly  with  respect  to  b  and  only  deter¬ 
mines  how  large  is  the  set  of  implementable  actions.  Improvements  in  congruity,  when  the  set  of 
implementable  actions  does  not  change,  are  not  reflected  in  the  variance-covariance  of  the  likeli¬ 
hood  function,  because  such  improvements  are  about  the  position  of  a  given  b  relative  to  M. 

IV.  APPLICATIONS  OF  THE  VCM  CONDITION 

In  this  section  we  present  a  series  of  results  that  rely  on  the  sufficiency  of  the  VCM  condition 
for  ranking  information  systems  for  the  three  previously  considered  particular  cases  with  normally 
distributed  performance  measures:  (1)  optimal  contracts  that  implement  a  given  action,  as  in 
Proposition  5;  (2)  optimal  LEN  contracts  that  implement  a  given  action,  as  in  Proposition  6;  and 
(3)  optimal  LEN  contracts  that  implement  the  optimal  action  in  equilibrium,  as  in  Proposition  7. 

Value  of  Additional  Information 

Consider  two  joint  normally  distributed  performance  measures  ,  y2^  where  y,-  =  Mp  +  Sj, 
and  (81 ,  £2)  ~  MO  ,  X)  with  joint  variance-covariance  partitioned  as: 
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Sn 

^12 

S21 

S22  - 

(19) 


We  say  that  >2  has  no  incremental  value  over  if  the  principal  is  indifferent  between  the 
information  systems  {yj}  and  either  with  respect  to  implementing  any  action  that  is 

feasible  under  both  information  systems  or  for  implementing  the  optimal  actions.  Our  next  result 
characterizes  several  equivalent  conditions  for  y2  to  have  no  incremental  value  over  yj. 

Proposition  8:  If  the  performance  measures  y^  and  y2  are  joint  normally  distributed  as  above, 
then  the  following  conditions  are  equivalent: 

(i)  y2  has  no  incremental  value  over  yi  in  implementing  a  given  action  with  either  optimal 
or  LEN  contracts,  or  in  implementing  the  optimal  action  with  LEN  contracts; 

(ii)  there  exists  a  matrix  Q  such  that  M2  =  QM^  and  S21  =  2Sn; 

(iii)  there  exists  a  matrix  Q  such  that  y2  =  Qyy  +  8,  where  e  ~  A^(0  ,  SJ,  and  yi  and  e  are 
independent; 

(iv)  yi  is  a  sufficient  statistic  for  (y,  ,y2)  for  all  a:  there  exists  a  function  h{yi  ,y2)  such 

that  4>{y^ ,  ^2  I  «)  =  h{y\ ,  I  a),  for  all  yj,  y2  and  all  a; 

(v)  SL(0,yi  ,y2)  =  SL(0,yi); 

In  addition,  any  of  the  equivalent  conditions  (i)-(v)  imply  that  y2  is  a  Blackwell  garbling  of  yj.^” 

Conditions  (ii)-(v)  are  purely  statistical  properties  of  the  performance  measures,  one  of  them 
being  that  yj  is  a  sufficient  statistic  for  (yj  ,y2).  The  key  link  between  condition  (v)  and  the 
ranking  of  inclusive  information  systems  in  (i)  is  provided  by  the  VCM  condition  that  underlies 
our  three  ranking  results  for  normally  distributed  performance  measures:  Proposition  5  on  imple¬ 
menting  a  given  action  with  optimal  contracts;  Proposition  6  on  implementing  a  given  action  with 
LEN  contracts;  and  Proposition  7  on  implementing  the  optimal  action  with  LEN  contracts. 

Condition  (v)  in  Proposition  8  is  the  same  as  the  weak  mean-preserving  spread  (MPS)  con¬ 
dition  of  Kim  (1995)  because  we  have  normally  distributed  performance  measures.  In  our  case,  the 
cumulative  distribution  functions  of  the  likelihood  ratios  are  mean-preserving  spreads  of  each 
other,  either  for  all  a  or  for  no  a  because  the  agent’s  effort  does  not  affect  the  vaiiance  of  the 
performance  measures.  As  a  result,  the  equivalence  between  condition  (v)  and  the  sufficient 
statistic  condition  (iv)  is  similar  to  Kim  (1995,  Proposition  2).  Condition  (ii)  in  Proposition  8  is  the 
same  as  condition  (13)  in  Eeltham  and  Xie  (1994)  and  is  equivalent  to  condition  (iii)  that  one 
performance  measure  is  a  noisy  linear  function  of  the  other. 

As  in  Holmstrdm  (1979)  and  Eeltham  and  Xie  (1994),  we  can  use  the  sufficient  statistic 
condition  to  determine  whether  an  additional  signal  is  incrementally  valuable  in  a  multi-task 
agency.  Proposition  8  generalizes  Holmstrom  (1979,  Proposition  3)  to  a  multi-task  agency  with 
normally  distributed  performance  measures  whose  mean  is  linearly  affected  by  effort.  That  is,  an 
additional  performance  measure  y2  is  of  no  value  if,  and  only  if,  yj  is  a  sufficient  statistic  for  the 
pair  (yi  ,y2).  Our  result  also  incorporates  Eeltham  and  Xie  (1994,  Proposition  3)  on  the  value  of 
additional  information  for  LEN  models  on  the  equilibrium  path.  Eeltham  and  Xie’s  (1994)  proof 
relies  on  the  “standard”  LEN  formulation  and  direct  calculation  of  payoffs.  We  rely  instead  on  the 
VCM  condition  to  rank  information  systems  when  the  performance  measures  are  normally  dis¬ 
tributed.  Thus,  we  extend  Eeltham  and  Xie  (1994,  Proposition  3)  to  optimal  linear  contracts  that 
implement  any  given  action  in  a  more  general  formulation  of  the  LEN  model. 


20 

The  Blackwell  garbling  condition  does  not  imply  any  of  the  equivalent  conditions  (i)-(v).  If  y,  is  a  Blackwell  garbling 
of  y2,  then,  by  Proposition  5,  y2  weakly  dominates  y,  in  the  VCM  sense.  However,  if  y2  weakly  dominates  y,  in  the 
VCM  sense,  it  does  not  follow  that  yj  has  no  incremental  value  over  y2. 
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The  linear  structure  of  the  normally  distributed  performance  measures  not  only  guarantees  the 
sufficiency  of  the  VCM  condition  for  ranking  information,  but  also  ensures  the  condition  is 
independent  of  the  agent’s  actions.  This  is  important  in  that  it  avoids  the  problems  associated  with 
the  “all  a  or  no  a  condition.”  Amershi  et  al.  (1990)  show  that  the  all  a  or  no  a  condition  must  hold 
when  the  sufficient  statistic  condition — which  is  stated  for  all  a — is  necessary  for  additional 
information  to  have  no  value.^'  The  all  a  or  no  a  condition  requires  that  the  information  aggregate 
\i{a)-L{cf),7j\d)  that  is  sufficient  to  implement  a — and  is  generally  local  with  respect  to  a — matches 
the  global  nature  of  the  sufficient  statistic  condition  with  respect  to  a.  The  optimal  contract  that 
implements  a  can  be  written  on  a  sufficient  statistic  with  respect  to  all  a  if,  and  only  if,  the 
probability  distributions  of  performance  measures  satisfy  the  all  a  or  no  a  condition. 

In  general,  even  in  a  single-task  agency,  the  necessity  of  the  equivalent  conditions  (ii)-(v)  for 
additional  information  to  have  no  value  does  not  hold.  Which  information  system  is  preferred  also 
depends  on  which  action  the  principal  wants  to  induce.  For  a  multi-task  agency,  p(a)  is  an 
m-dimensional  vector  and  depends  on  the  implemented  actions  (for  example,  in  the  TEN  case, 
XLif,  r]i)fJb{a)  I  r  =  VaK{a),  see  (14)).  Thus,  even  if  the  information  content  of  the  likelihood  ratio 
is  independent  of  a,  the  information  content  of  \i{a)-L{cf),r]\a)  changes  with  For  this  reason,  the 
SSD  condition  on  p(a) •!(</),  77]  a)  in  Proposition  1  is  unlikely  to  be  necessary  for  ranking  informa¬ 
tion  systems  in  general.  For  the  linear  structure  of  normally  distributed  performance  measures 
assumed  here,  the  SSD  condition  can  be  replaced  with  the  stronger  VCM  condition  that  applies  for 
all  p  and  does  not  depend  on  a.  The  VCM  condition  has  the  global  nature  to  evaluate  the 
information  content  of  performance  measures  and  allows  the  equivalent  conditions  (ii)-(v)  to  be 
necessary  for  additional  information  to  have  no  value. 

Conditional  Controllability 

The  controllability  principle  is  fundamental  in  performance  evaluation:  a  manager  should  only 
be  held  accountable  for  that  which  s/he  can  control.  A  key  implication  of  Holmstrom’s  (1979) 
results  on  the  value  of  an  additional  performance  measure  is  that  even  though  an  agent  cannot 
affect  a  specific  performance  measure,  this  measure  may  well  enter  the  agent’s  optimal  contract  if 
it  is  correlated  with  uncontrollable  events  affecting  other  primary  performance  measures.  This  is 
Antle  and  Demski’s  (1988)  conditional  controllability  principle:  an  additional  performance  mea¬ 
sure  is  valuable  if  it  is  controllable  by  the  agent  conditional  on  other  available  performance 
information. 

Consider  two  sets  of  performance  measures  77^  =  {yi}  and  77^  =  {yi  ,y2}  as  in  the  preceding 
subsection.  Define  ^  =  {z},  where  z  is  the  “filtered”  measure  z  =  y2-E[y2|}'i  = 

+  S2iS7|(ji -^1^^)]-  The  joint  density  of  the  two -performance  measures  can  be  written  as 
,  z  I  a)  =  0(yi  I  a)(/)(z  |  a).  The  likelihood  ratio  for  the  joint  distribution  is: 


L{4>,yi,z\a) 


^a4>i.z\a) 

</>(yi|a)  4>{z\a) 


=  Li(f),  7}^\a)  +  L{(f),^\a) . 


(20) 


The  two  likelihood  ratios  are  stochastically  independent  because  the  performance  measures  y  i  and 
z  are  stochastically  independent.  It  follows  that: 


The  all  a  or  no  a  condition  holds  if  the  likelihood  ratio  measure  yL{a)-Li(l>,v\a)  has  the  same  information  content  for  all 
ia-ixia))  pairs,  where  p(a)  is  the  optimal  multiplier  in  the  optimal  contract  that  implements  a.  That  is,  for  any  action^to 
be  implemented,  d,  and  for  all  other  actions  a,  there  exists  a  function  such  that  ipfix{a)-L{<t),d)) 

=  Ma)  ■  L{4> ,  r]\a).  j-r>  1 

“  The  all  a  or  no  a  condition  is  also  the  reason  why  the  relative  incentive  rates  in  the  optimal  linear  aggregate  ot  Banker 

and  Datar  (1989)  are  independent  of  a  in  the  single-task  case,  but  vary  with  a  in  the  multi-task  case. 
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XL{ct>,  rj\C)  =  SL(0, 77O  +  ^L{4>,0  ^  SL(</>,  rj^) .  (21) 

In  other  words,  the  filtered  performance  measures  ^  =  {z}  are  incrementally  valuable  if  they  are 
controllable.  Note  that  {rf  is  equivalent  to  rf-.  In  addition,  z  is  controllable 
(V^^(z  I  a)  /  0(z  I  <3)  0  on  a  set  of  realizations  of  z  of  positive  measure),  if,  and  only  if,  y2  is 
conditionally  controllable  ^ a4>iyi  I  3^1  >  <2)  /  (i>{y2  |  yi ,  «)  ^  0  on  a  set  of  realizations  of  (yj  ,  y2)  of 
positive  measure).  Thus,  the  additional  performance  measures  y2  ^0  valuable  if  they  are  condi¬ 
tionally  controllable. 

This,  of  course,  is  Antle  and  Demski’s  (1988)  conditional  controllability  principle  for  the 
special  case  of  normal  distributions,  and  it  can  also  be  found  in  Feltham  and  Xie  (1994,  Propo¬ 
sition  4)  for  the  special  case  of  a  multi-task  LEN  model.  We  generalize  Antle  and  Demski  (1988) 
to  the  multi-task  case  and  Feltham  and  Xie  (1994,  Proposition  4)  to  the  optimal  contract  case.  The 
same  proof  works  for  both  linear  and  optimal  contracts  because  it  relies  on  the  VCM  condition. 

If  y2  is  not  controllable,  then  y2  =  -^2^  +  with  M2  =  0,  y2  ^n^Y  still  be  conditionally  con¬ 
trollable  if  £2  is  correlated  with  Sj.  The  variance-covariance  niatrix  of  the  likelihood  ratio  with  zf- 
is: 


SL(0,772)  =  [m[  0] 


^11 

S12 

-1 

Ml 

-^21 

^22. 

.  0  _ 

(22) 


The  column  space  of  SL(0,  r^)  is  the  same  as  the  column  space  of  XL{(/)  ,  77')  and  equal  to  the 
row  space  of  M^.  By  Proposition  4  and  Lemma  1,  the  sets  of  implementable  actions  with  square- 
root  utility  and  in  the  LEN  model  are  the  same  for  77^  and  77^,  A(  77^)  =  A(  77^).  Thus,  the  uncon¬ 
trollable  performances  measures  ^2  do  not  expand  the  set  of  implementable  actions,  but  they  can 
be  used  to  reduce  the  risk  premium  to  induce  a  particular  action  if  S21  ^  0. 

Value  of  More  Precise  Information 

Assume  there  are  two  information  systems  77'  and  rf-  with  the  same  number  of  performance 
measures  and  the  same  sensitivity  to  effort  M,  where  y,  =  Ma  +  £,  and  £  ~  N{0  ,  S,),  such  that  one 
is  “noisier”  than  the  other  in  the  sense  that  2(77^  >  2(77^).  The  VCM  condition  holds  because 
2l( 0,7/)  =  22'  >  2]\  and  for  any  vector  /jl  e.  R"*  we  have; 

Tf-)jx  ■  jx  -  •  IX  =  'V^Mjx  •  Mfx  >  27'M7z  •  M/x  =  M^27^M/x  ■  /x 

=  2L(0,77O7x- /X.  (23) 

In  other  words,  the  less  noisy  system,  that  is  rf-,  is  preferred  for  inducing  any  action  a. 

Informativeness  and  Linear  Aggregation  in  Single-Task  LEN  Models 

We  now  show  that  the  optimal  linear  aggregate  of  two  signals  maximizes  the  variance  of  the 
likelihood  ratio,  which  is  the  informativeness  measure  in  the  single-task  case.  This  links  the  linear 
aggregation  results  of  Banker  and  Datar  (1989)  to  the  informativeness  results  of  Kim  and  Suh 
(1991)  and  explains  why  we  get  the  “correct”  linear  aggregation  with  LEN  models. 

A  direct  calculation  shows  that,  in  the  single-task,  single  performance  measure  “standard” 
LEN  model  with  y  =  ma  +  £,  £  ~  A(0  ,  cr^),  the  equilibrium  action,  contract,  and  payoffs  can  all 
be  expressed  in  terms  of  rr?  /  This  suggests  itself  as  an  informativeness  measure  for  the 
performance  measure  y.  The  likelihood  ratio  is  L(  4>{y) ,  77 1  <3)  =  ma~^{y  -  ma)  and  its  variance  is 
which  shows  that  !  cr^  is  the  right  informativeness  measure. 

Consider  two  performance  measures  y,.  =  m,a  +  £,-,  i  =  1,2,  where  (cj ,  £2)  ~  V(0 , 2), 
var(£i)  =  of,  and  cov(£] ,  £2)  =  ^criO'2-  Then,  the  likelihood  ratio  of  (y^ ,  y2)  and  its  variance  are: 
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Uy\,y2\a)  = 


'^Uy^yi)  = 


1  I  >fh  _  ^^2 

\(j\  0-10-2 

1 


]iyi-mia)  + 


m2  ^m 
,  o^  0-20-1 


"■i/ 


1-^- 


m. 


L  o^  0-,  0-2  ^  o^ 


O'!  0-2  U2 

The  likelihood  ratio  and  its  variance  for  any  linear  aggregate  y  =  ayi  +  f3y2  are: 

am^  +  /3m2 


L{y\a)  = 


{a^cr^  +  2^a/3aia2  + 


1/2 


(y  -  (omi  +  /3m2)a) 


(24) 


2L(y)  = 


(ami  + 

Op'O^  +  2^aySo-i02  +  0^02 


(25) 


From  the  above,  it  is  straightforward  to  show  that  aggregation  is  generally  not  desirable  and  that 
optimizing  the  variance  of  the  likelihood  ratio  of  the  aggregate  system  in  (25)  gives  an  optimal 
linear  aggregate  with  the  same  variance  of  the  likelihood  function  as  in  (24).  The  relative  incentive 
weights  in  the  optimal  linear  aggregate  of  the  two  signals  are  a^l  0  =  {mia\ 
-  ^m2(Ti(T2)  /  (m20^-  ^miO-io-2).  Thus,  there  is  no  loss  from  aggregation  if,  and  only  if,  it  is  the 
optimal  linear  aggregation  characterized  in  Banker  and  Datar  (1989). 

The  linear  aggregation  of  Banker  and  Datar  (1989)  holds  for  the  LEN  model,  because  the 
contracts  are  linear  in  the  likelihood  ratios.  While  any  linear  contract  by  definition  linearly  aggre¬ 
gates  the  performance  measures,  an  optimal  LEN  contract  is  linear  in  the  likelihood  ratio,  and  the 
optimal  linear  aggregate  is  determined  by  the  likelihood  ratios  as  with  optimal  contracts. 


V.  CONCLUSIONS 

Comparing  information  systems  in  agencies  is  the  basis  for  managerial  performance  evalua¬ 
tion  purposes.  It  has  been  long  recognized  that  managers  attend  to  several  tasks,  but  we  know  how 
to  compare  information  systems  only  in  single-task  agencies,  such  as  in  Holmstrom  (1979)  and 
Kim  (1995).  Most  of  what  we  know  about  the  value  of  information  in  a  multi-task  setting  comes 
from  the  multi-task  LEN  models  of  Holmstrom  and  Milgrom  (1991),  Eeltham  and  Xie  (1994),  and 
Datar  et  al.  (2001).  While  LEN  provides  much  tractability  to  the  multi-task  environment,  questions 
remain  as  to  the  robustness  of  the  results  given  the  linear  contract  restriction. 

We  derive  information  ranking  results  for  multi-task  models  with  both  optimal  and  LEN 
contracts.  Eor  optimal  contracts,  we  generalize  the  second-order  stochastic  dominance  (SSD) 
condition  in  Kim  (1995)  by  showing  that  it  is  sufficient  to  hold  for  the  likelihood  ratio  weighted 
by  the  optimal  multiplier  on  the  incentive  compatibility  constraints.  In  general,  Blackwell  garbling 
implies  our  version  of  the  SSD  condition.  In  particular  cases  (square-root  agent  utility  or  normally 
distributed  performance  measures),  the  SSD  condition  can  be  replaced  by  the  simpler  variance- 
covariance  matrix  of  likelihood  ratios  (VCM)  condition  that  requires  the  ranking  of  the  variance- 
covariance  matrices  of  the  likelihood  ratios.  The  partial  ordering  of  information  systems  by  the 
VCM  condition  is  the  multi-task  generalization  of  the  complete  ordering  by  the  variance  of  the 
likelihood  ratio  in  Kim  and  Suh  (1991).  By  recasting  the  LEN  model  using  likelihood  ratios,  we 
find  that  the  same  VCM  condition  is  sufficient  to  rank  information  systems  with  LEN  models.  We 
use  the  VCM  condition,  for  normally  distributed  performance  measures,  to  prove  a  series  of 
results  that  hold  both  with  and  without  the  LEN  assumptions:  the  sufficient  statistic  results  of 
Holmstrom  (1979)  and  Eeltham  and  Xie  (1994)  and,  thus,  the  conditional  controllability  principle; 
the  value  of  more  precise  information;  and  the  optimal  linear  aggregation  of  performance  infor¬ 
mation. 
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While  we  use  the  VCM  condition  in  many  of  our  results,  we  do  not  claim  it  has  general 
applicability  in  ranking  performance  measures.  For  example,  in  Feltham  and  Wu  (2001),  the 
agent’s  effort  affects  the  mean  and  the  variance  of  a  normally  distributed  performance  measure 
(the  standard  deviation  is  a  linear  function  of  the  agent’s  effort).  In  this  case,  the  likelihood  ratio 
is  a  quadratic  function  of  the  performance  measure  and,  thus,  it  has  a  noncentral  Chi-square 
distribution  and  the  VCM  condition  no  longer  applies  (unless  the  agent  has  square-root  utility). 

LEN  models  have  been  appropriately  criticized  on  the  grounds  that  suboptimal  use  of  infor¬ 
mation  casts  doubts  on  the  results;  “The  restriction  to  linear  contracts  has  been  a  controversial  one 
in  the  agency  literature.  In  particular,  agency  theory’s  intellectual  roots  lie  in  information  econom¬ 
ics,  where  information  systems  are  compared  based  on  the  optimal  use  of  information  generated 
by  the  system.  Restricting  the  contract  to  be  linear  is  a  very  significant  philosophical  departure” 
Lambert  (2001,  29;  emphasis  in  the  original).  What,  then,  is  the  use  of  LEN  models? 

Our  derivation  of  the  optimal  LEN  contracts  in  terms  of  likelihood  ratios  and  the  validity  of 
the  VCM  condition  for  both  optimal  contract  and  LEN  models  contributes  to  our  understanding  of 
the  limits  of  using  LEN  models.  The  same  VCM  condition  ranks  information  systems  in  three 
particular  cases:  square-root  utility,  normally  distributed  performance  measures,  and  LEN  models. 
However,  this  does  not  imply  that  the  ranking  of  information  systems  in  all  three  cases  is  the  same. 
In  other  words,  preferences  over  performance  measures  derived  from  one  model,  say  LEN,  do  not 
necessarily  hold  for  other  models  and  vice  versa.  Because  any  result  on  the  value  of  performance 
measures  relying  on  the  VCM  condition  will  hold  in  all  three  settings,  we  can  shed  some  light  on 
when  conclusions  derived  from  a  LEN-based  analysis  have  more  general  validity. 

The  main  justification  for  the  use  of  LEN  models  remains  their  tractability.  Holmstrdm  and 
Milgrom  (1991,  24)  point  out  that  the  general  agency  models  “have  remained  too  intractable  to 
effectively  address  broader  organizational  issues  such  as  asset  ownership,  job  design,  and  alloca¬ 
tion  of  authority,”  but  at  the  same  time  “because  the  model  is  so  tractable,  we  shall  not  avoid  the 
temptation  to  stretch  its  use  somewhat  further  in  this  article,”  Holmstrom  and  Milgrom  (1991,  31). 
Tractability  and  the  associated  explicit  calculations,  if  pushed  too  far,  can  obscure  the  information 
economics  of  the  problem.  Our  derivation  of  the  VCM  condition  for  LEN  models  avoids  some  of 
the  temptations  of  tractability:  we  found  the  “right”  result  by  deliberately  staying  away  from  the 
usual  calculations  along  the  equilibrium  path. 

Hemmer  (2004,  150)  goes  further  in  criticizing  LEN  models:  “[t]he  problem  is  that  it  is 
impossible  to  know  whether  the  results  provided  by  the  LEN  framework  are  actually  consistent 
with  the  analysis  of  optimal  contracts  unless  one  does  the  latter  part  as  well.  But  since  the  LEN 
approach  often  is  used  where  the  analysis  of  optimal  contracts  seems  too  complicated,  such 
verification  is  rarely,  if  ever,  provided.  Moreover,  if  the  general  analysis  could  actually  be  done, 
there  would  be  no  reason  to  restrict  attention  to  linear  contracts  in  the  first  place.”  Arguing  that 
nothing  can  be  learned  from  a  LEN  model  is  akin  to  “throwing  out  the  baby  with  the  bath  water.” 
Although  we  present  the  general  results  first,  we  did  use  the  LEN  model  as  a  guide  to  find  them. 
The  key  was  to  tease  out  the  information  economics  lurking  behind  the  tempting  explicit  calcu¬ 
lations  of  LEN  models;  we  hope  this  approach  will  work  in  other  cases  of  interest. 

APPENDIX  A 

Proof  of  Proposition  1 

The  proof  follows  the  lines  of  the  proof  of  Proposition  18.6  in  Christensen  and  Feltham 
(2005).  We  formulate  the  principal’s  problem  of  implementing  a  given  action  a  by  choosing  an  ex 


See  also  Liang  and  Nan  (2008)  for  a  related  analysis  in  which  the  agent’s  action  may  affect  the  variance  of  a  normally 
distributed  performance  measure. 
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ante  optimal  randomization  strategy  with  probabilities  (q;,1  —  a)  over  two  information  systems 
(77^ ,  77')  used  to  compensate  the  agent  ex  post  with  an  optimal  contract: 


c{y\v)  = 


ClW 

Cliy) 


if  77=  77^ 
ii  rj-  rf' 


(Al) 


The  principal  optimizes  over  a  and  the  contract.  At  contracting  time,  s/he  announces  the  random¬ 
ization  parameter  a  (and  commits  to  it)  and  offers  the  agent  the  contract  in  (Al).  The  principal’s 
expected  utility  given  randomization  a  over  the  two  information  systems  and  the  contract  is: 


U’’{c,a\a)  =  aU^(c2,a\ 77^)  +  (1  -  a)U^{ci,a\ 77'), 


(A2) 


UP{ci,a\  V')  =  -  j  T/)dyi.  (A3) 

Knowing  the  randomization  probability  a  over  the  two  contracts,  the  agent’s  expected  utility  is: 
U“ic,a\a)  =  af/"(c2,a|77^)  +  (1  -  a)C/“(ci,a| 77'),  (A4) 

W{ci,a\ rf)  =  j  u(cfyi))k(a)(^(yila,  7f)dyi  -  v{a).  (A5) 

We  assume  that  the  first-order  approach  is  valid,  hence  the  agent’s  action  choice  is  constrained  by: 

SJ^U%c,a\a)  =  aS!^W{c2,a\v^)  +  (1  -  a)V JJ%c^,aW)  =  0.  (A6) 


V„t/‘'(ci,a|770 


J 


u{ci{y^)\k{a)'^  ^(t>{y,\a,  rj)  +  V^/c(a)0(y;|a,  fY^dy^  -  V„t;(fl) . 


(A7) 


Let  T]  denote  the  agent’s  lower  bound  on  expected  utility.  Then,  the  principal’s  problem  of  induc¬ 
ing  a  given  action  a  with  an  optimal  contract  and  an  optimal  randomization  strategy  is  character¬ 
ized  by  the  following  Lagrangian: 


£  =  aUP{c2,a\ 77^)  +  (1  -  a)UP{ci,a\r]^)  +  \\_aW{c2,a\v^)  +  (1  -  aWic^,a\ 77^)  -  [/] 

+  p.  ■  \_aSJ ^U^{c2,a\Tf')  +  (1  -  «)VaC/‘'(ci,a|  V)]- 


(A8) 


The  principal’s  optimization  problem  is  where  the  contracts  cfy^  are  in  the  set  of 

functions  bounded  from  below  by  c.  Optimizing  the  Lagrangian  pointwise  with  respect  to  Ci(yi) 
and  C2(y2)  for  the  common  optimal  muitipliers  \  and  p  yields  the  following  first-order  conditions 
for  the  optimal  compensation  contracts,  i  =  1,2: 


wicfy^))  =  • 


Xk{a)  +  p  •  \k{a)L{(f){y^,  Tf\a)  -t-  V  fk{a)] 

if  \k{a)  +  p  ■  \k{a)L{(f){y^,  ri\a)  +  ^  Jcia)']  >  w(c) 


w{c)  otherwise 


(A9) 


where  w(c)  =  1  /  m'(c)  is  the  agent’s  inverse  marginal  utility  of  consumption.  For  a  risk-averse 
agent,  the  marginal  utility  of  consumption  m'(c)  is  decreasing,  so  w{c)  is  an  increasing  function. 

The  principal’s  expected  utility  and  the  agent’s  expected  utilities  are  both  linear  functions  of 
a  so  the  optimal  a  is  a  corner  solution  in  [0,1];  that  is,  a  =  0  or  a  =  1.  Differentiating  the 
Lagrangian  with  respect  to  a  yields: 
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B  =  UPic2,a\ 7^)  +  \lJ\c2A  ^)  +  /^  •  V«t/"(c2,a|  rf-)  -  [.UP{c,,a\  V)  +  \U%c„a\ 77') 

+  //.•  Vat/“(ci,<3|V)]  =  J  [ci(}'i)-M(ci(>’i))m(3^i|a,  V)]0(}'ik>  -  J  [c2(3'2) 

where  we  substitute  in  the  definitions  of  t/^(c; ,  a  \  rf),  ,  a  \  ?/),  VQf/“(ci ,  a  |  77*),  we  collect 
terms,  and  we  denote; 


m{yi\a,  rf)  =  \k{a)  +  72  •  {k{a)L{(f){y  ^ ,  rf\a)  +  Va^(fl)) . 


We  now  show  that  5  >  0,  so  the  principal  optimally  chooses  a  =  1. 

The  optimal  incentive  contracts  only  depend  on  the  information  system  rf  and  the  signal  y, 
through  the  likelihood  ratio  functions  with  the  common  multipliers  X.  and  p.  Hence,  we  write  both 
optimal  contracts  as  c,(m)  =  cc{m): 


cc{m)  - 


w 


-1 


(m) 


if  m  >  w(c) 
otherwise 


(All) 


where  the  likelihood  measure  m  is  defined  as  m  =  \k(a)  +  /x.  •  [^(fl)L(^(y,) ,  7/  |  a)  +  V ^kiaf]  over 
its  entire  support.  Only  the  distributions  of  the  likelihood  measure  m  depend  on  rf, 
(f){m  \  a,  rf-)  ^  ^{m\a,  rf),  while  the  common  function  cc(-)  includes  the  truncation  due  to  the 
lower  bound  on  compensation.  We  define /(m)  =  cc{m)  -  u{cc{m))m,  so  we  can  write  B  as: 


5  =  J  f{m)(f){m\a,rf)dm-  ^  f{m)4>{m\a,rf)dm.  (A12) 

The  function /(•)  is  (strictly)  concave  (for  m  >  w(c)).  To  see  this,  we  differentiate: 

f'{m)  =  cc'{m)  -  [u\cc{m))cc' {rn)rn  +  u(cc(m))]  =  -  u(cc(m)),  (A  13) 

where  the  second  equality  follows  from  m  =  1  /  u'{cc{m))  for  m  >  w(c),  and  cc'{m)  =  0  for  tti 
<  w(c).  Furthermore,  note  that  /'(m)  =  -M'(cc(m))cc'(w)  <  0  follows  from  m'  >  0  and  cc'{m) 
>  0  for  m  >  w{c).  Finally,  for  likelihood  measures  m'  <  w{c)  such  that  c(m')  =  c,  we  get  that 
f{m')  =  -uic)>-uicc{m))  for  m  >  w(c),  that  is,  for  cc(m)  >  c.  Hence,  the  function /(•)  is 
concave  over  the  full  support  of  the  likelihood  measure  m,  that  is,  over  the  interval  (-0°,  »),  and 
strictly  concave  over  the  interval  [w(c) ,  °o). 

The  final  step  is  to  note  that  if  $(m  |  a ,  77')  second-order  stochastically  dominates 
<i>{m\a,  rf)  and  the  two  distributions  have  the  same  mean,  then  for  any  concave  function /(•)  (be 
it  increasing  or  decreasing),  which  is  strictly  concave  over  some  range  of  nonzero  probability  at 
which  the  distributions  do  not  coincide,  it  holds  that,  see  Rothschild  and  Stiglitz  (1970,  Theorem 
2):^^ 


5  =  J  /(m)0(m|fl,  rf)dm  -  J  /(m)0(m|a,  rf)dm  >  0. 


(A14) 


□ 

The  following  lemma  is  useful  in  the  proof  of  Proposition  2  below;  see  Billingsley  (1995)  for 
the  probability  concepts  used  in  the  lemma  and  its  proof. 


Rothschild  and  Stiglitz  (1970)  only  prove  their  Theorem  2  for  distributions  with  compact  support,  but  the  result  holds 
for  general  distributions  under  additional  mild  regularity  conditions  (Hadar  and  Russell  1969;  Hanoch  and  Levy  1969). 
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Lemma  A.l:  Let  jc  e  R""  and  y  e  R"*  be  two  vectors  of  independent  zero-mean  random 


variables  defined  on  the  product  space  generated  by  the  probability 

spaces  (ft; ,  Ti ,  'P,)  for  the  JF,- measurable  random  variables  X;  and  y,-,  i  = 
1,  ...  ,m.  If  X;  second-order  stochastically  dominates  y,-,  for  all  i  =  1,  ...  ,m, 
then  a-x  second-order  stochastically  dominates  a-y  for  any  a  e  R"". 


Proof  of  Lemma  A.l 

Rothschild  and  Stiglitz  (1970,  Theorem  2)  show  that  x,  SSD  y,  if,  and  only  if,  there  exists  a 

P, -measurable  random  variable  S;  with  y;  =  X;  +  Si,  and  E[8;  |  xj  =  0.  The  cr-fields  in  the  prod¬ 
uct  space  are  independent;  thus,  if  X;  SSD  y,-,  i  =  1,  ...  ,m,  then  there  exists  a  P- measurable  vector 

of  independent  random  variables  s  e  R'”  with  y,=X;  +  Si  and  E[e,|x,]  =  0  for  /  =  1,  ...  ,m. 
Because  the  random  variables  y;  and  x,  +  s,  have  the  same  distribution  functions,  they  also  have 
the  same  characteristic  functions  defined  by  (pxit)  =  E[e‘'^] ,  t  e  R,  i.e.,  (Py{t)  =  (Px^+ei^t)-  Since  y  is 
a  vector  of  independent  random  variables,  the  characteristic  function  for  a-y  is  <Pa.yU) 
=  (Py^{af) . . .  cpy  Similarly,  x  +  s  is  a  vector  of  independent  random  variables,  so  we  have 

<Pa-{L)(t)  =  Since  (Pyit)  =  <Px,+£,.(0  implies  that  (p^.(ag)  =  <?;,.+«,( for 

all  fl;  e  R,  we  get  that  (Pa.y{t)  =  <Pa-{x+s)U)-  But  a-y  and  a-{x  +  e)  have  the  same  characteristic 

functions,  so  they  also  have  the  same  distribution  functions  (by  the  uniqueness  theorem  for 

d  ,  d 

characteristic  functions,  Billingsley  1995,  Theorem  26.2):  a-y  =  a|(x  +  8)  =  a-  x+  a-e. 

It  remains  to  show  that  E[a  •  s\a  •  x]  =  0.  This  is  clearly  satisfied  if  E[8;  |  a  •  x]  =  0  for  all  i. 
By  iterated  expectations  E[8;  |  a  •  x]  =  E[E[8;  |  x]  |  a  •  x].  Since  8,-  is  7y-measurable  and  Xj  is 
jFj-measurable,  8,-  and  Xj  are  independent  for  i  j.  Hence,  E[8;  |  x]  =  E[8j  |  x,].  The  result  now 
follows  from  the  fact  that  E[8;  |  x,]  =  0.  Hence,  there  exists  a  P-measurable  random  variable  z  = 

a-8  such  that  a-y  =  a-x+z,  where  E[z  1  a  •  x]  =  0.  It  then  follows  from  Rothschild  and  Stiglitz 
(1970,  Theorem  2)  that  a-x  SSD  a-y  for  any  a  e  R"".  □ 

Proof  of  Proposition  2 

Technological  and  stochastic  independence  imply  that  the  zero-mean  random  variables 
L.(y.  I  ai ,  77)  =  (paiyi  I  ’  v) !  I  ’  v)  independent  random  variables.  If  the  task-specific 
performance  measures  are  technologically  and  stochastically  independent  for  both  information 
systems,  then  the  likelihood  ratios  for  the  two  information  systems  are  vectors  of  independent 
random  variables  defined  on  a  common  product  space  as  in  Lemma  A.L  Hence,  if  L,(y;  |  a, ,  77^ 
SSD  L;(y;  I  ,  77^)  for  all  i,  it  follows  from  Lemma 'A.l  that  /x-L{y\a,  77O  SSD  -  L{y  \  a  ,  rf) 
for  any  /U,  e  R"*.  The  result  then  follows  from  Proposition  1 .  □ 

Proof  of  Proposition  3 

From  the  definition  of  likelihood  ratios,  it  follows  from  (7a)  that: 


(A15) 


where  cf>{y2  |  yi ,  a)  =  /j(yi ,  y2)0(y2  \  a) !  f^y  i\  a) .  Since  (7  a)  also  implies  that  J0(y2  |  yi ,  a)dy2 
=  1,  I  yi ,  a)  is  a  conditional  probability  density  function  with  respect  to  y2  for  any  given  y^ 
and  a.  Thus,  L((^(yi) ,  77’ |  a)  =  E[L((^,  77^  |  a)  1  y^ ,  a].  Define  the  random  vector  8 
=  L{cl){y2)  ,rf\a)-B[L{(p,Tf\a)\yua\  Then  L(0(y2) ,  ?7^  |  a)  =  L(<^(yi) ,  77' |  a)  +  8.  Since 
this  equality  holds  on  all  measurable  sets,  it  also  holds  in  distribution.  Hence,  for  all  /x  e  R  and 

all  a  e  A  we  have  that  fi  -  L{(f) ,  rf  \  a)  =n  -  L(4>,  7/ \a)  +  r- s. 
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It  remains  to  show  that  E[^t  •  s  |  •  L((^ ,  77’  |  a)]  =  0.  It  follows  from  iterated  expectations 

that: 

E[/7,  •  s\iJL  ■  L{(p,  rj^\a)'\  -  /x  •  E[E[£|yi,fl]|yU-  •  L{(f),  7j^\a)]. 

The  result  now  follows  from  the  definition  of  e: 

E[ebi,a]  =  E[L{(f)iy2),  rf\a)  -  E[L(0,  r^\a)\y^,a\\yua]  =  0. 

Hence,  there  exists  a  random  variable  z  =  p  e  such  that  ix-L{4>,  rj^  \  a)  = /i  ■  L( cf) ,  r/\a)+z, 
where  E[z  |  yu.  •  L(  </> ,  77*  |  a)]  =  0.  It  then  follows  from  Rothschild  and  Stiglitz  (1970,  Theorem  2) 
that  jx  ■  L{(t) ,  7]^  \  a)  SSD  /z  •  L( 0 , 77^  |  a)  for  all  p  and  a.  The  result  then  follows  from  Proposition 
1.  □ 

Proof  of  Proposition  4 

With  square-root  utility,  the  function /(m)  =  cc{m)  —  u{cc{m))m  is/(m)  =  -m^,  whenever  the 
optimal  compensation  is  interior.  Because  the  optimal  compensation  is  assumed  to  be  interior  over 
the  full  support  of  the  likelihood  measure,  (A14)  becomes  a  comparison  of  the  variances  of  the 
corresponding  likelihood  measures.  This  proves  the  first  result. 

To  determine  the  set  of  implementable  actions,  we  start  with  the  optimal  contract  (4)  for 
additively  separable  square-root  utility: 

=  \  + IX- -^{y\a,7j).  (A16) 

(p 

Multiplying  by  on  both  sides  of  (A16)  and  integrating,  we  have: 

j  V^VXy|a,77My  =  J  (x  +  7i- ^(y|a,77)j^^^^</>(y|a,77)Jy.  (A17) 

The  right-hand  side  is  'l,L{4>,ri  \  fl)p  and  the  left-  hand  side  is  jo{a)  by  the  incentive  compat¬ 
ibility  constraint,  thus  XL{(f>,  7i\a)ii=  An  action  a  is  implementable  if,  and  only  if, 

V^v{a)  is  in  the  image  space  of  'ZL{(f>,7i\a).  If  SL(0,77|a)  is  nonsingular,  then  77, 
=  1/2SL(^,  77 1  a)~^VaV(a). 

The  square-root  utility  function  also  allows  us  to  express  the  minimum  expected  compensa¬ 
tion  cost  from  (A  16)  as  a  function  of  the  variance-covariance  matrix  of  the  likelihood  ratio: 

E[c(y)|a,  77]  =  +  'ZL{(f),7i\a)ii  ■  fi.  (A18) 

With  square-root  utility  (assuming  the  optimal  compensation  is  everywhere  interior),  we  can 
solve  for  X  by  exploiting  the  binding  participation  constraint.  Integrating  (A  16)  and  using  the 
binding  participation  constraint,  we  have: 

X  =  J  'J^(f){y\a,')j)dy  =  j[v{a)  +  U].  (A19) 

Therefore,  X  is  a  constant  determined  by  the  induced  action  (and  the  reservation  utility).  Substi¬ 
tuting  X  and  p  into  the  expected  compensation  (A18),  we  have: 

E[c(y)|a,  77]  =  ^[(u(a)  +  Uf  +  SL(0,  77|a)"‘V^i;(a)  •  V„i;(a)],  (A20) 

which  is  a  direct  proof  that  the  expected  compensation  cost  of  implementing  an  action  a  is  lower 
whenever  the  variance-covariance  matrix  is  larger.  □ 
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Proof  of  Proposition  5 

Both  distributions  of  the  likelihood  measure  m  for  the  two  information  systems  and  irf-  are 
normal,  with  mean  \k{a) -k  jx-'^  Jc{d)  and  variances  k^(a)SL(0 ,  77^/^  • /r  and 
k^{a)XL{4>  ^  fkt,  respectively.  Hence,  the  distribution  0(m  |  a  ,  77')  second-order  stochasti¬ 
cally  dominates  the  distribution  0(m  |  a  ,  77^)  if,  and  only  if,  ,  77^)77-  •  /x  <  SL(^,  Tf)/X-  IX, 

which  is  the  case  if  1,L{4> ,  77^)  -  SL((/)  ,  77^)  is  positive  definite,  that  is,  the  VCM  condition  holds. 

Following  Proposition  3,  if  yj  is  a  Blackwell  garbling  of  y2,  the  distribution  function  of 
fx-  L{(f),  if  \  a)  second-order  stochastically  dominates  the  distribution  function  of  fx-  L{(f) ,  rj^  |  a). 
For  normally  distributed  yj  and  y2,  this  means  fx  ■  L{4> ,  rf- \  a)  has  a  higher  variance  than 
77,  •  L(^,  77^  I  a),  that  is,  ,  7^)77,  •  77,  >  ,  7707/- •  7x,  for  all  p.  Therefore,  SL(0,77^) 

-'ZL{4>  ,  7]^)  is  a  positive  semi-definite  matrix.  This  concludes  the  proof  that  ,  rf') 

^SL(0,77i).  □ 

Proof  of  Lemma  1 

Writing  explicitly  the  agent’s  certainty  equivalent  CE[^c(y)\a,7j\,  and  because,  with  linear 
contracts,  the  action  does  not  impact  the  variance  of  compensation,  the  individual  rationality 


constraint,  and  the  incentive  compatibility  constraint  become; 

j  c(y)0(y|fl,  77)7/y  -  jrJ  (c(y)  -  E[c(y)|a,  77])^</>(y|a,  77)7iy  - /c(a)  >  CEq,  (A21) 

V„CE[c(y)|a,77]  =  J  c(y)V„(^(y|a,  77)afy  -  V^/c(a)  =  0.  (A22) 

The  Lagrangian  for  the  principal’s  problem  is: 

£(c(y),X,/x)  =  -  E[c(y)|a,  77]  +  \(CE[c(y)|a,  77]  -  CEq)  +  77  •  V^CE(c(y)lfl,  77).  (A23) 

25 

The  first-order  condition  with  respect  to  c(y)  gives: 

-  ^(y|a,  77)  -I-  k[0(y|fl,  77)  -  r(c(y)  -  E[c(y)|a,  rj\)(}){y\a,  77)]  +  77,  •  V^0(yla,  77)  =  0.  (A24) 

Dividing  through  by  (f){y-a,rj)  gives  the  equivalent  condition: 

-  1  -I-  k[l  -  r(c(y)  -  E[c(y)|a,  77])]  +  77,  •  L(<^(y),  77|a)  =  0.  (A25) 

Integrating  (A24)  with  respect  to  y  gives  \  =  1  because: 

I  Va</)(y|a,77)7/y  =  0  and  I  (c(y)  -  E[c(y)la,  77])</>(y|a,  77)7/y  =  0.  (A26) 


Here  we  exploit  again  the  fact  that  c(y)  is  normally  distributed.  Because  the  vector  space  C,  over  which  we  optimize, 
only  contains  linear  (affine)  functions  of  y,  an  additional  step  is  necessary  to  ensure  that  the  first-order  condition  with 
respect  to  c(y)  is  satisfied  pointwise  in  y.  Specifically,  one  calculates: 

dCtdciy)  =  d£(c(y)  H-  th(y))ldt  at  t  =  0  for  all  h{y)  e  C. 

This  gives: 

J  [_  1  +  X(1  -  r(c(y)  -  E[c(y)|fl,  77]))  +  A '  V„(?!>(y|fl,  y)/<?!>(y|a,  v)^h{y)<k{y\a,  v)dy  =  j  /(y)/j(y)</>(y|a,  v)dy  =  0, 

where /(y)  denotes  the  term  in  square  brackets.  Note  that/(y)  is  the  same  as  the  left-hand  side  in  (A25)  and  is  a  linear 
function  of  y  because  it  is  linear  in  c(y)  and  in  the  likelihood  ratio  L(cl>,vl  a),  which  in  turn  is  linear  in  y.  Substituting /z(y) 
=  1  and  k(y)  =  f[y)  in  the  integral  form  of  the  first-order  condition,  imply  that  E[/(y)]  =  var(/(y))  -  0,  and  since  fly)  is 
normally  distributed,  it  follows  that/(y)  =  0. 
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Substituting  X,  =  1  and  solving  for  c(y)  in  (A25)  yields; 

c(y)  =  E[c(y)|a,  77]  +  -  •  L(^(y),  77|a) .  (A27) 

r 

We  substitute  the  optimal  linear  contract  (A27)  into  the  agent’s  first-order  incentive  compatibility 
constraint  (A22)  to  obtain: 

f  -  ■  L{cf){y),rf\a)L{(l)(y),Tj\a)(f>{y\a,y)dy  =  XL((f),7j)-  =  (^28) 

Jr  r 

This  proves  part  (i)  of  the  Lemma.  There  exists  a  p  that  satisfies  (A28)  if,  and  only  if,  V ^Kia)  is 
in  the  image  of  the  linear  operator  which  is  the  same  as  the  image  of  the  linear  operator 

26  proves  part  (ii)  of  the  Lemma. 

In  what  follows,  assuming  action  a  is  implementable,  we  write  for  any  p  that  satisfies  (A28): 

—  eXL{(f),7f)~^VaK(a).  (A29) 

r 

When  '1,L{4>,7])  is  singular  and  p  is  not  uniquely  determined,  XL{(f) ,  77)"*  stands  for  the  inverse 
image  of  the  given  vector,  otherwise  it  denotes  the  inverse  matrix.  The  participation  constraint 
(A21)  is  binding  in  equilibrium,  and  we  can  use  it  to  substitute  the  expected  compensation  in 
(A27)  by  the  sum  of  the  compensation  risk  premium  and  effort  cost.  Using  (A27),  the  variance  of 
the  agent’s  compensation  is: 

var(c(y))  =  var(  —  •  L(</>,  77|a) )  =  SL(</>,  77)—  •  — .  (A30) 

Substituting  into  (A27)  gives  the  optimal  linear  contract  that  implements  a\ 

c(y)  =  CLq  +  K(a)  -t-  \rXL{(f),  rj)—  .  —  +  —  .  Li(f){y),7j\a).  (A31) 

r  r  r 

Thus,  the  principal’s  expected  compensation  cost  for  a  given  action  is  given  by: 

E[c(y)\a,  77]  =  CLq  -l-  K{a)  +  \r1,L{(j),  77)—  •  — .  (A32) 

r  r 

Substituting  p  determined  by  (A29)  in  (A32),  gives  the  optimal  cost  (16)  and  the  optimal  linear 
contract  (15)  for  implementing  action  a.  This  proves  part  (iii)  of  the  Lemma.  To  conclude,  note 
that  the  optimal  contract  (15)  and  the  expected  cost  (16)  for  inducing  a  are  both  uniquely  deter¬ 
mined  for  any  p  that  satisfies  (A29),  even  when  the  multiplier  p  is  not  uniquely  determined,  that 
is  when  a  is  implementable  but  ^L{<p,rj)  is  singular. □ 


To  see  this,  note  that  ,  rj)  =  M^~'M  =  A'^A,  where  A  =  X~^'^M.  Because  Im(AT4)  C  Im  and  rank(A^A) 

=  rank(A^,  it  follows  that  Im(A^A)  =  Iin(A^.  To  conclude,  note  that  Ini(A^  = 

Indeed,  let  fi'  be  two  multipliers  that  both  implement  the  same  action  a,  that  is  both  satisfy  (A28).  Then, 
rj){/j,-  fi')  =  0  implies  7j)fi  •  fji  =  l,L/u,'  ■  /x'  because: 

XL{(f>,  7])ix-  iJL  =  2L(<^,  77) /U,'  •  /X  =  /x'  •  77) ax  =  /x'  ■  2L(<7>,  77) /x'  =  SL(<^,  77) ax'  ■  ax'  • 

Furthermore,  77 1  a)  is  a  zero  mean  normally  distributed  vector;  this  implies  ( /x  -  ax')  •  T(  </> ,  77 1  a)  is  normally  distrib¬ 
uted  and  has  mean  zero;  finally,  var((Ax- ax')  •  T(<A,  77I  a))  =2L((7>,  77)(ax- ax')  •  (ax- Ax')  =  0  implies  (ax 
-  fji')  ■  L((p ,  77la)  =  0  because  a  normally  distributed  random  variable  with  zero  mean  and  variance  is  zero  (almost) 
everywhere.  By  (A31),  the  optimal  contract  that  induces  a  is  the  same  for  both  p  and  ax'. 
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Proof  of  Lemma  2 

The  proof  is  in  two  steps.  In  the  first  step,  we  show  that,  if  A  >  5  >  0  are  two  symmetric 
semi-positive  definite  matrices,  then  Im(5)  C  Im(A). 

We  first  show  that  Ker(A)  C  Ker(5).  For  any  vector  x  e  Ker(A),  we  have  Ax  =  0,  then  Ax-x 
=  0.  The  condition  A  >  B  >  0  implies  Ax  x  >  Bx  x  >  0,  so  we  have  Bx-x  =  0.  Because  B  is 
semi-positive  definite,  there  exists  a  square  root  yjB.  It  follows  that  0  =  Bx  •  x  =  a/Bx  •  'Jbx  and,  thus 
Vbx  =  0.  Hence,  we  have  VbVbx  =  Bx  =  0. 

A  vector  x  belongs  to  the  kernel  of  a  matrix  if,  and  only  if,  x  is  orthogonal  to  the  row  space 
of  the  matrix.  When  the  matrix  is  symmetric,  x  is  also  orthogonal  to  the  column  space  of  the 
matrix,  which  is  the  image  of  the  associated  linear  operator.  Therefore,  because  B  is  symmetric,  x 
G  Im(B),  if,  and  only  if,  x  _L  Ker(B).  Because  we  have  established  that  Ker(A)  C  Ker(B),  then  x 
_L  Ker(B)  implies  x  ±  Ker(A)  and  x  e  Im(A).  This  completes  the  proof  of  the  first  step:  if  x  e 
Im(B),  then  x  g  Im(A). 

For  the  second  step,  recall  that  A(  if)  =  {a  e  S'”  |  e  Im(XL(  f ,  ?/))}.  If  a  e  A(  77^)  and 

rf-)  >  rf),  then  V^/<:(a)  g  lm{XL{cf) ,  rf))  C  Im(SL(^,  if)).  Thus,  a  g  A{rf), 

which  concludes  the  proof.  □ 

Proof  of  Proposition  6 

For  two  information  systems  77^,  rf,  the  actions  that  are  implementable  under  both  are  char¬ 
acterized  by: 


AW)T^A{7f)  =  {a  G  R'”|V^/f(a)  g  Im(SL(0, 77*))  n  Im(SL(0, 77^))} 

=  {a  G  W^\V^K{a)  G  Im(M[)  n  Im(M^)}. 

In  particular,  if  the  row  spaces  of  the  sensitivity  matrices  M-^  and  M2  are  equal,  the  actions  that  can 
be  implemented  with  both  information  systems  are  the  same.  In  this  case,  the  restrictions  of 
SL(0,77O  and  %L{(f),if)  to  the  subspace  Im(SL(0 ,  77^)  =  Im(SL(</> ,  77^))  =  Im(M[) 
=  Im(M2)  are  both  non-singular.  We  can  then  use  the  following  lemma  from  Goldberger  (1964, 
38): 

Lemma  A.2:  If  A,B  are  positive  definite  matrices  of  the  same  order  and  A  >  B,  then  B“^ 
>  A“'. 

Applying  Lemma  A.2  in  comparing  the  expected  compensation  cost  (16)  of  implementing  the 
same  action  a  proves  the  proposition.  '  Q 

Proof  of  Lemma  3 

Any  implementable  action  a  is  determined  by  the  multiplier  p  from  the  optimal  contract  that 
implements  a  as  follows  (see  (A28)): 


a  =  %L{(f),7f)  —  .  (A33) 

r 

The  principal’s  expected  surplus  from  inducing  a  with  an  optimal  contract  is: 

UP  =  b-a-  E[c(y)|a,  7j\  =  b-a-  CEo  -ja-a-  \rl,L{f,  77)^  •  ~ .  (A34) 

Recall  that  the  principal’s  expected  surplus  when  implementing  a  is  well  defined,  even  if  1,L{(f),if) 
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is  a  singular  matrix  and  |a.  is  not  uniquely  determined  by  (A28)?^  Substituting  (A33)  in  the 
principal’s  expected  surplus  gives: 

W  =  b-  XLicf>,  v)^  -  v)-  •  XL((/>,  v)- -  hr%L{cl>,  v)-'--  CEq.  (A35) 

r  r  r  r  r 

Differentiating  with  respect  to  p  gives  the  following  first-order  condition  with  respect  to  p: 

r/)b  -  rjf-—  -  rj)—  =  0.  (A36) 

r  r  r  r 

Rearranging  terms  and  solving  for  XL{(f),rj)[i/r  gives: 

a  =  77)—  =  77)  -I-  rf)b,  (A37) 

r 

where  I  denotes  the  m  X  m  identity  matrix.  While  p  may  not  be  uniquely  determined,  jx 
=  (SL(^,  77)  -I-  riy'^b  does  satisfy  the  first-order  condition,  and  the  corresponding  induced  action 
is  uniquely  determined.^^  Finally,  using  (A37),  we  can  explicitly  calculate  the  principal’s  expected 
surplus.  To  complete  the  proof,  it  remains  to  show  (18).  We  use  (A37)  to  substitute  into  the 
principal’s  expected  surplus: 

UP  =  b-  (SL((^,  77)  -I-  r/)“*SL(0, 77)^7  -  rj)  +  r/)"*SL(0,  Tj)b  ■  rj) 

+  r/)“'2L(</>,  rj)b  -  |r(SL(^,  77)  +  rI)~^XL{(t>,  rj)b  ■  ^  -  CEq  (A38) 

We  next  use  the  fact  that  (XLicf),  rj)  4-  r/)"*  and  'l^L{(f),rj)  are  symmetric  and  commute  to  further 
simplify  the  third  term  in  (A3  8): 

UP  =  (SL(^,  rj)  +  r/)"^SL(</),  rj)b  ■  \b  -  \{XL{(t),  rj)  +  r/)"^SL(0, 77)^7]  -  \r{%L{(t),  rj) 

+  rl)~^b  ■  (SL(</),  77)  -I-  r/)"^SL(</>,  rj)b  -  CEq.  (A39) 

Finally,  factoring  and  simplifying  gives  the  principal’s  expected  surplus  as: 

UP  -  rj)  +  r/)“‘SL(^,  77)^7  •  \b  -  |(SL(0,  rj)  +  r/)“HSL(0,  rj)  r/)Z7]  -  CEq 

=  {{XUcj),  rj)  +  r/)-‘SL(0,  rj)b-b-  CEq  =  -  r{^L{cf>,  77)  +  rl)-^)b  •  b  -  CEq.  (A40) 

□ 


Proof  of  Proposition  8 

To  prove  the  proposition,  it  is  sufficient  to  show:  (i)  =>  (ii)  =^>  (iii)  ^  (iv)  =>  (v)  =>  (i).  (i)  =5- 
(ii).  Let  z  =  y2  “  E[y2  I  for  a  given  d.  The  assumed  parametrization  of  the  performance 

measures  implies  that  E[y2  1  }'i  >  <2]  =  M2a  +  S2iS7{(>’i  ~  Mid)  and: 


The  optimal  contract  is  well  defined  by  choosing  fj.!  r=('ZL{<f>,  7f)  +  r[)~^b,  because  any  other  multiplier  fi'  that 
satisfies  (A33)  has  the  property  that  r}){n-  fi')  =  0,  which  in  turn  implies  (/x- /r')  •  L(</) ,  t;]  a)  =  0.  The  matrix 

+  rl  is  always  non-singular  because  it  is  positive  definite.  The  reason  is  that  XL{(f>,r})  is  non-negative  definite, 
and  rl  is  positive  definite. 

This  already  exploits  the  particular  structure  of  the  Feltham  and  Xie  (1994)  model,  where  the  agent  has  preferences  over 
how  effort  is  allocated  among  tasks.  While  in  general  a  may  not  be  uniquely  determined  by  the  first-order  condition 
r=V^K{a),  this  is  not  a  problem  when  K(a)  =  ^a-a  because  V„/c(a)  =  a  in  this  case.  The  specification 
Kia)  =-\a-  a  replaces,  through  a  simple  linear  re-scaling,  without  loss  of  generality,  any  quadratic  effort  cost  of  the  form 
K{a)  =  Ka  a  where  K  is  positive-definite. 
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—  (M2  —  S2iSi}Mi)(a  —  a)  +  82  ~  ^21^11®!  •  (A41) 

We  start  with  the  observation  that  y2  has  no  incremental  value  over  3^1  if,  and  only  if,  z[a)  has  no 
incremental  value  over  yj.  This  is  true  because  any  contract  that  can  be  written  on  (yj ,  y2)  can  be 
equivalently  written  on  (z(a)  ,yi)  and  vice  versa.  By  construction,  z  is  independent  of  yi,  thus 
0(yi ,  z  \  a  ,d)  =  <f{yi  \  a)(f>{z  \  a  ,d)  and  the  likelihood  ratio  is  additively  separable: 

L{(f)\yi,z{d),a)  =  L{(/)\yi,a)  +  L{(f)\z{d),a) .  (A42) 

The  likelihood  ratios  are  linear  in  the  independent  normally  distributed  random  variables  yj  and 
z{d),  and  are  independent  normally  distributed  random  variables  as  well  with  additively  separable 
variance-covariance  matrices: 

SL(^|y i,z(fl))  =  2L((/)|y  1)  -l-  XL((f)\z{d)) .  (A43) 

Our  assumption  that  y^  has  no  incremental  value  over  yj  implies  that  z(d)  has  no  incremental 
value  over  y^.  Thus,  |  z(d))  =  0;  otherwise  'hL{4>  \  yi ,  z{d))  >  SL((/>  |  y^)  implies  (by  Corol¬ 
lary  5)  that  zid)  has  incremental  value  over  yi  and  contradicts  our  assumption.  Furthermore, 
z{d))  =  0  implies  that  L{c[>\  z{d) ,  a)  =  0  because  it  is  a  normally  distributed  variable  with 
mean  and  variance-covariance  equal  to  zero.  This  is  only  possible  if  V^cf>{z{d)  |  a)  =  0  for  all  a, 
which  implies  M2  -  ^2^n^\  -  0  ^^d  proves  the  desired  result  with  Q  =  S2iS7}- 

(ii)  =>  (hi)  From  E[y2  |  yi ,  a]  =  M2a  +  S2iS7j (yi  -  M^a)  and  the  assumption  that  M2  =  QM^, 
S21  =  QXn,  we  have  E[y2  |  yi ,  a]  =  22iS7jyi.  Thus,  we  can  write: 

Ji  =  E[y2|yi,fl]  +  y2  -  E[y2|yi,a]  =  S2iS7jyi  +  e 
where  s  =  82  -  S2iS7jei  is  independent  of  yj. 

(iii)  =>  (iv)  It  is  sufficient  to  show  that  the  conditional  density  I  Ji  >  <2)  does  not  depend 

on  a  because  0(yi ,  y2  I  «)  =  (/»(>’2  1  Ji  ^  I  «);  because  the  signals  are  normally  distributed  and 

a  does  not  affect  their  variance,  it  is  sufficient  to  prove  that  E[y2  \y\,a]  does  not  depend  on  a.  But 
E[y2  I  Ji ,  «]  =  Qyi  +  E[e  I  yi ,  a]  =  Qyi  only  depends  on  y^  and  this  concludes  the  argument. 

(iv)  ^  (v)  A  direct  calculation  shows  that  L(</)  |  yi ,  y2  ,  a)  =E(^  |  yi ,  a)  for  all  a;  it  follows 
that  SL( 0 1  yi ,  y2)  =  SE( 0  [  y  1). 

(v)  =^>  (i)  From  Corollary  5,  the  principal  weakly  prefers  (yi,y2)  to  yj  because 
SL(</>|yi,y2)^XL(0|yi)  and  weakly  prefers'  yi  to  (yi,y2)  because  ^L{(f)\y^  ,y2) 
<  SL(</>  I  yi).  Thus,  the  principal  is  indifferent  between  the  two  information  systems  and  y2  has  no 
incremental  value  over  y^. 

To  conclude,  the  sufficient  statistic  condition  (iv)  implies  that  /i(yi,y2) 
=  </»(ji .  3^2  I  «)  /  4>^y\  I  «)  satisfies  the  Blackwell  garbling  conditions  in  (7).  □ 

APPENDIX  B 

In  this  appendix  we  examine  a  multi-task  setting  in  which  the  agent’s  action  choice  is  discrete 
and,  thus,  the  incentive  constraints  cannot  be  substituted  with  a  first-order  condition.  Our  approach 
in  characterizing  the  optimal  contracts  follows  that  of  Grossman  and  Hart  (1983).  We  show  that 
the  result  in  Proposition  1  still  applies  with  a  slightly  different  definition  of  the  likelihood  ratios. 
Specifically,  let  A  =  {oj  e  R'”}  be  a  finite  set  of  (multi-task)  actions  available  to  the  agent.  As  in  the 
proof  of  Proposition  1,  we  examine  the  principal’s  problem  of  implementing  a  given  action  a  e  A 
with  an  optimal  contract  by  choosing  an  ex  ante  optimal  randomization  strategy  with  probabilities 
(o:,l  —  a)  over  two  information  systems  {rf-  ,7]")  used  to  compensate  the  agent  ex  post  with 


The  Accounting  Review 


September  2010 
American  Accounting  Association 


1572 


Christensen,  §abac,  and  Tian 


contracts  C2  and  Cj,  respectively.  Knowing  the  randomization  probability  a  over  the  two  contracts, 
the  agent’s  expected  utility  is: 

U‘‘{c,a\a)  =  aU^{c2,a\rf')  +  (1  -  a;)t/“(ci,a| 77^),  (Bl) 


U%Ci,a\7f)  =  j  u{ci{yi))k{a)cf)iyi\a,7/)dyi- via).  (B2) 

The  agent’s  incentive  compatibility  constraints  are: 

U°ic,a\a)  >  U‘‘ic,aj\a),  for  all  fly  e  A,  (B3) 

with  associated  non-negative  Lagrange-multipliers  fij. 

The  principal’s  problem  of  inducing  action  a  with  an  optimal  contract  and  an  optimal  ran¬ 
domization  strategy  is  characterized  by  the  following  Lagrangian: 


C  =  aU^ic2,a\7f')  -t-  (1  -  a)U^ici,a\7/^)  +  k[aU‘^ic2,a\if)  +  (1  -  a)Wici,a\rj^)  -  U] 

+  M'j[(x[U‘‘ic2,a\rf')  -  U‘'ic2,aj\'}f)]  +  (1  -  a)[U\cx,a\rj')  -  t/“(ci,fly| 77^)]].  (B4) 

As  in  the  proof  of  Proposition  1,  optimizing  the  Lagrangian  pointwise  with  respect  to  Ci(yi)  and 
^2(^2)  (for  the  common  optimal  multipliers  X  and  jjLj)  yields  the  following  first-order  conditions 
for  the  optimal  compensation  contracts  for  i  =  1,2: 


wiciiyi)) 


I  miyi\a,  rf)  if  miy\a,  rf)  >  w(c) 
[w(c)  otherwise 


where  we  define: 


(B5) 


miy\a,Tf)  =  \kia)+  X  lJij{kiaj)Li^iy^,Tt\a,a^  +  kia)  -  kia^)]. 


(B6) 


L{(f){y^,Tt\a,aj)  = 


i)  -  (l^(yi\ap  i) 

v') 


Differentiating  the  Lagrangian  with  respect  to  a  yields: 


(B7) 


B  =  f/^(c2,fl|  77^)  +  Xf/“(c2,fl|  77^)  +  27^)  -  27^)]  -  V) 

-I-  Xt/"(6'i,fl|  77’)  -f-  7Zy[f/“(C],fl|  77')  -  t/“(Ci,fly|  77')]]  =  j  [Ci(yi) 

-  M(ci(yi))m(yi|fl,  77')]<;6(yilfl,  77')Jyi  -  j  [c2(y2)  “  M(c2(.V2))w(>’2|a,  27^)]0(>’2|^i,  rr)dy2, 

where  for  the  second  equality,  we  substitute  in  the  definitions  of  f/^(c, ,  a  j  t/),  t/“(c,  ,a\rf)  and 
collect  terms. 

Denote  fiia)  =  i/a^jkiaj))a.^A^  U(f>iyi) ,  V  1 «)  =  (LitPiyi) ,  V  I  «  >  (^j))ajBA^  and  A^(fl)  =  (kia) 
-  /c(fly))a.eA-  Then,  the  likelihood  measure  is  m  =  Xk(fl)  -i-  /z(fl)  •  iLi4>iy^  ,  rf\a)  +  Ak(fl))  and: 
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(w  '(m)  if  m  ^  w{c) 
c  otherwise 


(B8) 


With  these  redefinitions  of  the  likelihood  ratio  and  the  likelihood  measure,  we  follow  the  same 
steps  as  in  the  proof  of  Proposition  1  and  we  obtain  its  discrete  equivalent  as  follows. 

Proposition  B.l:  Assume  the  agent’s  utility  is  either  additively  or  multiplicatively  separable 
in  compensation  and  effort.  If  a  is  an  implementable  action  under  two 
information  systems  and  rf',  then  there  exists  a  vector  p(a)  such  that,  if 
the  distribution  function  of  /ji(a)  ■  L{4> ,  \  a)  second-order  stochastically 

dominates  the  distribution  of  /u(a)  •  L(</> ,  77^  |  a),  then  the  principal  prefers 
if  to  ?7^  for  implementing  action  a. 

This  discrete  equivalent  of  Proposition  1  demonstrates  that  the  key  assumption  in  ranking 
information  systems  in  a  multi-task  agency  is  the  second-order  stochastic  dominance  of  p(a)-L  for 
the  likelihood  vector  L  defined  in  (B7).  In  general,  this  likelihood  vector  is  of  the  same  dimension 
as  the  number  of  actions  available  in  the  discrete  set  A.  If  we  eliminate  the  non-binding  incentive 
compatibility  constraints  (for  which  /Hj  =  0),  we  can  restrict  our  attention  to  a  likelihood  vector  of 
dimension  equal  to  the  number  of  binding  incentive  constraints. 

To  obtain  something  even  closer  to  our  Proposition  1,  we  assume  that  only  the  incentive 
compatibility  constraints  for  the  actions  “adjacent”  to  the  desired  one  are  binding.  To  define  the 
actions  adjacent  to  the  action  a  =  («!,...,  a„)  e  R'”  that  the  principal  wants  to  implement,  suppose 
that  the  action  set  A  is  rich  enough  so  that  a^=  (aj , ... ,  a^_i ,  ,  a^.+i , ... ,  a„,)  e  A  for  all  k  = 

1,  ...  ,m;  that  is,  it  contains  all  the  actions  obtained  by  substituting  the  effort  level  on  task  k,  a^,  by 
the  next  less  costly  one  a^,  while  maintaining  the  effort  levels  on  all  the  other  tasks.  The  actions 
dk  are  the  actions  adjacent  to  a  in  the  m  different  “directions”  determined  by  the  m  tasks.  If  we 
assume  that  all  the  incentive  compatibility  constraints  for  aj  e  A\{ai ,  ...,dj  are  not  binding, 
then  the  multiplier  p(a)  in  Proposition  B.l  is  an  m-dimensional  vector  as  in  Proposition  1. 

It  takes  a  number  of  strong  assumptions  to  ensure  that  only  the  adjacent  incentive  constraints 
are  binding  even  in  single-task  settings  (for  example,  spanning  or  convexity  of  distribution  func¬ 
tion  conditions,  such  as  in  Grossman  and  Hart  1983).  In  single-task  settings,  the  conditions 
sufficient  for  the  adjacent  incentive  constraints  to  be  binding  are  very  similar  to  those  that  are 
sufficient  for  the  first-order  approach  to  apply  (Christensen  and  Feltham  2005,  Appendix  17A).  It 
is  beyond  the  scope  of  this  study  to  derive  such  conditions  in  multi-task  settings.  However,  with 
only  two  possible  actions  (such  as  high  and  low),  there  is  only  one  incentive  constraint  and  that  is 
binding  by  assumption. 
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ABSTRACT:  This  study  focuses  on  the  persistence  and  market  value  implications  of  a 
subset  of  nonrecurring  charges  that  are  atypical  due  to  repeated  occurrence.  The  in¬ 
creased  recurrence  of  supposedly  nonrecurring  items  perhaps  reflects  managerial  shift¬ 
ing  of  (more  permanent)  ordinary  expenses  to  a  transitory  category  or,  alternatively, 
may  reflect  an  environment  where  these  items  naturally  occur  more  frequently.  Either 
scenario  suggests  that  these  repetitive  charges  have  future  earnings  implications  dra¬ 
matically  different  from  truly  nonrecurring  events  and  should  therefore  be  valued  more 
like  a  recurring  component  of  earnings.  Consistent  with  this  notion,  we  find  that  as  the 
frequency  of  reporting  negative  special  items  increases  (measured  by  the  presence  of 
multiple  prior  charges),  the  persistence  of  these  items  significantly  increases  with  re¬ 
spect  to  future  earnings.  Our  evidence  also  suggests  that  the  valuation  multiple  on  such 
charges  increases  with  frequency.  That  is,  the  market  values  “recurring  nonrecurring” 
items  more  like  the  other  components  of  recurring  earnings. 

Keywords:  nonrecurring  items;  special  items;  restructuring;  frequency;  persistence; 
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I.  INTRODUCTION 

Burgstahler  et  al.  (2002)  provide  evidence  consistent  with  the  conclusion  that  negative 
special  items  are  largely  transitory.  The  empirical  estimates  from  their  earnings  prediction 
model  confirm  that  negative  special  items  (NSIs)  are  more  transitory  than  the  remaining 
components  of  earnings.  Consistent  with  this  notion,  a  substantial  body  of  empirical  research  finds 
that  the  market  places  a  substantially  lower  valuation  multiple  (response  coefficient)  on  accounting 
income  components  classified  as  nonrecurring  (e.g.,  Strong  and  Walker  1993;  Elliott  and  Hanna 
1996).  This  lower  valuation  is  consistent  with  the  notion  that  investors  view  these  items  as 
finite-horizon  events  and  supports  the  accounting  reporting  policy  that  allows  material  nonrecur¬ 
ring  income  components  to  be  identified  separately  in  the  income  statement.  We  extend  this 
literature  by  investigating  the  persistence  and  market  valuation  of  a  subset  of  “nonrecurring”  items 
that  are  atypical  due  to  repeated  occurrence.’ 

The  financial  press  is  replete  with  articles  addressing  the  increased  incidence  of  repetitive 
“nonrecurring”  charges.  For  example,  according  to  one  Merrill  Lynch  analyst,  “special  items  are 
so  prevalent  now  that  they’re  not  special  anymore”  (Fowler  2006,  Al).  DiMartino  (2002,  Dl) 
writes,  “Corporate  America  has  been  reporting  so  many  nonrecurring  restructuring  charges  that 
it’s  tough  to  see  where  routine  operations  end  and  restructuring  begins”  (emphasis  added).  Barton 
Biggs,  former  chief  global  strategist  for  Morgan  Stanley,  stated,  “One,  maybe  even  two  ‘nonre¬ 
curring  charges’  are  understandable  and  possibly  bullish  for  a  company’s  stock  price,  because  they 
clear  the  decks  for  future  earnings  gains,  but  how  can  repeated  write-offs  be  nonrecurring  or 
extraordinary?  How  can  investors  in  the  future  believe  that  reported  earnings  are  for  real  and 
won’t  be  canceled  by  subsequent  write-offs?”  (Smith  and  Lipin  1996,  Al).  Hence,  it  is  of  particu¬ 
lar  interest  to  improve  our  understanding  of  the  persistence  characteristics  of  these  repetitive 
“nonrecurring”  charges  and,  more  importantly,  how  the  market  values  such  charges.^ 

Empirically,  we  show  that  the  frequency  of  prior  reported  NSIs  is  significantly  associated  with 
the  future  reporting  of  NSIs.  We  find  firms  that  report  no  NSIs  over  the  prior  12  quarters  report 
significantly  fewer  NSIs  over  the  subsequent  12  quarters  than  firms  that  report  more  than  four 
prior  charges.  We  also  estimate  earnings  persistence  models  that  relate  current  earnings  and  NSIs 
to  earnings  in  the  first  and  fourth  subsequent  quarters.  Our  evidence  suggests  that  as  the  frequency 
of  prior  reported  NSIs  increases,  the  persistence  of  these  items  with  respect  to  future  earnings  also 
increases.  Overall,  our  evidence  suggests  that  the  frequency  of  prior  NSIs  has  persistence  impli¬ 
cations  for  future  earnings  and  NSIs. 

Consistent  with  Elliott  and  Hanna  (1996),  we  find  in  a  regression  of  market  returns  on 
unexpected  earnings  and  NSIs  that  the  coefficient  on  unexpected  earnings  declines  as  the  fre¬ 
quency  of  prior  NSIs  increases.  More  importantly  for  our  analysis,  we  find  that  the  coefficient  on 
NSIs  increases  as  the  frequency  of  prior  reported  special  items  increases.  This  result,  contrary  to 
the  findings  of  Elliott  and  Hanna  (1996),  suggests  that  the  market  values  recurring  NSIs  as  more 


'  This  study  focuses  on  the  implications  of  negative  special  items  (NSIs)  rather  than  all  special  items  for  three  reasons. 
First,  most  special  items  are  income-decreasing.  Burgstahler  et  al.  (2002)  report  more  than  twice  as  many  NSIs  in  their 
sample  compared  to  positive  special  items.  Second,  our  examination  directly  extends  Elliott  and  Hanna  (1996),  which 
focuses  solely  on  NSIs.  Third,  in  our  own  analyses  of  positive  special  items,  we  find  no  evidence  of  empirical  regu¬ 
larities  meriting  further  investigation. 

^  In  addition,  standard-setters  have  also  expressed  concern  with  the  clarity  of  financial  statements  with  respect  to  the 
reporting  of  infrequent  or  unusual  items  (nonrecurring  items).  Reporting  of  these  items  has  generally  been  governed  by 
a  relatively  old  standard.  Accounting  Principles  Board  (APB)  Opinion  No.  30,  Reporting  the  Results  of  Operations 
(APB  1973).  While  it  appears  standard- setters  are  not  considering  new  guidance  on  how  these  items  are  repotted,  there 
is  substantial  interest  in  clarifying  the  implications  of  these  items.  For  example,  in  a  joint  project  of  the  FASB  and  lASB 
on  financial  statement  presentation,  the  preliminary  views  document  states,  “An  entity  should  present  as  part  of  that 
schedule  [a  schedule  that  reconciles  the  statement  of  cash  flows  to  the  statement  of  comprehensive  income]  infonnation 
about  amounts  related  to  an  unusual  or  infrequent  event  or  transaction”  (FASB  and  lASB  2008). 


The  Accounting  Review 
American  Accounting  Association 


September  2010 


The  Persistence  and  Market  Valuation  of  Recurring  Nonrecurring  Items 


1579 


persistent  (less  transitory)  than  truly  nonrecurring  charges.  Although  the  coefficient  on  NSIs  in¬ 
creases  with  prior  reporting  frequency,  our  evidence  also  suggests  that  the  coefficient  on  unex¬ 
pected  earnings  uniformly  exceeds  the  special  items  coefficients  in  all  cases.  That  is,  even  if  a 
special  item  is  recurring  in  nature,  the  market  still  values  these  items  as  less  persistent  than  the 
other  components  of  earnings. 

We  further  extend  this  analysis  by  focusing  on  a  specific  type  of  NSI — restructuring  charges. 
Restructuring  charges  are  of  particular  interest  for  several  reasons.  First,  restructuring  charges 
commonly  reflect  a  specific  type  of  decision  that  is  not  only  of  established  interest  within  the 
academic  accounting  and  economics  literatures  (e.g..  Dechow  et  al.  1994;  Dial  and  Murphy  1995; 
Chaney  et  al.  1999;  Adut  et  al.  2003;  Clement  et  al.  2007),  but  are  also  of  broad  interest  in  the 
business  media  (e.g.,  Bleaikley  1995;  DiMartino  2002;  Smith  and  Lipin  1996;  Stewart  2002). 
Second,  because  restructurings  focus  on  firm  downsizing,  these  events  constitute  a  comparatively 
homogeneous  subsample  within  the  highly  heterogeneous  population  of  special  items. 

Consistent  with  the  results  on  special  items,  we  find  that  the  market  attaches  less  weight  to 
restructuring  charges  than  to  earnings  before  restructuring  charges,  the  coefficient  on  unexpected 
earnings  declines  as  the  frequency  of  prior  restructuring  charges  increases,  and  restructuring 
charge  coefficients  increase  monotonically  with  the  number  of  prior  reported  restructuring 
charges.  In  fact,  estimated  restructuring  charge  coefficients  beyond  the  second  in  sequence  become 
indistinguishable  from  the  coefficients  on  earnings.  Overall,  our  evidence  suggests  the  market 
values  repetitive  restructuring  charges  more  like  permanent  components  of  earnings  than  transi¬ 
tory  charges. 

Our  evidence  is  broadly  consistent  with  the  idea  that  managers  find  it  attractive  to  classify 
ordinary  recurring  operating  expenses  as  “special  items.”  The  prior  literature  (e.g.,  Lipe  1986; 
Elliott  and  Hanna  1996;  Francis  et  al.  1996;  Bradshaw  and  Sloan  2002;  McVay  2006)  suggests  that 
the  further  a  line  item  is  from  sales  on  the  income  statement,  the  less  weight  the  market  attaches 
to  the  individual  line  item.  Thus,  the  stockholders  of  the  firm  potentially  benefit  from  the  lower 
valuation  multiple  attached  to  NSIs  (which  are  generally  reported  further  from  sales  than  operating 
expenses).  Our  evidence  also  suggests  that  “nonrecurring”  items  have  become  more  persistent  for 
some  firms.  More  importantly,  the  market  appears  responsive  to  this  phenomenon,  in  that  such 
“recurring  nonrecurring”  charges  are  valued  more  like  ordinary  components  of  income.  In  addi¬ 
tion,  our  results  clearly  show  that,  as  firms  increasingly  report  NSIs  (restructuring  charges),  the 
persistence  of  adjusted  earnings  decreases  and  the  market  responds  to  this  lower  persistence  as 
evidenced  by  the  decreasing  valuation  on  earnings. 

Section  II  discusses  the  prior  related  literature.  Section  III  develops  the  research  design. 
Section  IV  presents  the  sample  selection  and  descriptive  statistics.  Section  V  presents  empirical 
results,  and  Section  VI  briefly  summarizes  and  concludes. 

11.  PRIOR  RELATED  LITERATURE 

The  business  media  and  analysts  typically  treat  restructuring  charges  as  nonrecurring  costs, 
commonly  referring  to  “earnings  before  restructuring  charges”  as  the  primary  measure  of 
earnings.^  Bricker  et  al.  (1995,  547)  report  that  analysts  focus  on  the  '"core  operating  results  of 
companies”  (emphasis  in  original)  and  that  such  analysis  “typically  involve(s)  the  removal  of 
items  from  continuing  operations  that  analysts  judge  are  nonrecurring”  (emphasis  in  the  original). 
Lain  and  Lim  (1997,  76)  state,  “these  so-called  special,  nonrecurring  losses  are  generally  ignored 


^  For  example,  the  New  York  Times  on  September  6,  2002  (Section  C;  Page  4,  emphasis  added)  reported,  “The  Procter  & 
Gamble  Company  said  yesterday  that  it  expected  earnings  before  restructuring  charges  to  increase  by  about  15  percent 
in  the  current  quarter  . . .  That  would  beat  Wall  Street  expectations.  Analysts  surveyed  by  Thomson  First  Call  expected 
core  earnings  per  share  of  $1.09,  up  13.5  percent  from  96  cents  a  share  a  year  ago.” 
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by  both  Wall  Street  analysts  and  shareholders,  they  are  viewed  as  isolated  one-time  events  (even 
when  companies  seem  to  claim  them  every  two  to  three  years).” 

Strong  and  Walker  (1993)  examine  the  association  between  annual  stock  returns  and  reported 
earnings  for  United  Kingdom  industrial  companies.  They  report  that  the  earnings  response  coef¬ 
ficient  on  “pre-exceptional”  earnings  is  significantly  greater  than  the  coefficient  on  “exceptional” 
earnings,  where  exceptional  earnings  are  defined  as  material  unusual  or  infrequent  items.  Simi¬ 
larly,  Parkash  (1995)  demonstrates  that  permanent  unexpected  earnings  exhibit  a  stronger  relation 
with  stock  returns  than  temporary  unexpected  earnings.  Elliott  and  Hanna  (1996)  find  that  large 
NSIs  induce  far  smaller  announcement  period  price  responses  than  do  earnings  before  NSIs.  These 
results  are  consistent  with  Kormendi  and  Lipe  (1987)  and  Collins  and  Kothari  (1989),  who  con¬ 
clude  the  market  response  to  earnings  is  positively  related  to  earnings  persistence. 

The  notion  that  special  items,  which  include  restructuring  charges,  are  transitory  is  central  to 
policy  perspectives  on  how  such  items  should  be  reported.  However,  consistent  with  anecdotal 
reports  (Bleakley  1995;  Stewart  2002;  DiMartino  2002;  Fowler  2006),  empirical  research  provides 
evidence  that  (1)  the  incidence  of  repeat  “nonrecurring”  charges  is  pervasive,  and  (2)  such  charges 
possibly  include  ordinary  income  components.  The  repetitive  nature  of  these  charges  is  evident  in 
at  least  three  prior  studies.  Of  the  91  firms  in  Dechow  et  al.  (1994),  45  report  multiple  charges 
over  the  eight-year  sample  time  period,  26  of  which  report  three  or  more  restructuring  charges. 
Similarly,  22  percent  of  the  large  write-off  firms  (i.e.,  negative  special  item  firms)  in  Elliott  and 
Hanna  (1996)  report  three  or  more  such  charges  over  the  prior  12  quarters.  Approximately  two- 
thirds  of  the  149  firms  in  Adut  et  al.  (2003)  report  a  temporally  proximate  restructuring  charge  (a 
second  charge  within  two  years).  Dechow  et  al.  (1994,  153)  report  the  degree  to  which  compen¬ 
sation  is  shielded  from  the  restructuring  charge  decreases  with  the  frequency  of  such  charges. 
They  interpret  this  finding  as  consistent  with  the  notion  that  such  charges  “are  more  likely  to 
contain  ongoing  operating  costs.”  Adut  et  al.  (2003)  refine  this  aspect  of  the  Dechow  et  al.  (1994) 
analysis  and  find  that  temporally  proximate  restructuring  charges  do  not  result  in  shielding.  That 
is,  repetitive  restructuring  charges  are  treated  like  other  operating  expenses  for  executive  compen¬ 
sation  purposes. 


III.  RESEARCH  DESIGN 

Overview 

In  a  market-based  study,  Elliott  and  Hanna  (1996,  136)  state,  “firms  might  transfer  normal 
components  of  operating  expenses  into  the  special  item”  as  an  underlying  rationale  for  why  such 
items  might  adversely  impact  the  market  valuation  of  income  before  special  items.  While  this 
makes  sense  for  earnings,  the  suggestion  that  repetitive  nonrecurring  charges  are  likely  to  contain 
regular  components  of  operating  expenses  leads  to  the  following  questions  related  to  special  items: 
If  normal  components  of  operating  expenses  are  shifted  into  special  items,  then  does  the  market 
recognize  this  and  attach  a  higher  valuation  to  these  costs?  Similarly,  if  seemingly  nonrecurring 
items  become  recurring  in  nature,  then  does  the  market  recognize  and  value  this  phenomenon 
accordingly?  Conversely,  is  the  market  either  ignoring  or  somehow  fooled  by  the  fact  that  these 
repetitive  costs  are  identified  in  the  financial  statements  as  nonrecurring  items?  The  existing 
empirical  evidence  suggests  the  latter  may  be  the  case.  Specifically,  Elliott  and  Hanna  (1996)  find 
that  the  response  coefficient  on  NSIs  becomes  insignificant  as  these  items  become  more  repetitive. 
Such  a  finding  contradicts  the  proposition  that  the  valuation  of  earnings  components  should  in¬ 
crease  as  these  components  become  more  persistent  (less  transitory).  This  proposition  is  even 
more  plausible  if  repetitive  charges  are  more  likely  to  contain  ongoing  operating  costs,  as  sug¬ 
gested  by  both  Dechow  et  al.  (1994)  and  Elliott  and  Hanna  (1996). 

Importantly,  our  analysis  complements  an  examination  of  how  the  market  valuation  of  NSIs 
varies  with  past  frequency  by  also  evaluating  how  special  item  persistence  varies  with  frequency. 
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If  the  market  is  valuing  NSIs  conditional  on  past  frequency,  then  the  observed  valuations  should 
correspond  with  the  associated  persistence  levels.  For  instance,  if  persistence  increases  with  prior 
frequency,  then  we  would  expect  market  valuation  to  increase  with  frequency  as  well.  A  lack  of 
symmetry  between  the  market  valuation  of  repetitive  NSIs  and  the  associated  persistence  proper¬ 
ties  would  imply  the  market  is  either  missing  or  ignoring  the  persistence  implications  of  special 
items. 

Our  analysis  also  provides  some  indirect  insights  about  the  underlying  circumstances  that  lead 
to  the  reporting  of  NSIs.  Such  reporting  depends  on  two  fundamentals:  (1)  the  occurrence  of  an 
unusual  or  infrequent  event  that  can  be  classified  as  a  NSI,  and  (2)  management  discretion  over  the 
reporting  of  NSIs.  Observed  NSI  frequency  for  a  firm  depends  on  both  of  these  factors.  Ceteris 
paribus,  firms  are  more  likely  to  repetitively  report  special  items  in  operating  environments  where 
such  events  are  common.  Similarly,  management  that  is  more  conservative  in  its  special  item 
designations  is  less  likely  to  report  repetitive  NSIs  than  management  that  is  more  aggressive  in  its 
definition  of  special  items.  Consequently,  observing  that  NSI  market  valuations  and  persistence 
effects  increase  with  past  frequency  is  consistent  with  either  the  notion  that  persistent  special  items 
are  an  underlying  characteristic  of  the  firm’s  operating  environment  or  that  high  frequency  is 
symptomatic  of  discretionary  special  item  reporting  (i.e.,  shifting  ordinary  persistent  operating 
items  into  “transitory”  special  items). 

We  attempt  to  distinguish  between  these  two  underlying  explanations  for  special  item  persis¬ 
tence  in  two  substantive  ways.  First,  we  incorporate  a  number  of  supplemental  control  variables 
that  reflect  operating  environment  differences  between  high-  and  low-frequency  NSI  reporting 
firms.  That  is,  we  explicitly  control  for  operating  environment  factors  shown  to  influence  special 
item  reporting  across  firms.  Second,  we  conduct  a  supplemental  targeted  analysis  of  a  specific  type 
of  special  item — restructuring  charges.  Nominally,  such  charges  should  be  purely  transitory. 
Hence,  there  should  be  minimal  variation  in  persistence  outside  managerial  misclassification. 
Consequently,  restructuring  charges  should  provide  a  cleaner  setting  for  evaluating  whether  man¬ 
ager  discretion  is  an  underlying  driver  of  NSI  persistence  and  its  related  valuation  effects. 

Persistence  of  Repetitive  Special  Items 

We  examine  persistence  by  starting  with  the  basic  persistence  relation: 

=  ao  +  a^ADJJNCu  +  1 ) 

INCii+j  is  quarterly  income  before  extraordinary  items  for  firm  i  in  quarter  t+j  (Compustat  data 
item'  #8),  where;  =  1  or  4,  scaled  by  the  market  value  of  equity  at  the  beginning  of  quarter  t;  RSft 
is  after-tax  quarterly  special  items  (Compustat  data  item  #32)  reported  by  firm  i  in  quarter  t,  SI^ 
is  RSIit  scaled  by  the  market  value  of  equity  at  the  beginning  of  quarter  r,  and  ADJJNCn  is 
quarterly  income  before  extraordinary  items  for  firm  i  in  quarter  t  (Compustat  data  item  #8)  less 
RSIit  scaled  by  the  market  value  of  equity  at  the  beginning  of  quarter  t.  The  SI  coefficient  in 
Equation  (1)  measures  the  average  persistence  of  special  items  related  to  next  quarter  s  earnings 


Our  analysis  cannot  distinguish  whether  such  discretionary  misclassification  is  intentional  or  unintentional.  For  example, 
a  firm  may  discover,  ex  post,  that  recurring  costs  were  mistakenly  reported  as  “nonrecurring”  items  m  pnor  years  when 
the  costs  should  have  been  included  in  SG&A.  Eastman  Kodak  is  a  perfect  example  of  this  notion.  Kodak  has  reported 
a  restructuring  charge  in  11  of  the  past  15  years.  These  costs  were  necessitated  by  the  changing  business  environment 
that  Kodak  operates  in  and  could  have  been  reported  as  operating  expenses.  In  fact,  IBM  expenenced  similar  restruc- 
turing  activity  in  the  1990s  and  ultimately  announced  it  would  no  longer  report  these  costs  as  restructuring  ch^ges 
because  they  represented  recurring  operating  costs  necessitated  by  IBM’s  competitive  environment  (Ziegler  1996). 

5  Pre-tax  values  are  converted  to  after-tax  values  by  multiplying  the  reported  pre-tax  value  by  one  minus  the  quarterly 
effective  tax  rate  measured  as  total  tax  expense  (Compustat  data  item  #6)  divided  by  pretax  income  (Compustat  data 

item  #23). 
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(?+l)  or  earnings  four  quarters  hence  (t+4).  We  also  expand  Equation  (1)  to  include  separate  slope 
coefficients  on  earnings  and  special  items  depending  on  the  frequency  of  prior  reported  special 
items,  as  follows: 

5  5  5 

=  2  IdkPREQJcit  +  2  auADJJNC*FREQ_kit  +  2  \kSI*FREQ_ki, .  (2) 

k=0  k=0  k=Q 

EREQ_kit  equals  1  if  firm  i  reported  k  special  items  in  the  12  quarters  prior  to  quarter  t,  where  k 
takes  on  values  of  0  through  5,  otherwise  0.  When  k  takes  on  the  value  of  5,  this  indicates  that  firm 
i  reported  at  least  five  special  items  in  the  12  quarters  prior  to  quarter  In  all  earnings  persistence 
regressions,  the  t-statistics  we  rely  on  are  based  on  robust- standard  errors  using  two-way  cluster¬ 
ing  by  firm  and  year  to  control  for  both  time-series  and  cross-sectional  dependence  (Gow  et  al. 
2010)7 

We  estimate  Equation  (2)  separately  for  positive  and  negative  special  items,  although  NSIs  are 
our  primary  focus.  We  estimate  separate  SI  slope  coefficients  (X-q  through  \5)  depending  on  the 
number  of  prior  reported  special  items.  If  the  increased  frequency  of  prior  special  items  signals 
that  these  recurring  nonrecurring  items  have  a  level  of  persistence  that  exceeds  truly  nonrecurring 
items,  then  the  coefficient  on  SI  will  be  significantly  greater  when  preceded  by  five  or  more  special 
items  than  when  no  prior  special  items  are  reported  (i.e.,  Xs  >  \q).  We  also  estimate  separate 
ADJ_INC  slope  coefficients  (ag  through  depending  on  the  number  of  prior  reported  special 
items.  If  the  repetitive  reporting  of  special  items  negatively  influences  the  persistence  of  adjusted 
earnings,  then  the  coefficient  on  ADJ_INC  when  preceded  by  five  or  more  special  items  will  be 
smaller  than  the  coefficient  on  ADJ_INC  when  preceded  by  no  prior  special  items  (i.e.,  0:5  <  ao). 

Market  Valuation  of  Negative  Special  Items 

The  valuation  aspect  of  our  analysis  is,  in  part,  an  extension  of  Elliott  and  Hanna  (1996). 
Hence,  we  begin  with  a  replication  of  their  analysis  by  estimating  the  following  equation: 

5  5  5 

EA_RETi,  =  2  PkFREQJiit  +  2  (Pk^INC*FREQ_ki,  +  2  S^NSI^EREQJi^,.  (3) 

^=0  k=0  k=0 

EA_RETn  is  the  two-day  earnings  announcement  period  [-1,0]  market-adjusted  stock  return  (ad¬ 
justed  for  value-weighted  index)  for  firm  i  in  quarter  t;  RNSIn  is  firm  i’s  tax-adjusted  negative 
special  item  (Compustat  data  item  #32);  NSI^  is  RNSIn  scaled  by  the  market  value  of  equity  in 
quarter  t-4;^  INCu^j  is  income  before  extraordinary  items  (Compustat  item  #8)  for  firm  i  in  quarter 
t-j',  and  AAINCii  (unexpected  earnings  before  NSIs)  is  measured  as  {INCn-  RNSIn 
-  INCii_4)  /  MVALii_^,  where  MVAE  is  market  value  of  equity.  All  other  variables  are  as  previously 
defined.  This  formulation  is  a  single  equation  equivalent  to  the  multiple  equation  method  em¬ 
ployed  by  Elliott  and  Hanna  (1996).  Based  on  their  analysis,  we  expect  that  the  earnings  response 
coefficient  is  higher  for  firms  with  no  prior  NSIs  than  for  firms  with  one,  two  or  three  prior  NSIs 
(i.e.,  (po  >  <Pi,  <Po  >  <P2^  and  cpQ  >  We  also  expect  that  NSI  coefficients  (4  through  4)  will  be 
smaller  than  companion  earnings  response  coefficients  (cpo  through  (p^). 


®  The  measurement  of  FREQ  depends  on  whether  the  test  focuses  on  positive  or  negative  special  items.  For  example,  in 
our  tests  of  the  persistence  of  NSIs,  we  measure  FREQ  as  the  number  of  prior  reported  NSIs. 

’  We  thank  Ian  Gow  for  providing  us  with  the  SAS  code  to  perform  the  two-way  clustering  procedure. 

We  define  NSL,  as  any  Compustat  quarterly  data  item  #32  that  exceeds  0.5  percent  of  total  assets  at  the  beginning  of  the 
quarter.  We  perform  sensitivity  analyses  employing  the  1.0  percent  of  total  assets  cutoff  in  Elliott  and  Hanna  (1996)  as 
well  as  all  NSIs  without  limitation.  Our  results  are  robust  to  these  alternative  specifications. 

®  Elliott  and  Hanna  (1996)  find  that  the  earnings  response  coefficient  (ERC)  for  no  write-off  observations  is  significantly 
greater  than  the  ERC  on  one,  two,  or  three  write-off  observations.  However,  they  find  no  significant  difference  between 
the  ERC  on  no  write-off  observations  and  the  ERC  on  four  and  five  or  more  write-off  observations. 
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Elliott  and  Hanna  (1996)  examine  the  two-day  announcement  period  price  response  to  NSls 
consistent  with  their  primary  focus  on  how  NSIs  impact  the  information  content  of  adjusted 
earnings.  On  the  other  hand,  we  are  directly  concerned  with  the  valuation  of  the  special  item. 
Elliott  and  Hanna  (1996,  153)  note,  “firms  reporting  large  write-offs  might  be  providing  more 
information  prior  to  the  earnings  announcement.”  This  notion  is  consistent  with  prior  research 
detailing  how  managers  pre-announce  bad  news  in  an  attempt  to  induce  downward  revision  in 
analyst  forecasts  (e.g.,  Soffer  et  al.  2000).  Material  NSIs  are  often  publicly  disclosed  prior  to  the 
earnings  announcement  date  (e.g.,  Elliott  and  Shaw  1988;  Lopez  2002).  In  an  examination  of 
1,220  restructuring  observations  in  our  sample,  we  find  over  65  percent  were  announced  four  or 
more  days  prior  to  the  earnings  announcement  date.  Such  “bad  news”  pre-announcements  poten¬ 
tially  compromise  the  ability  of  an  eamings-announcement  period  return  design  to  fully  capture 
the  market  response  to  the  charge.  The  use  of  this  short  return  window  potentially  biases  the 
coefficient  on  nonrecurring  charges  toward  zero.  In  order  to  ensure  that  we  capture  the  full 
valuation  impact  of  the  NSI,  our  primary  analysis  incorporates  a  quarterly  return  similar  to  Free¬ 
man  and  Tse  (1992).'^ 

Expanding  the  return  window  from  a  two-day  window  to  a  quarterly  accumulation  period 
weakens  the  ability  of  our  analysis  to  pinpoint  the  earnings  announcement  as  the  direct  source  of 
the  market  price  movement,  but  insulates  against  the  potentially  biasing  effect  of  NSI  pre¬ 
announcements.  We  employ  an  equation  identical  to  Equation  (3)  in  all  respects  except  that  the 
dependent  variable,  QRETfj,  is  defined  as  the  quarterly  market-adjusted  stock  return  for  firm  i 
accumulated  over  the  period  from  one  trading  day  after  the  earnings  announcement  date  for  the 
prior  quarter  (t-1)  through  the  earnings  announcement  date  for  the  current  quarter  (t).  We  modify 
Equation  (3)  as  follows: 

5  5  5 

QRETi,  =  2  (BkPREQ-kit  +  2  <pMINC*EREQ_ki,  +  2  S^NSI^EREQ_ku-  (3a) 

k=0  k=0  k=0 

Since  value-relevant  earnings  information  commonly  impacts  returns  prior  to  the  earnings 
announcement  (Kothari  and  Sloan  1992),  the  estimated  coefficients  for  Equation  (3a)  are  likely  to 
be  higher  than  those  for  Equation  (3).  Consistent  with  Elliott  and  Hanna  (1996),  we  anticipate  that, 
as  the  frequency  of  prior  reported  NSIs  increases,  the  decline  in  the  AAINC  coefficients  should 
remain  intact  (i.e.,  (pQ>  cp^,  (Po^  <P2^  ‘Po  ^  “Ps)  coefficient  on  AAINC  will  be  signifi¬ 

cantly  greater  than  the  coefficient  on  NSI  when  k  =  0  (no  prior  NSIs).  However,  we  have  no  a 
priori  expectation  of  the  relation  between  unexpected  earnings  (AAINC)  and  special  items  (NSI)  as 
the  sequence  of  reported  NSIs  increases.  Finally,  if  the  market  perceives  an  increase  in  special 
items  reporting  frequency  as  an  increase  in  the  persistence  effects  of  these  items,  then  the  coeffi¬ 
cient  on  NSI  will  be  increasing  in  k  (i.e.,  >  (5o). 

Market  Valuation  of  Restructuring  Charges 

Our  final  analysis  focuses  on  a  subset  of  special  items  identified  as  restructuring  charges.  We 
evaluate  how  the  market  values  adjusted  income  and  restructuring  charges  using  the  following 
variations  of  Equations  (3)  and  (3a): 


This  caveat  does  not  apply  to  an  evaluation  of  the  response  to  an  earnings  “surprise  contained  in  the  earnings 
announcement.  If  the  coefficient  on  income  (earnings  surprise)  is  of  primary  importance,  as  is  the  case  m  Elliott  and 
Hanna  (1996),  then  the  announcement  period  analysis  is  highly  appropriate.  ,  ^ 

“  Freeman  and  Tse  (1992)  select  a  quarterly  return  window  in  their  analysis  because  they  did  not  have  the  exact  date  of 
analysts’  forecasts.  They  indicate  that  use  of  a  shorter  window  that  excludes  a  portion  of  the  market  reaction  to  the 
analysts’  forecasts  will  result  in  a  downward  biased  ERC  (Freeman  and  Tse  1992,  189). 
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4 


4 


4 


EA_RETit  =  2  /3kFREQ_ki,  +  2  Ok^AINC_RC*EREQ_kit  +  2  JkRC^PREQJcn 
k=0  k=0  k=0 


(4) 


and 

4  4  4 

QRETit  =  2  l3kFREQ_kit  +  2  OkAAINC_RC*FREQ_kii  +  2  JkRC^FREQJiit.  (4a) 

fc=0  fc=0  Jt=0 

RRCjt  is  firm  i’s  tax-adjusted  restructuring  charge;  ^INC_RCii  is  unexpected  earnings  before 
restructuring  charges  for  firm  i  in  quarter  t,  measured  as  {INCn-  RRCn-  INCit-4)  I  MVALjf_4, 
where  MVAE  is  market  value  of  equity;  RCn  equals  RRC^  scaled  by  the  market  value  of  equity  in 
quarter  t-4;  and  FREQ_kn  equals  1  if  firm  i  reported  k  restructuring  charges  in  the  12  quarters 
prior  to  quarter  t,  where  k  takes  on  values  of  0  through  4.  When  k  takes  on  the  value  of  4,  this 
indicates  that  firm  i  reported  at  least  four  restructuring  charges  in  the  12  quarters  prior  to  quarter 
t.  All  other  variables  are  as  previously  defined. 

The  behavior  of  compensation  committees  with  respect  to  repetitive  restructuring  charges 
documented  in  Dechow  et  al.  (1994)  and  Adut  et  al.  (2003)  implies  that  7o  <  7i  <  72  <  73 
<  74  in  the  estimation  of  Equation  (4a).  That  is,  we  expect  that  the  response  coefficient  on 
restructuring  charges  will  increase  with  observed  frequency  of  prior  reported  restructurings.  For  9q 
through  64,  we  have  no  reason  to  anticipate  any  substantive  departure  from  the  findings  of  Elliott 
and  Hanna  (1996).  Hence,  we  expect  these  coefficient  values  to  be  positive  but  declining  in 
magnitude  as  the  frequency  of  prior  reported  restructuring  charges  increases. 

IV.  SAMPLE  SELECTION  AND  DESCRIPTIVE  STATISTICS 

Our  analysis  employs  three  separate  samples:  (1)  to  estimate  the  persistence  of  special  items, 
(2)  to  estimate  the  market  valuation  of  repetitive  special  items,  and  (3)  to  estimate  the  market 
valuation  of  repetitive  restructuring  charges,  a  more  homogenous  subset  of  NSIs.  For  our  first  two 
analyses,  we  employ  the  Compustat  Industrial  Quarterly  database  to  identify  the  special  item 
sample.  The  sample  of  special  item  observations  consists  of  firms  reporting  at  least  one  special 
item  (Compustat  data  item  #17)  between  1992  and  2003.  From  this  set,  we  identify  all  firm- 
quarters  with  available  Compustat  and  CRSP  data  necessary  to  estimate  the  regression  equations. 
The  earnings  persistence  tests  in  Equation  (2)  require  complete  data  for  17  quarters  surrounding 
quarter  t  (t- 12  through  t+A),  the  quarter  in  which  the  special  item  is  recognized.  We  then  eliminate 
as  an  outlier  any  observation  that  exceeds  the  cutoffs  proposed  in  Belsley  et  al.  (1980)  for  at  least 
three  of  four  outlier  metrics  (hat  matrix,  studentized  residual,  COVRATIO  statistic,  and  dfFITS 
statistic).  Our  data  requirements  and  sample  selection  criteria  result  in  a  test  sample  of  112,721 
firm-quarter  observations  for  the  estimation  of  Equation  (2).  To  perform  our  market  valuation 
tests.  Equations  (3)  and  (3a),  we  identify  all  firm-quarter  observations  from  1992  to  2003  with 
available  Compustat  and  CRSP  data  necessary  to  estimate  the  corresponding  equations  and  elimi¬ 
nate  outliers  consistent  with  our  earnings  persistence  tests  noted  above.  This  process  results  in  test 
samples  of  151,532  and  151,498  firm-quarter  observations  to  estimate  Equations  (3)  and  (3a), 
respectively.'^ 

The  restructuring  observations  in  our  sample  are  all  associated  with  corporate  downsizing. 
While  not  perfectly  homogeneous,  we  limit  our  restructuring  observations  to  downsizings  because 
we  believe  the  similarity  of  an  event  is  essential  when  seeking  to  understand  how  the  market 


The  difference  in  sample  sizes  for  Equations  (3)  and  (3a)  are  due  to  our  method  of  addressing  outliers.  We  also  ran  our 
tests  using  a  1  percent  cutoff  for  all  regression  variables  (resulting  in  the  same  sample  size  for  these  tests)  and  our 
primary  results  are  robust  to  this  alternative  method  of  addressing  outliers. 
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responds  to  a  series  of  sequential  events.  Our  sample  of  restructuring  observations  consists  of 
firms  reporting  at  least  one  restructuring  charge  between  1992  and  2003.  For  years  prior  to  1995, 
restructuring  observations  are  identified  by  searching  the  National  Automated  Accounting  Re¬ 
search  System  (NAARS)  using  the  search  strings  “restruct!,”  “unusual,”  and  “special.”  For  years 
between  1994  and  1998,  the  Lexis-Nexis  Academic  Universe  Business  News  database  is  searched 
using  the  same  search  strings.  We  then  review  the  annual  financial  statements  and  footnotes  of 
each  firm  identified  in  this  search  to  assure  each  firm  actually  reported  a  restructuring  charge 
related  to  downsizing.  This  search  and  review  yields  an  initial  sample  of  642  “restructuring”  firms. 
To  identify  quarterly  restructuring  observations  for  this  sample  of  642  firms,  we  review  the  10-Q 
reports  of  these  firms  for  every  quarter  between  1992  and  2003.  From  this  search,  an  initial  sample 

13 

of  2,278  quarterly  restructuring  observations  is  identified. 

We  estimate  cross-sectional  regression  models  using  all  quarterly  restructuring  charges,  re¬ 
turns,  and  earnings  data  available  for  the  period  from  1992  through  2003  (a  maximum  of  48 
time-series  observations  for  each  firm).  The  firm  must  report  at  least  one  quarterly  restructuring 
charge  from  1992  to  2003  and  have  Compustat  and  CRSP  data  necessary  to  estimate  all  equations. 
These  limitations  result  in  a  test  sample  of  1,220  quarterly  restructuring  observations  taken  by  382 
firms  for  purposes  of  estimating  Equations  (4)  and  (4a)  for  a  total  of  15,533  firm-quarters. 

Panel  A  of  Table  1  reports  descriptive  statistics  for  the  full  special  item  sample.  Mean  (me¬ 
dian)  total  assets  and  market  value  of  equity  are  $2.9  billion  ($215  million)  and  $1.9  billion  ($161 
million),  respectively.  The  mean  (median)  negative  special  item  is  3.2  (0.7)  percent  of  market 
value  while  the  mean  (median)  positive  special  item  is  2.2  (0.4)  percent  of  market  value.  Average 
quarterly  (earnings  announcement)  market-adjusted  return  is  0.37  percent  (0.43  percent).  The 
mean  quarterly  effective  tax  rate  (ETR)  is  25  percent  and  the  mean  operating  margin  is  1.9 
percent.  Einally,  our  sample  firms  have  an  average  of  1.8  reportable  operating  segments. 

Panel  B  of  Table  1  reports  mean  descriptive  statistics  for  those  firms  that  report  a  NSI 
conditioned  on  the  frequency  of  prior  reported  NSIs.  More  than  16  percent  of  our  NSI  observa¬ 
tions  report  no  NSIs  the  prior  12  quarters  (No  Prior  NSI),  while  more  than  a  third  report  four  or 
more  prior  NSIs.  The  No  Prior  NSI  observations  generally  experience  lower  returns  (both  the 
quarterly  and  short  windows,  QRET  and  EA_RET,  respectively),  are  smaller,  and  report  lower 
income  than  observations  with  multiple  prior  NSI  incidences.'^  In  fact,  income  {AINC  and 
AAINC)  and  market  return  (both  QRET  and  EA_RET)  means  for  all  groups  in  Panel  B  are  sig¬ 
nificantly  lower  (not  tabulated)  than  the  overall  sample  means  reported  in  Panel  A,  consistent  with 
prior  research  showing  that  firms  are  more  likely  to  report/experience  special  items  during  periods 
of  poor  performance  (e.g.,  Elliott  and  Shaw  1988;  DeAngelo  et  al.  1994;  Carter  2000).  However, 
Panel  B  suggests  a  performance  improvement  as  prior  NSI  frequency  increases.  That  is,  as  prior 
NSI  frequency  increases,  AINC,  QRET,  and  EA_RET  tend  to  become  less  negative,  while  the  two 
adjusted  income  measures  {AAINC  and  OPMARGIN)  become  positive.  Hence,  while  NSIs  are 
associated  with  poor  performance,  we  find  that  performance  improves  toward  a  more  normal  level 
as  the  frequency  of  reporting  NSIs  increases.  The  improved  (or  less  negative)  performance  level 


13  Our  search  for  restructuring  firms  is  limited  to  the  years  1992  through  1998.  We  then  search  the  10-Qs  for  this  sample 
for  every  quarter  from  Q1  1992  through  Q4  2003  to  identify  quarterly  restructuring  observations  for  the  642  firms 
identified  in  our  initial  search.  The  primary  reason  we  stopped  our  search  at  1998  is  the  significant  amount  of  hand- 
collection  of  data  required. 

In  an  untabulated  analysis,  we  estimate  descriptive  statistics  for  our  restructuring  sample.  In  general,  these  statistics  are 
quantitatively  and  qualitatively  similar  to  those  reported  here.  The  one  noteworthy  difference  is  restructuring  firms  are 
larger.  Mean  (median)  total  assets  for  the  restructuring  sample  is  $5.1  billion  ($482  million). 

Differences  between  the  More  than  four  NSI  means  and  the  No  Prior  NSI  means  in  Panel  B  are  significant  at  the  0.05 
level  or  higher  (not  tabulated)  for  all  variables. 
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for  high-frequency  NSI  firms  is  broadly  consistent  with  the  notion  that  these  charges  reflect 
ongoing  or  more  “normal”  activities  of  the  firms  in  this  category. 

Firm  size  also  increases  with  NSI  frequency  as  mean  assets  and  market  value  of  the  More 
than  4  NSI  group  are  approximately  four  times  those  of  the  No  Prior  NSI  group.  One  reason  for 
such  a  relation  is  that  larger  firms  tend  to  report  more  segments  than  smaller  firms.  In  multi¬ 
segment  firms,  special  item  disclosures  are  often  determined  on  the  basis  of  whether  a  charge  is 
material  at  the  segment  level.  Ceteris  paribus,  the  more  segments  a  firm  reports,  the  more  likely 
the  firm  will  report  a  special  item.  The  increase  in  average  number  of  segments  from  1.825  for  the 
No  Prior  NSI  group  to  2.180  for  the  More  than  4  NSI  group  is  consistent  with  the  number  of 
segments  contributing  to  the  frequency  of  NSI  reporting. 

As  prior  reporting  frequency  increases,  the  average  effective  tax  rate  (ETR)  decreases  mono- 
tonically  from  23.8  percent  down  to  19.1  percent.  Our  main  results  are  based  on  tax-adjusted 
NSIs,  and  may  be  sensitive  to  the  tax  rate  adjustment.  In  sensitivity  analyses,  we  find  our  results 
to  be  robust  to  the  use  of  pre-tax  NSI  values  as  well  as  to  several  alternative  tax  rate  adjustment 
methods.  Finally,  the  magnitude  of  NSIs  (measured  on  either  a  pre-tax  or  tax-adjusted  basis) 
declines  monotonically  as  prior  reporting  frequency  increases  with  the  sharpest  decline  at  the 
More  than  4  prior  NSI  category.  Such  a  decline  in  magnitude  suggests  that  firms  repeatedly 
reporting  NSIs  may  have  a  lower  materiality  threshold  for  NSIs  than  firms  that  report  less  fre¬ 
quently.  In  addition,  it  is  also  consistent  with  the  notion  that  some  firms  that  frequently  report  NSIs 
are  engaging  in  a  planned  multi-period  event  where  only  part  of  the  total  cost  of  an  event  is 
recognized  in  any  one  period. 

Panel  C  of  Table  1  reports  mean  descriptive  statistics  for  those  firms  that  report  a  positive 
special  item  {PSI)  conditioned  on  the  frequency  of  prior  reported  PSIs.  There  are  marked  differ¬ 
ences  between  the  positive  and  negative  special  item  subsamples.  There  are  noticeably  fewer 
recurring  PSIs  as  approximately  39  percent  report  no  PSIs  in  the  prior  12  quarters  while  only  11 
percent  report  four  or  more  prior  PSIs.  The  frequency-related  decrease  noted  in  ETRs  for  NSIs  is 
not  found  as  the  ETR  for  PSIs  remains  relatively  stable  at  25  percent,  regardless  of  prior  reporting 
frequency.  Unlike  the  NSI  sample,  we  find  that  adjusted  income  decreases  with  the  number  of 
prior  reported  PSIs.  Similar  to  the  NSI  sample,  firm  size  increases  with  the  frequency  of  prior 
reported  PSIs,  but  no  such  relationship  is  apparent  for  the  number  of  reportable  segments. 

In  a  supplemental  descriptive  analysis  reported  in  Appendix  A,  we  examine  NSI  properties  for 
a  randomly  selected  subsample  of  100  high-frequency  NSI  observations  (three  or  more  NSIs  over 
the  prior  12  quarters).  This  analysis  indicates  that  restructuring  charges  are  the  most  common,  but 
not  dominant,  type  of  NSI  reported  in  this  subsample  at  34  percent  of  the  total.'®  The  remaining 
reported  items  show  the  considerable  degree  of  heterogeneity  in  the  cross-section  of  high- 
frequency  special  items  that  makes  measurement  of  repeated  SI  effects  somewhat  problematic. 
This  analysis  also  shows  that  a  considerable  degree  of  heterogeneity  exists  within  the  time-series 
of  charges  for  these  firms.  Merger-related  costs  is  the  only  category  where  over  time  frequency 
rates  approach  50  percent  and  these  rates  are  under  30  percent  for  most  other  special  item  types. 

Appendix  B  reports  prior  reporting  frequencies  of  NSIs  by  industry  classification.  Industries 
with  the  greatest  frequency  of  NSI  recognition  are  business  services,  electronic  equipment,  com¬ 
puters,  banking,  and  pharmaceuticals,  respectively.  These  five  industries  represent  over  40  percent 
of  all  quarterly  NSI  observations  in  our  sample.  Because  there  is  evidence  of  substantial  industry 
clustering,  we  control  for  possible  industry  effects  in  our  empirical  testing. 


The  special  item  types  are  restructuring  charges,  merger-related  costs,  in-process  R&D,  asset  write-offs,  litigation  costs, 
goodwill  write-off,  asset  disposition,  and  other. 
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V.  RESULTS 

Persistence  Properties  of  Negative  Special  Items 

To  begin,  we  explore  the  persistence  properties  of  NSIs.  Specifically,  we  investigate  whether 
the  frequency  of  prior  special  item  recognition  relates  to  the  frequency  of  subsequent  recognition. 
We  perform  this  analysis  on  negative  and  positive  special  item  subsamples.  For  the  NSl  sub¬ 
sample,  16,430  observations  report  at  least  one  NSI  during  the  test  period  and  have  complete  data 
for  the  25  quarters  from  t-\l  to  t+12.  For  the  PSI  subsample,  5,294  observations  report  at  least 
one  PSI  during  our  test  period  and  have  complete  data  for  the  25  quarters  from  t-\2  to  t+12. 

Panel  A  of  Table  2  reports  results  for  the  NSI  analysis.  On  average,  sample  observations 
report  2.9  NSIs  over  the  12  quarters  subsequent  to  quarter  t.  For  NSIs,  the  results  suggest  the 
frequency  of  prior  reporting  is  significantly  related  to  the  frequency  of  subsequent  reporting.  The 
mean  number  of  subsequent  NSIs  for  observations  that  are  preceded  by  no  NSIs  over  the  prior  12 
quarters  is  1.9.  On  the  other  hand,  for  observations  preceded  by  five  or  more  prior  reported  NSIs, 
the  mean  number  of  subsequent  charges  is  4.5.  This  2.7  difference  in  subsequent  frequency  is 
significant  at  the  0.01  level.  We  also  investigate  the  magnitude  of  subsequent  NSIs  conditioned  on 
the  frequency  of  prior  reported  charges.  We  find  that,  for  observations  with  no  prior  reported  NSIs, 
the  aggregate  charges  over  the  subsequent  12  quarters  represent  an  average  of  7.6  percent  of  the 
market  value  of  equity  in  quarter  t.  However,  if  the  current  special  item  is  preceded  by  five  or 
more  prior  NSIs,  the  aggregate  charges  over  the  subsequent  12  quarters  represent  an  average  of 
14.9  percent  of  the  market  value  of  equity  in  quarter  t.  This  7.3  percent  difference  is  significant  at 
the  0.01  level.  Finally,  for  observations  with  no  prior  reported  NSIs,  the  aggregate  charges  over  the 
subsequent  12  quarters  represent  198  percent  of  the  charge  taken  in  quarter  t.  However,  if  the 
current  special  item  is  preceded  by  five  or  more  prior  NSIs,  the  aggregate  charges  over  the 
subsequent  12  quarters  represent  357  percent  of  the  charge  reported  in  quarter  t.  This  159  per¬ 
centage  point  difference  is  also  significant  at  the  0.01  level.  ITie  evidence  suggests  that  both  the 
frequency  and  magnitude  of  subsequently  reported  NSIs  are  significantly  associated  with  past 
reporting  frequency. 

Panel  B  of  Table  2  reports  results  for  the  PSI  analysis.  PSI  firms  report  1.5  PSIs  over  the  12 
quarters  subsequent  to  quarter  t,  approximately  half  of  that  reported  for  NSIs.  The  mean  number 
of  subsequent  PSIs  is  1.2  (2.3)  for  observations  that  are  preceded  by  no  (five  or  more)  PSIs  over 
the  prior  12  quarters.  This  difference  of  1.1  is  significant  at  the  0.01  level;  however,  it  is  consid¬ 
erably  smaller  than  the  NSI  difference  of  2.7.  The  aggregate  value  of  PSIs  over  the  subsequent  12 
quarters  represents  3.6  (5.9)  percent  of  the  market  value  of  equity  in  quarter  t  for  observations  that 
are  preceded  by  no  (five  or  more)  PSIs  over  the  pridr  12  quarters.  This  2.3  percent  difference  is 
marginally  significant  (p  <  0.10)  and  less  than  a  third  of  the  NSI  difference  of  7.3.  Finally,  the 
aggregate  value  of  PSIs  recognized  over  the  subsequent  12  quarters  represents  136  (252)  percent 
of  the  PSI  reported  in  quarter  t  for  observations  that  are  preceded  by  no  (five  or  more)  PSIs  over 
the  prior  12  quarters.  This  116  percentage  point  difference  is  significant  at  the  0.01  level,  but  again 
substantially  less  than  the  NSI  difference  of  159  percent.  Regardless  of  sign,  the  evidence  suggests 
that  both  the  frequency  and  magnitude  of  subsequently  reported  special  items  are  significantly 
associated  with  past  reporting  frequency,  but  the  association  is  stronger  for  NSIs. 

Table  3  reports  estimates  of  Equation  (2),  which  measures  the  persistence  of  current  earnings 
and  special  items  related  to  next  quarter  earnings  or  subsequent  seasonal  earnings  (fourth  subse¬ 
quent  quarter).  Panel  A  reports  NSI  estimates,  while  Panel  B  reports  PSI  estimates.  Table  3  also 
provides  estimates  for  an  augmented  version  of  Equation  (2),  which  introduces  controls  for  em¬ 
pirically  identified  determinants  of  repetitive  special  item  frequencies.  The  descriptive  evidence 
reported  in  Table  1,  Panel  B  as  well  as  prior  literature  (e.g.,  Francis  et  al.  1996;  Bens  and  Johnston 
2009)  suggest  that  several  factors  are  significantly  associated  with  the  recognition  of  a  negative 
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special  item/restructuring  charge.  In  a  supplemental  analysis  (not  tabulated),  we  identify  factors 
associated  with  negative  and  positive  special  item  reporting  frequencies.  We  find  that  firm  size, 
accounts  receivable  turnover,  PP&E  turnover,  change  in  ROE,  change  in  ROA,  industry  growth  in 
sales,  change  in  industry  ROA,  prior  year  annual  return,  prior  five-year  annual  return,  average 
five-year  change  in  the  book-to-market  ratio,  average  five-year  change  in  the  industry  book-to- 
market  ratio,  and  an  indicator  variable  for  the  presence  of  a  loss  in  any  of  the  three  prior  years  are 
significant  predictors  of  NSI  frequency.  We  also  find  that  firm  size,  profit  margin,  change  in  ROA, 
industry  growth  in  sales,  change  in  industry  ROA,  prior  year  annual  return,  prior  five-year  annual 
return,  average  five-year  change  in  the  book-to-market  ratio,  average  five-year  change  in  the 
industry  book-to-market  ratio,  and  an  indicator  variable  for  the  presence  of  a  loss  in  any  of  the 
three  prior  years  are  significant  predictors  of  PSI  frequency.  Our  empirical  specification  measures 
these  control  variables  as  decile  ranks  and  includes  them  as  both  main  effects  and  interacted 
variables  with  ADJ_INC  and  either  NSI  (Panel  A)  or  PSI  (Panel  B).^® 

The  third  and  fourth  (fifth  and  sixth)  columns  of  Table  3,  Panel  A  report  estimations  of 
Equation  (2)  with  quarter  t+\  {t+A)  earnings  as  the  dependent  measure,  without  and  with  control 
variables,  respectively.  The  results  without  controls  are  broadly  similar  for  both  future  earnings 
designations  (t-l-1  and  t+A).  The  coefficient  on  ADJ_INC  generally  decreases  as  the  frequency  of 
prior  NSIs  increases.  Eor  t-l-1  earnings,  iho  ADJJINCJJ  coefficient  (no  prior  NSIs)  is  0.608  which 
is  significantly  greater  (p  <  0.01)  than  the  0.580  coefficient  on  ADJ_INC_5.  Eor  t+A  earnings,  the 
ADJ_INC_0  coefficient  of  0.537  is  also  significantly  greater  than  the  ADJ__INC_5  coefficient  of 
0.422  (p  <  0.01).  When  frequency-related  control  variables  are  introduced,  the  income  coefficient 
magnitudes  drop  as  they  now  reflect  marginal  rather  than  total  earnings  effects.  Eor  the  t+\  and 
t+A  earnings  regressions,  the  largest  income  coefficients  eontinue  to  be  ADJ_INC_0  (coefficient 
estimates  of  0.062  and  0.001,  respectively).  In  the  case  of  t+A  earnings,  the  ADJ_INC_5  coeffi¬ 
cient  of  —0.055  is  significantly  smaller  (p  <  0.01)  than  iho  ADJ_INC_0  coefficient.  In  the  case  of 
t+\  earnings  the  difference  in  these  coefficients  lacks  significance  at  conventional  levels  (p  = 
0.142).  However,  the  intervening  coeffieient  estimates  (i.e.,  ADJ_INC_I  through  ADJ_INC_4)  are 
all  significantly  smaller  (p  <  0.01)  than  the  ADJ_INCJ)  coefficient.  Collectively,  these  results 
suggest  that  earnings  beeome  less  persistent  as  the  prior  reporting  frequency  of  NSIs  increases. 

With  respect  to  NSIs,  the  NSIjO  coefficients  are  significantly  smaller  (p  <  0.05  or  better)  than 
the  NSI_5  coefficients  in  all  four  estimations.  In  the  two  no-controls  estimations,  the  NSI_0 
coefficients  are  significantly  negative  (p  <  0.05),  suggesting  that  current  NSIs  not  preceded  by 
prior  NSIs  lead  to  higher  earnings  in  the  subsequent  quarter.  In  contrast,  these  same  eoefficients 
are  positive  and  significant  (p  <  0.01)  in  the  two  estimations  that  include  control  variables.  Thus, 
after  controlling  for  the  determinants  of  NSIs,  the  marginal  effect  of  a  current  NSI  is  a  decrease  in 
future  earnings.  Regardless  of  whether  it  is  a  main  effect  or  marginal  effect,  the  evidence  remains 
consistent  with  the  notion  that  the  persistence  of  NSIs  with  respeet  to  future  earnings  increases 
with  prior  reporting  frequency.'^ 

Table  3,  Panel  B  reports  estimations  of  Equation  (2)  for  PSIs.  The  results  with  respect  to  the 
earnings/frequency  interaetion  terms  when  estimating  without  eontrols  are  generally  consistent 


The  supplemental  analysis  is  available  from  the  second  author  upon  request.  The  variables  included  in  this  analysis  are 
defined  in  Appendix  C. 

We  also  use  continuous  versions  of  these  controls  in  additional  testing  and  the  results  are  quantitatively  and  qualitatively 
similar  to  the  tabulated  results. 

We  estimate  seasonal  earnings  difference  variations  of  Equations  (1)  and  (2).  The  results  of  these  additional  tests  (not 
tabulated)  are  quantitatively  and  qualitatively  similar  to  those  tabulated  in  Table  3.  In  addition,  we  used  two-way 
clustering  by  industry  and  year  to  control  for  the  industry  clustering  we  note  in  Appendix  B.  Our  primary  conclusions 
are  unaffected. 
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with  the  results  reported  in  Panel  A.  That  is,  the  persistence  of  earnings  decreases  with  increased 
reporting  frequency  of  PSIs.  In  general,  once  the  control  variables  are  included,  earnings/ 
frequency  interaction  terms  become  insignificant,  suggesting  that  PSI  reporting  frequency  has  little 
or  no  marginal  effect  on  earnings  persistence.  However,  unlike  the  results  in  Panel  A  for  NSIs,  the 
PSI  results  provide  no  evidence  of  a  relation  between  the  prior  reporting  frequency  of  PSIs  and 
their  persistence  effects  on  future  earnings.  Consistent  with  this  conclusion,  we  find  no  clear 
pattern  in  the  PSIJc  interaction  terms  based  on  frequency.  In  fact,  regardless  of  the  model  we 
estimate  (i.e.,  quarterly  or  seasonal  persistence  with  or  without  control  variables),  the  PSI_k 
variables  are  generally  insignificant  at  conventional  levels. 

Our  results  with  respect  to  frequency  of  prior  NSIs  and  earnings  persistence  triangulate  nicely 
with  the  market  results  of  Elliott  and  Hanna  (1996).  The  persistence  of  earnings  before  NSIs  (our 
result)  and  the  market  valuation  multiple  on  earnings  before  NSIs  (their  result)  both  decrease  with 
the  prior  reporting  frequency  of  NSIs.  In  contrast,  the  Elliott  and  Hanna  (1996)  results  suggest  the 
valuation  multiple  on  NSIs  decreases  with  prior  frequency,  whereas  our  analysis  suggests  that 
future  earnings  implications  of  NSIs  become  more  persistent.  The  increased  persistence  of  NSIs 
equates  to  lower  future  earnings,  since  the  special  items  are  entered  as  negative  amounts.  In 
addition  to  the  persistence  effects,  our  Table  2  results  suggest  that  increased  prior  reporting 
frequency  is  associated  with  a  higher  probability  of  reporting  future  NSIs  (both  in  frequency  and 
magnitude).  In  light  of  the  increased  persistence  characteristics  and  the  future  income-decreasing 
implications  of  repetitive  NSIs  documented  in  Tables  2  and  3,  the  Elliott  and  Hanna  (1996)  results 
with  respect  to  the  market  valuation  of  repetitive  charges  become  difficult  to  reconcile.  Is  the 
market  fooled  by  repetitive  NSIs?  Or  is  the  market  somehow  missing  the  future  earnings  impli¬ 
cations  of  repetitive  NSIs?  Alternatively,  is  there  something  missing  in  the  market  valuation 
models  of  prior  research?  We  investigate  the  latter  possibility  by  considering  several  alternative 
specifications  of  the  Elliott  and  Hanna  (1996)  model. 

Market  Valuation  of  Repetitive  Negative  Special  Items 

Table  4,  Panel  A  reports  regression  results  from  the  estimation  of  Equations  (3)  and  (3a)  for 
NSIs.^°  Panel  B  reports  coefficient  difference  tests  from  the  estimation  of  Equations  (3)  and  (3a). 
We  also  estimate  expanded  versions  of  Equations  (3)  and  (3a),  which  include  controls  for  other 
factors  associated  with  the  reporting  frequency  of  NSIs  (see  Appendix  C  for  control  variable 
definitions).  In  the  Equation  (3)  short- window  earnings  announcement  period  return  analysis  (col¬ 
umn  3,  No  Controls),  our  findings  are  generally  consistent  with  those  reported  by  Elliott  and 
Hanna  (1996).  The  series  of  AAINC_k  coefficients  declines  from  a  high  of  0.233  for  firm-quarters 
with  no  prior  NSIs  (in  the  12  quarters  prior  to  the  current  quarter)  to  0.099  for  firm-quarters  with 
five  or  more  prior  NSIs.  A  similar  pattern  also  holds  after  inclusion  of  the  control  variables  and 
associated  interactions  (column  4).  The  coefficient  on  AAINC_0  (no  prior  NSIs)  is  significantly 
greater  (p  <  0.01)  than  each  of  the  other  AAINC__k  coefficients  in  either  EA_RET  model  (test 
results  not  tabulated). 

With  respect  to  the  short-window  valuation  of  NSIs  without  controls  (Equation  (3)),  the 
coefficients  on  NSIJ)  through  NSI_4  each  significantly  exceed  zero  (p  <  0.06  or  better),  but  the 
coefficient  on  NS1_5  is  insignificant.  The  NSI_0  coefficient  of  0.039  exceeds  the  NS1_5  coefficient 
of  0.001,  consistent  with  a  decline  in  the  weight  the  market  attaches  to  NSIs  as  the  sequence  of 
these  items  increases,  as  reported  by  Elliott  and  Hanna  (1996).  However,  the  difference  between 


Given  the  comparison  to  Elliott  and  Hanna  (1996)  and  our  Table  3  results  that  show  little  or  no  relation  between  the 
prior  reporting  of  PSIs  and  the  earnings  persistence  effects  of  PSIs,  we  focus  the  remainder  of  our  analyses  solely  on 
NSIs. 
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Cready,  Lopez,  and  Sisneros 


these  variables  lacks  statistical  significance  at  conventional  levels  (p  =  0.119,  see  Panel  B). 
Interestingly,  when  the  control  variables  are  introduced  into  the  analysis,  this  pattern  reverses  as 
the  NSI_5  coefficient  significantly  exceeds  the  NSI_0  coefficient  (p  <  0.01).  That  is,  the  marginal 
effect  is  that  the  market  places  greater  weight  on  NSIs  where  there  is  a  prior  history  of  such 
charges  in  the  short  window  around  the  earnings  announcement. 

Columns  5  and  6  report  the  estimation  of  Equation  (3a),  where  we  expand  the  return  accu¬ 
mulation  period  to  a  quarterly  window.  Regardless  of  whether  controls  are  included,  these  results 
reveal  a  declining  pattern  of  AAlNCJc  coefficients  as  frequency  increases,  consistent  with  the 
Equation  (3)  results.  However,  the  NSI_k  coefficient  patterns  from  either  estimation  of  Equation 
(3a)  clearly  differ  from  the  results  reported  in  the  estimation  of  Equation  (3)  that  does  not  include 
controls  (i.e.,  the  replication  of  Elliott  and  Hanna  1996).  In  either  QRET  estimation,  the  largest 
special  item  coefficient  occurs  for  NSI_5  (0.382  for  the  No  Controls  estimation  and  0.614  for  the 
estimation  With  Controls)  and  the  seeond  largest  occurs  for  NSI_4.  In  Panel  B,  the  coefficient  on 
NSI_5  exceeds  that  on  NSI_0  in  either  estimation  of  Equation  (3a)  (significance  levels  of  0.007 
and  0.017,  respectively).  In  spite  of  the  significant  increase  in  the  coefficient  on  NSI  with  fre¬ 
quency,  the  AAINC_0  coefficient  exceeds  (p  <  0.01)  each  of  the  NSIJc  coefficients  in  every 
estimation.  In  addition  to  market  returns  adjusted  for  the  value-weighted  index,  we  estimate 
versions  of  Equations  (3)  and  (3a)  using  size-adjusted  and  beta-adjusted  returns  with  and  without 
controls.  The  results  of  these  additional  tests,  not  tabulated,  are  quantitatively  and  qualitatively 
similar  to  the  tabulated  results.  More  importantly,  our  conclusions  are  robust  to  the  use  of  these 
alternative  return  measures. 

By  evaluating  announcement  period  and  quarterly  return  windows,  we  confirm  the  Elliott  and 
Hanna  (1996)  conclusions  with  respect  to  earnings,  but  present  uniquely  different  evidence  with 
respect  to  the  valuation  of  repetitive  NSIs.  The  results  from  the  quarterly  return  window  clearly 
demonstrate  a  reduction  in  the  earnings  response  coefficient  associated  with  an  increase  in  the 
reporting  frequency  of  NSIs,  which  is  the  primary  focus  of  their  study.  More  important  for  our 
study,  the  quarterly  return  results  as  well  as  the  earnings  announcement  period  results  with  con¬ 
trols  are  clearly  inconsistent  with  the  notion  that  the  valuation  effects  of  NSIs  decline  with 
frequency.  These  results  suggest  the  market  prices  repetitive  NSIs  as  more  persistent  than  non- 
repetitive  charges  consistent  with  the  NSI  persistence  tests  we  report  in  Tables  2  and  3.^^ 

Interestingly,  Table  4  reports  that  the  Equation  (3  a)  NSI_k  coefficients  converge  to  a  magni¬ 
tude  roughly  equal  to  the  corresponding  AAINC_k  coefficients  once  the  number  of  prior  charges 
exceeds  three  (particularly  in  the  no-controls  estimation).  This  raises  the  possibility  that  NSI 
persistence  effects  rise  to  the  same  level  as  the  decreasing  earnings  persistence  effects  in  such 
settings.  On  the  other  hand,  the  Equation  (2)  no-controls  earnings  persistence  analysis  reported  in 
Table  3  does  not  support  such  an  interpretation,  as  the  NSI_k  coefficients  are  all  substantially 
smaller  than  the  associated  earnings  coefficients.  One  likely  explanation  for  at  least  part  of  this 
inconsistency  is  the  substantial  inter-period  expense  transfer  effects  documented  by  Burgstahler  et 
al.  (2002).  They  report  approximately  8.7  and  27.7  percent  of  the  quarter  t  NSI  is  expense 
transferred  from  quarters  t+1  and  t+4,  respectively.  If  NSIs  are  used  to  accelerate  future  period 
expenses  into  a  current  quarter  NSI,  then  this  would  result  in  a  downward  bias  on  the  NSI 


In  our  primary  tests,  we  measure  prior  reporting  frequency  based  on  the  number  of  NSIs  reported  in  the  prior  12 
quarters.  Our  primary  empirical  method  ignores  the  possibility  that  the  recurrence  of  any  special  item  (positive  or 
negative)  provides  information  affecting  the  valuation  of  a  current  special  item.  We  re-estimate  Equation  (3a)  by 
redefining  FREQ  as  the  number  of  prior  reported  special  items  regardless  of  sign.  The  results  of  these  additional  tests, 
not  tabulated,  are  quantitatively  and  qualitatively  similar  to  the  tabulated  results. 

We  ran  positive  special  item  versions  of  Equations  (3)  and  (3a).  The  results  reveal  no  evidence  that  the  market  places 
a  higher  multiple  on  repetitive  positive  special  items. 
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coefficients  in  earnings  persistence  regressions.  That  is,  in  terms  of  earnings  persistence,  one 
dollar  of  future  period  expense  accelerated  into  a  current  period  NSI  will,  ceteris  paribus,  increase 
future  earnings.  Indeed,  the  significantly  negative  NSIjO  coefficients  observed  in  the  no-controls 
estimations  from  Table  3  are  highly  consistent  with  the  presence  of  such  an  inter-period  transfer 
effect  in  our  data.  By  contrast,  the  marginal  persistence  effect  of  a  NSI  is  a  decrease  in  future 
earnings  since  all  of  the  NSIJc  coefficients  in  the  with-controls  estimations  in  Table  3  are  signifi¬ 
cantly  positive.  In  addition,  it  is  also  conceivable  that,  since  the  market  coefficients  in  Equation 
(3a)  are  estimates  of  the  market’s  perception  of  the  persistence  of  recurring  NSIs  relative  to 
changes  in  earnings,  the  lack  of  symmetry  between  the  market  results  from  Equation  (3a)  and  the 
actual  persistence  results  from  Equation  (2)  may  reflect  market  overestimation  of  the  persistence 
of  these  “recurring”  charges. 

Restructuring  Charge  Frequency  Analysis 

Table  5,  Panel  A  reports  regression  results  from  the  estimation  of  Equations  (4)  and  (4a) 
which  focus  on  restructuring  charges  rather  than  special  items.  The  AAINC_RC_k  coefficients  for 
both  equations  loosely  correspond  to  the  patterns  identified  by  Elliott  and  Hanna  (1996)  as  well  as 
in  our  own  Table  4  analysis  (i.e.,  a  decline  as  frequency  increases).  Eurther,  we  find  that  when 
firms  report  three  or  more  restructuring  charges  {AA1NC_RC_3  and  AAINC_RC_4),  the  earnings 
coefficients  lack  statistical  significance.  The  short-window  restructuring  charge  coefficients  from 
Equation  (4)  generally  lack  statistical  significance  regardless  of  reporting  frequency.  These  re¬ 
sults  underscore:  (1)  the  comparatively  small  sample  of  restructuring  events  in  play  here,  and  (2) 
the  notion  that  prior  disclosure  of  these  events  may  be  common. 

The  coefficients  of  central  interest  in  our  analysis,  however,  are  the  series  of  restructuring 
charge  coefficient  estimates  in  Equation  (4a).  With  the  expanded  return  accumulation  window,  we 
are  more  confident  of  capturing  the  value  relevant  information  in  restructuring  charges.  The 
coefficient  on  a  current  restructuring  charge  not  preceded  by  a  prior  charge,  RC_0,  is  -0.019  and 
insignificant.  On  the  other  hand,  the  coefficients  on  the  second,  third,  and  fourth  or  more  restruc¬ 
turing  charges  {RC_2,  RC__3,  and  RC_4)  are  all  positive  and  significantly  greater  than  zero  (p  < 
0.01).  In  addition,  the  coefficients  on  RC__2,  RC_3,  and  RC_4  increase  monotonically  from  0.366 
for  RC_2  to  0.637  for  RC_4. 

Table  5,  Panel  B  reports  tests  of  coefficient  differences  from  the  estimation  of  Equations  (4) 
and  (4a).  Eocusing  on  the  results  from  the  estimation  of  Equation  (4a),  we  find  that  the 
AAINC_RC_0  coefficient  exceeds  (p  <  0.10  or  better)  the  RCJ),  RC_1,  and  RC_2  coefficients. 
However,  we  find  no  statistical  difference  between  AAINC__RCJ)  and  the  RC_3  or  RC_4  coeffi¬ 
cients.  Also,  the  coefficient  on  RC_4,  the  restructuring  with  the  longest  sequence  of  prior  reported 
restructuring  charges,  exceeds  the  RC_0  and  RCJ  coefficients  (p  <  0.01),  but  not  RCJ  or  RCJ. 
Similar  to  the  results  from  the  estimation  of  Equation  (3a),  the  Equation  (4a)  estimates  are  incon¬ 
sistent  with  the  valuation  effect  of  repetitive  “nonrecurring  items”  declining  with  observed 
frequency. 

An  important  reason  for  employing  quarterly  return  windows  in  our  analysis  is  to  address  the 
possibility  that  NSIs  and  restructurings  are  announced  prior  to  the  earnings  announcement  date.  In 


The  sole  exception  is  the  coefficient  on  RC_2.  u  • 

In  untabulated  tests,  we  estimate  Equations  (4)  and  (4a)  with  the  additional  control  vanables  identified  as  being 
significantly  associated  with  NSI  frequency.  The  results  of  these  additional  tests  are  consistent  with  our  tabulated  results 
in  Table  5  In  addition  to  market  returns  adjusted  for  the  value-weighted  index,  we  also  estimate  Equations  (4)  and  (4a) 
using  size-adjusted  and  beta-adjusted  returns.  The  results  of  these  additional  tests,  not  tabulated,  are  quantitatively  and 
qualitatively  similar  to  the  tabulated  results  and  our  conclusions  are  robust  to  the  use  of  these  alternative  return 

measures. 
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fact,  we  find  that  over  65  percent  of  the  restructurings  in  our  sample  are  announced  four  or  more 
days  prior  to  the  earnings  announcement  date.  If  the  two-day  earnings  announcement  period  return 
window  utilized  by  Elliott  and  Hanna  (1996)  and  replicated  by  us  in  Equations  (3)  and  (4)  results 
in  downward  biased  NSI  (RC)  coefficients  as  we  suggest,  then  a  restructuring  announcement  return 
window  should  yield  results  qualitatively  similar  to  our  quarterly  return  results  from  the  estimation 
of  Equations  (3a)  and  (4a).  Accordingly,  we  estimate  the  following  modified  version  of  Equation 

(4): 

4  4 

RC_RETi,  =  2  PkFREQJiit  +  2  ykRC^EREQJcn.  (5) 

k=G  k=0 

RC_RETii  is  the  two-day  restructuring  announcement  period  [—1,0]  market-adjusted  stock  return 
for  firm  i  in  quarter  t,  and  all  other  variables  are  as  defined  previously.  Equation  (5)  excludes 
earnings  variables  because  this  model  focuses  on  the  market  return  around  the  announcement  of 
the  restructuring.  Restructuring  announcement  dates  are  identified  by  searching  Lexis-Nexis  for 
the  public  announcement  of  the  restructuring  for  the  1,220  observations  in  our  sample.  If  no 
separate  announcement  date  was  found,  then  we  assume  the  announcement  date  for  the  restruc¬ 
turing  is  the  earnings  announcement  date. 

Table  6  reports  results  from  estimating  Equation  (5).  The  coefficient  on  a  current  restructuring 
charge  not  preceded  by  a  prior  charge,  RC_0,  is  0.001  and  insignificant.  The  coefficients  on  RC_1 
and  RC_2  are  also  insignificant.  On  the  other  hand,  and  in  contrast  to  the  earnings  announcement 
return  test  results  reported  in  Table  5  (Equation  (4)),  the  coefficients  on  RC_3  and  RC_4  are 
positive  and  significantly  greater  than  0  (p  <  0.10).  In  addition,  rather  than  declining  or  flattening, 
the  RC_k  coefficients  increase  monotonically  from  0.001  for  RC_0  to  0.217  for  RC_4.  Table  6  also 
reports  tests  of  coefficient  differences  from  the  estimation  of  Equation  (5).  Here  we  find  that  the 
coefficient  on  RC_4  is  significantly  greater  than  the  coefficients  on  RC_0,  RCJ,  and  RC_2  (p  < 
0.05  or  better),  but  there  is  no  significant  difference  between  RC_4  and  RC_2.  Overall,  the  results 
from  the  estimation  of  Equation  (5)  suggest  that  the  valuation  effect  of  repetitive  nonrecurring 
items”  increases  with  observed  frequency.  In  addition,  these  results  are  consistent  with  the  notion 
that  use  of  an  earnings  announcement  date  return  window  biases  coefficients  on  restructuring 
{NSI)  variables  toward  zero.^^ 

Supplemental  Analyses 
Tax  Rate  Adjustment 

In  our  tabulated  empirical  tests,  we  adjust  special  items  for  the  firm-specific  quarterly  effec¬ 
tive  tax  rate.  To  assess  the  sensitivity  of  our  results  to  our  choice  of  the  tax  rate,  we  perform 
several  additional  analyses  utilizing  alternative  tax  rate  adjustments.  First,  we  estimate  an  implied 
tax  rate  on  NSIs  using  the  following  empirical  model. 

TaxExpi,  =  <Ao  +  tpxADJ_PTIit  +  ilr2PSlit  +  -t  f^PSI  *  P_EREQu  +  *  N_EREQu 

-t  il/^DJ_PTI  *  P_FREQij  -t  ilf2ADJ_PTI*N_EREQi, .  (6) 

INCit  is  income  before  extraordinary  items  (Compustat  data  item  #8)  for  firm  i  in  quarter  t;  PTf^ 
is  pre-tax  income  (Compustat  data  item  #23)  for  firm  i  in  quarter  t;  RSl^  is  special  items  (Com¬ 
pustat  data  item  #32)  reported  by  firm  i  in  quarter  t;  PSI  equals  RSI  if  RSI  >  0,  otherwise  PSI  = 
0;  NSI  equals  RSI  if  RSI  <  0,  otherwise  NSI  =  0;  ADJ_PTIn  is  PTft  less  RSIf,  N^FREQn 


In  untabulated  tests,  we  re-estimate  Equation  (5)  with  the  additional  control  variables  as  before.  The  results  of  this 
additional  test  are  consistent  with  our  tabulated  results  in  Table  6. 
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TABLE  6 

Cross-Sectional  Regressions  of  Two-Day  Restructuring  Announcement  Period 
Market- Adjusted  Returns  on  Restructuring  Charges  Conditioned  on  Reporting  Frequency 

(n  =  1,220) 


4 


4 


RC_RETu  =  2  PkFREQJcu  +  2  YkRC  *  FREQ_ku 

*=0  fc=0 


Equation  (5) 


Coefficient  Estimates 

Variable  Predicted  Sign  two-tailed  p] 


RCJ) 

? 

0.001 

[0.9802]‘ 

RC_1 

7 

0.010 

[0.7826] 

RC_2 

7 

0.034 

[0.3662] 

RC_3 

-f 

0.091 

[0.0643] 

RC_4 

-b 

0.217 

[0.0006] 

RC_4  -  RCJ) 

74  >  7o 

0.216 

[0.0036] 

RC_4  -  RCJ 

74  >  7i 

0.207 

[0.0043] 

RCJ  -  RCJ 

74>  72 

0.183 

[0.0128] 

RCJ - RCJ 

Adj.  R2 

74>  73 

0.126 

[0.1485] 

*  Two-tailed  p. 

Variable  Definitions: 

RC_RETi,  =  two-day  restructuring  announcement  period  [-1,0]  market-adjusted  stock  remrn  (adjusted  for 
value-weighted  index)  for  firm  i  in  quarter  f, 

RRCii  =  the  amount  of  firm  fs  after-tax  restructuring  charge,  adjusted  at  the  firm’s  quarterly  effective  tax  rate 
in  which  the  charge  is  recognized; 

RCi,  =  RRC^^  scaled  by  the  market  value  of  equity  in  quarter  r-4; 

FREQJiif  =  1  if  firm  i  reported  k  restructuring  charges  in  the  12  quarters  prior  to  quarter  t,  where  k  takes  on 

values  of  0  through  4.  When  k  takes  on  the  value  of  4,  this  indicates  that  firm  i  reported  at  least  four 
restructuring  charges  in  the  12  quarters  prior  to  quarter  r;  and 
RC_ki,  =  RCit  *  FREQ_ki,. 


iP_FREQii)  is  the  number  of  NSIs  (PSIs)  reported  by  firm  i  over  the  12  quarters  prior  to  quarter 
t;  IMP_TEit  is  the  implied  tax  expense  measured  as  PTIn  less  /AQ,;  ACTJCEn  is  actual  tax 
expense  (Compustat  data  item  #6)  for  firm  /  in  quarter  t;  and  TaxExpn  is  ACTJTEj  or  IMPJlEj.  All 
continuous  variables  are  scaled  by  the  market  value  of  equity  in  quarter  t-A. 

The  empirical  evidence  from  estimating  Equation  (6),  not  tabulated,  is  consistent  with  prior 
research  that  finds  positive  and  negative  special  item  observations  are  taxed  at  substantially  dif- 
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ferent  rates.  Specifically,  Beaver  et  al.  (2007)  report  evidence  consistent  with  the  conclusion  that 
negative  special  item  observations  have  effective  tax  rates  close  to  zero,  while  positive  special 
item  observations  have  effective  tax  rates  that  approach  the  highest  marginal  statutory  tax  rates. 
We  find  that  the  tax  rate  on  negative  special  items  implied  by  the  coefficient  on  NSI  ranges 
between  4  and  6  percent  while  the  tax  rate  on  positive  special  items  is  approximately  19  percent. 
In  addition,  the  estimated  tax  rate  declines  as  the  prior  reporting  frequency  of  NSIs  increases  (i.e., 
significantly  negative  coefficient  on  the  NSI  *  N_FREQif  interaction).  From  this  analysis,  we  in¬ 
corporate  an  alternative  tax  rate  adjustment.  Specifically,  we  adjust  positive  special  items  at  the 
average  of  the  coefficient  on  PSI,  19  percent.  For  NSIs,  the  tax  adjustment  is  based  on  the  prior 
reporting  frequency  of  NSIs.  That  is,  we  incorporate  the  specific  tax  rate  conditioned  on  prior 
reporting  frequency.  In  addition  to  this  alternative  adjustment,  we  also  incorporate  two  additional 
tax  rate  adjustments.  First,  we  adjust  negative  and  positive  special  items  using  firm-specific  annual 
marginal  tax  rates  provided  by  Graham  (1996a,  1996b).  Second,  we  adjust  all  special  items  at  the 
highest  statutory  rate,  35  percent  for  our  sample  time  period.  We  re-estimate  all  of  our  empirical 
analyses  using  each  of  these  alternative  tax  rate  adjustments  and  the  results  from  these  alternative 
specifications  are  robust  and  our  conclusions  are  unaffected. 

Expectation  of  Special  Items 

Our  primary  valuation  models  measure  unexpected  earnings  as  the  difference  between  the 
earnings  in  the  current  quarter  and  earnings  in  quarter  t-A.  On  the  other  hand,  we  make  no 
seasonal  or  other  adjustments  to  NSIs.  This  approach  is  consistent  with  the  view  that  investors 
base  expectations  of  continuing  earnings  on  prior  observations,  but,  in  general,  do  not  expect  a 
special  item  to  be  recognized.  There  are  two  reasons  why  this  approach  may  not  be  valid.  First,  it 
seems  unlikely  that  the  same  earnings  news  specification  would  be  appropriate  for  both  the 
short-window  and  long-window  analysis.  Second,  the  underlying  premise  of  our  analysis  is  that 
NSIs  are  likely  to  recur  for  some  companies.  If  so,  it  seems  reasonable  that  under  certain  circum¬ 
stances  investors  will  expect  nonzero  NSIs  and  the  design  should  recognize  this  expected  NSI 

effect.  That  is,  the  amount  of  “news”  in  a  negative  special  item  should  depend  on  whether  the  firm 

26 

recognizes  special  items  frequently. 

To  address  these  issues,  we  ran  two  additional  specifications  of  our  models.  First,  we  ran  a 
levels  version  of  Equation  (3a)  using  current  quarter  earnings  adjusted  for  the  NSI  rather  than 
seasonally  differenced  earnings.  The  results  from  this  additional  test  are  consistent  with  our  tabu¬ 
lated  results.  Specifically,  we  find  that  as  the  prior  sequence  of  NSIs  increases,  the  coefficient  on 
NSI  increases.  Second,  we  added  a  new  “unexpected  NSI”  analysis.  In  this  additional  analysis,  we 
measure  expected  NSIs  as  the  average  NSI  over  the  prior  12  quarters  and  the  unexpected  NSI  as 
the  reported  NSI  less  the  expected  component.  We  then  partition  NSIs  into  two  components 
(expected  and  unexpected)  in  Equation  (3a)  and  rerun  our  analyses.  With  respect  to  the  unexpected 
NSI  tests,  it  appears  that  our  tabulated  results  are  primarily  attributable  to  the  unexpected  com¬ 
ponent  of  NSIs.  In  particular,  we  find  that  the  coefficients  on  unexpected  NSIs  increase  as  the  prior 
reporting  frequency  increases,  while  the  coefficient  on  the  expected  component  is  generally  insig 
nificant  regardless  of  the  prior  reporting  history. 

Cross-Sectional  Determinants  of  ERCs 

The  longer  return  window  estimation  inferences  come  at  a  cost.  The  quarterly  return  accu¬ 
mulation  windows  employed  in  Equations  (3a)  and  (4a)  raise  the  possibility  that  risk-related 
factors  may  confound  the  analysis.  Accordingly,  we  modify  Equations  (3a)  and  (4a)  to  include 


We  thank  an  anonymous  reviewer  for  bringing  both  of  these  issues  to  our  attention. 
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controls  for  cross-sectional  variables  shown  by  prior  research  to  affect  the  relation  between  returns 

and  unexpected  earnings  (e.g.,  Kormendi  and  Lipe  1987;  Collins  and  Kothari  1989;  Easton  and 

Zmijewski  1989).  The  controls  we  use  are  growth  (market-to-book  ratio),  risk  (beta),^^  leverage 

(long-term  debt  divided  by  the  sum  of  long-term  debt  plus  common  stockholders’  equity),  and  firm 

size  (log  of  total  assets).  The  results  from  this  analysis  are  essentially  identical  to  those  reported  in 

28 

Tables  4  and  5  and  do  not  affect  our  conclusions. 

VI.  CONCLUSION 

This  study  examines  the  persistence  and  market  implications  of  what  has  been  referred  to  in 
the  business  media  as  “recurring  nonrecurring”  items  (Stewart  2002).  Our  results  suggest  that  the 
frequency  of  prior  reported  NSIs  is  positively  related  to  both  the  frequency  and  magnitude  of 
subsequently  reported  NSIs.  Our  results  also  suggest  that  the  frequency  of  prior  reported  NSIs  has 
a  significant  effect  on  the  persistence  of  earnings  before  NSIs.  Overall,  our  evidence  suggests  that 
as  the  sequence  of  prior  reported  NSIs  increases,  the  future  earnings  implications  of  a  current  NSI 
become  more  predictable. 

A  prior  related  analysis  by  Elliott  and  Hanna  (1996)  finds  that  the  market  places  greater 
weight  on  initial  NSIs  than  subsequent  NSIs  (in  the  two-day  window  around  the  earnings  an¬ 
nouncement).  That  is,  such  reoccurrences  are  viewed  as  decreasing  in  value  relevance  by  market 
participants.  Our  analysis  confirms  this  basic  finding  for  NSIs  in  the  short  window  around  the 
earnings  announcement  date.  This  result  is  perplexing,  given  the  clear  relation  between  increased 
frequency  of  NSIs  and  the  likelihood  and  magnitude  of  future  NSI  recognition.  However,  when  we 
introduce  control  variables  for  NSI  likelihood  or  expand  the  return  accumulation  window  to  allow 
for  pre-announcement  disclosure  of  such  events,  we  observe  a  valuation  effect  opposite  that 
reported  in  Elliott  and  Hanna  (1996).  Specifically,  we  find  that  as  prior  NSI  frequency  increases, 
the  weight  the  market  attaches  to  such  subsequent  charges  increases  rather  than  decreases.  More¬ 
over,  this  increase  in  valuation  is  consistent  with  the  estimated  persistence  properties  we  document 
for  such  high-frequency  charges. 

Interestingly,  the  decline  in  the  valuation  of  operating  earnings  with  special  items  frequency 
that  is  of  central  interest  in  Elliott  and  Hanna  (1996)  holds  for  the  expanded  return  window.  The 
robustness  of  this  decline  suggests  that  it  is  unlikely  that  one  of  the  three  explanations  they 
identify — heightened  pre-announcement  of  earnings  information  by  large  write-off  firms  (Elliott 
and  Hanna  1996,  153) — is  the  underlying  source  of  this  phenomenon.  That  is,  this  study’s  ex¬ 
panded  return  accumulation  period  includes  the  time  period  over  which  pertinent  pre-disclosure 
earnings  information  is  released,  thereby  capturing  these  information  effects.  Hence,  our  empirical 
method  is  not  sensitive  to  the  specific  date  that  the  market  learns  of  the  nonrecurring  item,  as  long 
as  it  occurs  within  the  quarter-long  accumulation  window  we  employ. 

Overall,  our  empirical  findings  are  important  in  at  least  two  respects.  Eirst,  our  results  suggest 
that  NSIs  may  be  indicative  of  the  firm’s  operating  environment  and  nearly  as  repetitive  as  other 
components  of  income.  Alternatively,  repetitive  write-offs  may  be  reflective  of  a  reporting  strategy 
that  classifies  ordinary  expenses  as  NSIs  to  manipulate  pre-write-off  income  (i.e.,  “classification 


The  firm-specific  beta  is  calculated  using  the  200  trading  days  prior  to  the  beginning  of  quarter  t  with  the  CRSP  equally 
weighted  portfolio  as  the  market  proxy. 

In  addition  to  the  pooled  cross-sectional  return  test  results  that  we  tabulate,  we  use  three  alternative  methods  to  calculate 
t-statistics  in  our  return  tests.  First,  we  re-estimate  all  return  regressions  on  an  annual  basis  using  the  Fama  and  MacBeth 
(1973)  methodology.  We  then  estimate  mean  coefficients  and  t-statistics  from  these  annual  regressions.  Second,  we 
calculate  robust-standard  errors  using  two-way  clustering  by  firm  and  year  to  control  for  both  time-series  and  cross- 
sectional  dependence  (Gow  et  al.  2010).  Third,  we  calculate  robust-standard  errors  using  two-way  clustering  by  industry 
and  year  to  control  for  the  industry  clustering  we  note  in  Appendix  B.  Our  findings  au'e  robust  to  these  alternative 
methods. 
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shifting”  as  McVay  2006;  Fan  et  al.  2010).  Either  interpretation  suggests  the  objective  of  sepa¬ 
rately  reporting  an  item  below  the  ordinary  operating  elements  of  the  income  statement  to  high¬ 
light  its  unusual  or  infrequent  nature  is  being  compromised.  Second,  our  results  suggest  media  and 
regulator  concern  about  the  use  of  repetitive  NSIs  to  fool  the  market  may  be  unfounded.  Specifi¬ 
cally,  our  results  suggest  that  the  market  values  these  “recurring  nonrecurring”  items  more  like  the 
other  components  of  recurring  earnings. 

Our  analysis  provides  limited  insights  about  manager  motivation  to  engage  in  repetitive 
reporting  of  NSIs.  The  inclusion  of  control  variables  (including  interactions  with  NSI  amounts)  for 
factors  associated  with  NSI  reporting  frequency  generally  yields  stronger  rather  than  weaker 
results  in  our  analyses.  To  the  extent  that  this  set  of  control  variables  is  effective  in  controlling  for 
such  factors,  the  results  suggest  that  managerial  discretion  regarding  NSI  reporting  is  an  underly¬ 
ing  driver  of  the  persistence  and  valuation  effects  we  observe.  Moreover,  given  that  our  analysis 
also  suggests  the  market  seems  to  understand  and  unwind  the  repetitive  nature  of  “nonrecurring” 
charges,  the  question  arises  as  to  why  managers  continue  to  record  such  charges.  The  provocative 
answer  to  this  question  is  that  managers  are  irrepressibly  and  irrationally  opportunistic  in  reporting 
earnings.  Perhaps  managers  believe  that  market  participants  value  such  charges  as  transitory,  even 
when  they  become  recurring.  Indeed,  anecdotal  evidence  (e.g.,  Lalli  and  Lim  1997)  supports  this 
notion.  A  more  mundane  alternative  is  that  managers  are  sometimes  slower  than  the  market  to 
respond  to  changing  conditions.  Hence,  managers  continue  to  report  items  as  nonrecurring  based 
on  industry  conventions  or  experience,  failing  to  recognize  that  the  basic  operating  environment 
has  changed  in  such  a  way  that  existing  financial  reporting  conventions  are  no  longer  appropriate. 
That  is,  what  once  was  nonrecurring  or  rare  has  now  become  ordinary  and  expected  and  is  valued 
accordingly. 
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APPENDIX  A 

Type,  Number,  Frequency,  and  Repetitive  Nature  of  Negative  Special  Items^ 


Type  of  Special 

Item 

Number  of 
Observations 
Current 
Quarter 

Percent  of 
Current 
Quarter 
Observations 

Number  of 
the  Same 
Type  SI 
over  the 
Prior  12 
Quarters 

Mean 
Frequency 
of  the 
Same 
Type  SI 
over  the 
Prior  12 
Quarters*’ 

Number 
of  the 
Different 
Type  SI 
over  the 
Prior  12 
Quarters 

Mean 
Frequency 
of  the 
Different 
Type  SI 
over  the 
Prior  12 
Quarters‘s 

Restructuring  Charges 

54 

34 

124 

2.30 

321 

27.8% 

Merger-Related  Costs 

6 

4 

33 

5.50 

36 

47.8% 

In-Process  R&D 

11 

7 

29 

2.64 

45 

39.2% 

Asset  Write-offs 

19 

12 

25 

1.32 

136 

15.5% 

Litigation  Costs 

8 

5 

10 

1.25 

45 

18.2% 

Goodwill  Write-off 

36 

23 

103 

2.86 

239 

30.1% 

Asset  Disposition 

8 

5 

8 

1.00 

38 

17.4% 

Other 

17 

11 

30 

1.76 

104 

22.4% 

^  This  table  was  created  by  directly  examining  the  income  statements  and  footnote  disclosures  for  the  current  and  prior  12 
periods  for  100  randomly  selected  firm-quarters  characterized  by  at  least  three  large  special  items  (greater  than  1  percent 
of  assets)  in  the  prior  12  quarters.  The  number  of  current  quarter  observations  exceeds  100  because  some  firms  reported 
multiple  types  of  special  items  in  a  single  quarter.  From  the  footnote  disclosures,  we  classified  the  cument  NSIs  (5/)  into 
one  of  the  eight  tabulated  types  based  on  the  description  of  the  charge  in  the  footnote.  The  number  over  the  prior  12 
quarters  represents  a  total  of  all  of  the  items  found  in  the  prior  12  quarters,  not  just  those  over  1  percent  of  assets. 
This  number  is  taken  by  dividing  the  total  number  of  observations  in  each  category  from  the  prior  12  quarters  by  the 
number  of  observations  in  the  category.  For  example,  there  were  124  restructuring  charges  reported  in  the  prior  12 
quarters  by  the  54  firms  that  reported  a  restructuring  charge  in  the  sample  quarter  (124/54  =  2.30). 

This  percentage  is  the  number  of  same  type  SI  over  the  prior  12  quarter  divided  by  the  sum  of  the  number  of  same  and 
different  type  Sh. 


APPENDIX  B 

Mean  and  Median  Number  of  Prior  Negative  Special  Items  by  Industry  Group 
Fama-French  Industry  Lower  Upper 


Classification 

n” 

Mean*’ 

Median 

Quartile 

Quartile 

Agriculture 

65 

2.55 

2 

1 

4 

Food  Products 

308 

1.29 

1 

0 

2 

Candy  and  Soda 

33 

0.79 

0 

0 

1 

Alcoholic  Beverages 

66 

1.58 

1 

0 

3 

Tobacco  Products 

29 

1.66 

1 

1 

2 

Recreational  Products 

159 

1.58 

1 

0 

2 

Entertainment 

344 

1.40 

1 

0 

2 

Printing  and  Publishing 

212 

1.87 

1 

1 

2 

Consumer  Goods 

407 

1.97 

1 

0 

2 

Apparel 

312 

1.67 

1 

0 

2 

Healthcare 

427 

1.87 

1 

0 

3 

Medical  Equipment 

727 

1.90 

1 

1 

3 

(continued  on  next  page) 
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Fama-French  Industry 
Classification 

n® 

Mean** 

Median 

Lower 

Quartile 

Upper 

Quartile 

Pharmaceutical  Products 

1,120 

1.89 

1 

1 

3 

Chemicals 

439 

1.79 

1 

1 

3 

Rubber  and  Plastic 

185 

1.79 

1 

0 

3 

Products 

Textiles 

131 

1.51 

1 

0 

2 

Construction  Materials 

349 

1.70 

1 

0 

3 

Construction 

191 

1.36 

1 

0 

2 

Steel  Works,  etc. 

302 

1.60 

1 

0 

2 

Fabricated  Products 

112 

1.20 

1 

0 

2 

Machinery 

785 

1.75 

1 

1 

3 

Electrical  Equipment 

232 

1.65 

1 

0 

2 

Miscellaneous 

236 

1.77 

1 

0 

2 

Automobiles  and 

220 

1.30 

1 

0 

2 

Trucks 

Aircraft 

99 

1.53 

1 

0 

2 

Shipbuilding,  Railroad 

40 

1.67 

1 

0 

3 

Equip 

Defense 

21 

1.24 

1 

1 

2 

Precious  Metals 

93 

1.39 

1 

0 

2 

Nonmetallic  Mining 

52 

0.67 

0 

0 

1 

Coal 

11 

1.18 

1 

1 

1 

Petroleum  and  Natural 

536 

1.04 

1 

0 

2 

Gas 

Utilities 

496 

0.96 

1 

0 

1 

Telecommunications 

767 

1.34 

1 

0 

2 

Personal  Services 

181 

1.50 

1 

0 

2 

Business  Services 

3,868 

2.33 

2 

1 

3 

Computers 

1,468 

2.36 

2 

1 

3 

Electronic  Equipment 

1,761 

2.11 

2 

1 

3 

Measuring  and  Control 

598 

1.77 

1 

1 

Equip 

Business  Supplies 

405 

1.57 

1 

0 

1 

L 

Shipping  Containers 

89 

1.44 

1 

2 

Transportation 

343 

1.19 

1 

0 

2 

Wholesale 

729 

1.52 

1 

0 

2 

Retail 

963 

1.33 

1 

0 

2 

Restaurants,  Hotel, 

514 

1.51 

1 

0 

2 

Motel 

Banking 

1,187 

0.24 

0 

0 

0 

1 

Insurance 

526 

0.84 

0 

0 

Real  Estate 

114 

1.58 

1 

0 

2 

Trading 

803 

0.84 

0 

0 

1 

^  n  =  number  of  quarterly  observations  with  a  negative  special  item. 

This  represents  the  mean  number  of  NSIs  reported  over  the  prior  12  quarters  conditioned  on  the  recognition  of  a  current 
quarter  NSI. 
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Control  Variables 

Variable 

Accounts  Receivable  Turnover 


Inventory  Turnover 


PP&E  Turnover 


Sales  Revenue  per  Employee 


Profit  Margin 


Change  in  ROE 


APPENDIX  C 


Name  Data  Source 


Definition 


AR_TURN 


INVJTURN 


PPEjrURN 


SALE_EMP 


PM 


AROE 


Compustat 


Compustat 


Compustat 


Compustat 


Compustat 


Compustat 


Accounts  receivable  turnover 
measured  as  sales  revenue 
(Compustat  data  item  #12) 
divided  by  receivables— total 
(Compustat  data  item  #2)  for 
firm  i  measured  in  the  year 
prior  to  the  year  in  which 
quarter  t  falls. 

Inventory  turnover  measured  as 
cost  of  goods  sold  (Compustat 
data  item  #41)  divided  by  the 
sum  of  inventories— total 
(Compustat  data  item  #3)  and 
inventory  LIFO  reserve 
(Compustat  data  item  #240)  for 
firm  i  measured  in  the  year 
prior  to  the  year  in  which 
quarter  t  falls. 

PP&E  turnover  measured  as 
sales  revenue  (Compustat  data 
item  #12)  divided  by 
PP&E -total,  net  (Compustat 
data  item  #8)  for  firm  i 
measured  in  the  year  prior  to 
the  year  in  which  quarter  t 
falls. 

Sales  revenue  per  employee 
measured  as  sales  revenue 
(Compustat  data  item  #12) 
divided  by  the  number  of 
employees  (Compustat  data 
item  #29)  for  firm  i  measured 
in  the  year  prior  to  the  year  in 
which  quarter  t  falls. 

Profit  margin  measured  as 
income  before  extraordinary 
items  (Compustat  data  item 
#18)  divided  by  sales  revenue 
(Compustat  data  item  #12)  for 
firm  i  measured  in  the  year 
prior  to  the  year  in  which 
quarter  t  falls. 

Change  in  return  on  equity 
measured  as  the  change  income 
before  extraordinary  items 
(Compustat  data  item  #18) 
scaled  by  shareholder’s  equity 
(Compustat  data  item  #216)  for 
firm  i  measured  in  the  year 
prior  to  the  year  in  which 
quarter  t  falls. 
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Variable 


Change  in  ROA 


Growth  in  Industry  Sales 


One  Year  Return 


Five  Year  Return 


Book-to-Market  Ratio 


Change  in  the  Book-to-Market 
Ratio 


Change  in  the  Industry  BTM 
Ratio 


Firm  Size 


Prior  Year  Loss  Indicator 


Name  Data  Source 

AROA  Compustat 

IND_GROWTH  Compustat 

RETl  CRSP 

RETS  CRSP 

BTM  Compustat 

ABTM  Compustat 

IND_ABTM  Compustat 

SIZE  Compustat 

LOSS  Compustat 


Definition 


The  four-year  average  change 
in  return  on  assets  measured  as 
income  before  extraordinary 
items  (Compustat  data  item 
#18)  scaled  by  total  assets 
(Compustat  data  item  #6)  for 
firm  i  measured  over  the  four 
years  prior  to  the  year  in  which 
quarter  t  falls 
The  four-year  average 
percentage  growth  in  industry 
sales  revenue  (Compustat  data 
item  #12)  for  firm  i  measured 
over  the  four  years  prior  to  the 
year  in  which  quarter  t  falls. 
One-year  raw  market  return  for 
firm  i  measured  over  the  year 
prior  to  the  beginning  of 
quarter  t. 

Four-year  raw  market  return 
for  firm  i  measured  from  year 
-5  through  the  end  of  the  year 
prior  to  the  beginning  of 
quarter  t. 

Book-to-market  ratio  measured 
as  common  equity  (Compustat 
data  item  #60)  divided  by  the 
market  value  of  equity 
(Compustat  data  item  #199 
times  item  #25)  for  firm  i 
measured  in  the  year  prior  to 
the  year  in  which  quarter  t 
falls. 

The  four-year  average  change 
in  the  BTM  for  firm  i 
measured  over  the  four  years 
prior  to  the  year  in  which 
quarter  t  falls. 

The  four-year  average  change 
in  the  industry  BTM  for  firm  i 
measured  over  the  four  years 
prior  to  the  year  in  which 
quarter  t  falls. 

Natural  log  of  total  assets 
(Compustat  data  item  #6)  for 
firm  i  measured  at  the  end  of 
quarter  t—  1 . 

Indicator  variable  equal  to  1  if 
firm  had  a  loss  (Compustat 
data  item  #18  less  than  0)  for 
any  of  the  three  years  prior  to 
the  year  in  which  quarter  t 
falls,  otherwise  0. 
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ABSTRACT:  We  document  that,  when  revising  their  short-term  earnings  forecasts  in 
response  to  management  guidance,  analysts  wishing  to  curry  favor  with  management 
weight  the  guidance  more  heavily  than  predicted,  based  on  the  credibility  and  useful¬ 
ness  of  the  guidance.  This  overweighting  of  guidance  is  present  prior  to  equity  offerings 
and  other  events  that  could  lead  to  investment  banking  business.  Although  analysts 
sacrifice  their  forecast  accuracy  by  overweighting  management  guidance,  they  appear 
to  benefit,  on  average,  by  subsequently  gaining  the  underwriting  business  for  their 
banks.  Thus,  while  analysts  wishing  to  please  managers  are  optimistic  in  their  long¬ 
term  earnings  forecasts,  they  take  their  cue  from  management  when  determining  their 
short-term  earnings  forecasts. 

Keywords:  analyst  incentives;  management  guidance;  management  forecasts;  ana¬ 
lyst  forecasts;  analyst  revisions. 

Data  Availability:  The  data  used  in  this  study  are  publicly  available  from  the  sources 
indicated  in  the  text. 

I.  INTRODUCTION 

Analysts  face  many  competing  incentives:  they  not  only  want  to  be  accurate  to  signal  their 
skill  and  build  their  reputation,  but  they  also  want  to  issue  optimistic  long-term  forecasts 
to  please  the  firms  they  follow  or  to  generate  trading  volume  for  their  brokerage  houses 
(Mikhail  et  al.  1999;  Lim  2001;  Hong  and  Kubik  2003;  Cowen  et  al.  2006;  Libby  et  al.  2008). 
Optimism  is  not  necessarily  desirable,  however,  for  short-term  forecasts  (Matsumoto  2002,  Rich¬ 
ardson  et  al.  2004;  Ke  and  Yu  2006).  Thus,  we  expect  analysts  with  incentives  to  please  the  firms 
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they  follow  to  take  their  cue  from  management  when  determining  their  short-term  forecasts. 
Specifically,  when  revising  their  forecasts  in  response  to  management  guidance,  we  expect  that 
these  analysts  will  weight  the  guidance  more  heavily  than  predicted  based  on  the  perceived 
credibility  and  usefulness  of  the  guidance  (hereafter,  overweight  management  guidance).^  Al¬ 
though  by  overweighting  management  guidance  analysts  bear  the  cost  of  reduced  forecast  accu¬ 
racy,  this  behavior  pleases  management,  as  it  helps  align  the  market’s  expectations  with  those  of 
management  (e.g.,  Givoly  and  Lakonishok  1979;  Ajinkya  and  Gift  1984;  King  et  al.  1990;  Shekel 
1991). 

We  examine  analysts’  incorporation  of  guidance  prior  to  equity  offerings,  where  analysts  have 
been  shown  to  have  strong  incentives  to  curry  favor  with  managers.  The  equity  offering  setting  is 
advantageous  because  the  benefit  to  analysts  is  straightforward  (underwriting  fees  for  their  banks) 
and  we  can  identify  which  investment  banks  underwrote  the  deal,  ex  post.  In  addition,  we  can 
conduct  powerful  cross-sectional  tests  by  analyzing  different  analysts’  incorporation  of  the  same 
guidance,  as  we  expect  only  those  analysts  with  incentives  to  please  management  to  overweight 
management  guidance. 

We  find  that  analysts  overweight  management  guidance  for  firms  that  subsequently  announce 
an  equity  offering.  After  controlling  for  other  variables  that  affect  analysts’  incorporation  of 
guidance,  such  as  the  credibility  and  usefulness  of  the  forecast,  analysts  on  average  incorporate  27 
percent  more  of  the  management  guidance  into  their  revised  short-term  forecasts  for  firms  that 
announce  an  equity  offering  in  the  subsequent  year.  This  overweighting  is  economically  signifi¬ 
cant,  as,  on  average,  analysts  tend  to  incorporate  approximately  50-80  percent  of  management 
guidance  (e.g.,  Baginski  and  Hassell  1990;  see  current  Table  4). 

Although  we  control  for  a  variety  of  determinants  related  to  the  credibility  and  usefulness  of 
management  guidance,  it  is  possible  that  guidance  prior  to  equity  offerings  is  informative  in  ways 
not  captured  by  our  measures  of  usefulness  and  credibility,  leading  to  the  appearance  of  an 
overweighting  of  guidance.  To  address  this  possibility,  we  conduct  two  sets  of  tests  to  better 
distinguish  between  the  expected  incorporation  of  credible  and  useful  guidance  and  the  over¬ 
weighting  of  guidance  to  please  managers  prior  to  equity  offerings. 

First,  we  examine  analysts’  ex  post  forecast  accuracy.  The  incorporation  of  credible  and  useful 
guidance  generally  improves  analysts’  earnings  forecasts.  Thus,  if  the  overweighting  of  guidance 
we  observe  is  in  response  to  some  omitted  credibility  and  usefulness  measures,  then  analysts’ 
forecasts  should  be  more  accurate  as  a  result.  In  contrast,  we  find  that  analysts’  ex  post  forecast 
errors  increase  by  40  percent,  on  average,  as  a  result  of  the  overweighting  of  management  guid¬ 
ance  prior  to  equity  offerings.  This  finding  is  consistent  with  analysts  bearing  significant  accuracy 
costs  in  order  to  please  management. 

Second,  we  investigate  whether  the  weight  placed  on  management  guidance  prior  to  equity 
offerings  varies  with  analyst  independence  and  affiliation.  These  cross-sectional  tests  use  only 
management  guidance  issued  by  firms  that  announce  an  equity  offering  in  the  next  year,  thereby 
mitigating  the  endogeneity  concerns  associated  with  the  equity  offering.  We  hold  variation  in  the 
credibility  and  usefulness  of  the  guidance  constant  by  requiring  that  both  analyst  types  (e.g., 
affiliated  and  unaffiliated)  forecast  earnings  for  a  given  firm,  allowing  us  to  determine  how  dif¬ 
ferent  analysts  respond  to  the  same  guidance. 

Our  first  cross-sectional  test  examines  differences  between  dependent  and  independent  ana¬ 
lysts.  Only  analysts  employed  by  investment  banks  (dependent  analysts)  have  the  potential  to  gain 


'  To  clarify,  we  expect  analysts  to  revise  their  forecasts  either  upward  or  downward  depending  on  management’s  prefer¬ 
ence,  which  is  communicated  via  management  guidance.  We  document  that  analysts  overweight  guidance  in  both 
directions  in  Section  V. 
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the  underwriting  business  for  their  firms.  Thus,  if  the  overweighting  of  guidance  stems  from 
analysts’  incentives  to  gain  underwriting  business,  we  should  find  that  dependent  analysts,  but  not 
independent  analysts,  overweight  management  guidance  prior  to  equity  offerings.  Conversely,  if 
the  underlying  properties  of  firms  issuing  equity  cause  the  overweighting,  we  should  not  observe 
a  systematic  difference  by  analyst  type.  We  find  that  dependent  analysts  weight  the  same  manage¬ 
ment  guidance  more  heavily  than  independent  analysts,  consistent  with  the  overweighting  of 
guidance  representing  a  mechanism  used  by  analysts  to  curry  favor  with  management. 

Our  second  cross-sectional  test  examines  differences  between  affiliated  and  unaffiliated  ana¬ 
lysts.  This  test  allows  us  to  condition  on  the  ex  post  realization  of  gaining  the  underwriting  fees. 
If  managers  reward  analysts  who  overweight  their  guidance,  we  should  find  that  those  analysts 
employed  by  the  banks  subsequently  underwriting  the  offerings  (affiliated  analysts)  weight  the 
firm’s  guidance  more  heavily  than  other  analysts.  Results  are  consistent  with  this  conjecture;  we 

find  that  analysts’  overweighting  of  management  guidance  is  associated  with  obtaining  the  under- 

•  2 

writing  business  of  the  upcoming  equity  offering. 

We  conduct  several  additional  analyses  to  corroborate  these  findings.  First,  to  further  control 
for  the  endogeneity  of  an  equity  offering,  we  model  the  equity  issuance  choice  and  conduct  a 
two-stage  Heckman  analysis.  We  continue  to  find  evidence  that  analysts  overweight  management 
guidance  prior  to  equity  offerings.  We  also  assess  the  likelihood  that  the  inclusion  of  an  omitted 
variable  could  overturn  the  results,  following  Frank  (2000),  and  conclude  that  the  probability  is 
small.^  Second,  we  document  that  analysts  who  overweight  management  guidance  also  tend  to 
issue  strong  buy  recommendations,  a  previously  documented  mechanism  to  please  management 
(e.g.,  Michaely  and  Womack  1999),  consistent  with  analysts  simultaneously  employing  several 
tools  to  curry  favor  with  management.  Third,  we  illustrate  that  our  conclusions  do  not  require 
analysts’  explicit  knowledge  of  an  equity  offering.  We  find  that  analysts  overweight  management 
guidance  for  firms  expected  to  have  equity  offerings.  Moreover,  the  overweighting  of  guidance  is 
more  prevalent  in  the  six  months  prior  to  the  equity  offering,  relative  to  offerings  announced  7-12 
months  in  the  future,  consistent  with  analysts  being  more  responsive  when  the  potential  payoffs 
are  nearer  (and  presumably  more  certain).  Fourth,  we  document  that  the  overweighting  of  guid¬ 
ance  prior  to  announcements  of  equity  offerings  is  present  for  both  upward  and  downward  guid¬ 
ance,  consistent  with  analysts  following  management  in  either  direction  in  order  to  curry  favor. 
Finally,  we  also  find  some  evidence  that  analysts  overweight  management  guidance  prior  to  other 
events  that  provide  analysts  an  incentive  to  please  management:  i.e.,  prior  to  debt  offerings  and 
M&A  activity,  where  debt  underwriting  fees  and  M&A  advisory  fees  might  be  gained  by  the 
analyst’s  employer. 

To  summarize,  we  find  that  analysts  overweight  short-term  management  guidance  when  they 
have  an  incentive  to  please  management— preceding  an  equity  offering.  This  overweighting  does 
not  appear  to  be  a  function  of  analysts  incorporating  useful  information,  but  rather  results  in  less 
accurate  forecasts.  The  overweighting  of  guidance  is,  on  average,  present  only  when  the  analysts 
are  able  to  gain  the  underwriting  business  and  the  analysts  appear  to  benefit  from  this  behavior  by 
increasing  their  chances  of  gaining  the  subsequent  underwriting  business  for  their  banks. 


^  An  alternative  explanation  for  this  result  is  that  the  subsequent  underwriter  is  also  the  past  underwriter,  and  the  analysts 
are  attempting  to  please  management  to  maintain  the  underwriting  business.  For  example,  Michaely  and  Womack 
( 1 999)  find  that  brokerage  analysts  are  more  optimistic  in  their  recommendations  for  companies  their  firm  has  recently 
taken  public.  This  plausible  alternative  does  not  alter  our  main  conclusion.  Analysts  under  either  scenario  overweight 
management  guidance  in  order  to  please  management.  Moreover,  Krigman  et  al.  (2001)  find  that  approximately  one- 
third  of  lead  underwriters  change  from  deal  to  deal;  thus,  the  past  underwriters  cannot  automatically  assume  they  will 
be  the  subsequent  underwriters,  further  heightening  their  incentive  to  please  management. 

^  Though  we  explore  this  endogeneity  concern  in  detail  in  Section  V,  we  cannot  completely  rule  out  the  possibility  that 
the  endogenous  nature  of  equity  offerings  drives  our  results. 
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Our  study  contributes  to  the  understanding  of  how  incentives  to  please  management  affect 
analysts’  short-term  earnings  forecasts.  Although  analysts  wishing  to  please  managers  are  overly 
optimistic  in  their  long-term  earnings  forecasts  and  recommendations,  we  document  that  they  take 
their  cue  from  management  when  determining  their  short-term  earnings  forecasts.  The  desire  to 
please  management,  therefore,  can  result  in  systematic  errors  in  analysts’  short-term  earnings 
forecasts.  Although  we  use  the  setting  of  equity  offerings  to  document  analysts’  overweighting  of 
guidance  to  please  management,  the  implications  of  our  findings  are  much  broader.  This  mecha¬ 
nism  can  be  used  by  any  analyst  wishing  to  curry  favor  with  management,  conditional  on  the 
issuance  of  management  guidance,  whether  the  analyst  wishes  to  gain  bank  fees  (the  focus  of  the 
current  study),  to  receive  preferred  treatment  in  conference  calls  (Libby  et  al.  2008;  Mayew  2008), 
or  to  please  management  for  other  reasons  (e.g.,  Chen  and  Matsumoto  2006;  Ke  and  Yu  2006). 

11.  MOTIVATION  OF  HYPOTHESIS 

In  general,  analysts  have  an  incentive  to  issue  accurate  earnings  forecasts.  Hong  and  Kubik 
(2003)  find  that  the  least  accurate  analysts  are  relegated  to  less  desirable  brokerage  houses,  while 
more  accurate  analysts  are  more  likely  to  move  up  to  high-status  brokerage  houses.  Lim  (2001) 
concludes  that  a  positive  forecast  bias  is  the  outcome  of  rational  analysts  trying  to  minimize  their 
forecast  error  while  also  trying  to  please  management.  Consistent  with  this  reasoning,  Chan  et  al. 
(2003a)  find  that  analysts  consistently  exceed  realized  growth  when  estimating  long-term  growth 
in  earnings,  and  Dechow  et  al.  (2000)  find  that  the  most  optimistic  long-term  growth  forecasts  are 
for  equity  issuers.^  This  positive  bias  also  extends  to  stock  recommendations.  For  example,  Dugar 
and  Nathan  (1995)  document  that  analysts  employed  by  investment  banks  issue  more  optimistic 
stock  recommendations  than  do  analysts  employed  by  non-investment  banks,  and  Lin  and  McNi- 
chols  (1998)  find  that  those  analysts  subsequently  employed  as  the  underwriters  (affiliated  ana¬ 
lysts)  issue  more  optimistic  recommendations  and  long-term  growth  forecasts  than  do  unaffiliated 
analysts. 

Although  optimistic  recommendations  and  long-term  forecasts  are  arguably  Pareto-optimal, 
this  is  not  the  case  for  short-term  forecasts.  Blanket  optimism  for  short-term  earnings  forecasts  is 
costly  if  the  firm  misses  the  benchmark  (e.g.,  Skinner  and  Sloan  2002)  and,  consistent  with  this, 
prior  research  has  not  found  systematic  evidence  of  optimism  in  short-term  forecasts  (e.g.,  Lin  and 
McNichols  1998).  There  is  no  clear  strategy  for  analysts  to  garner  management’s  favor  with  their 
short-term  forecasts.  Some  managers  will  be  pleased  by  optimistic  forecasts,  for  example,  if  they 
plan  to  issue  equity  or  sell  their  shares  before  earnings  are  reported,  if  they  are  in  danger  of  losing 
their  positions,  or  if  the  firm  is  in  danger  of  a  debt-rating  downgrade  or  bankruptcy.  Other  man¬ 
agers  would  prefer  pessimistic  forecasts,  for  instance  if  they  face  high  litigation  risk,  if  then- 
bonuses  depend  on  meeting  the  analyst  forecast,  or  if  they  plan  to  issue  equity  or  sell  their  shares 
after  earnings  are  reported  and  want  to  positively  surprise  the  market  to  maximize  the  sale  price 
(Rogers  and  Stocken  2005). 

In  the  face  of  these  varying  incentives,  managers  can  use  their  guidance  to  move  expectations 
to  the  desired  levels  (e.g.,  Ajinkya  and  Gift  1984;  King  et  al.  1990;  Hirst  et  al.  2008).  Thus,  rather 
than  observing  systematic  optimism  or  pessimism  in  shorter-term  analyst  forecasts,  we  expect  that, 
to  curry  favor  with  managers,  analysts  will  take  their  cue  from  management  and  overweight 
management  guidance  when  revising  their  short-term  earnings  forecasts,  even  though  the  over¬ 
weighting  will  likely  result  in  less  accurate  analyst  forecasts. 


""  It  is  also  possible  that  analysts  tend  to  follow  firms  that  they  expect  to  perform  well  and  choose  not  to  follow  firms  they 
expect  to  perform  poorly,  leading  to  the  appearance  of  ex  post  optimism  (Hayes  1998;  Bradshaw  et  al.  2006). 
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Several  conditions  must  exist  for  analysts  to  overweight  guidance  to  please  managers.  First, 
managers  must  want  analysts’  forecasts  to  closely  mirror  their  own  guidance.  This  desire  on  the 
part  of  managers  is  reasonable  because,  when  analysts  incorporate  guidance,  it  aids  in  the  align¬ 
ment  of  market  expectations  with  those  of  management  (e.g.,  Givoly  and  Lakonishok  1979;  Lys 
and  Sohn  1990;  Shekel  1991).  We  therefore  assume  that  managers  will  desire  analysts  to  be 
responsive  to  their  guidance. 

Second,  managers  must  be  able  to  identify  and  reward  analysts  who  overweight  their  guid¬ 
ance.  We  assume  that  a  manager  can  infer  each  analyst’s  weighting,  relative  to  an  unbiased 
weighting  of  the  guidance,  based  on  the  manager’s  knowledge  of  his  or  her  own  reputation, 
forecast  credibility,  and  forecast  precision.  Managers  can  reward  the  overweighting  by  giving  the 
analyst  (or  associated  investment  bank)  special  consideration,  for  example,  in  the  underwriter- 
selection  process.^  Finally,  analysts  must  have  an  incentive  to  curry  favor  with  management.  We 
use  upcoming  equity  offerings  to  proxy  for  analysts’  incentives  to  please  management.  During  the 
time  period  examined,  underwriting  departments  were  known  to  award  significant  bonuses  to 
analysts  who  help  attract  new  underwriting  business  (Cowen  et  al.  2006,  122).  We  expect  analysts 
employed  by  an  investment  bank  to  have  incentives  to  please  management  in  this  setting  in  order 
to  gain  the  underwriting  business  for  their  firm. 

In  sum,  we  expect  analysts  wishing  to  please  management  to  overweight  management  guid¬ 
ance  when  revising  their  short-term  earnings  forecasts,  at  the  expense  of  their  ex  post  accuracy,  in 
order  to  benefit  from  management’s  special  consideration — for  example,  by  increasing  the  likeli¬ 
hood  of  gaining  underwriting  business  for  their  investment  bank. 

Hypothesis:  Analysts  overweight  management  guidance  when  revising  their  short-term  earn¬ 
ings  forecasts,  at  the  expense  of  their  ex  post  forecast  accuracy,  to  curry  favor  with  manage¬ 
ment. 

We  conduct  several  analyses  that  jointly  test  our  hypothesis.  First,  we  examine  whether 
analysts  overweight  management  guidance  when  revising  their  short-term  earnings  forecasts  for 
firms  announcing  an  equity  offering  in  the  next  year.  Second,  we  examine  whether  this  over¬ 
weighting  of  guidance  leads  to  larger  analyst  forecast  errors.  Third,  we  conduct  cross-sectional 
tests  to  determine  whether  only  analysts  who  could  feasibly  gain  the  underwriting  business  over¬ 
weight  management  guidance  and  if  this  behavior  appears  to  result  in  gaining  the  underwriting 
business. 


III.  DATA  AND  DESCRIPTIVE  STATISTICS 

Data  and  Sample 

We  use  data  from  1994-2005,  obtaining  quarterly  management  guidance,  analyst  forecasts, 
and  actual  (reported)  earnings  from  the  First  Call  database.  We  acquire  information  regarding  the 
timing  and  underwriter  of  the  equity  offering  from  SDC  Platinum.  We  obtain  the  necessary  data 
for  the  control  variables  from  the  2005  Quarterly  and  Annual  Compustat  File.  We  outline  the 
sample  selection  procedure  in  Table  1.  As  Figure  1  illustrates,  we  consider  management  guidance 
for  quarter  q  that  was  issued  after  the  earnings  announcement  for  quarter  q-l  and  before  the 
earnings  announcement  for  quarter  q.  ^Ve  examine  the  analyst  forecast  revisions  for  quarter  q 
around  the  corresponding  management  guidance.  In  this  short-term  setting,  analysts  cannot  be 
systematically  optimistic  in  their  forecasts  as  some  managers  will  prefer  pessimistic  forecasts. 
Thus,  analysts  must  take  their  cue  from  management’s  guidance  when  determining  what  short- 


^  Although  our  focus  is  on  underwriting  fees,  this  special  consideration  could  also  include  other  types  of  consulting  fees, 
inside  information,  and  conference  call  favoritism.  Clearly,  managers  will  also  consider  deal  terms,  underwriter  quality, 
and  other  criteria  when  determining  whom  to  select  as  underwriters.  These  additional  features  of  the  choice  of  under¬ 
writer  should  add  noise  to  the  tests,  thereby  biasing  against  finding  evidence  consistent  with  our  hypothesis. 
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TABLE  1 

Management  Guidance  Sample  Selection 
1994-2005 


Number 

of 

Number 

Forecasts 

of  Firms 

Total  Forecasts 

43,885 

5,983 

Less: 

Non-EPS  forecasts 

1,291 

137 

Non-USD  currency  forecasts 

295 

132 

Forecasts  unduly  affected  by  mergers  or  accounting  changes 

169 

21 

Forecasts  issued  before  1994 

58 

13 

EPS  forecasts  made  after  1994 

42,072 

5,680 

Less: 

Qualitative  forecasts 

8,278 

911 

Point  and  range  forecasts 

33,794 

4,769 

Less: 

Duplicate  forecasts 

33 

1 

Forecasts  missing  actual  earnings  data 

4,131 

203 

Forecasts  with  actual  earnings  data 

29,630 

4,565 

Less: 

Forecasts  without  CUSIPs 

60 

32 

Forecasts  with  missing  Compustat  earnings  announcement  dates 

4,089 

370 

Forecasts  issued  before  the  earnings  announcement  date  of  the  prior 

12,655 

358 

quarter 

Forecasts  issued  after  the  earnings  announcement  date  of  the 

236 

44 

current  quarter 

Final  management  guidance  sample 

12,590 

3,761 

Less: 

Observations  without  analysts’  preexisting  forecasts 

5,912 

1,445 

Management  guidance  with  analysts’  forecasts 

6,678 

2,316 

Less: 

Observations  without  stock  price  data 

974 

554 

Management  guidance  with  stock  price  as  a  scalar 

5,704 

1,762 

Less: 

Multiple  guidance  issuances  (keeping  the  most  recent  guidance) 

515 

0 

The  top  and  bottom  one  percent  of  management  guidance,  analysts’ 

204 

54 

revisions,  and  managerial  reputation 

Final  Sample 

4,985 

1,708 
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FIGURE  1 

Timeline  and  Variable  Measurement 


REVISE 


Analyst  Forecast 
Error  (AFE) 


V 


MGUIDE 


Preexisting  analyst 

Management 

T 

Analysts  revise  their 

Quarterly  earnings 

forecast  available 

guidance  is 

forecasts  over  the  next 

announcement. 

2-30  days  prior  to 

provided. 

15  days;  if  there  is  no 

management 

revision,  preexisting 

guidance. 

forecast  is  carried 

forward. 

Equity  offering  is  announced  in  the  12  months  following 
management  guidance  for  some  firms. 


term  earnings  forecast  will  curry  favor.^  If  a  firm  provides  multiple  forecasts  for  the  same  quarter, 
then  we  keep  the  most  recent  (last-issued)  guidance.^  The  final  sample  contains  4,985  firm-quarter 
observations  consisting  of  1,708  unique  firms. ^ 

Variable  Definitions  and  Descriptive  Statistics 

All  variables  used  in  the  analysis  are  defined  in  Table  2,  with  corresponding  descriptive 
statistics  in  Table  3.  Our  research  design  requires  that  we  measure  management’s  “suggested” 
revision  (based  on  their  public  guidance)  and  analysts  subsequent  forecast  revisions  to  their 
short-term  earnings  forecast.  The  higher  the  correlation  between  management  and  analyst  revi¬ 
sions,  the  greater  the  “weighting”  of  guidance  by  analysts.  In  the  following  section,  we  introduce 
a  number  of  control  variables  that  are  expected  to  affect  the  weight  placed  on  the  management 
guidance,  thereby  allowing  us  to  focus  on  the  “overweighting”  of  guidance  by  analysts  prior  to 
equity  offerings. 

Management’s  suggested  revision  {MGUIDE)  is  the  measure  of  how  far  the  management 


®  We  include  pre- announcements  in  the  sample  because,  in  our  setting,  it  is  not  clear  that  analysts  incentives  would 
change  within  the  window  leading  up  to  the  earnings  announcement.  Specifically,  we  expect  analysts  to  wish  to  please 
management  throughout  the  quarter,  including  during  the  “pre-announcement”  period.  Our  results  do  not  change  quali¬ 
tatively  when  we  exclude  pre-announcements  from  our  sample  (not  tabulated). 

We  delete  515  instances  of  subsequently  revised  management  guidance  for  a  given  firm-quarter,  keeping  only  the  most 
recently  issued  guidance  in  a  given  firm-quarter.  It  is  possible  that  firms  issuing  multiple  forecasts  in  a  given  quarter  are 
systematically  different  from  the  rest  of  the  sample.  We  find  no  statistical  difference  in  analysts  incorporation  of 
guidance  for  these  515  firm-quarters  in  the  multivariate  tests  (not  tabulated). 

^  After  the  sample  constraints,  we  retain  approximately  29  percent  of  the  unique  firms  from  the  initial  population.  We 
compare  the  size,  performance,  and  expected  growth  of  the  final  sample  to  the  initial  population  from  which  the  sample 
was  collected.  We  find  that  our  sample  is  larger,  more  profitable,  and  has  higher  expected  growth,  consistent  with 
analysts  following  these  types  of  firms  (Bhushan  1989;  McNichols  and  O  Brien  1997). 
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TABLE  2 

Variable  Definitions 

Calculation  (Compustat  data  item  numbers  in 


Variable 

Description 

parentheses) 

SEO 

If  the  firm  issued  equity 

An  indicator  variable  that  is  set  equal  to  1  if  the  firm 
announces  an  equity  offering  in  the  year  following  the 
management  guidance,  and  0  otherwise. 

MGUIDE 

The  magnitude  of  the 
revision  suggested  by 
the  manager 

The  difference  of  the  management  earnings  forecast  and 
the  preexisting  median  consensus  analyst  forecast,  scaled 
by  stock  price  at  the  beginning  of  the  quarter.  The 
preexisting  consensus  analyst  forecast  is  the  most  recent 
consensus  before  the  management  guidance  (within  2-30 
days).  The  midpoint  of  the  range  is  used  for  all 
management  guidance  that  is  provided  as  a  range 
forecast. 

REVISE 

The  magnitude  of  the 
analyst  forecast  revision 

The  difference  of  the  revised  consensus  analyst  forecast 
and  the  preexisting  consensus  analyst  forecast,  scaled  by 
stock  price  at  the  beginning  of  the  quarter.  The 
preexisting  consensus  analyst  forecast  is  the  most  recent 
consensus  before  the  management  guidance  (within  2-30 
days).  The  revised  consensus  analyst  forecast  is  the 
updated  consensus  forecast  issued  within  15  days  of  the 
management  guidance.  If  there  is  not  a  revised  consensus 
analyst  forecast,  then  REVISE  is  0. 

FREQUENCY 

The  number  of 
management  forecasts 

The  number  of  quarterly  management  forecasts  issued 
over  the  previous  three  years. 

EPS 

Earnings  per  share 

Earnings  per  share  scaled  by  the  stock  price  at  the 
beginning  of  the  quarter. 

\AEE\ 

The  absolute  ex  post 
analyst  forecast  error 

The  absolute  value  of  the  median  consensus  analyst 
forecast  error  15  days  following  management  guidance, 
scaled  by  the  stock  price  at  the  beginning  of  the  quarter. 

AFEILIATED 

Affiliated  analyst 

An  indicator  variable  that  is  set  equal  to  1  if  the  analyst 
is  employed  by  the  investment  bank  that  subsequently 
underwrites  the  equity  offering,  and  0  otherwise. 

DEPENDENT 

Analyst  employed  by 
investment  bank 

An  indicator  variable  that  is  set  equal  to  1  if  the  analyst 
is  employed  by  an  investment  bank,  and  0  otherwise. 

DOWN 

Downward  management 
guidance 

An  indicator  variable  that  is  set  equal  to  1  if  the 
management  guidance  falls  below  the  preexisting 
consensus  analyst  forecast,  and  0  otherwise. 

REPUTATION 

Historical  reputation  of 
the  manager 

The  average  accuracy  of  the  preceding  management 
guidance  issued  in  the  prior  three  years,  where  the 
accuracy  is  equal  to  1 ,  0  and  —  1 ,  respectively,  if  the 
absolute  value  of  the  preexisting  analyst  forecast  error  is 
greater  than,  equal  to,  or  less  than  the  absolute  value  of 
the  management  forecast  error. 

AGREE 

Congruent  signs  on 
management  guidance 
and  market  response 

An  indicator  variable  that  is  set  equal  to  1  if  the 
three-day  abnormal  (market-adjusted)  return  around  the 
management  guidance  (-1,  -1-1)  has  the  same  sign  as  the 
management  revision,  and  0  otherwise.  The  abnormal 
return  is  equal  to  the  difference  between  the  firm  return 
and  the  value-weighted  market  return. 

(continued  on  next  page) 
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TABLE  2  (continued) 


Variable 

Description 

Calculation  (Compustat  data  item  numbers  in 
parentheses) 

HORIZON 

Management  guidance 
horizon 

The  log  of  the  number  of  days  between  the  issuance  of 
management  guidance  and  the  earnings  announcement. 

RANGE 

Management  guidance 
precision 

The  range  of  the  forecast  scaled  by  the  stock  price  at  the 
beginning  of  the  quarter  for  a  range  forecast,  and  0  for  a 
point  forecast. 

ANALYSTS 

Analyst  following 

The  log  of  the  number  of  analyst  forecasts  used  to  form 
the  consensus  analyst  forecast  following  management 
guidance. 

\AEPS\ 

Earnings  shocks 

The  absolute  change  in  earnings  per  share  from  First 

Call,  scaled  by  the  stock  price  at  the  beginning  of  the 
quarter. 

LOSS 

Loss  indicator 

An  indicator  variable  that  is  set  equal  to  1  if  earnings  per 
share  from  First  Call  is  negative,  and  0  otherwise. 

SPECIAL  ITEMS 

Special  items 

Special  items  from  Compustat  (#32),  scaled  by  the 
market  value  of  common  stock  at  the  beginning  of  the 
quarter. 

EIRM  SIZE 

Firm  size 

The  log  of  the  market  value  of  conunon  stock  at  the 
beginning  of  the  quarter. 

ANALYST 

DISPERSION 

Analyst  disagreement 

The  standard  deviation  of  the  individual  analyst  forecasts 
used  to  generate  the  consensus  analyst  forecast  following 
management  guidance,  scaled  by  the  stock  price  at  the 
beginning  of  the  quarter. 

DEBT 

If  the  firm  issued  debt 

An  indicator  variable  that  is  set  equal  to  1  if  the  firm 
announces  a  debt  offering  in  the  year  following  the 
management  guidance,  and  0  otherwise. 

M&A 

If  the  firm  had  an  M&A 

An  indicator  variable  that  is  set  equal  to  1  if  the  firm 
announces  a  merger  or  acquisition  in  the  year  following 
the  management  guidance,  and  0  otherwise. 

earnings  forecast  deviates  from  the  preliminary  earnings  expectations,  the  management  earnings 
forecast  less  the  preexisting  median  consensus  analyst  forecast  (the  last  consensus  available  in  the 
2-30  days  prior  to  the  guidance),  scaled  by  price  at  the  beginning  of  the  quarter.  The  analyst 
revision  (REVISE)  is  the  revised  consensus  analyst  forecast  (the  last  revised  consensus  available  in 
the  15  days  following  the  management  guidance),  less  the  preexisting  median  consensus  analyst 
forecast,  scaled  by  price  at  the  beginning  of  the  quarter.  In  other  words,  REVISE  is  how  much  the 
consensus  analyst  forecast  moves  after  management  guidance  is  issued.®  The  magnitude  of  the 
mean  analyst  revision  (REVISE',  —0.0029)  falls  just  below  that  of  the  suggested  management 


®  We  consider  the  consensus  analyst  forecast,  rather  than  each  individual  analyst  revision,  to  provide  direct  evidence  that 
the  consensus  forecasts  can  be  moved  by  the  overweighting  of  guidance.  We  consider  the  median,  rather  than  the  mean 
consensus  analyst  forecast  because  the  mean  consensus  could  be  more  easily  moved  by  one  analyst’s  overweighting  of 
guidance  to  please  management  and,  thus,  examining  the  mean  forecast  could  overstate  the  economic  significance  of  our 
findings.  Finally,  we  examine  analyst  revisions  in  the  15  days  following  management  guidance  to  better  capture 
revisions  made  by  analysts  in  response  to  management  guidance,  rather  than  other  news. 
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revision  {MGUIDE\  —0.0031),  on  average  (see  Table  3). 

We  measure  equity  offerings  {SEO)  as  an  indicator  variable  that  equals  1  if  the  firm  an¬ 
nounces  an  equity  offering  in  the  year  following  the  issuance  of  the  management  guidance,  and 
equals  0  otherwise.  We  consider  the  announcement  date  instead  of  the  actual  offering  date  for  two 
reasons.  First,  analysts  cannot  revise  their  forecasts  following  the  announcement  of  equity  offer¬ 
ings  due  to  SEC-mandated  quiet  periods  (e.g.,  see  Lin  and  McNichols  1998).^°  Second,  firms  have 
usually  selected  their  underwriters  by  the  time  of  the  announcement.  Thus,  it  is  before  the  an¬ 
nouncement  that  analysts  will  be  trying  to  gain  the  underwriting  business.  Approximately  4  per¬ 
cent  of  the  sample  issues  equity  through  a  secondary  equity  offering  in  the  year  following  man¬ 
agement  guidance  (Table  3).^^ 

We  compare  guidance  that  is  issued  in  the  year  prior  to  an  equity  offering  announcement 
(SEO)  to  guidance  issued  during  all  other  times  (non-SEO)  in  Table  3.  The  frequency  of  guidance 
is  higher  among  non-SEO  observations  relative  to  SEO  observations,  and  the  absolute  value  of 
analyst  forecast  errors  is  greater  among  SEO  observations,  though  we  explicitly  link  this  error  to 
the  overweighting  of  guidance  in  the  following  section.  Interestingly,  we  find  no  difference  in  the 
propensity  of  losses,  the  magnitude  of  special  items,  firm  size,  or  analyst  dispersion  between  SEO 
and  non-SEO  observations;  thus,  the  results  do  not  suggest  that  the  larger  forecast  errors  are  a 
result  of  greater  uncertainty  prior  to  SEOs.  Overall,  there  is  no  systematic  evidence  of  one  of  these 
two  groups  clearly  dominating  the  other  with  respect  to  the  usefulness  or  credibility  of  guidance. 
Non-SEO  observations  tend  to  have  more  downward  guidance  (which  is  deemed  more  credible), 
while  guidance  among  the  SEO  observations  tends  to  be  made  by  managers  with  slightly  higher 
reputations. 


IV.  TEST  DESIGN  AND  RESULTS 

Do  Analysts  Overweight  Management  Guidance  Prior  to  Equity  Offerings? 

To  determine  if  analysts  wishing  to  please  management  overweight  management  guidance 
when  revising  their  short-term  earnings  forecasts,  we  estimate  the  following  regression  equation; 


REVISE  =aQ+  aiMGUIDE  +  a2MGUIDE  X  SEO  +  a^MGUIDE  X  DOWN 

+  a^M GUIDE  X  REPUTATION  +  a^MGUIDE  X  AGREE  F  af^MGUIDE 
X  HORIZON  +  ajMGUIDE  X  RANGE  +  a^MGUIDE  X  ANALYSTS  F  a<)SEO 
F  a^oDOWNF  a^^REPUTATION F  a^2^GREEF  a^^HORIZON F  a^^RANGE 
Fay^NALYSTSFe.  (1) 


The  results  are  robust  to  several  alternative  specifications.  First,  our  results  do  not  change  qualitatively  when  we 
use  the  mean  consensus  analyst  forecasts  instead  of  the  median  (not  tabulated).  Second,  instead  of  analyst  forecast 
revisions  in  15  days,  we  examine  the  revisions  in  30  days  and  up  to  the  day  before  the  earnings  announcement.  Our 
conclusions  remain  unchanged  (not  tabulated).  Third,  we  include  all  analysts  in  the  revised  consensus  figure;  as  an 
additional  robustness  check,  we  drop  from  this  calculation  analysts  who  did  not  revise  their  forecast;  results  are  slightly 
stronger  after  the  exclusion  of  these  analysts  (not  tabulated).  Finally,  we  also  consider  controlling  for  the  age  of  the 
preexisting  forecast,  which  is  between  two  and  30  days.  The  age  of  the  preexisting  consensus  is  not  a  significant 
explanatory  variable  and  does  not  change  the  results  (not  tabulated). 

Consistent  with  the  interactions  between  managers  and  analysts  preceding  the  announcement  date,  in  our  sample  we  find 
only  38  instances  where  management  guidance  is  issued  after  the  announcement  of  the  equity  offering  but  before  25 
days  following  the  actual  offering  (not  tabulated). 

”  As  a  robustness  check,  we  follow  Bradshaw  et  al.  (2006)  and  consider  a  second  measure  of  equity  financing  that 
incorporates  a  broader  set  of  equity  financing  (18  percent  of  the  sample).  Results  are  similar  using  this  alternative 
financing  measure  (not  tabulated). 
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test. 

There  are  a  maximum  of  4,985  firm-quarter  observations  from  1994-2005;  some  variables  have  fewer  observations  due  to  missing  data.  The  untransformed  variables  are  presented 
for  both  HORIZON  and  ANALYSTS  for  ease  of  interpretation;  the  logged  values  are  included  in  the  following  regression  analyses. 

Variables  are  defined  in  Table  2. 
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In  general,  analysts  place  weight  on  (or  incorporate)  management  guidance  when  revising 
their  forecasts;  thus,  without  control  variables,  we  expect  to  be  positive  and  significant.  To 
determine  if  analysts  overweight  guidance  prior  to  equity  offerings,  we  must  first  control  for 
expected  variation  in  the  incorporation  of  guidance.  We  expect  analysts  to  weight  guidance  more 
heavily  if  it  is  useful  and  credible  (Hassell  et  al.  1988;  Baginski  and  Hassell  1990;  Williams  1996). 
We  interact  six  measures  of  usefulness  and  credibility  with  MGUIDE  to  control  for  the  expected 
incremental  incorporation  of  guidance.  The  main  effects  of  these  variables  are  expected  to  be 
largely  unassociated  with  analysts’  revisions;  thus,  we  include  them  for  completeness  but  do  not 
tabulate  them.  Because  a  firm  can  appear  multiple  times  in  the  regression,  we  cluster  the  standard 
errors  by  firm  following  Petersen  (2009)  in  this  and  all  subsequent  tests.  All  reported  p-values  are 
two-tailed. 

We  first  control  for  two  measures  of  credibility.  DOWN  is  an  indicator  variable  that  equals  1 
if  the  manager’s  suggested  revision  lowers  expectations,  which  is  generally  viewed  as  more 
credible  (Hassell  et  al.  1988;  Cotter  et  al.  2006).  REPUTATION  is  the  average  management 
guidance  accuracy  relative  to  analyst  forecasts  over  the  preceding  three  years  (Williams  1996; 
Hutton  and  Stocken  2009).  A  higher  value  indicates  that  prior  management  guidance  was  more 
credible — i.e.,  had  analysts  incorporated  the  guidance,  they  would  have  reported  a  lower  forecast 
error  than  if  they  had  retained  their  original  forecast.  We  expect  analysts  to  weight  the  guidance 
more  heavily  when  it  is  more  credible. 

The  next  three  controls  proxy  for  the  usefulness  of  the  guidance.  The  first,  AGREE,  is  an 
indicator  variable  that  equals  1  if  the  three-day  abnormal  return  around  the  management  guidance 
has  the  same  sign  as  the  revision  implied  by  the  management  guidance;  this  reflects  the  market’s 
assessment  of  the  usefulness  of  the  guidance  (e.g.,  Baginski  and  Hassell  1990).  The  second, 
HORIZON,  is  the  log  of  the  number  of  days  prior  to  the  earnings  announcement  that  the  guidance 
was  issued.  We  expect  guidance  issued  earlier  in  the  quarter  to  be  m.ore  useful  to  analysts.  Finally, 
we  include  a  measure  of  the  precision  of  the  guidance  (RANGE),  where  we  expect  more  precise 
guidance  to  be  more  useful  (Baginski  et  al.  1993).  We  expect  analysts  to  weight  useful  guidance 
more  heavily  when  revising  their  short-term  earnings  forecasts.  Because  the  measures  in  the 
analysis  are  based  on  the  consensus  analyst  forecast,  we  also  control  for  the  log  of  the  number  of 
analysts  (ANALYSTS)  following  the  firm.’^ 

We  predict  that,  prior  to  equity  offerings,  analysts  will  overweight  management  guidance 
when  revising  their  short-term  earnings  forecasts.  Therefore,  we  expect  0^2  to  be  positive  and 
significant. 

Table  4  presents  the  results  from  estimating  Regression  (1).  Referring  to  the  first  column  of 
results,  which  does  not  include  control  variables  and  thus  includes  all  202  firms  with  subsequent 
announcements  of  equity  offerings,  is  positive  and  significant  (0.83;  p  <  0.01),  consistent  with 
analysts  incorporating  a  significant  portion  of  management  guidance  into  their  short-term  fore¬ 
casts.  This  magnitude  is  significantly  higher  than  that  documented  in  the  early  work  by  Baginski 


Bhojraj  et  al.  (2008)  find  that  managers  who  are  frequent  forecasters  also  tend  to  be  more  accurate,  and  analysts  tend  to 
incorporate  more  of  their  guidance.  Because  our  measure  of  reputation  does  not  take  forecasting  frequency  into  account, 
as  a  sensitivity  check  we  create  the  variable  #FORECASTS  that  is  equal  to  the  number  of  forecasts  issued  by  the  firm 
over  the  three-year  period  that  REPUTATION  is  measured.  Consistent  with  Bhojraj  et  al.  (2008),  we  find  that  the 
interaction  term  on  MGUIDE  X  REPUTATION  X  ^FORECASTS  is  positive  and  significant  (a  =  0.0422;  p  <  0.01). 
Our  inferences  do  not  change;  the  coefficient  on  the  interaction  of  MGUIDE  and  SEO  is  0.2765  (p  <  0.01),  which  is 
similar  to  the  main  results,  presented  in  Table  4. 

We  do  not  have  a  predicted  sign  on  the  interaction  term  of  ANALYSTS  and  MGUIDE.  On  one  hand,  because  more 
analysts  would  need  to  revise  their  forecasts  to  move  the  consensus  when  there  are  more  analysts  following  the  firm,  it 
is  possible  that  the  guidance  is  incorporated  less  when  there  are  more  analysts.  On  the  other  hand,  analyst  following  is 
not  exogenous,  and  it  is  likely  that  more  analysts  follow  larger  and  relatively  more  important  firms,  which  might  suggest 
a  greater  degree  of  incorporation  of  management  guidance. 
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TABLE  4 

Revision  Magnitude  on  Equity  Offerings 

Dependent  Variable 

Predicted  Coefficient 

Independent  Variables  Sign  (t-statistic) 

=  REVISE 

Coefficient 

(t-statistic) 

Intercept 

-0.0003*** 

(-6.77) 

-0.0012* 

(-1.80) 

MGUIDE 

-f- 

0.8303*** 

(36.43) 

-0.1347 

(-0.31) 

MGUIDE  X  SEO 

-f 

0.1053*** 

(3.14) 

0.2718*** 

(3.68) 

MGUIDE  X  DOWN 

-f 

0.1473** 

(1.98) 

MGUIDE  X  REPUTATION 

-f 

0.1442*** 

(2.92) 

MGUIDE  X  AGREE 

+ 

0.1585*** 

(2.96) 

MGUIDE  X  HORIZON 

-h 

0.1005 

(0.99) 

MGUIDE  X  RANGE 

— 

-7.7507** 

(-2.53) 

MGUIDE  X  ANALYSTS 

? 

0.1151*** 

(2.88) 

Main  Effects 

Included 

Included 

Number  of  SEO  Observations 

202 

87 

Total  Number  of  Observations 

4,985 

3,025 

Adjusted 

0.8204 

0.8803 

***  Indicates  statistical  significance  at  the  0.10,  0.05,  and  0.01  levels,  respectively,  under  a  two-tailed  test. 

The  regressions  use  all  available  observations.  See  Table  1  for  sample  selection  and  Table  2  for  data  definitions.  All 
t-statistics  are  based  on  robust  standard  errors  clustered  at  the  firm  level. 


and  Hassell  (1990).  Recently,  however,  management  guidance  has  become  more  precise,  more 
accurate,  and  is  often  supplemented  with  additional  information,  consistent  with  guidance  issued 
during  our  time  period  being  more  credible  and  useful  (Baginski  et  al.  2008).  Consistent  with  our 
expectations,  the  coefficient  on  the  interaction  between  management  guidance  and  equity  offer¬ 
ings,  £^2,  is  positive  and  significant  (0.11;  p  <  0.01),  indicating  that  analysts  place  a  greater  weight 
on  management  guidance  when  the  firm  subsequently  announces  an  equity  offering. 

In  the  next  column  we  interact  each  of  the  six  control  variables  with  management  guidance, 
as  management  guidance  may  be  more  useful  or  credible  prior  to  equity  offerings.  The  main 
effect  of  MGUIDE  becomes  insignificant,  consistent  with  the  bulk  of  the  analyst  revision  being 
explained  by  the  credibility  and  usefulness  of  the  management  guidance.  In  initial  support  of  our 


The  sample  is  significantly  reduced  at  43  (61)  percent  of  the  original  sample  for  SEO  (all)  firm-year  observations.  This 
reduction  is  almost  entirely  attributable  to  the  requirement  of  REPUTATION,  where  firms  offering  equity  tend  to  have 
issued  less  guidance  in  the  past  (see  Table  3)  and  thus  are  missing  this  variable  more  often.  Results  are  similar  if  we 
exclude  this  control  variable  from  the  analysis  (not  tabulated). 
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hypothesis,  a2  remains  positive  and  significant  (0.27;  p  <  0.01).  In  economic  terms,  analysts 
incorporate  27  percent  more  of  the  management  guidance  into  their  short-term  earnings  forecasts 
prior  to  equity  offerings.  This  overweighting  is  economically  significant,  as,  on  average,  analysts 
tend  to  incorporate  approximately  80  percent  of  management  guidance. 

The  coefficients  on  the  control  variables  are  consistent  with  our  expectations.  Analysts  incor¬ 
porate  more  of  the  management  guidance  when  it  is  downward  or  issued  by  a  more  reputable 
manager  (when  it  is  more  credible),  as  well  as  when  the  market  reaction  is  in  the  same  direction 
as  the  guidance  and  when  the  precision  of  the  guidance  is  higher  (when  it  is  more  useful).  Finally, 
analysts  tend  to  weight  the  guidance  more  heavily  when  more  analysts  follow  the  firm.  In  sum, 
while  analysts  weight  guidance  more  heavily  when  it  is  credible  and  useful,  they  also  place  greater 
weight  on  guidance  received  prior  to  the  announcement  of  an  equity  offering.  Thus,  results  are 
consistent  with  our  hypothesis  that  analysts  with  incentives  to  please  management  overweight 
management  guidance  when  revising  their  short-term  earnings  forecasts. 

Do  Analysts  Sacrifice  Accuracy  to  Overweight  Guidance  Prior  to  Equity  Offerings? 

It  is  possible  that  management  guidance  prior  to  equity  offerings  is  more  informative  in  ways 
not  captured  by  our  measures  of  usefulness  and  credibility,  leading  to  the  appearance  of  an 
overweighting  of  guidance.  To  help  distinguish  between  the  expected  incorporation  of  credible  and 
useful  guidance  (Hassell  et  al.  1988;  Baginski  and  Hassell  1990;  Williams  1996;  Cotter  et  al. 
2006)  and  the  overweighting  of  guidance  associated  with  attempting  to  please  managers  prior  to 
equity  offerings  (our  hypothesis),  we  examine  ex  post  analyst  forecast  errors. 

If  analysts  weight  guidance  more  heavily  because  it  is  more  informative  before  equity  offer¬ 
ings,  or  because  analysts  face  more  uncertainty  regarding  the  growth  or  profitability  of  firms  about 
to  issue  equity,  then  the  greater  reliance  on  management  guidance  should  reduce  analyst  forecast 
errors.  Alternatively,  as  suggested  by  Lim  (2001)  and  Hong  and  Kubik  (2003),  when  competing 
incentives  are  introduced,  analysts  are  willing  to  sacrifice  accuracy.'^  We  predict  that  analysts 
overweight  management  guidance  in  order  to  increase  their  chances  of  gaining  the  underwriting 
business  and,  thus,  this  increased  reliance  on  management  guidance  should  reduce  forecast  accu¬ 
racy. 

In  the  following  set  of  tests,  we  explicitly  link  the  overweighting  of  guidance  prior  to  equity 
offerings  {SEO  REVISION  below)  with  larger  analyst  forecast  errors  in  a  multivariate  estimation 
(Regression  (2)  below).  First,  we  generate  an  estimate  of  the  incremental  weight  placed  on  guid¬ 
ance  prior  to  equity  offerings  from  Regression  (1)  by  decomposing  the  total  analyst  revision  into 
three  components: 

(1)  that  associated  with  the  upcoming  equity  offering: 

SEO  REVISION  =  a2MGUIDE  X  SEO  +  agSEO; 

(2)  that  associated  with  other  known  factors  affecting  analyst  responsiveness: 

STANDARD  REVISION  =  +  a^MGUIDE+  a^MGUIDE  X  DOWN  +  a^MGUIDE 

X  REPUTATION  +  a^MGUIDE  X  AGREE  +  a^MGUIDE 
X  HORIZON  F  a-jMGUIDE  X  RANGE  +  a^MGUIDE 


Ke  and  Yu  (2006)  examine  the  intertemporal  characteristics  of  analysts’  earnings  forecasts,  documenting  that  analysts 
switching  from  initially  optimistic  forecasts  to  ex  post  pessimistic  forecasts  just  before  the  earnings  announcement  are 
more  likely  to  receive  inside  information  from  managers  (prior  to  Regulation  Fair  Disclosure)  and  more  likely  to  be 
retained  by  their  brokerage  houses.  Similarly,  in  an  experimental  setting,  Libby  et  al.  (2008)  find  that  analysts  are  more 
likely  to  switch  from  initially  optimistic  forecasts  to  ex  post  pessimistic  forecasts  to  gain  favored  conference  call 
participation  and  information  access.  These  analysts  ex  post  have  lower  analyst  forecast  errors  (as  they  gain  inside 
information)  and,  thus,  this  setting  differs  from  ours  in  that  we  examine  incentives  to  overweight  public  guidance 
resulting  in  higher  analyst  forecast  errors,  ex  post. 
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X  ANALYSTS  +  a^QDOWN  +  a^iREPUTATION  +  ai2AGREE 
+  a^^tiORIZONy  a^^RANGE  +  a^^NALYSTS',  and 

(3)  the  residual  from  the  estimation: 

RESIDUAL  REVISION  =  REVISION  -  SEO  REVISION-  STANDARD  REVISION. 

Second,  we  regress  the  absolute  value  of  the  ex  post  analyst  forecast  error  on  the  absolute  value  of 
the  decomposed  revision  components  to  test  if  analyst  forecast  errors  are  larger  as  a  result  of  the 
increased  reliance  on  management  guidance  prior  to  equity  offerings: 

\AEE\  =  ao  +  ai\SEO  REVISION]  +  a2\STANDARD  REVISION]  +  a^^ESIDUAL  REVISION] 

+  a4]NEPS]  +  a^EOSS  +  a(,SPECIAL  ITEMS  +  a-jANALYSTS  +  a^HORIZON 

2005 

+  agRANGE  +  a^oANALYST  DISPERSION  +  anEIRM  SIZE  +  of,  2  YEAR;  +  s 

/=1994 

(2) 

where  \AEE\  is  the  absolute  value  of  the  revised  consensus  analyst  forecast  error  (the  difference 
between  actual  earnings  and  the  revised  analyst  consensus  forecast),  scaled  by  the  stock  price  at 
the  beginning  of  the  quarter/^ 

We  consider  eight  variables  to  control  for  the  innate  difficulty  in  estimating  earnings,  which 
might  vary  systematically  for  firms  issuing  equity.  We  also  include  yearly  indicator  variables, 
since  analyst  forecast  errors  have  changed  over  time  (Brown  2001).  To  control  for  unexpected 
earnings  shocks,  we  include  lAEP^I,  the  absolute  change  in  earnings  per  share  from  First  Call, 
scaled  by  the  stock  price  at  the  beginning  of  the  quarter.  Abarbanell  and  Lehavy  (2002)  document 
that  forecast  errors  are  larger  for  firms  with  losses  and  special  items.  Thus,  we  include  LOSS,  an 
indicator  variable  that  equals  1  if  earnings  per  share  from  First  Call  is  negative,  and  0  otherwise, 
and  SPECIAL  ITEMS  from  Compustat  (#32),  scaled  by  the  market  value  of  common  stock  at  the 
beginning  of  the  quarter.  We  control  for  ANALYSTS,  the  log  of  the  number  of  analyst  forecasts 
used  to  form  the  revised  consensus  analyst  forecast,  as  the  number  of  analysts  is  expected  to 
reduce  the  ex  post  analyst  forecast  error  (Brown  1997).  The  next  control  is  HORIZON,  the  log  of 
the  number  of  days  between  the  issuance  of  management  guidance  and  the  earnings  announce¬ 
ment.  (Recall  that  the  consensus  is  measured  within  15  days  of  this  event;  thus,  HORIZON  based 
on  management  guidance  is  highly  correlated  with  the  age  of  the  consensus  forecast.)  The  earlier 
in  the  quarter  the  analyst  forecast  is  issued,  the  greater  is  the  expected  error  (as  there  is  more  time 
for  innovations  in  earnings  to  occur).  We  include  two  proxies  for  uncertainty.  First,  RANGE  is 
based  on  the  precision  of  the  management  guidance;  the  greater  the  RANGE,  the  greater  the 
inferred  uncertainty  on  the  part  of  management,  and  the  larger  the  expected  forecast  error.  Second, 
ANALYST  DISPERSION  represents  the  uncertainty  among  analysts  about  forecasted  earnings.  This 
variable  is  the  standard  deviation  of  the  individual  analyst  forecasts  used  to  generate  the  revised 
consensus  analyst  forecast,  scaled  by  the  stock  price  at  the  beginning  of  the  quarter.  Finally, 


By  taking  the  absolute  value  of  the  revision,  we  focus  on  the  degree  of  incorporation,  the  variable  of  interest,  instead  of 
the  direction  of  incorporation.  Consider  two  firms.  The  manager  of  Firm  A  issues  guidance  that  is  ten  cents  below  the 
analyst  consensus,  and  the  analysts  revise  the  consensus  forecast  downward  by  0.08.  The  manager  of  Firm  B  issues 
guidance  that  is  ten  cents  above  the  analyst  consensus,  and  the  analysts  revise  the  consensus  upward  by  0.08.  REVISE 
is  -0.08  and  0.08  for  Firm  A  and  Firm  B,  respectively;  however,  in  both  instances  the  analysts  incorporate  80  percent 
of  the  guidance. 
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because  larger  firms  tend  to  have  more  disclosure  in  general,  we  control  for  FIRM  SIZE,  the  log 
of  market  value  of  common  stock  at  the  beginning  of  the  quarter. 

A  positive  coefficient  on  SEO  REVISION  would  indicate  that  the  increased  incorporation  of 
guidance  by  analysts  prior  to  equity  offerings  results  in  larger  ex  post  forecast  errors.  We  expect 
the  coefficient  on  STANDARD  REVISION  to  be  negative,  as,  in  general,  management  guidance  has 
been  shown  to  lower  analyst  forecast  errors  (e.g.,  Hassell  and  Jennings  1986).  We  do  not  predict 
the  sign  of  the  coefficient  on  the  residual  analyst  revision. 

Table  5  presents  the  results  from  estimating  Regression  (2).  Consistent  with  the  increased 
weighting  of  management  guidance  prior  to  equity  offerings  representing  a  tool  to  please  man¬ 
agement,  it  is  associated  with  higher  analyst  forecast  errors,  as  the  coefficient  on  SEO  REVISION 
is  positive  and  significant  with  and  without  control  variables.  In  terms  of  economic  magnitude,  the 
overweighting  of  management  guidance  prior  to  equity  offerings  increases  ex  post  forecast  errors 
by  40  percent,  on  average.'^  STANDARD  REVISION,  the  revision  associated  with  known  factors, 
is  positive  and  weakly  significant  when  omitting  control  variables,  but  negative  and  significant 
when  control  variables  are  included.'*  The  latter  result  is  consistent  with  management  guidance 
generally  improving  analyst  forecast  accuracy.  Finally,  RESIDUAE  REVISION  is  positively  asso¬ 
ciated  with  analyst  forecast  errors  across  both  specifications.  This  finding  suggests  that  other 
incentives,  in  addition  to  underwriting  business,  might  lead  analysts  to  vary  their  incorporation  of 
management  guidance  at  the  cost  of  ex  post  accuracy;  we  examine  additional  incentives  in  Section 
V. 

The  coefficients  on  the  control  variables  are  largely  consistent  with  expectations.  Large 
shocks  to  earnings  decrease  forecast  accuracy.  Losses  and  special  items  are  not  significant,  al¬ 
though  special  items  are  generally  excluded  from  core  earnings  determined  by  forecast  data 
providers  (Philbrick  and  Ricks  1991).  The  more  analysts  following  the  firm,  the  greater  the 
forecast  accuracy,  consistent  with  Brown  (1997).  When  the  management  guidance  is  older  or  less 
precise,  the  ex  post  analyst  forecast  errors  are  larger,  and  the  errors  are  also  larger  when  there  is 
less  of  a  consensus  among  analysts  (i.e.,  a  higher  standard  deviation  of  the  individual  forecasts). 
Finally,  firm  size  is  not  significant,  which  is  reasonable  given  that  the  sample  is  restricted  to  firms 
that  provide  guidance,  which  tend  to  be  larger  firms. 

Overall,  we  find  that  analysts’  abnormally  high  weighting  of  management  guidance  preceding 
equity  offerings  is  associated  with  an  increase  in  analyst  forecast  errors.  Analysts  appear  to  be 
sacrificing  their  forecast  accuracy  by  overweighting  management  guidance  prior  to  equity  offer¬ 
ings. 

Cross-Sectional  Tests  of  Analysts’  Overweighting  of  Management  Guidance 

Thus  far,  we  use  consensus  forecasts  issued  by  all  analysts.  Not  all  analysts,  however,  are 
vying  for  investment  bank  business.  In  this  section,  we  examine  whether  this  behavior  differs  by 
analyst  type.  Specifically,  we  expect  dependent  analysts  who  can  acquire  the  investment  banking 
business  to  overweight  management  guidance  when  revising  their  short-term  forecasts  prior  to  an 
equity  offering.  We  do  not  expect,  however,  to  observe  independent  analysts  overweighting  guid- 
anee  in  a  similar  fashion.  We  also  expect  the  overweighting  of  guidance  to  be  associated  with  the 


To  determine  the  economic  magnitude  of  the  increased  forecast  error  associated  with  the  overweighting  of  guidance 
prior  to  equity  offerings,  we  multiply  the  average  SEO  REVISION  for  SEO  firms  (0.0004;  not  tabulated)  with  the 
coefficient  on  SEO  REVISION  (1.2065),  which  indicates  an  increase  in  the  analyst  forecast  error  of  0.00048  on  average, 
or  40  percent  of  the  mean  analyst  forecast  error  (0.0012;  see  Table  3). 

The  coefficient  on  SEO  REVISION  (1.2065)  appears  to  be  much  larger  than  the  coefficient  on  STANDARD  REVISION 
(0.0368).  In  terms  of  economic  significance,  however,  the  importance  of  STANDARD  REVISION  is  about  double  that  of 
SEO  REVISION,  where  we  measure  economic  significance  as  the  change  in  \AFE\  when  the  REVISION  variable 
increases  by  one  standard  deviation. 
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TABLE  5 

Ex  Post  Analyst  Forecast  Accuracy 


Dependent  Variable  =  [AFE\ 


Independent  Variables 

Predicted 

Sign 

Coefficient 

(t-statistic) 

Coefficient 

(t-statistic) 

Intercept 

0.0006*** 

(10.52) 

-0.0038*** 

(-4.89) 

\SEO  REVISION] 

H- 

1.3837** 

(2.22) 

1.2065** 

(2.26) 

\STANDARD  REVISION 

— 

0.0198 

(1.57) 

-0.0368** 

(-2.53) 

\RESIDUAL  REVISION 

? 

0.6219*** 

(8.86) 

0.4278*** 

(5.60) 

\AEPS\ 

-b 

0.0183*** 

(4.56) 

LOSS 

-b 

0.0001 

(0.96) 

SPECIAL  ITEMS 

+ 

-0.0003 

(-0.31) 

ANALYSTS 

— 

-0.0002*** 

(-3.05) 

HORIZON 

-b 

0.0012*** 

(6.85) 

RANGE 

-b 

0.1018** 

(2.54) 

ANALYST  DISPERSION 

+ 

0.2489*** 

(4.10) 

FIRM  SIZE 

— 

-0.0000 

(-1.26) 

Year  Indicators 

Not  Included 

Included 

Number  of  SEO  Observations 

87 

60 

Total  Number  of  Observations 

2,997 

2,342 

Adjusted 

0.1500 

0.2762 

Indicates  statistical  significance  at  the  0.10,  0.05,  and  0.01  levels,  respectively,  under  a  two-tailed  test. 

\AFE\  is  the  absolute  value  of  the  revised  analyst  forecast  error.  We  decompose  the  total  analyst  revision,  from  the 
estimation  of  Regression  (1),  into  three  components;  that  associated  with  the  upcoming  equity  offering 
[SEO  REVISION  =  a2SEO+ aciMGUIDEX  SEO),  that  associated  with  other  known  factors  affecting  analyst  revisions 
[STANDARD  REVISION  =  Oq  +  a^MGUIDE+  a^MGUIDE  X  DOWN  +  d^MGUIDE  X  REPUTATION  +  a^MGUIDE 
X  AGREE  +  a(,MGUIDE  X  HORIZON  +  d^MGUIDE  X  RANGE  +  a^MGUIDE  X  ANALYSTS  -t  a^oDOWN 
+  auREPUTATION+auAGREE+a^2HORIZON+ai^RANGE+ai^NALYSTS),  and  the  residual  from  the  estimation 
[RESIDUAL  REVISION  =  REVISE  -  SEO  REVISION  -  STANDARD  REVISION).  All  t-statistics  are  based  on  robust 
standard  errors  clustered  at  the  firm  level. 

Additional  variables  are  defined  in  Table  2. 


ex  post  selection  of  the  underwriter  and,  thus,  we  expect  those  analysts  whose  employers  are  ex 
post  selected  to  underwrite  the  deal  to  have  incorporated  the  greatest  proportion  of  guidance  into 
their  short-term  earnings  forecasts,  after  controlling  for  the  credibility  and  usefulness  of  the  guid¬ 
ance.  For  these  analyses,  we  limit  the  sample  to  guidance  issued  in  the  year  prior  to  the  announce- 
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ment  of  an  equity  offering  (SEO  observations).  As  a  result,  these  cross-sectional  tests  also  help  to 
allay  concerns  that  some  underlying  difference  between  SEO  and  non-SEO  observations  drives  the 
higher  weighting  of  guidance  among  SEO  observations. 

Eor  each  analyst  following  the  202  firms  that  issue  management  guidance  and  announce  an 
equity  offering  in  the  subsequent  year,  we  identify  (1)  whether  they  are  employed  by  an  invest¬ 
ment  bank,  and  (2)  whether  their  employer  is  on  the  subsequently  formed  team  of  underwriters;  in 
these  instances,  we  set  DEPENDENT  and  AEEILIATED,  respectively,  equal  to  1.^^  Approximately 
90  (25)  percent  of  the  analysts  in  the  sample  are  dependent  (affiliated).  To  be  included  in  the 
sample,  the  firm  must  be  followed  by  both  dependent  and  independent  (affiliated  and  unaffiliated) 
analysts.  This  design  choice  rules  out  the  possibility  that  different  types  of  analysts  tend  to  follow 
different  types  of  firms  (and  therefore  different-quality  management  guidance).  This  requirement 

results  in  35  (172)  usable  management  guidance  observations  for  the  dependent  (affiliated) 
20 

analyst-type  tests. 

Eor  each  instance  of  management  guidance,  we  have  two  observations,  one  for  each  analyst 
group  (i.e.,  dependent  and  independent),  resulting  in  a  total  of  70  (344)  observations  for  the 
dependent  (affiliated)  test.  For  each  of  these  subgroups,  we  generate  MGUIDE  and  REVISE  using 
the  corresponding  median  consensus  forecast  that  includes  only  the  relevant  analysts  (dependent/ 
independent  and  affiliated/unaffiliated).  The  reason  for  this  approach  is  that  when  we  examine 
affiliated/unaffiliated  or  dependent/independent  analysts,  each  of  these  four  sub-groups  will  have  a 
potentially  different  pre-existing  and  revised  consensus,  as  each  analyst  group’s  consensus  should 
be  made  up  of  only  that  particular  analyst  type  (e.g.,  there  will  be  an  affiliated  analyst  consensus, 
which  might  differ  from  the  unaffiliated  analyst  consensus).  Partitioning  in  this  manner  allows  us 
to  determine  whether  the  main  movement  in  the  consensus  (documented  in  Table  4)  is  by  self- 
interested  (dependent  or  affiliated)  analysts,  supporting  our  hypothesis,  or  by  all  analysts,  support¬ 
ing  the  existence  of  a  correlated  omitted  variable  or  self-selection  concerns. 

We  replicate  the  estimation  of  Regression  (1),  but  instead  of  investigating  variation  based  on 
SEO,  we  hypothesize  that  there  will  be  variation  based  on  analyst  type  (recall  that  the  test  is 
limited  to  the  SEO  firms).  Table  6  presents  the  results  from  this  estimation,  with  dependent/ 
independent  analysts  in  the  first  column  and  affiliated/unaffiliated  in  the  second  column. 

Turning  first  to  the  dependent/independent  partition,  we  find  that,  conditional  on  an  upcoming 
announcement  of  an  equity  offering,  those  analysts  who  can  gain  the  underwriting  business  incor¬ 
porate  significantly  more  of  the  management  guidance  into  their  short-term  earnings  forecasts  than 
do  independent  research  analysts.  Specifically,  the  coefficient  on  MGUIDE  X  DEPENDENT  is 
positive  and  significant,  while  the  coefficient  on  MGUIDE  is  not  significantly  different  from  zero. 
Unlike  dependent  analysts,  who  have  an  incentive  to  gain  the  subsequent  underwriting  business, 
independent  analysts  do  not  appear  to  incorporate  any  management  guidance  beyond  that  dictated 
by  the  credibility  and  usefulness  of  the  guidance.  Thus,  the  overweighting  of  management 
guidance  preceding  equity  offerings  that  we  document  in  Table  4  is,  on  average,  present  only 
among  analysts  with  greater  incentives  to  please  management — those  who  can  actually  obtain  the 


To  determine  if  the  analysts’  employers  could  feasibly  gain  the  underwriting  business,  we  use  the  data  from  Gu  and  Xue 
(2008),  who  separate  full  service  investment  banks  and  brokerage  firms  from  independent  research  firms;  we  thank 
Zhaoyang  Gu  for  providing  us  with  these  data.  Ideally,  we  would  like  to  focus  solely  on  those  analysts  employed  by 
banks  that  offer  underwriting  services  (as  some  brokerage  firms  do  not).  We  are  limited,  however,  to  this  broader 
partition,  which  biases  against  finding  results  consistent  with  our  hypothesis.  We  use  SDC  Platinum  to  identify  the 
subsequent  underwriting  team.  Analysts  are  affiliated  if  they  are  employed  by  any  investment  bank  involved  in  the  deal, 
rather  than  solely  by  the  lead  underwriter. 

To  maximize  the  number  of  usable  observations,  we  set  REPUTATION  equal  to  0  if  it  is  missing. 

If  we  exclude  the  control  variables,  the  coefficient  on  the  main  effect  of  MGUIDE  is  positive  and  significant,  consistent 
with  independent  research  analysts  incorporating  management  guidance  into  their  earnings  forecasts,  on  average. 
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TABLE  6 

Revision  Magnitude  by  Analyst  Type 

Dependent  Variable  =  REVISE 


Analyst  Type 


Dependent  Analysts 

Affiliated  Analysts 

Predicted 

Coefficient 

Coefficient 

Independent  Variables 

Sign 

(t-statistic) 

(t-statistic) 

Intercept 

-0.0006 

-0.0029* 

(-0.44) 

(-1.85) 

MGUIDE 

-b 

1.4482 

0.7540 

(1.32) 

(0.61) 

MGUIDE  X  ANALYST  TYPE 

+ 

0.2387* 

0.3355* 

(1.70) 

(1.74) 

MGUIDE  X  DOWN 

-b 

-0.3762* 

-0.2403 

(-1.88) 

(-1.28) 

MGUIDE  X  REPUTATION 

-b 

-0.0846 

-0.0446 

(-0.81) 

(-0.39) 

MGUIDE  X  AGREE 

+ 

0.5982*** 

-0.0235 

(2.90) 

(-0.11) 

MGUIDE  X  HORIZON 

-b 

-0.2661 

-0.1785 

(-1.23) 

(-0.58) 

MGUIDE  X  RANGE 

— 

24.2946 

-5.5979 

(1.10) 

(-0.32) 

MGUIDE  X  ANALYSTS 

7 

-0.1454 

0.4167*** 

(-1.17) 

(3.60) 

Main  Effects 

Included 

Included 

Analyst  Type  Equal  to  1 

35 

172 

Total  Number  of  Observations 

70 

344 

Adjusted 

0.7945 

0.7487 

***  Indicates  statistical  significance  at  the  0.10,  0.05,  and  0.01  levels,  respectively,  under  a  two-tailed  test. 

Only  firms  issuing  equity  in  the  next  year  are  included  in  these  estimations.  We  define  DEPENDENT  {AFFILIATED)  as  an 
indicator  variable  that  is  set  equal  to  1  if  the  analyst  is  employed  by  a  full-service  investment  bank  or  a  brokerage  house 
(the  subsequent  underwriter),  and  0  otherwise.  We  recalculate  MOUIDE  and  REVISE,  defined  in  Table  2,  for  both  the 
dependent/affiliated  and  independent/unaffiliated  analysts’  initial  forecasts  and  subsequent  revisions  (based  on  the  median 
consensus  analyst  forecast  for  that  specific  type  of  analyst;  we  examine  only  firm-quarters  that  have  both  types  of  analyst 
forecasts).  All  t-statistics  are  based  on  robust  standard  errors  clustered  at  the  firm  level. 

Additional  variables  are  defined  in  Table  2. 


underwriting  business  for  their  investment  bank.  This  finding  is  consistent  with  analysts  trying  to 
please  management  by  overweighting  their  guidance. 

Turning  next  to  the  affiliated/unaffiliated  partition,  we  again  find  that  the  coefficient  on 
MGUIDE  is  insignificant,  while  the  coefficient  on  MGUIDE  X  AEEILIATED  is  positive  and 
significant.  These  results  suggest  that  the  analysts  who  incorporate  the  greatest  amount  of  the 
management  guidance  are  affiliated  analysts — those  who  work  for  the  investment  banks  housing 
the  subsequent  underwriters  of  the  equity  offering.  These  results  are  consistent  with  analysts 
benefiting  from  overweighting  management  guidance.  Specifically,  although  the  overweighting  of 
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guidance  is  associated  with  lower  analyst  forecast  accuracy  (Table  5),  it  is  also  associated  with 

22 

future  underwriting  business. 

Overall,  the  results  are  consistent  with  our  central  hypothesis:  analysts  appear  to  overweight 
management  guidance  when  revising  their  short-term  earnings  forecasts  in  order  to  please  man¬ 
agement.  This  behavior  precedes  the  announcements  of  equity  offerings,  leads  to  larger  ex  post 
analyst  forecast  errors,  is  present  only  among  those  analysts  who  can  gain  the  investment  banking 
business,  and  is  associated  with  the  ex  post  receipt  of  the  underwriting  services.  The  following 
section  presents  additional  analyses  to  corroborate  our  conclusions. 

V.  ADDITIONAL  ANALYSES 
Self-Selection  of  an  Equity  Offering 

Although  we  attempt  to  control  for  the  underlying  economics  of  each  firm  in  each  of  the  tests, 
and  the  cross-sectional  tests  help  alleviate  the  endogeneity  concerns,  in  this  section  we  explore  the 
issue  of  endogeneity  further.  First,  we  employ  the  “omitted  variables”  variant  of  the  Hausman 
(1978,  1983)  test  to  determine  if  the  main  treatment  variable,  MGUIDE  X  SEO,  exhibits  evidence 
of  endogeneity  after  we  control  for  the  variables  presented  in  Table  4.  The  test  is  unable  to  reject 
the  null  condition  of  no  endogeneity  (p  =  0.31;  not  tabulated). 

Second,  we  control  for  self-selection  bias  using  a  two-stage  approach  in  which  the  first  stage 
is  a  probit  regression  of  SEO  on  the  hypothesized  determinants  of  equity  offerings  (presented  in 
the  Appendix).  This  first-stage  regression  uses  the  following  expected  determinants  (with  annual 
data):  profitability,  R&D  and  capital  expenditure  investments,  the  book-to-market  ratio,  leverage, 
firm  size,  cash  reserves,  tangible  assets,  annual  stock  returns,  financial  stress,  and  equity  offering 
history  (Rajan  and  Zingales  1995;  Jung  et  al.  1996;  Loughran  and  Ritter  1997;  Frank  and  Goyal 
2003;  Dittmar  and  Thakor  2007;  Kadan  et  al.  2009).  From  this  first-stage  regression,  which 
identifies  the  likelihood  of  a  firm  choosing  to  issue  equity,  we  calculate  the  inverse  Mills  ratio  (see 
Heckman  1979;  Leuz  and  Verrecchia  2000)  and  include  it  in  Regression  (1).  Including  this  ratio 
helps  control  for  the  likelihood  of  self-selecting  into  the  equity  issuer  group.  The  results  are 
similar  to  those  in  Table  4  when  we  include  the  inverse  Mills  ratio  (not  tabulated). 

Finally,  following  Frank  (2000)  and  Larcker  and  Rusticus  (2010),  we  describe  the  character¬ 
istics  an  omitted  variable  must  have  in  order  to  overturn  the  OLS  results.  The  unidentified  omitted 
variable  must  be  correlated  with  both  REVISE  (the  dependent  variable)  and  MGUIDE  X  SEO  (the 
treatment  variable)  in  order  to  cause  spurious  positive  correlations  between  REVISE  and  MGUIDE 
X  SEO.  Specifically,  the  product  of  two  partial  correlations,  the  correlation  between  the  omitted 
variable  and  REVISE,  and  the  correlation  between  the  omitted  variable  and  MGUIDE  X  SEO, 
needs  to  be  at  least  0.0415  to  overturn  the  positive  association  between  REVISE  and  MGUIDE  X 
SEO.  To  put  this  number  in  perspective,  the  highest  product  of  the  partial  correlations  with 
REVISE  and  MGUIDE  X  SEO  based  on  the  identified  control  variables  is  0.0299  (for  MGUIDE  X 
REPUTATION).  Thus,  we  would  need  an  omitted  variable  with  an  impact  39  percent  greater  than 
that  of  any  of  the  control  variables  to  change  the  results,  suggesting  that  it  is  unlikely  that  a 
confounding  omitted  variable  would  overturn  the  positive  association  between  REVISE  and 
MGUIDE  X  SEO. 


It  is  possible  that  affiliated  analysts  are  simply  more  diligent  at  following  the  firm  and  incorporating  new  information 
about  the  firm  (along  the  lines  of  Jacob  et  al.  2008).  This  diligent  incorporation,  however,  should  not  be  associated  with 
larger  ex  post  analyst  forecast  errors.  Alternatively,  some  analysts  might  always  wish  to  please  management.  We  also, 
therefore,  examine  the  incorporation  of  guidance  by  affiliated  versus  unaffiliated  analysts  in  all  years  that  do  not  precede 
an  equity  offering.  Unlike  in  the  year  prior  to  the  equity  offering,  we  do  not  find  a  difference  in  the  incorporation  of 
guidance  between  these  two  types  of  analysts  when  guidance  is  issued  at  other  times  (not  tabulated),  suggesting  that 
analysts  appear  to  change  their  behavior  when  equity  offerings  are  imminent. 
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To  summarize,  we  are  unable  to  reject  the  null  of  no  endogeneity  based  on  the  omitted 
variables  variant  of  the  Hausman  test,  results  are  similar  after  including  the  inverse  Mills  ratio,  and 
the  likelihood  that  our  OLS  results  are  driven  by  an  omitted  variable  is  low.  Regardless,  we  cannot 
completely  rule  out  the  possibility  that  the  endogenous  nature  of  equity  offerings  drives  our 
results. 

Alternative  Mechanisms  to  Please  Management 

We  document  a  tool  that  analysts  appear  to  use  to  please  management:  the  overweighting  of 
management  guidance  by  analysts  revising  their  short-term  earnings  forecasts.  Prior  research  has 
found  evidence  that  analysts  issue  overly  optimistic  stock  recommendations  (e.g.,  Dugar  and 
Nathan  1995;  Lin  and  McNichols  1998;  Michaely  and  Womack  1999)  for  firms  they  wish  to 
please.^^  It  is  likely  that  analysts  use  a  portfolio  of  tools  to  curry  favor  with  management.  To 
investigate  the  interplay  of  these  tools,  we  identify  the  analysts  in  the  sample  who  issue  a  stock 
recommendation,  and  partition  those  issuing  a  strong  buy  recommendation  from  those  issuing  any 
other  recommendation  (buy,  hold,  or  sell).  To  be  included  in  the  test,  each  guidance  observation 
must  have  at  least  one  strong  buy  recommendation  and  one  other  recommendation.  As  with  the 
analyst-type  tests,  we  re-estimate  REVISE  and  MGUIDE  for  both  groups  of  analysts.  We  estimate 
Regression  (1)  separately  for  the  two  groups  (strong  buy  and  other),  presented  in  Table  7.  We  find 
that  only  analysts  who  issue  a  strong  buy  recommendation  overweight  management  guidance  prior 
to  an  equity  offering  (0:2  — 0.18,  p  =  0.08).  In  contrast,  among  analysts  not  issuing  a  strong  buy 
recommendation,  there  is  no  evidence  of  analysts  overweighting  guidance  prior  to  an  equity 
offering  for  the  same  management  guidance  observations  («2  =  -0.04,  p  >  0.50).  These  findings 
suggest  that  analysts  please  managers  by  jointly  using  both  optimistic  stock  recommendations  and 
the  overweighting  of  guidance. These  findings  also  further  establish  that  analysts  with  different 
incentives  respond  differently  to  the  same  guidance,  which  is  more  consistent  with  the  overweight¬ 
ing  of  guidance  representing  a  mechanism  used  to  curry  favor  with  management  and  less  consis¬ 
tent  with  alternative  explanations  such  as  the  endogeneity  of  equity  offerings. 

Knowledge  of  the  Equity  Offering 

An  assumption  underlying  our  study  is  that  analysts  know  which  firms  will  be  issuing  equity 
in  the  upcoming  year  and  thus  actively  vie  for  the  underwriting  business.  We  conduct  two  tests  to 
determine  the  robustness  of  the  results  to  this  assumption.  First,  we  replace  actual  equity  an¬ 
nouncements  from  SDC  with  an  expected  probability  that  the  firm  will  issue  equity  in  the  year  to 
come  (see  the  Appendix).  We  find  that  analysts  incorporate  more  guidance  for  firms  that  are 
expected  to  have  an  equity  offering,  suggesting  that  analysts  make  overtures  to  all  management 
teams  that  could  be  undertaking  an  equity  offering  in  the  near  future  (not  tabulated). 

Second,  we  examine  the  sensitivity  of  the  results  to  the  length  of  the  horizon  period.  Specifi¬ 
cally,  we  examine  announcements  that  occur  in  the  six-month  period  following  the  management 
guidance  relative  to  those  that  occur  in  the  7-12-month  period  following  the  management  guid¬ 
ance.  If  the  incentive  to  gain  underwriting  business  is  driving  the  results,  then  we  expect  stronger 
results  when  the  horizon  is  shorter — i.e.,  when  equity  offerings  are  both  more  certain  and  have 
more  timely  payoffs.  Alternatively,  if  firms  that  tend  to  issue  equity  also  tend  to  provide  more 
useful  guidance,  we  would  not  expect  to  find  stronger  results  in  the  shorter  time  honzon. 


Research  has  also  documented  overly  optimistic  long-term  growth  forecasts  as  a  mechanism  to  please  managen^nt  (e.g., 
Lin  and  McNichols  1998;  Dechow  et  al.  2000);  however,  this  variable  is  more  difficult  to  identify,  ex  ante.  Thus,  we 

focus  the  cross-sectional  test  on  stock  recommendations.  .  .  ,  ,  u  .  1 

Results  are  similar  if  we  partition  only  those  firms  announcing  equity  offenngs  m  the  subsequent  year  by  stock 
recommendation.  MGUIDE  is  positive  and  significant  only  among  the  strong  buy  partition  (not  tabulated). 
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TABLE  7 

Revision  Magnitude  on  Equity  Offerings  Based  on  Stock  Recommendation  and  Horizon 

Dependent  Variable  =  REVISE 


Analyst 

Recommendation  Management  Guidance  Issued 


Independent  Variables 

Predicted 

Sign 

Buy,  Hold, 
or  Sell 

Strong  Buy 

In  the  Six 
Months  before 
the  SEO 
Announcement 

In  the  7-12 
Months  before 
the  SEO 
Announcement 

Coefficient 

(t-statistic) 

Coefficient 

(t-statistic) 

Coefficient 

(t-statistic) 

Coefficient 

(t-statistic) 

Intercept 

0.0001 

0.0011 

-0.0012* 

-0.0014** 

(0.16) 

(1.05) 

(-1.75) 

(-1.97) 

MGUIDE 

-b 

-0.3200 

-0.2230 

-0.1627 

-0.2287 

(-0.68) 

(-0.53) 

(-0.37) 

(-0.50) 

MGUIDE  X  SEO 

-b 

-0.0365 

0.1790* 

0.3287*** 

0.0431 

(-0.29) 

(1.75) 

(4.29) 

(0.72) 

MGUIDE  X  DOWN 

-b 

0.1082 

0.0512 

0.1413* 

0.1430* 

(1.27) 

(0.72) 

(1.88) 

(1.87) 

MGUIDE  X  REPUTATION 

-b 

0.1239***  0.0835* 

0.1498*** 

0.1494*** 

(2.83) 

(1.80) 

(3.04) 

(3.08) 

MGUIDE  X  AGREE 

4- 

0.1866***  -0.0169 

0.1571*** 

0.1711*** 

(2.73) 

(-0.27) 

(2.98) 

(3.15) 

MGUIDE  X  HORIZON 

-b 

0.0365 

0.0062 

0.1077 

0.1183 

(0.27) 

(0.06) 

(1.05) 

(1.12) 

MGUIDE  X  RANGE 

- 

16.5235** 

19.5753* 

-8.3088*** 

-7.4736** 

(2.21) 

(1.80) 

(-2.61) 

(-2.38) 

MGUIDE  X  ANALYSTS 

? 

0.2210** 

*  0.1485***  0.1155*** 

0.1172*** 

(4.22) 

(3.01) 

(2.88) 

(2.92) 

Main  Effects 

Included 

Included 

Included 

Included 

Number  of  SEO 

59 

59 

45 

42 

Observations 

Total  Number  of 

2,133 

2,133 

2,983 

2,978 

Observations 

Adjusted 

0.7374 

0.3548 

0.8806 

0.8733 

*,  **,  ***  Indicate  statistical  significance  at  the  0.10,  0.05,  and  0.01  levels,  respectively,  under  a  two-tailed  test. 
All  t-statistics  are  based  on  robust  standard  errors  clustered  at  the  firm  level. 

Additional  variables  are  defined  in  Table  2 


Table  7  presents  the  results  in  the  final  two  columns,  with  the  shorter  time  horizon  announce¬ 
ments  in  the  first  column  and  the  7- 12-month  time  horizon  announcements  in  the  second  column. 
Consistent  with  our  expectations,  analysts  weight  management  guidance  more  heavily  when  the 
equity  offerings  are  nearer  term  in  nature  (a2  =  0-33;  p  <  0.01).  When  the  equity  offering  is  not 
imminent,  there  is  no  evidence,  on  average,  of  analysts  overweighting  management  guidance 
(q;2  =  0.04;  p  =  0.47),  consistent  with  analysts  having  less  certainty  about  the  possibility  of  an 
offering  or  placing  a  lower  weight  on  the  less  timely  benefits. 
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Upward  versus  Downward  Guidance 

Chan  et  al.  (2003b),  Ke  and  Yu  (2006),  Baik  and  Yi  (2007),  and  Malmendier  and  Shanthiku- 
mar  (2007)  generally  find  that  affiliated  analysts  tend  to  issue  more  pessimistic  short-term  fore¬ 
casts,  thereby  allowing  managers  to  meet  the  analyst  forecast,  and  Richardson  et  al.  (2004)  docu¬ 
ment  a  “walk  down”  of  analyst  forecasts  that  is  more  pervasive  prior  to  equity  offerings.  It  is 
likely  that  the  overweighting  of  guidance  we  document  is  one  way  that  managers  walk  down 
analysts  to  ex  post  pessimistic  analyst  forecasts  (e.g.,  Cotter  et  al.  2006).  However,  we  posit  that 
managers  will  also,  at  times,  wish  analysts  to  increase  their  forecasts. Generally,  we  expect 
analysts  to  take  their  cue  from  management,  regardless  of  the  direction  of  the  management  guid¬ 
ance. 

We  investigate  this  directly  by  examining  the  direction  of  the  guidance  in  Table  8.  Of  the  87 
instances  of  management  guidance  issued  in  the  year  before  the  announcement  of  the  equity 
offering  with  all  necessary  control  variables,  31  are  upward,  42  are  downward,  and  14  are  con¬ 
firmations.  We  re-estimate  Regression  (1)  separately  for  upward  and  downward  guidance  and  find 
that  analysts  overweight  management  guidance  prior  to  equity  offerings  in  both  groups.  Moreover, 
the  magnitude  of  the  overweighting  appears  similar  for  upward  and  downward  forecasts  (0.31 
versus  0.33),  suggesting  that  analysts  are  willing  to  overweight  management  guidance  by  an 
economically  significant  amount  in  either  direction  to  please  management.  Thus,  our  findings  go 
beyond  those  documenting  a  “walk  down”  of  analysts,  as  we  find  analysts  are  also  walked  up. 

Additional  Settings 

Though  we  examine  the  specific  setting  of  equity  offerings,  the  overweighting  of  management 
guidance  is  also  a  viable  tool  for  analysts  wishing  to  curry  favor  with  management  in  the  face  of 
other  incentives.  As  a  final  robustness  check,  we  examine  two  additional  incentives,  the  desire  to 
gain  debt  underwriting  fees  and  M&A  advisory  fees.  These  incentives  are  present  in  the  year 
following  management  guidance  for  6.3  and  4.0  percent  of  the  sample  firms,  respectively.  We 
examine  all  three  incentives  {SEO,  DEBT,  and  M&A)  in  the  first  column  of  Table  9,  and  we  create 
a  joint  indicator  variable  (INCENTIVES)  that  equals  1  if  any  of  these  three  incentives  are  present 
in  the  second  column  of  Table  9.  Turning  first  to  the  disaggregated  estimation,  we  see  that  the 
coefficient  on  MGUIDE  X  SEO  is  0.23  (p  <  0.01),  similar  to  the  results  in  Table  4.  Analysts  also 
overweight  management  guidance  prior  to  debt  offerings;  the  coefficient  on  MGUIDE  X  DEBT  is 
0.13  (p  =  0.02).  Finally,  there  is  some  evidence  that  analysts  overweight  guidance  when  there  are 
prospective  M&A  advisory  services;  the  coefficient  on  MGUIDE  X  M&A  is  0.11  (p  =  0.12). 
Thus,  the  results  are  not  limited  to  the  incentives  of  equity  offerings,  but  rather  extend  to  addi¬ 
tional  settings  where  analysts  might  gain  fees  for  their  banks.  When  we  aggregate  all  of  the 
incentives,  342  firm-year  observations  (11.3  percent  of' the  sample)  have  at  least  one  of  these  three 
incentives  in  the  following  year.  The  coefficient  on  MGUIDE  X  INCENTIVES  is  0.16  (p  —  0.02), 
consistent  with  the  premise  that  analysts  with  incentives  to  please  management  overweight  man¬ 
agement  guidance  when  revising  their  short-term  earnings  forecasts. 

In  summary,  we  consider  the  endogeneity  of  equity  offerings,  the  interaction  of  different 
mechanisms  to  curry  management  favor  (e.g.,  stock  recommendations),  the  knowledge  of  the 
offering,  the  direction  of  the  management  guidance,  and  alternative  measures  of  analysts’  incen¬ 
tives  (e.g.,  debt  offerings). Each  of  these  results  supports  our  conclusion  that  analysts  wishing  to 
please  management  overweight  management  guidance  when  revising  their  short-term  earnings 

forecasts. 


Evidence  of  downward  management  guidance  is  not  direct  evidence  of  ex  post  pessimism;  however,  finding  that 
managers  provide  upward  guidance  suggests  that  they  do  not  desire  pessimistic  forecasts  alone  (or  they  would  simply 

allow  the  analysts  to  underestimate  earnings).  ....  .  i 

Results  are  also  similar  if  we  limit  the  sample  to  firms  that  announce  an  equity  offenng  at  least  once  dunng  the  sample 
period  (not  tabulated).  We  also  split  the  sample  on  whether  the  management  guidance  occurred  before  or  after  Regu- 
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TABLE  8 

Revision  Magnitude  on  Upward  and  Downward  Management  Guidance 


Predicted 


Independent  Variables  Sign 

Intercept 

MGUIDE  + 

MGUIDE  X  SEO  + 

MGUIDE  X  REPUTATION  + 

MGUIDE  X  AGREE  + 

MGUIDE  X  HORIZON  + 

MGUIDE  X  RANGE 
MGUIDE  X  ANALYSTS  ? 

Main  Effects 


Number  of  SEO  Observations 
Total  Number  of  Observations 
Adjusted 


Dependent  Variable  =  REVISE 

Coefficient 

Coefficient 

(t-statistic) 

(t-statistic) 

Upward  Management 

Downward  Management 

Guidance 

Guidance 

-0.0001 

-0.0037 

(-0.08) 

(-1.45) 

0.0020 

-0.0853 

(0.00) 

(-0.14) 

0.3059** 

0.3269*** 

(2.14) 

(3.50) 

0.0621 

0.1792*** 

(0.45) 

(2.93) 

0.3882** 

0.1138* 

(2.37) 

(1.67) 

0.0769 

0.1134 

(0.32) 

(0.84) 

-24.8156** 

-6.4814* 

(-2.29) 

(-1.76) 

0.0487 

0.1397*** 

(0.51) 

(2.59) 

Included 

Included 

31 

42 

865 

1,514 

0.6202 

0.8503 

*,  **,  ***  Indicates  statistical  significance  at  the  0.10,  0.05,  and  0.01  levels,  respectively,  under  a  two-tailed  test. 
Management  guidance  is  upward  if  the  guidance  is  greater  than  the  preexisting  consensus  analyst  forecast  and  downward 
if  the  guidance  is  less  than  the  preexisting  consensus  analyst  forecast.  All  t-statistics  are  based  on  robust  standard  errors 
clustered  at  the  firm  level. 

Additional  variables  are  defined  in  Table  2. 


VI.  CONCLUSION 

We  predict  and  find  that  analysts  who  wish  to  please  the  firms  they  follow  overweight  man¬ 
agement  earnings  guidance  when  revising  their  short-term  earnings  forecasts,  even  at  the  expense 
of  ex  post  accuracy.  We  also  document  evidence  of  the  benefit  to  analysts — subsequent  underwrit¬ 
ing  business  for  their  firms.  Specifically,  we  show  that  analysts  who  are  affiliated  with  the  ex  post 
underwriters  of  the  subsequent  equity  offerings  weight  guidance  more  heavily  than  other  analysts. 


lation  Fair  Disclosure  in  October  2000,  and  the  Global  Research  Analyst  Settlement  in  April  2003.  In  results  not 
tabulated,  we  find  a  significantly  stronger  association  between  management  guidance  and  analysts’  incorporation  of  that 
guidance  preceding  equity  offerings  in  the  post-Regulation  Fair  Disclosure  era,  when  other  mechanisms  of  guidance 
were  restricted,  and  we  find  an  economically  weaker  relation  between  upcoming  equity  offerings  and  analysts’  revisions 
following  the  settlement,  a  period  when  analysts  faced  increased  scrutiny  by  regulators. 
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TABLE  9 

Analyst  Revision  Magnitude  on  Multiple  Incentives:  Equity  Offerings,  Debt  Offerings,  and 

M&A  Activity 


Predicted 

Independent  Variables  Sign 


Intercept 

MGUIDE 

MGUIDE  X  SEO  + 

MGUIDE  X  DEBT  + 

MGUIDE  X  M&A  + 

MGUIDE  X  INCENTIVES  + 

MGUIDE  X  DOWN  + 

MGUIDE  X  REPUTATION  + 

MGUIDE  X  AGREE  + 

MGUIDE  X  HORIZON  + 

MGUIDE  X  RANGE 

MGUIDE  X  ANALYSTS  ? 

Main  Effects 


Number  of  SEO  Observations 
Number  of  DEBT  Observations 
Number  of  M&A  Observations 
Number  of  Incentives  Observations 
Total  Number  of  Observations 
Adjusted 


Dependent  Variable 

=  REVISE 

Coefficient 

Coefficient 

(t-statistic) 

(t-statistic) 

-0.0012* 

-0.0011* 

(-1.78) 

(-1.68) 

-0.1722 

-0.0746 

(-0.39) 

(-0.14) 

0.2293*** 

(3.96) 

0.1327** 

(2.30) 

0.1062 

(1.57) 

0.1439* 

0.1609** 

(2.40) 

0.1516** 

(1.93) 

(2.04) 

0.1418*** 

0.1380*** 

(2.89) 

(2.71) 

0.1685*** 

0.1576*** 

(3.12) 

(2.82) 

0.1076 

0.0871 

(1.05) 

(0.86) 

-7.3614** 

-6.6200** 

(-2.48) 

(-2.22) 

0.1155*** 

0.1130*** 

(2.88) 

(2.80) 

Included 

Included 

87 

87 

175 

175 

115 

115 

342 

342 

3,025 

3,025 

0.8810 

0.8789 

jj)(jicates  statistical  significance  at  the  0.10,  0.05,  and  0.01  levels,  respectively,  under  a  two-tailed  test. 
INCENTIVES  is  an  indicator  variable  that  is  set  equal  to  1  if  there  is  an  equity  offering,  debt  offering,  or  M&A  activity  in 
the  year  following  the  management  guidance,  and  0  if  none  of  these  events  occur  in  the  subsequent  year.  Standard  errors 
are  clustered  by  firm.  All  t-statistics  are  based  on  robust  standard  errors  clustered  at  the  firm  level. 

Additional  variables  are  defined  in  Table  2. 


Our  Study  has  limitations.  First,  equity  offerings  are  endogenous,  and  characteristics  of  firms 
issuing  equity  might  lead  analysts  to  incorporate  more  guidance  prior  to  equity  offerings  for 
reasons  other  than  gaining  underwriting  business.  To  minimize  this  limitation,  we  explicitly  link 
the  overweighting  of  guidance  prior  to  equity  offerings  to  larger  ex  post  forecast  errors,  mitigating 
the  possibility  that  the  overweighting  is  in  response  to  more  informative  guidance  prior  to  offer- 
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ings.  We  also  examine  different  analysts’  responses  to  the  same  guidance  and  find  that  only 
analysts  with  an  incentive  to  please  management  weight  the  management  guidance  more  than 
predicted  based  on  the  credibility  and  usefulness  of  the  guidance.  In  addition,  we  conduct  a 
Hausman  test  that  does  not  reject  the  null  of  no  endogeneity.  Finally,  we  estimate  a  Heckman 
two-stage  regression  to  further  control  for  the  endogenous  choice  to  provide  equity  and  continue 
to  find  similar  results.  A  second  limitation  is  that  we  do  not  model  the  manager’s  selection  of  an 
underwriting  team.  Clearly,  managers  will  consider  deal  terms,  underwriter  quality,  and  other 
criteria  when  determining  whom  to  select  as  underwriters.  This  analysis  is  beyond  our  scope. 
These  additional  criteria,  however,  would  likely  add  noise  to  the  analysis.  A  third  limitation  is  that 
all  of  the  main  tests  and  inferences  surround  a  single  incentive;  upcoming  equity  offerings.  We 
therefore  conduct  a  preliminary  analysis  on  both  debt  issues  and  M&A  activity,  where  analysts 
might  wish  to  vie  for  the  debt  underwriting  business  or  the  M&A  advisory  fees.  We  find  evidence 
of  analysts  overweighting  guidance  prior  to  these  events  as  well. 

Overall,  our  findings  suggest  that  analysts  weight  management  guidance  more  heavily  than 
expected  when  faced  with  incentives  to  please  management.  Future  research  might  examine  other 
settings  where  analysts  might  overweight  management  guidance,  such  as  pre-Regulation  Fair 
Disclosure  selective  disclosure  (Chen  and  Matsumoto  2006;  Ke  and  Yu  2006)  or  favored  confer¬ 
ence  call  participation  (Mayew  2008).  Future  research  might  also  investigate  whether  investors 
anticipate  that  analysts  overweight  guidance  in  certain  settings,  whether  analysts’  reputations  are 
hurt  by  this  behavior,  or  whether  analysts  appear  to  understand  individual  managers’  incentives 
and  incorporate  guidance  differentially  based  on  these  perceived  incentives. 

APPENDIX 

We  form  an  SEO  prediction  model  to  test  two  of  our  robustness  checks:  the  choice  to  issue 
equity  and  the  knowledge  of  the  upcoming  equity  offering  versus  the  expectation  of  such  an  event 
(both  presented  in  Section  V).  Our  model  is  as  follows: 

=  ao  +  oc^PROFITABILITY,  +  a2^SALES,  +  a^INVESTMENT,  +  a^BIM  RATIO, 

-I-  a^LEVERAGE,  +  a^SlZE,  +  a^CASH,  +  a^TANGIBLE  ASSETS, 

+  agANNUAL  RETURN,  +  aioEINANCIAL  STRESS,  +  ai^SEO  HISTORY 

+  Industry  Indicators  +  Year  Indicators  -i-  c^.  (Al) 

Each  of  the  variables  is  defined  in  Table  10.  We  are  predicting  actual  equity  offerings  in  year  t+ 1 
and  include  the  following  expected  determinants  from  year  t,  based  on  prior  research  (e.g.,  Rajan 
and  Zingales  1995;  Jung  et  al.  1996;  Loughran  and  Ritter  1997;  Erank  and  Goyal  2003;  Dittmar 
and  Thakor  2007;  Kadan  et  al.  2009). 

Eirst,  we  include  firm  profitability.  Generally,  the  more  profitable  the  firm,  the  more  likely  it 
is  to  issue  equity,  relative  to  debt.  However,  we  are  predicting  an  equity  offering  relative  to  no 
equity  offering  (instead  of  the  use  of  debt).  Thus,  it  could  be  that  more  profitable  firms  have  a 
lower  requirement  for  additional  funds  and,  thus,  we  have  no  prediction  on  the  sign  of  profitability. 
Our  next  variable,  change  in  sales  {AS ALES),  is  a  proxy  for  growth,  and  we  expect  growth  firms 
to  be  more  likely  to  issue  equity.  INVESTMENT  is  the  amount  of  expenditures  on  R&D  and  capital 
investments  in  year  r;  generally,  the  higher  these  expenditures,  the  higher  the  expected  probability 
of  an  equity  offering.  We  expect  the  book-to-market  ratio  {B/M  RATIO)  to  be  negatively  related  to 
equity  offerings;  again,  higher  growth  firms  are  expected  to  have  a  higher  probability  of  issuing 
equity.  Leverage  is  expected  to  increase  a  firm’s  likelihood  of  issuing  equity  over  debt,  but  as 
noted  above,  we  are  predicting  the  likelihood  of  issuing  equity  versus  not  issuing  equity  and,  thus, 
we  have  no  prediction  for  the  associated  variable.  We  include  SIZE  (the  log  of  sales),  as  larger 
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TABLE  10 

SEO  Prediction  Model 


Independent  Variables  Predicted  Sign 


Intercept 

PROFITABILITY 

? 

ASALES 

-b 

INVESTMENTS 

-b 

B/M  RATIO 

- 

LEVERAGE 

? 

SIZE 

- 

CASH 

- 

TANGIBLE  ASSETS 

- 

ANNUAL  RETURN 

+ 

FINANCIAL  STRESS 

-b 

SEO  HISTORY 

-b 

Industry  Indicators 

Year  Indicators 

Number  of  SEO  Observations 

Total  Number  of  Observations 

Pseudo 


Dependent  Variable  =  SEO 

Coefficient  (;^'^-statistic) 

-1.5554*** 

(22.45) 

-0.2069** 

(4.21) 

0.2309*** 

(38.67) 

0.4918*** 

(11.90) 

-0.3305*** 

(37.20) 

0.9946*** 

(81.33) 

-0.0696*** 

(39.18) 

-0.1662*** 

(13.76) 

0.1453** 

(3.85) 

0.2831*** 

(170.08) 

0.1490** 

(5.06) 

0.2878*** 

(48.76) 

Included 

Included 

1,104 

14,407 

0.1435 


***  Indicates  statistical  significance  at  the  0.10,  0.05,  and  0.01  levels,  respectively,  under  a  two-tailed  test. 


Variable  Definitions: 

SEO,+i  = 

PROFITABILITY t  = 
ASALESt  = 
INVESTMENT,  = 

B/M  RATIO,  = 
LEVERAGE,  = 
SIZE,  = 
CASH,  = 
TANGIBLE  ASSETS,  = 
ANNUAL  RETURN,  = 
FINANCIAL  STRESS  = 

SEO  HISTORY  = 


indicator  variable  that  is  set  equal  to  1  if  the  firm  issued  equity  in  year  t+\,  and  0 
otherwise; 

profitability  ((Annual  Compustat  #13-1-  #62)Aagged  #6); 
change  in  sales  ((#12  -  lagged  #12)/lagged  #12); 

investments  in  research  and  development  and  capital  expenditures  in  year  t  (#128  -I-  #46)/ 
lagged  #6; 

book-to-market  ratio  (#60/ (#54  X  #199)); 
debt  leverage  (#9/#6); 

firm  size,  measured  using  the  log  of  sales  (#12); 
cash  reserves  (#l/#6); 

measure  of  collateral  or  tangible  assets  (#8/lagged  #6); 
market-adjusted  fiscal  year  return  for  year  r, 

sum  of  cash  dividends,  net  investments,  and  change  in  working  capital  less  cash  inflows 
from  operations  in  year  t  (see  Frank  and  Goyal  2003);  and 
1  if  a  firm  issued  equity  in  previous  three  years,  and  0  otherwise. 
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firms  are  expected  to  have  lower  costs  of  debt  and  thus  may  be  more  likely  to  issue  debt.  Firms 
with  more  cash  are  expected  to  be  less  likely  to  issue  equity,  while  firms  with  greater  tangible 
assets  are  expected  to  be  more  likely  to  issue  debt  than  equity.  We  expect  firms  with  higher  returns 
over  the  past  year  to  be  more  likely  to  issue  equity;  ANNUAL  RETURN  is  the  market-adjusted 
fiscal  year  return  for  year  t.  We  include  a  variable  for  financial  stress,  following  Frank  and  Goyal 
(2003),  who  argue  that  demand  for  capital  is  closely  tied  to  the  firm’s  financing  deficit.  FINAN¬ 
CIAL  STRESS  is  a  measure  of  excess  cash.  Finally,  we  include  SEO  HISTORY,  which  is  an 

27 

indicator  variable  showing  whether  the  firm  issued  equity  in  the  prior  three  years. 

Table  10  reports  results.  Generally,  the  predicted  signs  are  consistent  with  existing  theory, 
although,  as  noted  above,  our  setting  differs,  in  that  we  are  predicting  the  difference  between 
equity  offering  firms  and  all  other  firms;  we  are  not  conditioning  on  the  need  for  additional 
financing.  Overall,  the  estimated  pseudo-R^  is  approximately  14  percent. 

We  form  the  inverse  Mills  ratio  following  Heckman  (1979).  To  form  the  predicted  probability 
of  an  upcoming  equity  offering,  we  estimate  Regression  (Al)  in  a  pooled  regression  (as  presented 
in  Table  10)  and  retain  the  predicated  value  for  each  firm-year.  We  then  merge  the  annual  predic¬ 
tion  of  an  equity  offering  with  guidance  quarters  in  that  year  to  avoid  look-ahead  bias,  as  analysts 
would  need  year  t  values  to  predict  year  f+1  equity  offerings  and,  thus,  would  not  be  able  to  form 
their  predictions  until  year  t-i-1. 
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ABSTRACT:  This  study  reports  the  result  of  an  experiment  examining  two  aspects  of 
the  audit  context  that  auditors  likely  do  not  suspect  can  influence  audited  account 
balances:  the  magnitude  of  an  audit  difference  and  the  presence  of  a  prior  client  con¬ 
cession.  Negotiation  theory  shows  that  negotiators’  initial  positions  (e.g.,  clients’  unau¬ 
dited  balances)  as  well  as  feelings  of  reciprocity  created  by  prior  negotiations  serve  to 
create  expectations  for  the  current  negotiation  and,  in  turn,  affect  the  outcomes  of  such 
negotiations.  Our  results  show  that  the  magnitude  of  an  audit  difference  involving  an 
estimate  (i.e.,  difference  between  client’s  account  balance  and  the  auditor’s  indepen¬ 
dent  estimate)  as  well  as  the  presence  of  a  prior  client  concession  influence  auditors’ 
negotiation  expectations.  Specifically,  auditors  proposed  smaller  adjustments  when  the 
magnitude  of  the  audit  difference  was  high  and  when  the  client  conceded  on  an  audit 
issue  prior  to  resolving  the  difference  in  estimates.  These  manipulations  similarly  influ¬ 
ence  the  negotiated  outcome,  and  this  influence’is  fully  mediated  by  the  auditor’s  initial 
negotiation  position. 

Keywords:  suditor  nsgotidtioni  dudit  sdjustmGntS!  accounting  GStimotBS,  sudit 
diffGrGncGs;  cliGnt  concGssions. 
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Data  Availability:  The  data  used  in  this  study  are  avaiiabie  upon  request. 

1.  INTRODUCTION 

Negotiations  between  auditors  and  clients  are  quite  common  (Gibbins  et  al.  2001),  and  they 
directly  affect  values  reflected  in  the  audited  financial  statements  (e.g.,  see  Brown  and 
Wright  2008).  Consequently,  the  financial  statements  can  be  viewed  as  a  product  of  the 
joint  effort  of  clients  and  auditors  (Antle  and  Nalebuff  1991),  making  these  negotiations  a  crucial 
link  between  the  quality  of  the  audit  and  the  quality  of  the  financial  statements.  In  this  study,  we 
examine  two  common  characteristics  of  the  auditor’s  negotiation  environment:  the  magnitude  of 
the  difference  between  an  unaudited  account  balance  and  the  auditor’s  independent  estimate  of 
that  account  balance  (i.e.,  the  magnitude  of  an  audit  difference)  and  prior  concessions  by  client 
management.  While  auditors  likely  do  not  expect  these  two  characteristics  of  the  negotiation 
context  to  result  in  different  balances  in  the  audited  financial  statements,  prior  negotiation  litera¬ 
ture  implies  that  these  two  characteristics  likely  will  significantly  influence  auditor-client  negotia¬ 
tion  outcomes. 

Prior  negotiation  research  demonstrates  that  the  magnitude  of  the  opening  move  (i.e.,  initial 
negotiation  position)  by  a  negotiator’s  counterpart  is  important  in  determining  how  the  negotiator 
will  set  his/her  negotiation  goals,  which  in  turn  significantly  influence  the  outcome  of  the  ensuing 
negotiation  (e.g.,  Rubin  and  DiMatteo  1972).  Based  on  this  perspective,  we  assert  that  the  client’s 
reported  account  balances  in  the  unaudited  financial  statements  (e.g.,  a  significant  accounting 
estimate)  acts  as,  either  purposefully  or  not,  their  “opening  move”  with  regard  to  subsequent 
discussions  regarding  those  accounts,  although  it  is  unlikely  that  auditors  view  it  this  way.  Based 
on  relevant  negotiation  theory,  we  predict  that  auditors’  negotiation  goals  and  limits,  as  well  as 
their  initial  negotiation  positions,  will  be  farther  from  the  audit  team’s  preliminary  estimate  when 
the  magnitude  of  the  difference  between  the  client’s  account  balance  and  the  auditor’s  independent 
estimate  is  larger. 

In  addition,  audits  often  include  multiple  issues  that  clients  and  auditors  must  resolve,  and 
about  half  the  time  they  resolve  these  issues  sequentially  rather  than  simultaneously  (Gibbins  et  al. 
2001).  From  an  auditing  perspective,  resolution  of  a  subjective  issue,  such  as  an  estimate,  should 
not  be  influenced  by  whether  the  auditor  and  client  have  previously  resolved  a  prior  issue  (e.g.,  a 
known  misstatement  in  the  financial  statements).  However,  the  social  norm  of  reciprocity  has  been 
found  to  influence  the  negotiation  process  as  well  as  subsequent  outcomes  (e.g.,  Pruitt  and 
Camevale  1993;  Sanchez  et  al.  2007;  Hatfield  et  al.  2008).  Consistent  with  reciprocity,  when  a 
client  agrees  to  an  auditor’s  request  to  correct  a  known  misstatement  in  the  financial  statements 
(i.e.,  a  prior  client  concession),  the  auditor  is  likely  to  feel  the  need  to  reciprocate  the  client’s 
concession  in  the  subsequent  negotiation  of  a  more  subjective/contentious  issue. 

To  investigate  these  issues,  we  conduct  a  computer-based  experiment  in  which  audit  partner 
and  manager  participants  resolve  two  audit  issues  with  a  hypothetical  client.  We  manipulate  two 
independent  variables,  each  at  two  levels.  The  first  independent  variable  is  the  relative  magnitude 
of  an  audit  difference  (allowance  for  obsolete  inventory).  The  audit  team’s  independent  estimate  is 
held  constant  while  the  client’s  unaudited  account  balance  varies,  resulting  in  high-  or  low- 
difference  conditions.  The  second  independent  variable  is  the  presence  of  a  prior  client  concession 
(i.e.,  whether  a  significant,  but  quantitatively  immaterial,  known  misstatement  was  resolved  prior 
to  or  after  the  inventory  issue).  We  measured  various  aspects  of  the  auditor’s  expectations  with 
respect  to  an  ensuing  negotiation  with  the  client  and  asked  participants  to  engage  in  a  simulated 
negotiation  concerning  the  inventory  issue. 

Our  results  are  consistent  with  our  predictions.  Auditors’  pre- negotiation  positions  differed 
more  from  the  audit  team’s  independent  estimate  when  the  magnitude  of  the  audit  difference  is 
high.  Additionally,  participants’  initial  negotiation  positions  reflected  smaller  adjusted  account 
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balances  in  the  presence  of  a  prior  client  concession.  Both  independent  variables  influenced  the 
negotiated  outcome  and,  consistent  with  expectations,  this  effect  was  fully  mediated  by  the  par¬ 
ticipants’  initial  negotiation  position  (i.e.,  initial  offer  in  the  negotiation).  Additional  analyses  show 
that  more  auditors  take  initial  negotiation  positions  and  negotiate  account  balances  materially 
below  the  audit  team’s  estimate  when  the  magnitude  of  the  audit  difference  is  high,  and  in  the 
presence  of  a  prior  client  concession.  These  results  add  to  the  growing  body  of  literature  demon¬ 
strating  that  various  aspects  of  the  audit  negotiation  context  have  a  direct  impact  on  audit  and 
financial  statement  quality. 

Kayla  Gillan  (then  a  PCAOB  board  member)  stated  in  a  speech  concerning  PCAOB  inspec¬ 
tions  that  human  nature  (i.e.,  unconscious  biases),  rather  than  ethical  issues,  influences  auditor 
judgments  and  contributes  to  audit  deficiencies  (Gillan  2007).  She  asserted  that  making  auditors 
aware  of  these  unintended  influences  is  essential  to  minimizing  their  effect  on  auditor  judgments. 
While  such  biases  may  be  difficult  to  eliminate  (e.g.,  Bazerman  et  al.  2002),  our  results  should  be 
of  interest  to  researchers  as  they  consider  how  to  improve  negotiated  audit  outcomes  and  practi¬ 
tioners  as  they  develop  negotiation  training.^  For  example,  can  explicit  consideration  of  the  mag¬ 
nitude  of  an  audit  difference  or  the  presence  of  a  prior  client  concession  mitigate  the  biases 
demonstrated  in  our  study? 

Our  mediation  results  show  that  the  auditor’s  initial  negotiation  position  is  a  critical  factor  in 
auditor/client  negotiations.  This  result  validates  prior  auditor/client  negotiation  literature  that  relies 
on  pre-negotiation  measures  such  as  first  counter-offer  as  primary  dependent  variables  (e.g., 
Bame-Aldred  and  Kida  2007;  Hatfield  et  al.  2008;  Trotman  et  al.  2009).  Finally,  with  the  increased 
importance  placed  on  estimates  involving  fair-value  accounting,  issues  such  as  those  addressed  in 
this  study  (i.e.,  discussions  between  auditors  and  clients  over  appropriate  estimates)  will  be  en¬ 
countered  increasingly  in  practice. 

Section  II  discusses  auditing  standards  related  to  accounting  estimates  and  summarizes  nego¬ 
tiation  research  relevant  to  our  study  as  the  basis  for  developing  hypotheses.  Section  III  describes 
the  research  design,  and  Section  IV  reports  results  of  our  analyses.  Section  V  concludes  with  a 
summary  of  the  findings  and  their  implications  for  auditors  and  audit  research. 

IL  BACKGROUND  AND  HYPOTHESIS  DEVELOPMENT 
Auditing  Accounting  Estimates 

SAS  No.  57,  Auditing  Accounting  Estimates  (AICPA  1989),  provides  guidance  related  to 
auditing  clients’  accounting  estimates.^  Management  is  responsible  for  generating  estimates  in¬ 
cluded  in  the  financial  statements  based  on  its  consideration  of  relevant  factors;  ultimately  these 
estimates  are  based  on  management’s  judgment.  Auditors  are  responsible  for  determining  the 
reasonableness  of  management’s  estimates.  SAS  No.  57  (AICPA  1989,  AU  342.04)  recognizes 
that; 

Even  when  management’s  estimation  process  involves  competent  personnel  using  relevant  and 
reliable  data,  there  is  potential  for  bias  in  the  subjective  factors.  Accordingly,  when  planning  and 
performing  procedures  to  evaluate  accounting  estimates,  the  auditor  should  consider,  with  an  atti¬ 
tude  of  professional  skepticism,  both  the  subjective  and  objective  factors. 


*  While  auditor-client  negotiations  are  critically  important  to  financial  statement  values,  only  28  percent  of  auditor 
participants  we  surveyed  receive  formal  negotiation  training.  To  date,  only  Trotman  et  al,  (2005)  have  examined  how  to 
improve  auditor  negotiations,  finding  that  pre-negotiation  consideration  of  the  client  s  position  improves  negotiated 

audit  outcomes.  ....  ... 

^  SAS  No.  57  specifically  cites  “net  realizable  value  of  inventories  where  future  selling  prices  and  future  costs  are 
involved,”  the  subjective  accounting  issue  we  examine,  as  an  example  of  a  potentially  significant  accounting  estimate. 
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Professional  guidance  requires  auditors  to  gain  an  understanding  of  how  management  devel¬ 
ops  estimates  and  evaluate  the  reasonableness  of  estimates  based  on  a  combination  of  approaches, 
which  can  include  developing  an  independent  expectation.  SAS  No.  107  (AICPA  2006a)  states 
that  auditors  should  treat  differences  between  the  auditor’s  independent  estimate  and  manage¬ 
ment’s  estimate  as  likely  misstatements,  even  if  management’s  estimate  is  not  explicitly  deemed 
um-easonable.  Consequently,  the  magnitude  of  the  initial  difference  between  management’s  esti¬ 
mate  and  the  auditor’s  independent  estimate  should  not  affect  the  amount  that  the  auditor  ulti¬ 
mately  believes  should  be  included  in  the  financial  statements. 

Auditor-Client  Negotiation  Literature 

While  management  bears  ultimate  responsibility  for  the  financial  statements,  the  financial 
statements  can  be  viewed  as  a  joint  effort  of  both  management  and  the  auditor,  with  account 
balances  often  depending  on  the  auditor’s  approach  to  negotiating  audit  differences  (Antle  and 
Nalebuff  1991).  Gibbins  et  al.  (2001)  report  that  67  percent  of  audit  partners  surveyed  enter  into 
negotiations  with  more  than  half  of  their  clients,  and  that  all  partners  negotiate  with  at  least  some 
clients.  Therefore,  the  link  between  audit  quality  and  the  quality  of  the  financial  statements  clearly 
is  dependent  on  the  nature  of  negotiations  between  auditors  and  clients  to  resolve  proposed  audit 
differences. 

Research  concerning  the  disposition  of  proposed  audit  adjustments  initially  did  not  consider 
explicitly  the  underlying  negotiation  process.  Such  research  was  concerned  more  with  character¬ 
istics  of  the  proposed  adjustment  or  the  audit  engagement  that  influence  the  likelihood  of  waiving 
proposed  adjustments.  This  research  found  that  adjustments  are  more  likely  to  be  waived  when, 
for  example,  they  are  immaterial  or  income-increasing  (Wright  and  Wright  1997;  Nelson  et  al. 
2002);  the  salience  of  factors  related  to  qualitative  materiality  is  low  (Ng  and  Tan  2007);  the  client 
is  small  or  internal  controls  are  strong  (Joe  et  al.  2008);  or  the  risk  of  waiving  the  adjustment  is 
low  (Braun  2001). 

Survey  research  provides  a  practical  and  descriptive  perspective  on  auditor-client  negotiations 
that  allows  researchers  to  develop  an  understanding  of  the  experiences  and  incentives  affecting  the 
negotiating  parties  (e.g.,  Gibbins  et  al.  2001;  Gibbins  et  al.  2005;  Gibbins  et  al.  2007).  A  growing 
body  of  experimental  research  examines  the  auditor-client  negotiation  process.  This  research  finds 
that  negotiated  outcomes  and  aspects  of  the  negotiation  process  are  influenced  by  factors  such  as 
engagement  risk  (Johnstone  et  al.  2002);  audit  committee  effectiveness  or  existence  of  authorita¬ 
tive  guidance  (Ng  and  Tan  2003);  auditor  experience  (Brown  and  Johnstone  2005);  and  process 
variables  such  as  altering  the  auditor’s  strategy/approach  to  the  negotiation  process  (Trotman  et  al. 
2005;  Sanchez  et  al.  2007;  Hatfield  et  al.  2008;  Tan  and  Trotman  2009). 

While  we  extend  the  research  on  contextual  factors  that  affect  auditor-client  negotiations,  we 
also  complement  and  extend  prior  studies  that  measure  potential  negotiation  outcomes  through  the 
use  of  pre-negotiation  measures  (e.g.,  Bame-Aldred  and  Kida  2007;  Hatfield  et  al.  2008;  Trotman 
et  al.  2009).^  These  studies  are  based  on  the  premise  that  pre-negotiation  measures  (e.g.,  the 
auditor’s  initial  negotiation  position,  negotiation  goal,  and  negotiation  limit)  are  predictive  of  the 
negotiated  outcomes.  However,  this  premise  has  not  been  established  in  accounting/auditing  where 
environmental  characteristics  such  as  regulations  and  professional  concerns  can  moderate  this  link. 


^  There  are  three  pre-negotiation  measures  that  comprise  the  negotiation  range.  The  negotiator’s  limit  represents  the 
lowest  bound  (in  the  audit  context,  the  lowest  amount  allowed  without  qualifying  the  opinion).  The  initial  negotiation 
position  is  the  negotiator’s  starting  point  in  the  negotiation  and  represents  the  upper  bound  of  the  range.  The  negotiator’s 
goal  is  a  pre-negotiation  measure  of  the  amount  the  negotiator  hopes  or  expects  to  achieve  as  an  outcome. 
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Accordingly,  our  study  elicits  these  pre-negotiation  measures  in  a  computer- simulated  negotiation 
to  provide  evidence  on  the  effects  of  our  independent  variables.  We  also  provide  a  measurable  link 
between  these  pre-negotiation  measures  and  negotiation  outcomes. 

Unaudited  Financial  Statement  Balances:  Client’s  Opening  Move? 

When  viewed  through  a  negotiation  lens,  the  client’s  unaudited  account  balances  represent  the 
opening  move  (i.e.,  initial  negotiation  position)  in  any  subsequent  negotiations."^  For  example, 
suppose  that  the  auditor  develops  an  independent  estimate  for  the  allowance  for  obsolete  inventory 
of  $14  million.  Although  the  magnitude  of  any  audit  adjustment(s)  should  be  based  on  the  audi¬ 
tor’s  estimate  and  should  not  be  affected  by  the  magnitude  of  the  difference  between  the  auditor’s 
estimate  and  the  client’s  unaudited  balance,  negotiation  research  implies  that  the  audited  balance 
will  be  influenced  by  the  magnitude  of  the  client’s  balance  (e.g.,  an  unaudited  balance  of  $8 
million  versus  an  unaudited  allowance  of  $11.2  million). 

Pruitt  (1998)  finds  that,  in  negotiation  settings,  one  party’s  initial  negotiation  position  (or 
opening  move)  is  a  key  determinant  of  the  other  party’s  actions  in  subsequent  negotiations.  For 
example,  Pruitt  and  Drews  (1969)  show  that  the  initial  position  accounts  for  67  percent  of  the 
variance  in  subsequent  demands.  Therefore,  unaudited  client  account  balances  (i.e.,  the  client’s 
initial  positions)  are  predictive  of  their  posturing  regarding  subsequent  discussions  about  those 
amounts  (including  levels  of  concession  and  negotiation  goals).  Negotiators  often  alter  their  ne¬ 
gotiation  behavior  based  on  their  counterpart’s  initial  position  in  an  effort  to  make  an  ultimate 
resolution  more  likely  (Rubin  and  Brown  1975).  For  example,  Rubin  and  DiMatteo  (1972)  find 
that,  when  a  counterpart’s  initial  demands  are  high,  negotiators  tend  to  lower  their  own  expecta¬ 
tions  regarding  the  outcome  of  the  negotiation  and,  thus,  adopt  negotiation  positions  closer  to  their 
counterpart’s. 

Consider  our  audit  example  regarding  the  allowance  for  obsolete  inventory,  where  the  audi¬ 
tor’s  independent  estimate  is  $14  million.  Negotiation  research  suggests  that  an  unaudited  client 
account  balance  of  $8  million  can  cause  the  auditor  to  adopt  negotiation  goals  and  limits  closer  to 
the  client’s  to  make  resolution  more  likely.  That  is,  getting  the  client  to  move  $6  million  from  its 
unaudited  balance  could  be  perceived  as  quite  difficult,  causing  the  auditor  to  lower  his/her 
negotiation  goals  and  limits.  However,  if  the  unaudited  balance  is  $11.2  million,  getting  the  client 
to  move  $2.8  million  could  be  seen  as  less  difficult,  resulting  in  the  auditor  maintaining  higher 
negotiation  goals  and  limits.  Specifically,  we  expect  auditors  to  reduce  their  aspiration  levels  to  a 
greater  degree  when  the  difference  between  their  preferred  outcome  and  the  client’s  initial  position 
is  larger  (i.e.,  when  the  magnitude  of  the  initial  audit  difference  is  larger;  Pruitt  and  Camevale 
1993).  This  yields  the  following  hypotheses,  stated  in  the  alternative  form: 

Hla:  The  magnitude  of  the  auditor’s  goals  and  limits  in  a  negotiation  will  be  inversely 
related  to  the  magnitude  of  the  initial  audit  difference. 

Hlb:  The  magnitude  of  the  auditor’s  initial  position  in  a  negotiation  wifi  be  inversely  related 
to  the  magnitude  of  the  initial  audit  difference. 

We  note  that  our  predictions  for  magnitude  of  the  audit  difference  (Hypothesis  la  and  lb)  are 
consistent  with  anchoring  theory,  which  would  suggest  that  auditors  will  insufficiently  revise  their 
negotiation  positions  from  the  anchor  provided  by  the  client  (i.e.,  unaudited  account  balance). 
Negotiation  research  has  considered  the  influence  of  anchoring  in  negotiation  settings  (e.g.,  Ga- 


^  The  client’s  balance  is  not  necessarily  meant  by  the  client  to  be  nor  interpreted  by  the  auditor  as  an  opening,  strategic 
move  in  an  ensuing  negotiation.  However,  once  it  becomes  apparent  that  a  negotiation  will  take  place,  the  client’s  initial 
account  balance  acts  as  an  initial  negotiation  position. 
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linsky  and  Mussweiler  2001).^  However,  research  in  other  domains  indicates  that  our  setting  is  not 
conducive  to  anchoring.  For  example,  Cotter  and  Henley  (2008)  found  that  only  inexperienced 
negotiators  were  susceptible  to  anchoring;  our  participants  are  highly  experienced  auditors  (pri¬ 
marily  senior  managers  and  partners).  Perhaps  the  key  characteristic  of  our  setting  that  mitigates 
anchoring  is  the  existence  of  an  alternative  to  the  potential  anchor.  Research  finds  that  the  exis¬ 
tence  of  alternatives  to  anchors  (i.e.,  independent  estimates  and/or  anchor-inconsistent  informa¬ 
tion)  reduces  or  eliminates  anchoring  effects  (e.g..  Lord  et  al.  1984;  Wansink  et  al.  1998;  Kns- 
tensen  and  Garling  2000).  Because  the  audit  team’s  independent  estimate  serves  as  a  separate, 
inconsistent  alternative  to  the  client’s  unaudited  balance,  anchoring  could  work  against  our  ex¬ 
pectations  related  to  audit  difference  magnitude,  which  are  based  on  negotiation  theory. 

Multiple  Negotiation  Issues 

Prior  negotiation  literature  finds  that  outcomes  differ  when  multiple  issues  are  negotiated 
sequentially  versus  simultaneously  (e.g.,  Rubin  and  Brown  1975).  Generally,  research  finds  greater 
integration  (i.e.,  win-win)  outcomes  with  simultaneous  discussions.  However,  the  audit  setting  is 
unique  in  that  issues  can  arise  at  different  times  during  the  course  of  the  audit,  resulting  in  a 
greater  likelihood  of  sequential  issue  resolution.  This  characteristic  is  likely  magnified  by  shorter 
filing  periods,  which  have  resulted  in  increased  interim  procedures  (i.e.,  pushing  toward  a  more 
continuous  audit  approach;  e.g.,  Lambert  et  al.  2009).  In  these  settings,  auditors  and  clients  often 
resolve  issues  without  knowing  whether  there  will  be  additional  audit  issues  to  resolve;  conse¬ 
quently,  while  audits  likely  involve  sequential  issue  resolution,  many  issues  are  resolved  as  if  they 
are  the  only  issue  requiring  resolution.^ 

Audit  issues  can  be  objective  or  subjective  and  material  or  immaterial.  Subjective  items,  such 
as  estimates,  represent  issues  for  which  the  auditor  and  client  can  conceivably  negotiate  over  the 
amount  because  there  is  no  clearly  correct  amount.  Objective  items,  such  as  miscalculations,  are 
amounts  identified  by  the  auditor  that  are  clearly  incorrect  in  the  unaudited  financial  statements. 
For  objective  items,  the  amount  would  either  have  to  be  adjusted  in  the  financial  statements  or 
waived  by  the  auditor  (if  immaterial)  and  reported  to  the  audit  committee.  That  is,  the  correct 
amount  is  not  in  dispute — the  question  is  whether  it  should  be  adjusted. 

This  study  focuses  on  the  resolution  of  a  material,  subjective  audit  difference.  Negotiations 
over  such  issues  are  at  least  somewhat  contentious  (Wright  and  Wright  1997)  because  auditors  and 
clients  are  likely  to  differ  in  their  perceptions  of  what  the  “correct”  amount  should  be.  The 
disposition  of  objective  items  tends  to  be  less  contentious  because  the  correct  treatment  of  the  item 
is  clear  and  auditors  are  better  able  to  convince  their  clients  to  make  adjustments  related  to 
objective  items  than  subjective  items  (Wright  and  Wright  1997).  Thus,  if  a  significant  objective 
item  is  discovered  during  an  audit,  auditors  are  highly  likely  to  insist  that  the  item  be  corrected.^ 
In  such  settings,  clients  are  likely  to  readily  concede  to  auditors’  requests  to  make  adjustments 
(Wright  and  Wright  1997). 


^  Much  of  this  research  does  not  provide  alternatives  to  the  manipulated  anchor  and  primarily  examines  the  “first 
mover’s”  advantage,  which  occurs  because  the  first  negotiator  to  make  an  offer  potentially  sets  the  range  for  the  ensuing 
negotiation.  For  example,  when  the  sticker  price  on  a  car  is  $30,000,  the  buyer  is  anchored  to  that  price  and  will  base 
the  counter-offer  on  the  sticker  price.  This  is  different  from  the  anchoring  literature  in  accounting  where  new  informa¬ 
tion  is  discounted  in  the  presence  of  an  anchor  (e.g.,  Ashton  and  Ashton  1988).  Research  shows  that  first  mover’s 
advantage  is  mitigated  when  participants  explicitly  consider  anchor-inconsistent  knowledge  (Mussweiler  et  al.  2000)  or 
features  inconsistent  with  the  anchor  (Chapman  and  Johnson  1999). 

®  This  assertion  assumes  that  auditors  do  not  hold  onto  all  such  issues  and  resolve  them  at  the  end  of  the  audit. 
Discussions  with  auditors,  as  well  as  additional  survey  evidence  we  collected,  suggest  that  auditors  are  very  unlikely  to 
wait  to  resolve  these  issues.  For  example,  in  a  scenario  similar  to  that  used  in  the  experimental  instrument,  87.9  percent 
of  auditors  we  surveyed  would  resolve  a  significant  audit  issue  when  it  occurred  rather  than  hold  it  until  later. 

^  SAS  No.  107  (AICPA  2006a)  es.sentially  requires  that  auditors  make  this  type  of  adjustment  (i.e.,  a  significant  known 
misstatement).  Specifically,  Paragraph  45  states  that  “the  auditor  should  request  management  to  record  the  adjustment 
needed  to  correct  all  known  misstatements  ...  other  than  those  that  the  auditor  believes  are  trivial.” 
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The  social  norm  of  reciprocity,  as  applied  to  negotiation,  asserts  that,  when  one  negotiating 
party  makes  a  concession,  the  other  party  is  expected  to,  and  tends  to,  reciprocate  the  concession 
(e.g.,  Pruitt  and  Camevale  1993;  Sanchez  et  al.  2007;  Hatfield  et  al.  2008).  Reciprocity  holds  even 
in  situations  in  which  negotiators  would  not  have  conceded  had  it  not  been  for  the  presence  of  the 
perceived  need  to  reciprocate.  In  fact,  people  often  take  actions  that  they  normally  would  dislike 
in  the  presence  of  a  perceived  need  to  reciprocate  (Cialdini  2001).  Further,  relatively  small  con¬ 
cessions  can  result  in  relatively  large  reciprocations  (Cialdini  and  Ascani  1976). 

Consequently,  an  auditor’s  reaction  to  a  client’s  agreement  to  record  an  objective  adjustment 
as  if  it  is  a  concession  could  create  an  environment  in  which  the  auditor  feels  the  need  to 
reciprocate  the  client’s  concession  when  negotiating  other  audit  issues  (e.g.,  a  material  subjective 
item).  It  is  important  to  note  that  the  influence  of  reciprocity  will  be  manifested  as  a  social  need 
to  concede  in  the  negotiation;  we  do  not  expect  it  to  influence  pre-negotiation  aspirations  (e.g., 
goals  and  limits).  Thus,  we  expect,  based  on  reciprocity  theory,  that  a  prior  client  concession  on 
the  objective  issue  creates  social  pressure  for  the  auditor  to  reduce  the  magnitude  of  his/her  initial 
negotiation  position  (i.e.,  concede  on  his/her  initial  counter-offer),  yielding  the  following  hypoth¬ 
esis: 

H2:  In  the  presence  of  a  prior  client  concession  to  the  auditor’s  request  to  record  a  significant 
objective  item,  the  auditor’s  initial  negotiation  position  for  a  subsequent  material  sub¬ 
jective  issue  will  be  lower  than  in  the  absence  of  this  client  concession. 

The  above  discussion  proposes  that  the  magnitude  of  the  audit  difference  for  the  subjective 
issue,  as  well  as  a  prior  client  concession,  will  influence  the  auditor’s  initial  position  in  the  ensuing 
negotiation  over  the  subjective  issue.  Recall  that  a  general  conclusion  in  negotiation  research  is 
that  higher  initial  positions  result  in  higher  negotiation  outcomes  (e.g.,  Rubin  and  Brown  1975). 
We  test  this  relationship  because  unique  aspects  of  the  audit  setting  could  influence  this  relation¬ 
ship.  For  example,  Bame-Aldred  and  Kida  (2007)  and  Ng  and  Tan  (2003)  find  that  auditors  are 
less  likely  than  other  negotiators  to  make  concessions  during  the  negotiation  process,  most  likely 
because  of  professional  concerns  that  cause  auditors  to  perceive  that  it  is  inappropriate  to  propose 
one  value  in  an  effort  to  achieve  a  different  value.  Thus,  in  an  auditing  context,  the  auditor  s  initial 
position  is  likely  to  be  a  particularly  predictive  value,  possibly  even  the  final  negotiated  outcome 
(i.e.,  no  concession).  Consequently,  we  expect  that  the  characteristics  of  the  audit  context  exam¬ 
ined  in  this  study  will  affect  the  negotiated  outcome  and  that  this  effect  will  be  caused,  in  large 
part,  by  the  auditor’s  initial  position.  This  leads  to  the  following  mediation  hypothesis: 

H3:  The  magnitude  of  the  audit  difference  as  well  as  the  occurrence  of  a  prior  client  con¬ 
cession  will  affect  the  negotiation  outcome  through  their  effect  on  the  auditor’s  initial 
negotiation  position. 


III.  METHOD 

Task  and  Procedure 

The  102  participants  for  which  we  report  results  include  40  partners,  60  managers,  and  two 
seniors  from  public  accounting  firms  ranging  in  size  from  regional  to  Big  4.  On  average,  partici¬ 
pants  had  14.1  years  of  experience  (S.D.  =  8.4),  and  self-assessed  their  level  of  experience 
resolving  audit  differences  with  clients  as  extensive  (mean  =  9.6,  S.D.  =  1.9,  with  1  =  No 
Experience  and  11  =  Extensive  Experience).  Seventy-four  participants  (73  percent)  are  male  and 
28  female.  Participants  took  an  average  of  approximately  18  minutes  to  complete  the  case. 


We  include  the  two  seniors  in  all  analyses  because 
Patterns  of  cell  means  and  statistical  inferences  are 


both  indicated  that  they  had  experience  negotiating  audit  issues, 
the  same  as  those  reported  if  these  two  participants  are  excluded 
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We  distributed  and  collected  cases  with  assistance  from  contact  partners  at  public  accounting 
firms,  primarily  in  the  southeastern  United  States.  Each  partner  provided  us  with  the  number  of 
partner  and  manager  participants  whom  they  thought  would  complete  the  case.  We  sent  a  corre¬ 
sponding  number  of  USB  drives  containing  the  case  to  the  respective  offices.^  Completed  cases 
were  returned  to  the  contact  partner,  who  returned  the  USB  drives  to  us.  The  four  versions  of  the 
case  were  randomly  distributed  to  participants  and  contact  partners  did  not  know  which  version  of 
the  case  was  included  on  any  given  USB  drive.  We  received  108  cases  and  excluded  six  partici¬ 
pants  from  the  analyses,  yielding  a  sample  of  102  used  in  the  analyses. 

The  Case 

Each  participant  received  a  cover  letter  and  a  USB  drive  containing  the  case  materials.  We 
asked  participants  to  assume  the  role  of  manager  on  the  audit  of  the  2006  financial  statements  for 
Wareham  Electronics,  a  publicly  traded  (on  the  New  York  Stock  Exchange)  manufacturer  of 
electronic  products.  Participants  then  viewed  the  unaudited  2006  and  audited  2005  balance  sheet 
and  income  statement.  Participants  could  choose  to  review  the  financial  statements  at  virtually  any 
time  while  completing  the  case  (see  the  Appendix  for  details  and  examples  of  screens  from  the 
case). 

Participants  then  learned  of  the  existence  of  an  audit  issue  (either  the  subjective  inventory 
valuation  issue  or  a  quantitatively  immaterial  objective  issue,  depending  on  the  condition).  For 
each  issue,  participants  were  notified  that  Wareham’s  controller  requested  a  meeting  to  discuss  it; 
to  help  the  manager  prepare  for  the  meeting,  the  audit  team  prepared  a  memo  to  summarize  the 
issue.  It  is  important  to  note  that,  when  participants  were  addressing  the  first  audit  issue,  they  were 
unaware  that  a  second  issue  would  arise.  After  proceeding  to  the  second  audit  issue,  participants 
could  not  revise  their  responses  to  the  first  audit  issue  and,  after  completing  the  second  audit  issue, 
participants  proceeded  directly  to  the  manipulation  check  questions  and  could  not  alter  any  pre¬ 
vious  responses. 

Objective  Audit  Issue 

The  objective  audit  difference  concerned  a  professional  services  invoice  received  in  early 
2007  that  was  related  to  outsourced  internal  audit  services  provided  in  2006.  The  amount, 
$711,000,  should  have  been  accrued  as  an  expense  in  2006  (the  year  in  which  services  were 
rendered),  but  because  of  a  clerical  error  by  an  accounts  payable  clerk  it  was  recorded  as  an 
expense  in  2007.  The  magnitude  of  the  issue,  $711,000,  was  intended  to  be  considered  immaterial. 


from  the  analyses.  Further,  because  Trotman  et  al.  (2009)  and  McCracken  et  al.  (2008)  suggest  differences  between 
partners’  and  managers’  negotiation  abilities/strategies,  we  consider  whether  experience  affects  our  results.  Two  mea¬ 
sures  of  participant  experience,  position  (i.e.,  partner/manager)  and  years  of  audit  experience,  while  correlated  with  each 
other,  are  not  correlated  with  any  of  the  other  measures.  In  addition,  we  included  position  and  years  of  experience  as 
independent  variables  in  supplemental  analyses  to  test  whether  they  affected  the  results;  there  was  no  evidence  that 
either  experience  measure  was  related  to  any  of  the  dependent  measures,  nor  were  the  statistical  inferences  involving  the 
manipulated  variables  different. 

®  We  sent  extra  USB  drives  to  each  contact  partner,  some  of  which  may  not  have  been  distributed;  consequently  we  do  not 
have  an  exact  response  rate.  However,  at  least  90  percent  of  the  case  materials  distributed  was  completed. 

Two  participants  failed  at  least  one  manipulation  check  and  three  took  less  than  two  minutes  to  complete  the  case  (one 
participant  did  both).  We  considered  two  minutes  as  unreasonable  given  the  average  of  over  18  minutes  to  complete  the 
case  and  that,  aside  from  these  three  participants,  none  spent  less  than  five  minutes  to  complete  the  case.  Of  these  four 
participants,  three  provided  enough  data  for  analyses;  patterns  of  cell  means  and  statistical  inferences  are  the  same  if 
they  are  included  in  any  analyses.  Additionally,  we  omit  two  participants  who  provided  materiality  assessments  (the 
covariate  in  our  analyses)  identified  as  extreme  outliers  (i.e.,  more  than  two  standard  deviations  from  the  mean)  based 
on  a  box  plot  of  responses.  If  these  participants’  responses  are  included  in  the  analyses,  patterns  of  cell  means  and 
statistical  Inferences  are  the  same  as  those  reported. 


The  Accounting  Review 
American  Accounting  Association 


September  2010 


The  Effect  of  Magnitude  of  Audit  Difference 


1655 


but  significant/'^  We  intended  to  create  a  scenario  in  which  participants  would  be  inclined  to  make 
the  adjustment  (to  be  consistent  with  SAS  No.  107;  AICPA  2006a). 

Participants  were  provided  with  the  journal  entry  that  would  be  necessary  to  correct  the  error 
and  then  were  asked  to  indicate  their  preference  for  whether  the  adjustment  should  be  recorded 
(with  higher  numbers  indicating  a  greater  preference  for  adjustment).  To  ensure  that  all  partici¬ 
pants  knew  that  the  auditor  recommended  the  adjustment  and  that  the  client  conceded,  when  this 
issue  was  presented  first,  they  were  told  that  prior  to  the  meeting  with  Wareham’s  controller,  “you 
and  the  audit  engagement  partner  ...  agreed  that  your  firm  should  recommend  that  Wareham 
correct  this  error,”  that  Wareham’s  controller  agreed  to  make  the  correcting  entry,  and  that  the 
financial  statements  were  adjusted  to  reflect  the  effects  of  the  journal  entry.  Overall,  only  18 
percent  of  participants  (18  participants)  responded  below  the  scale  midpoint. 

Subjective  Audit  Issue 

The  subjective  issue  involved  the  estimate  for  the  allowance  for  obsolete  inventory  (see  the 
Appendix  for  a  detailed  description).  Because  a  competitor  introduced  a  technologically  superior 
and  less  expensive  product,  Wareham  needed  to  increase  its  existing  allowance.  The  audit  team 
developed  an  independent  estimate  ($14  million)  that  was  materially  larger  than  the  client’s  al¬ 
lowance,  substantiated  by  a  memo  from  the  audit  team.  Participants  provided  expectations  regard¬ 
ing  the  upcoming  discussions  (i.e.,  their  own  goals  and  limits  and  their  perceptions  of  the  client  s 
goals  and  limits)  and  then  completed  a  simulated  negotiation  with  the  client. 

In  the  ensuing  negotiation,  participants  were  reminded  of  the  magnitude  of  the  audit  team’s 
estimate  and  Wareham’s  balance  and  were  asked  to  make  one  of  three  choices:  (1)  accept  Ware¬ 
ham’s  balance;  (2)  insist  that  Wareham  adjust  the  balance  to  $14  million;  or  (3)  propose  an 
adjustment  somewhere  between  the  two  amounts.  If  participants  chose  option  1,  they  were  asked 
to  confirm  that  they  decided  to  waive  the  adjustment;  they  then  moved  to  either  the  second  audit 
issue  or  the  manipulation  check  questions,  depending  upon  the  condition.  If  participants  chose 
option  2  or  3,  they  were  told  that  Wareham  responded  by  offering  to  increase  its  account  balance. 

Each  negotiation  round  started  with  participants  being  offered  the  same  three  options  as  on 
the  first  negotiation  screen  (with  option  2  being  equal  to  the  auditor’s  previous  proposed  amount). 
When  a  participant  contended  (i.e.,  did  not  move  from  the  balance  stated  in  the  prior  round),  we 
programmed  Wareham’s  response  as  an  increase  of  10  percent  of  the  difference  between  the 
auditor’s  and  Wareham’s  balance.  If  the  auditor  contended  for  three  rounds,  the  negotiation  ended, 
and  the  auditor  required  an  adjusting  entry  based  on  the  their  balance. 

As  participants  proposed  account  balances,  they  viewed  the  adjusting  journal  entry  that  would 
result.  If  the  auditor  proposed  a  new  adjustment  (optibn  3  above),  then  the  client’s  response  was 
programmed  based  on  prior  negotiation  research  finding  that  concessions  by  one  party  are  likely  to 
be  matched  by  the  other  party  (Pruitt  1998).  In  our  case,  the  client  engages  in  tracking  (or 
concession  matching)  behavior,  a  common  negotiation  strategy  in  which  a  negotiator  makes  con- 


Arguably,  the  most  commonly  used  quantitative  measure  of  materiality  is  5  percent  of  net  income  (c-g->  SEC  1999). 
Based  on’ this  measure,  quantitative  materiality  in  our  case  is  approximately  $1.8  million.  We  define  significant  as  large 
enough  to  be  added  to  a  list  of  unadjusted  audit  differences  that  are  aggregated  to  assess  whether  their  cumulative  effect 
is  material.  Additionally,  significant  items,  if  waived,  would  need  to  be  reported  to  the  audit  committee  (SAS  No.  114; 

AICPA  2006b).  u 

Most  of  these  18  participants  were  near  the  scale  midpoint  of  6  (i.e.,  between  3  and  5),  suggesting  that  these  participants 

only  mildly  agreed  with  the  statement  that  the  item  could  be  waived.  More  participants  in  the  objective  item  first 
condition  fell  below  the  scale  midpoint  (i.e.,  would  prefer  to  waive  the  adjustment).  This  is  consistent  with  the  idea  that 
participants  who  received  the  objective  item  second  and  had  already  made  concessions  in  the  negotiation  involving  the 

subjective  issue  were  less  likely  to  concede  further.  ,  r  j  \ 

While  negotiating,  participants  were  not  aware  of  when  negotiations  would  end  (i.e.,  the  maximum  number  of  rounds). 
In  addition,  during  negotiations,  the  auditor’s  new  proposed  balance  was  included  in  all  subsequent  negotiation  screens. 
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cessions  similar  in  amount  as  their  counterparts’  (Pruitt  Specifically,  Wareham  increased 

its  previous  balance  by  either  110  or  90  percent  of  the  amount  that  the  auditor  moved  from  his/her 
previous  offer  (we  alternated  between  110  and  90  percent  to  simulate  a  negotiation  in  which  client 
counter-offers  did  not  obviously  match  the  auditor’s  movement). 

To  ensure  that  the  case  took  a  reasonable  amount  of  time  to  complete,  the  ensuing  negotiation 
could  last  up  to  five  rounds  and  would  end  at  any  time  if  the  difference  between  the  auditor  and 
client  balances  was  less  than  $200,000.  At  the  end  of  the  negotiation,  participants  were  shown  the 
journal  entry  that  Wareham  would  make  and  were  told  that  the  financial  statements  were  adjusted 
to  reflect  the  effects  of  this  entry  (participants  had  the  opportunity  to  view  the  adjusted  financial 
statements).  Upon  resolution  of  the  second  audit  issue,  participants  completed  manipulation  check 
questions  related  to  the  two  independent  variables,  provided  their  assessment  of  materiality  (based 
on  the  unadjusted  financial  statements),  and  answered  several  questions  related  to  the  case  and 
themselves  (including  demographic  questions).  In  our  statistical  analyses,  we  include  materiality 
assessments  as  a  covariate  because  we  believe  that  the  extent  to  which  individuals  differ  with 
regard  to  materiality  assessments  could  influence  the  size  of  preferred  audit  adjustments.  Conse¬ 
quently,  our  analyses  control  for  random  variation  in  the  dependent  variables  caused  by  differ¬ 
ences  in  individuals’  materiality  assessments.'^ 

Independent  Variables 

To  test  the  hypotheses,  we  utilize  a  2  X  2  between-subjects  experimental  design  that  manipu¬ 
lated  (1)  the  magnitude  of  the  audit  difference;  and  (2)  whether  the  client  conceded  on  an  objec¬ 
tive,  immaterial  but  significant,  issue  prior  to  resolving  the  inventory  valuation  issue  (i.e.,  prior 
client  concession).  We  manipulated  the  magnitude  of  the  audit  difference  by  varying  the  client’s 
unaudited  inventory  valuation  balance  as  either  $8  million  (i.e.,  high  difference)  or  $11.2  million 
(low  difference).  Recall  that  the  audit  team  estimate  was  $14  million,  so  the  magnitude  of  either 
audit  difference  (i.e.,  $6  million  or  $2.8  million)  is  always  material.  We  operationalized  the  second 
factor,  prior  client  concession,  by  altering  the  order  of  the  two  issues.  Half  of  the  participants 
resolved  the  objective  issue  first  and  half  resolved  the  subjective  inventory  valuation  issue  first. 

To  validate  our  assertion  that  these  issues  are  addressed  sequentially  in  practice,  in  a  supple¬ 
mental  data  collection  (detailed  in  the  “Additional  Experimentation’’  section)  we  presented  par¬ 
ticipants  with  a  scenario  identical  to  the  objective  issue  and  asked  them  when  they  would  resolve 
it  (1  =  Definitely  resolve  now;  11  =  Definitely  resolve  at  end  of  audit).  The  mean  response  is 
2.89;  of  the  33  responses,  15  equal  1  and  29  (87.9  percent)  are  less  than  5,  which  we  interpret  as 
strong  evidence  that  auditors  would  resolve  this  issue  sequentially  (i.e.,  at  the  time  it  arose). 

Dependent  Variables 

The  primary  dependent  variables  used  to  test  the  hypotheses  are  those  elicited  during  consid¬ 
eration  of  the  subjective  inventory  issue,  including  the  ensuing  negotiation.  Specifically,  we  ob¬ 
tained  pre-negotiation  amounts  by  asking  participants  to  provide  their  negotiation  goal  and  lowest 


Tracking,  or  concession  matching,  is  a  strategy  by  which  a  negotiator  will  concede  a  similar  amount  as  his/her  coun¬ 
terpart.  For  example,  when  a  seller  comes  down  $10,000  on  the  value  of  a  home,  a  buyer  using  the  matching  strategy 
likely  will  increase  their  last  offer  by  around  $10,000. 

While  a  covariate  is  best  measured  prior  to  any  manipulations  to  ensure  that  the  manipulations  do  not  influence  the 
covariate,  we  measured  materiality  assessments  at  the  end  of  the  case  because  we  were  concerned  that  such  a  question 
might  influence  participants’  negotiation  behavior.  Because  materiality  assessments  could  have  been  associated  with  the 
independent  variables,  we  examined  the  correlations  between  the  independent  variables  and  materiality  assessments. 
Prior  to  the  removal  of  two  extreme  outliers  (discussed  in  footnote  10),  Prior  Concession  was  marginally  correlated  with 
materiality  assessments;  however,  this  correlation  became  insignificant  when  these  observations  were  removed,  suggest¬ 
ing  that  materiality  is  an  appropriate  covariate. 
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acceptable  balance  (i.e.,  negotiation  limit)  for  the  allowance  account.  Such  pre-negotiation  mea¬ 
sures  are  employed  in  audit  negotiation  research  as  they  are  indicative  of  the  resulting  negotiated 
outcomes  (e.g.,  Bame-Aldred  and  Kida  2007;  Trotman  et  al.  2009).  At  the  start  of  the  negotiation 
process,  participants  indicated  their  initial  negotiation  position,  which  serves  as  a  pre-negotiation 
measure  and  the  mediating  variable  for  the  test  of  H3.  Finally,  the  outcome  that  results  at  the 
conclusion  of  the  negotiation  is  used  as  the  dependent  variable  for  H3. 

To  provide  a  supplemental  measure  of  practical  significance,  we  considered,  for  each  inde¬ 
pendent  variable,  whether  participants’  goals,  limits,  initial  negotiation  positions,  and  negotiation 
outcomes  were  materially  lower  than  the  audit  team’s  estimate.  For  this  analysis,  we  subtract  the 
appropriate  materiality  figure  from  the  audit  team  estimate  and  determine  the  frequency  with 
which  the  respective  dependent  variables  fall  below  this  figure.  We  first  use  a  common  quantitative 
assessment  of  materiality,  5  percent  of  net  income  (e.g.,  SEC  1999;  Messier  et  al.  2008).  For 
robustness,  we  also  consider  two  measures  of  account  level  materiality  (i.e.,  tolerable  misstate¬ 
ment),  75  and  50  percent  of  overall  materiality.^®  These  tests  provide  evidence  indicating  how 
often  the  manipulated  variables  result  in  economically  and  practically  significant  differences  rela¬ 
tive  to  the  audit  team’s  independent  estimate. 

TV.  RESULTS 

Magnitude  of  the  Audit  Difference  (Hla  and  Hlb) 

Hypotheses  la  and  lb  predict  that  the  magnitude  of  the  auditor’s  goals  and  limits  as  well  as 
their  initial  negotiation  position  will  be  inversely  related  to  the  magnitude  of  the  initial  audit 
difference  for  the  inventory  valuation  issue.  Table  1  reports  the  means  for  the  dependent  variables 
and  a  summary  of  results  of  t-tests  that  examine  the  effects  of  prior  concession  and  magnitude  of 
the  audit  difference  on  the  dependent  variables.'^  Hypothesis  la  predicts  that  auditors’  pre¬ 
negotiation  positions  will  be  lower  when  the  magnitude  of  the  audit  difference  is  relatively  high 
than  when  it  is  relatively  low  (higher  positions  reflect  more  conservative  account  balances).  The 
auditors’  goal  when  the  magnitude  of  the  audit  difference  was  high  ($13,283,205)  was  signifi¬ 
cantly  less  than  when  the  magnitude  was  low  ($13,644,021;  p  =  0.02).  Further,  auditors’  limit 
when  the  magnitude  of  the  audit  difference  was  high  ($12,469,147)  was  significantly  less  than 
when  the  magnitude  was  low  ($13,163,508;  p  <  0.01),  supporting  Hla.  These  results  suggest  that 
auditors’  aspirations  regarding  the  negotiation  outcome  are  lower  when  the  magnitude  of  the 
difference  is  higher  (i.e.,  reflecting  a  less  conservative  account  balance).  Auditors’  initial  negotia¬ 
tion  positions  are  also  significantly  lower  when  the  magnitude  of  the  audit  difference  is  high 
($12,929,204)  compared  to  when  the  magnitude  is  Iqw  ($13,437,903;  p  <  0.01),  supporting  Hlb. 

Table  2  reports  the  results  of  one-tailed  Fisher’s  Exact  Tests  that  provide  additional  evidence 
regarding  economic  (i.e.,  practical)  significance.  Specifically,  we  examine  the  proportion  of  audi¬ 
tors  whose  initial  negotiation  position  is  materially  below  the  audit  team’s  estimate.  Consistent 
with  Hlb,  when  the  magnitude  of  the  difference  is  low,  only  4.1  percent  of  participants  have  an 


Our  review  of  two  large  international  accounting  firms’  audit  manuals,  conversations  with  Big  4  audit  partners,  and 
various  audit  texts  (e.g.,  Messier  et  al.  2008;  Puncel  2008)  reveal  that  account  balance  materiality  (i.e.,  tolerable  error) 
typically  is  between  50-75  percent  of  overall  materiality  (with  50  percent  being  the  most  conservative  measure). 

We  first  conducted  ANCOVAs  for  the  full  2X2  design  for  all  dependent  variables  described  in  the  “Method”  section 
(preceded  by  a  MANGO VA  because  the  dependent  variables  are  highly  correlated).  We  report  results  for  HI  and  H2 
using  t-tests  that  take  the  covariate  into  account  by  using  the  mean  squared  error  (i.e.,  MSB)  from  the  ANCOVA  to 
compute  single  degree  of  freedom  contrasts.  As  all  hypotheses  are  directional,  associated  p-values  are  one-tailed  and, 
because  we  base  our  analyses  on  ANCOVA,  we  report  adjusted  means.  The  ANCOVA  results,  not  tabulated,  are 
consistent  with  the  results  of  these  tests,  and  there  were  no  significant  interactions  in  any  of  the  analyses.  We  also 
conducted  the  analyses  based  on  converting  our  dependent  variables  to  ranked  data  because  there  is  some  evidence  that 
our  dependent  variables  violate  the  normality  assumption.  The  results  based  on  ranked  data  yield  patterns  of  cell  means 
and  statistical  inferences  that  are  the  same  as  those  reported  in  Table  1. 
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TABLE  1 
Hypothesis  Tests 


Dependent  Variables 

Auditor’s  Negotiation  Goal 

Auditor’s  Negotiation  Limit 

Auditor’s  Initial  Negotiation 
Position 

Negotiation  Outcome 


Magnitude  of  Difference 
Mean 


(Std.  Dev.)® 

High 

Low 

n  =  53 

n  =  49 

13,283,205 

13,644,021 

(1,045,099) 

(587,311) 

12,469,147 

13,163,508 

(1,104,601) 

(868,177) 

12,929,204 

13,437,903 

(1,017,842) 

(644,677) 

12,740,492 

13,196,996 

(992,340) 

(742,461) 

Prior  Concession 
Mean 


(Std. 

Dev.) 

Yes 

No 

n  =  52 

n  =  50 

13,347,938 

13,579,288 

(928,609) 

(825,837) 

12,758,763 

12,873,892 

(1,001,422) 

(1,158,092) 

13,040,122 

13,326,985 

(928,490) 

(862,288) 

12,787,392 

13,150,096 

(1,001,904) 

(1,008,523) 

Statistical  Tests 
t-statistic 
(p-value) 
[Hypothesis] 


Magnitude 

of 

Difference 

Prior 

Concession 

2.08 

1.36 

(0.02) 

(0.18) 

[HI  a] 

[NA] 

3.56 

0.60 

(<0.01) 

(0.55) 

[Hla] 

[NA] 

2.91 

1.67 

(<0.01) 

(0.049) 

[Hlb] 

[H2] 

2.34 

1.89 

(0.01) 

(0.03) 

[H3] 

[H3] 

“  All  reported  means  are  adjusted  means  from  an  ANCOVA  with  Magnitude  of  the  Audit  Difference  and  Prior  Concession 
as  the  independent  variables  and  participants’  materiality  assessments  as  the  covariate. 

Because  all  hypotheses  are  directional,  the  reported  p-values  for  hypothesis  tests  are  one-tailed.  The  results  are  reported 
using  t-tests  following  an  ANCOVA  that  use  the  MSB  from  the  ANCOVA  to  compute  a  single  degree  of  freedom 
contrast. 


TABLE  2 

Tests  of  Economic  Significance 
Significance  Levels  for  Fisher’s  Exact  Tests® 


Materiality  75%  Materiality  50%  Materiality 


Dependent  Variables 

Magnitude 

of 

Difference 

Prior 

Concession 

Magnitude 

of 

Difference 

Prior 

Concession 

Magnitude 

of 

Difference 

Prior 

Concession 

Auditor’s  Negotiation  Goal 

<0.01 

0.76 

<0.01 

0.61 

0.03 

0.09 

Auditor’s  Negotiation  Limit 

<0.01 

1.00 

<0.01 

1.00 

<0.01 

0.31 

Auditor’s  Initial 

<0.01 

0.11 

0.03 

0.05 

<0.01 

0.06 

Negotiation  Position 
Negotiation  Outcome 

<0.01 

0.10 

0.09 

0.03 

0.04 

0.02 

Numbers  in  table  are  p-values  based  on  Fisher’s  Exact  Test.  All  p-values  one-tailed  (consistent  with  hypothesis  tests) 
except  for  the  tests  of  Prior  Concession  for  Auditor’s  Negotiation  Goal  and  Auditor’s  Negotiation  Limit  (which  are 
two-tailed),  because  we  do  not  provide  related  hypotheses.  Significant  p-values  indicate  that  participants’  goals,  limits, 
initial  negotiation  positions,  as  well  as  negotiated  outcomes  were  more  likely  to  be  materially  below  the  audit  team’s 
estimate  when  the  magnitude  of  the  audit  difference  is  high  or  if  there  is  a  prior  client  concession. 
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initial  negotiation  position  materially  below  the  audit  team’s  estimate,  as  compared  to  30.2  percent 
when  the  magnitude  of  the  difference  is  high  (p  <  0.01).  Results  are  similar  (p  =  0.03  and  p  < 
0.01)  when  using  the  endpoints  of  the  range  for  tolerable  misstatement  as  the  basis  of  comparison 
(i.e.,  75  and  50  percent  of  materiality,  respectively).  Consistent  with  Hla,  we  find  similar  results 
for  the  auditor’s  goal  (limit).  Specifically,  for  goal  (limit),  the  low-magnitude  audit  difference 
results  in  2.0  percent  (16.0  percent)  of  participants  providing  values  materially  below  the  audit 
team  estimate,  as  compared  to  18.9  percent  (51.9  percent)  when  the  magnitude  of  the  difference  is 
high  (all  p-values  <  0.01).  Inferences  are  the  same  when  considering  either  measure  of  tolerable 
misstatement.  These  tests  provide  evidence  that  auditors  are  more  likely  to  move  materially 
toward  the  client’s  unaudited  balance  when  the  magnitude  of  the  audit  difference  is  high. 

Prior  Concession  (H2) 

Hypothesis  2  predicts  that  when  a  client  makes  a  concession  prior  to  the  negotiation,  the 
auditor  will  feel  the  need  to  reciprocate  the  concession,  resulting  in  a  lower  initial  negotiation 
position.  Results  reported  in  Table  1  indicate  that,  in  the  presence  of  a  prior  client  concession, 
auditors’  mean  initial  negotiation  position  ($13,040,122)  is  lower  than  in  the  absence  of  a  prior 
concession  ($13,326,985;  p  <  0.05),  providing  support  for  H2.  While  the  difference  in  mean 
responses  ($286,863  for  initial  negotiation  position  and  $362,704  for  negotiation  outcome)  does 
not  rise  to  the  level  of  materiality  or  even  tolerable  misstatement,  misstatements  of  this  amount 
(about  15  percent  of  materiality)  would  likely  be  large  enough  to  be  listed  in  a  summary  of 
unadjusted  audit  differences  and,  if  waived,  disclosed  to  the  audit  committee  (e.g.,  SAS  No.  114; 
AICPA  2006b). 

Further  analysis  in  Table  2  to  provide  evidence  of  economic  significance  shows  that  23.1 
percent  of  auditors  in  the  prior  concession  condition  had  an  initial  negotiation  position  materially 
below  the  audit  team’s  estimate,  as  compared  to  12.0  percent  in  the  absence  of  a  concession  (p  = 
0.11;  one-tailed  Fisher’s  Exact  test).  When  using  tolerable  misstatement  as  the  basis  for  compari¬ 
son,  results  are  stronger  (p  =  0.05  and  p  =  0.06  for  75  percent  and  50  percent  of  materiality, 
respectively).  In  addition,  statistical  inferences  are  the  same  for  the  negotiation  outcome  (p  = 
0.10,  0.03,  and  0.02  for  the  three  levels  of  materiality,  respectively).  These  tests  provide  additional 
evidence  that  auditors  are  more  likely  to  move  materially  toward  the  client’s  unaudited  balance  in 
the  presence  of  a  prior  client  concession. 

Negotiated  Outcome:  Mediating  Role  of  Initial  Demand  (H3) 

The  dependent  variables  examined  in  the  first  two  hypotheses  are  based  on  prior  research 
demonstrating  that  negotiators’  pre-negotiation  positions  (goals  and  limits)  as  well  as  their  initial 
negotiation  positions  are  strongly  correlated  with  negotiators’  eventual  success  in  the  negotiation. 
That  is,  higher  goals,  limits,  and  initial  demands  result  in  higher  outcomes  for  a  negotiator  (Rubin 
and  Brown  1975).  However,  to  obtain  further  confidence  that  the  independent  variables  considered 
in  this  study  influence  the  negotiation,  we  simulate  a  negotiation,  via  computer  interaction,  with 
the  client’s  controller.  As  discussed  above,  the  client  negotiator  is  programmed  with  a  matching 
strategy,  which  is  a  predominant  negotiation  style  in  this  context  (Pruitt  1998).  Using  the  outcome 
of  the  negotiation  as  the  dependent  variable,  both  magnitude  of  the  audit  difference  and  prior 
concession  have  a  significant  effect  on  the  negotiation  outcome  (one-tailed  p-values  =  0.01  and  = 
0.03,  respectively;  see  Table  1),  with  the  pattern  of  means  and  statistical  inferences  consistent  with 
those  reported  above. 


For  example,  one  large  international  firm  we  consulted  has  guidelines  stating  that  its  Audit  Difference  Posting  Threshold 
is  3  to  5  percent  of  materiality,  and  another  indicated  that  its  threshold  is  5  percent  of  materiality,  both  well  below  the 
differences  we  observe. 
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Hypothesis  3  predicts  that  both  manipulations  will  affect  the  negotiation  outcome  and  that 
these  effects  are  attributable  to  their  influence  on  the  auditor’s  initial  negotiation  position.  We  test 
this  prediction,  diagramed  in  Figure  1,  with  mediation  analyses  based  on  Baron  and  Kenny  (1986). 
The  mediating  relationship  will  be  demonstrated  if  three  conditions  are  met:  (1)  magnitude  of 
audit  difference  and  prior  concession  influence  negotiation  outcome  (paths  Cl  and  C2  in  Figure 
1),  (2)  magnitude  of  audit  difference  and  prior  concession  significantly  influence  initial  negotiation 
position  (paths  A1  and  A2  in  Figure  1),  and  (3)  when  the  initial  negotiation  position  is  included  in 
the  model,  the  effect  of  the  manipulated  variables  become  insignificant.  The  resulting  beta  coef¬ 
ficients  of  these  regressions  and  their  significance  levels  are  included  in  the  paths  of  the  mediating 
model  in  Figure  1. 

Results  indicate  that  condition  1  is  met,  as  the  coefficients  for  paths  Cl  and  C2  are  significant 
when  the  mediator  is  not  included  in  the  model.  Further,  condition  2  is  met,  as  paths  A1  and  A2 
are  significant.  Finally,  full  mediation  is  indicated,  as  the  effects  of  both  magnitude  of  audit 
difference  and  prior  concession  (paths  Cl  and  C2)  become  insignificant  when  the  mediating 
variable  (i.e.,  initial  negotiation  position)  is  added  to  the  model  (condition  3)  and  path  B  is 
significant.  These  results  indicate  that  the  effect  of  our  two  independent  variables  on  the  negotia¬ 
tion  outcome  is  attributable  to  their  influence  on  auditors’  initial  negotiation  positions  (i.e.,  medi- 


FIGURE  1 

Mediating  Relationship 
(Beta  Coefficients  for  Paths) 


*,  **,  ***  p  <  0.10,  p  <  0.05,  and  p  <  0.01,  respectively. 

Equation  (1);  Initial  Negotiation  Position  =  A1  (magnitude  of  estimate  difference)  +  A2  (prior  concession) 
Equation  (2):  Negotiation  Outcome  =  B  (Initial  Negotiation  Position) 

Equation  (3):  Negotiation  Outcome  =  Cl  (magnitude  of  estimate  difference)  +  C2  (prior  concession) 

Equation  (4):  Negotiation  Outcome  =  C 1  (magnitude  of  estimate  difference)  +  C2  (prior  concession)  +  C3 
(Initial  Negotiation  Position) 

“  We  coded  Magnitude  of  Audit  Difference  as  1  if  high,  and  0  if  low.  Prior  Concession  is  coded  as  1  in  the 
presence  of  a  prior  concession,  and  0  in  its  absence.  For  ease  of  interpretation,  the  analyses  in  this  figure  do 
not  include  Materiality  as  a  control  variable;  if  materiality  is  included,  it  is  not  significant  in  any  of  the 
equations,  p-values  and  Beta  coefficients  generally  are  virtually  the  same  as  those  reported,  and  full  media¬ 
tion  still  occurs. 
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ated  paths  A1  and  A2,  and  path  B).  While  it  is  possible  that  the  mechanical  nature  of  the  computer 
tracking  strategy  drove  the  mediation  finding,  a  closer  examination  of  the  data  suggests  that  the 
auditors’  use  of  a  contending  strategy  possibly  drives  the  mediation  results.  Seventy-six  percent  of 
participants  stick  to  their  initial  offer  without  conceding  (same  percentage  across  conditions).  This 
level  of  contention  demonstrates  a  difference  from  the  audit  and  psychology  negotiation  literature 
and  appears  to  result  in  a  particularly  strong  link  between  initial  negotiation  position  and  nego¬ 
tiation  outcome. 

Additional  Experimentation 

We  gathered  additional  data  to  rule  out  potential  alternative  explanations  for  each  independent 
variable.  First,  while  we  use  negotiation  theory  as  a  basis  for  predicting  the  findings  related  to  the 
magnitude  of  the  audit  difference,  a  possible  alternative  explanation  is  that  participants  were  less 
confident  in  the  accuracy  of  the  audit  team’s  estimate  when  it  differs  more  from  the  client’s 
balance.  Consequently,  we  test  whether  auditors’  confidence  in  the  audit  team’s  estimate  differs 
based  on  the  magnitude  of  the  audit  difference.  Second,  while  we  base  our  prediction  for  prior 
concession  on  reciprocity  theory,  participants  could  view  a  prior  concession  as  an  indication  that 
the  client  is  an  “easy  negotiator,”  and  thus  start  the  negotiation  closer  to  the  client’s  balance,  which 
yields  the  same  directional  prediction.  Accordingly,  we  test  whether  auditors’  expectations  of  the 
difficulty  of  the  negotiation  vary  based  on  whether  the  client  has  made  a  prior  concession. 

Participants  in  the  additional  experiment  included  33  audit  partners  and  managers  from  large 
accounting  firms.  Compared  to  the  main  experiment,  participants  included  a  higher  proportion  of 
partners  (61  percent  versus  39  percent  in  the  main  experiment;  Fisher  Exact  Test  p  =  0.04)  and 
had  more  work  experience  (17.5  years  versus  14.1  in  the  main  experiment;  p  =  0.04).  However, 
as  in  the  main  experiment,  when  we  control  for  these  measures  of  experience,  statistical  inferences 
are  the  same  as  those  reported.  Participants’  level  of  experience  resolving  audit  differences  with 
clients  did  not  differ  significantly  across  the  two  experiments  (means  of  9.5  and  9.6,  with  11 
representing  extensive  experience,  for  the  additional  and  main  experiments,  respectively;  p  = 
0.80).  To  examine  the  potential  competing  explanations  for  each  of  the  independent  variables,  we 
conducted  two  1X2  analyses. 

Confidence  in  the  Audit  Team  Estimate 

The  first  analysis  varies  the  magnitude  of  the  audit  difference  to  consider  whether  larger 
differences  led  participants  to  believe  that  the  audit  team’s  estimate  was  of  lower  quality.  We 
provided  participants  with  information  almost  identical  to  that  in  the  main  experiment;  however, 
we  conducted  this  experiment  on  paper,  precluding  us  from  simulating  the  negotiation.^'’  We 
measured  the  auditor’s  goal,  limit,  and  initial  negotiation  position.  Results  reported  in  Table  3 
demonstrate  that  Hla  and  Hlb  are  supported  in  the  supplemental  data  collection.  In  addition, 
results  not  tabulated  indicate  that  statistical  inferences  from  tests  of  economic  significance  iden¬ 
tical  to  those  reported  in  Table  2  are  the  same  as  for  the  main  experiment. 


Of  the  78  participants  who  did  not  move  from  their  initial  position,  43  (55  percent)  proposed  an  account  balance  below 
the  audit  team’s  estimate.  The  mean  and  median  for  these  participants’  initial  positions  (and  outcome,  since  they  did  not 
move  from  their  initial  position)  was  approximately  $12.6  million.  It  is  interesting  to  note  that,  on  an  overall  basis,  these 
participants’  initial  positions  did  not  differ  significantly  from  those  who  did  move  from  their  initial  position  (mean  of 
$12.9  million;  t  =  1.27,  p  =  0.21).  For  participants  who  did  move  from  their  initial  position,  the  negotiation  lasted  an 
average  of  3.42  rounds,  with  no  differences  between  conditions. 

The  only  difference  from  the  main  test  of  HI  conducted  in  the  primary  experiment  is  that  we  use  $11  million  rather  than 
$11.2  million  as  the  initial  client  balance  to  rule  out  the  possibility  that  an  estimate  of  $11.2  was  viewed  as  more  precise 
than  $8.0  or  $14.0,  and  thus  was  given  more  weight. 
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TABLE  3 

Summary  of  Results  for  Additional  Experimentation 

Magnitude  of  Difference 
Mean 
(Std.  Dev.) 


Dependent  Variables 

High 
n  =  15 

Low 
n  =  18 

t-statistic 

Auditor’s  Negotiation  Goal 

13,136,960 

13,630,687 

1.52* 

(1,109,215) 

(600,245) 

Hla 

Auditor’s  Negotiation  Limit 

10,949,312 

12,050,688 

2  54*** 

(1,659,568) 

(884,008) 

Hla 

Auditor’s  Initial  Negotiation 

11,788,670 

12,791,885 

2.38** 

Position 

(1,626,968) 

(738,534) 

Hlb 

Confidence  in  Audit  Team’s 

7.60 

7.06 

0.88 

Estimate 

(1.24) 

(2.10) 

*,  **,  ***  Indicates  p  <  0.10,  p  <  0.05,  and  p  <  0.01,  respectively.  The  p-value  for  test  of  Confidence  in  Audit  Team’s 
Estimate  is  two-tailed,  as  we  do  not  make  a  directional  prediction. 

All  other  p-values  are  one-tailed,  consistent  with  the  hypotheses  (relevant  hypotheses  are  noted  in  the  table).  Results  for 
hypothesis  tests  are  based  on  one-tailed  t-tests  following  an  ANCOVA  with  Magnitude  of  the  Audit  Difference  as  the 
independent  variable  and  participants’  materiality  assessments  as  the  covariate.  These  t-tests  use  the  MSE  from  the 
ANCOVA  to  compute  a  single  degree  of  freedom  contrast.  Because  we  base  our  analyses  for  auditor’s  negotiation  goal, 
negotiation  limit,  and  initial  negotiation  position  on  ANCOVA,  means  for  these  variables  are  adjusted  means. 

The  p-value  for  Confidence  in  the  Audit  Team’s  Estimate  is  based  on  a  two-sample  t-test.  Confidence  in  the  Audit  Team’s 
Estimate  is  measured  on  an  11 -point  scale  with  higher  numbers  indicating  greater  confidence.  All  other  variables  measured 
in  dollars. 


A  potential  alternative  to  negotiation  theory  is  that,  in  our  setting,  auditors  are  less  confident 
in  the  accuracy  of  their  audit  team’s  estimate  when  the  difference  from  the  client’s  balance  is 
larger.  Thus,  it  could  be  an  explicit  consideration  of  the  quality  of  the  information  provided  to  the 
participant  rather  than  the  influence  of  the  client’s  initial  position,  as  suggested  by  negotiation 
theory.  We  address  this  alternative  explanation  by  asking  participants  to  respond  to  the  following 
question  after  they  were  presented  with  the  client’s  balance  and  the  audit  team’s  independent 
estimate:  “How  confident  are  you  in  the  accuracy  of  your  audit  team’s  independent  estimate  of  the 
allowance  for  obsolete  inventory?”  (1  =  Not  at  all  confident;  11  =  Very  confident).  Results 
indicate  that  confidence  in  the  audit  team’s  estimate  is  virtually  the  same  across  condition  (7.60 
and  7.06  for  high  and  low  magnitude  of  the  audit  difference,  respectively;  two-tailed  p  =  0.38). 
Consequently,  it  appears  that  participants’  confidence  in  the  audit  team’s  estimate  does  not  drive 
the  observed  difference  in  negotiation  behaviors. 

Expected  Negotiation  Difficulty 

Participants  could  view  a  prior  concession  as  indicative  of  the  difficulty  of  subsequent  nego¬ 
tiations.  That  is,  auditors  could  perceive  the  concession  as  a  signal  that  the  client  is  an  easy 
negotiator,  which  would  cause  auditors  to  expect  easier  negotiations  to  get  the  desired  outcome, 
and  therefore  take  an  initial  negotiation  position  closer  to  the  client’s  balance.  This  view  provides 


In  addition,  confidence  in  the  estimate  did  not  approach  significance  as  a  control  variable  in  analyses  including  any  of 
the  negotiation  measures.  Confidence  in  the  estimate  exceeded  the  scale  midpoint  for  both  conditions  (p  <  0.01), 
suggesting  that  auditors  were  confident  in  the  accuracy  of  the  audit  team’s  estimate. 
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a  similar  directional  prediction  as  our  argument  that  the  result  is  attributable  to  reciprocity.^^  To 
address  this  counter-explanation,  we  conducted  a  second  1X2  analysis  by  presenting  the  same  33 
auditors  with  a  separate  case  in  which  they  viewed  one  of  two  scenarios  intended  to  replicate  the 
prior  concession  condition  in  the  main  experiment.^^ 

In  one  condition,  we  described  the  resolution  of  an  objective  issue  (the  same  issue  and  the 
same  dollar  amount  as  in  the  main  experiment)  and  then  provided  a  subjective  issue;  specifically, 
we  told  participants  that  the  audit  team  developed  a  balance  for  the  allowance  for  bad  debts 
significantly  higher  than  the  client’s,  and  that  the  participant  would  meet  with  the  client  to  resolve 
the  issue  (dollar  amounts  were  slightly  different  from  the  primary  study  to  ensure  that  participants 
viewed  this  issue  as  distinct  from  the  inventory  issue).  In  the  other  condition,  we  included  only  the 
subjective  allowance  issue.  We  asked  all  participants  to  assess  “How  do  you  expect  the  negotia¬ 
tions  with  the  client  to  be  when  you  try  to  resolve  this  issue?”  (1  =  Very  easy;  11  =  Very 
difficult).  Results  (not  tabulated)  demonstrate  virtually  no  difference  between  perceived  difficulty 
of  the  negotiation  when  only  the  subjective  issue  was  present  (mean  =  7.44,  n  =  16)  than  when 
it  was  preceded  by  the  objective  issue  (mean  =  7.53,  n  =  17;  p  =  0.87). 

V.  DISCUSSION  AND  CONCLUDING  REMARKS 

This  study  reports  the  results  of  an  experiment  that  considers  two  aspects  of  the  auditor’s 
negotiation  environment  that  auditors  possibly  do  not  suspect  will  result  in  different  balances  in 
the  audited  financial  statements;  the  magnitude  of  the  difference  between  a  client’s  unaudited 
balance  for  a  subjective,  material  accounting  issue  relative  to  the  audit  team’s  independent  esti¬ 
mate  and  the  occurrence  of  a  prior  concession  by  management  on  a  significant  objective  item.  We 
find  that  both  factors  affect  auditors’  negotiation  expectations  and  behavior  and,  ultimately,  the 
amounts  reflected  in  the  financial  statements — in  fact,  the  negotiation  outcome  is  fully  mediated 
by  auditors’  pre-negotiation  behavior.  We  also  report  results  suggesting  that  these  differences  are 
economically  significant.  Our  findings  are  consistent  with  the  premise  from  the  negotiation  litera¬ 
ture  that  a  negotiation  counterpart’s  initial  position  (e.g.,  a  client’s  unaudited  balance)  as  well  as 
feelings  of  reciprocity  created  by  prior  concessions  set  expectations  for  the  current  negotiation, 
shaping  the  outcomes. 

These  findings  highlight  important  considerations  for  auditors  who  resolve  audit  differences 
with  their  clients.  The  greater  a  difference  between  the  client’s  unaudited  account  balance  and  the 
auditor’s  independent  estimate  for  that  account,  the  less  likely  it  is  that  the  auditor  will  adjust  that 
account  balance  close  to  the  auditor’s  estimate.  These  results  are  important  because  (1)  ceteris 
paribus,  the  auditor’s  proposed  adjustment  should  be  unaffected  by  the  magnitude  of  the  audit 
difference  when  the  auditor  believes  its  independent  estimate  is  appropriate;  and  (2)  management, 
in  many  cases,  has  incentives  to  bias  estimates  in  the  financial  statements,  and  its  unaudited 
balances  (especially  in  accounts  involving  subjectivity,  such  as  estimates)  likely  reflect  its  incen¬ 
tives  to  bias.  This  “non-normative”  effect  is  particularly  highlighted  by  the  result  that  auditors 
materially  alter  the  point  at  which  they  would  qualify  the  financial  statements  between  conditions 
(i.e.,  negotiation  limit). 

Our  prior  concession  results  suggest  that  the  order  in  which  multiple  issues  in  a  negotiation 
are  addressed  also  can  affect  negotiation  outcomes.  With  regard  to  situations  in  which  issues  differ 
in  importance,  researchers  are  divided  on  whether  the  more  or  less  important  issue  should  be 


While  we  assert  that  our  result  is  attributable  to  reciprocity,  we  did  not  elicit  from  participants  a  perceived  need  to 
reciprocate.  Consequently,  we  cannot  directly  address  that  process,  representing  a  limitation  of  our  study. 

Because  participants  in  the  supplemental  data  collection  responded  to  both  experimental  scenarios,  we  counterbalanced 
the  scenarios  and  manipulations,  resulting  in  four  different  versions  of  the  case.  Patterns  of  cell  means  and  statistical 
inferences  are  the  same  regardless  of  scenario  order  or  differing  manipulations  in  the  other  scenario. 
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addressed  first  (e.g.,  Rubin  and  Brown  1975).  We  demonstrate  that,  in  an  audit  setting,  the  ultimate 
disposition  of  audit  issues  can  differ  based  on  the  order  in  which  they  are  addressed.  Our  results 
imply  that  auditors  should  consider,  if  possible,  the  order  in  which  they  address  audit  issues. 

We  also  extend  audit  negotiation  research  by  simulating  a  negotiation.  By  doing  so,  we 
provide  evidence  that  pre-negotiation  measures  used  in  several  prior  audit  studies  (e.g.,  Bame- 
Aldred  and  Kida  2007;  Hatheld  et  al.  2008;  Trotman  et  al.  2009)  are  indicative  of  negotiation 
outcomes.  While  our  results  also  support  statements  by  Ng  and  Tan  (2003)  and  Bame-Aldred  and 
Kida  (2007),  who  hnd  that  auditors  are  less  likely  than  other  negotiators  to  make  concessions 
during  the  negotiation  process,  care  should  be  taken  when  drawing  inferences  from  these  results 
due  to  the  limitations  based  on  the  mechanistic  nature  of  simulated  negotiations. 

Finally,  in  a  broader  sense,  we  add  to  research  concerning  the  link  between  audit  quality  and 
financial  statement  quality.  Audit  quality  has  been  the  focus  of  several  streams  of  research  as  well 
as  the  object  of  recent  and  substantial  changes  in  audit  regulation.  However,  financial  statement 
quality  ultimately  is  contingent  upon  how  the  discussions  and  negotiations  between  auditors  and 
client  management  affect  adjustments  to  the  financial  statements.  By  demonstrating  that  the  size  of 
audit  adjustments  are  a  function  of  two  aspects  of  negotiation  settings  to  which  they  should  be 
invariant,  we  demonstrate  that,  even  when  auditors  plan  and  execute  quality  audits,  financial 
statement  quality  can  ultimately  be  diminished  by  contextual  features  that  lead  to  biases  such  as 
those  warned  against  by  Gillan  (2007). 

This  study  is  subject  to  several  limitations,  some  of  which  we  have  already  mentioned.  For 
example,  prior  research  demonstrates  that,  in  certain  contexts,  gender  affects  negotiation  behavior 
(e.g.,  Barron  2003;  Flynn  and  Ames  2006),  and  our  sample  is  primarily  comprised  of  males  (73 
percent).  We  find  that  gender  is  not  significant  in  any  analyses,  nor  does  it  interact  with  any 
variables.  Further,  our  experimental  environment  lacks  incentives  present  in  typical  audit  environ¬ 
ments  (e.g.,  no  consequence  to  waiving  adjustments  or  disagreeing  with  clients). 

Also,  as  in  most  experimental  research,  we  are  only  able  to  examine  a  particular  piece  of  the 
audit  (negotiation)  without  consideration  of  other  aspects  of  the  audit  occurring  before  and  after 
any  negotiations  take  place.  Other  aspects  of  the  audit  potentially  influence  the  negotiation.  For 
example.  Tan  (1995)  finds  that  prior  involvement  with  a  going-concern  decision  leads  auditors  to 
pay  more  attention  to  evidence  consistent  with  that  prior  finding.  Consequently,  future  research 
could  examine  whether  prior  involvement  at  other  points  during  the  audit  (e.g.,  auditor  evaluation 
of  the  client’s  process  to  develop  its  estimate,  or  involvement  in  developing  the  audit  team’s 
estimate)  influences  judgments  in  subsequent  negotiations.  Further,  in  practice,  it  is  likely  difficult 
for  auditors  to  ascertain  whether  an  audit  difference  is  large  or  small,  and  it  is  also  likely  difficult 
to  manage  the  order  in  which  issues  are  addressed.  However,  auditors  should  be  aware  that  these 
biases  exist  and  should  perhaps  receive  negotiation  training  intended  to  minimize  undesirable 
outcomes.  Yet,  the  auditors  we  surveyed  indicated  that  the  availability  of  formal  negotiation 
training  is  limited.  We  propose  that  audit  firms  consider  developing  formal  negotiation  training 
that  addresses  issues  addressed  in  this  study  as  well  as  other  issues  influencing  the  effectiveness  of 
auditor  negotiation  behavior. 


APPENDIX 

With  respect  to  the  inventory  valuation  issue,  participants  learned  that  “because  of  frequent 
technological  advances  and  associated  price  reductions  in  competitors’  products,  it  is  important 
that  Wareham  constantly  monitor  its  competitors’  new  product  development  and  pricing  to  ensure 
that  inventory  is  properly  valued  at  lower  of  cost  or  market.”  In  the  audit  issue  memo  prepared  by 
the  audit  team  in  anticipation  of  the  meeting  with  Wareham’ s  controller,  participants  then  were 
told  that,  late  in  2006,  one  of  Wareham’s  competitors  introduced  a  new  product  line  that  was 
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technologically  superior  to  and  less  expensive  than  Wareham’s  similar  product  offerings  and  that 
the  competitors’  products  likely  would  reduce  demand  and  prices  for  Wareham’s  products.  Be¬ 
cause  Wareham  believed  that  some  of  the  inventory  would  be  sold  at  lower  prices  and  some  would 
never  be  sold,  the  company  increased  the  balance  in  its  allowanee  for  obsolete  inventory  from  its 
previous  balance  of  $7.5  million. 

The  audit  team  developed  an  independent  estimate  for  the  allowance  for  obsolete  inventory 
and  proposed  an  adjusting  entry  based  on  this  estimate.  The  experimental  materials  stated  that  the 
audit  team  believed  that  management’s  estimate  was  made  in  good  faith,  but  that  the  audit  team 
gathered  additional  evidence  to  develop  an  independent  estimate  and  concluded  that  the  allowance 
for  obsolete  inventory  should  be  increased  to  $14  million.  We  worded  the  case  materials  so  that, 
consistent  with  SAS  No.  57  (AICPA  1989),  the  difference  between  Wareham’s  balance  and  the 
auditor’s  independent  estimate  should  be  considered  as  a  likely  misstatement.  Below  are  examples 
of  the  two  of  the  negotiation  screens  included  in  the  research  case,  the  second  of  which  was  seen 
if  participants  chose  to  propose  a  different  adjustment  amount.  Note  that  the  amounts  for  the 

24 

adjusting  entry  changed  with  changes  to  the  proposed  balance. 


Rb 


Yolu-  Mtetiiig  with  Wstrrfiam’s  Controller 

Assume  that  you  are  now  in  the  meeting  with  Wai  eham’s  conh  oller  to  resolve  the  inveutoiy  issue.  Recall  the  following; 
Wareham’s  balance  in  tlie  allowance  for  obsolete  inveutoiy  is  $8,000,000. 

Yoin  team's  proposed  balance  for  the  allowance  for  obsolete  inveutoiy  is  $14,000,000. 


You  must  now  deckle  how  to  proceed, 

(Please  indicate  your  decision  by  clicking  on  the  appropriate  button  below) 

1.  AcceiJt  the  amoiuit  cmxentlj'  in  tlie  finaiidal  statements  (i.e,,  $8,000,000)  Waive  AOjustment  j  or, 

2.  Insist  that  Wareliam  adjust  tlie  accmmt  balance  to  $14.000,000  Require  Full  Adjustment  |  or, 

3.  Propose  an  acl|ustment  somewhere  between  tliese  two  amoimts  Propose  New  Adjestment  | 


^  tieview  Audit  Issue  Memo 


Review  Rn|inclsl  Statements 


Numbers  change  as  the  negotiation  continues  and  the  client  counters  with  new  adjustment  amounts.  In  this  example,  the 
auditor  counters  the  client’s  initial  balance  with  an  offer  of  $13  million. 
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File 


Pleas:e  iiidicate  the  new  amoiuit  that  you  propose  for  the  allowance  for  obsolete  inventoiy.  To  adjust 
your  audit  team's  proposed  balance  of  $14,000,000,  click  on  the  arrow's  below  to  increase  or  decrease 
the  balance,  \\lien  you  are  finished,  please  click  "Continue”  to  proceed  to  the  next  screen. 

Recall  that  Wareham's  balance  for  ob.solete  inventoiy  currently  is  $8,000,000 


Jour  revtsert  amount  for  the  nttmmnee for  obsolete  inventory: 


Proposed  Account  Balance 


$13,000,000 


TIii.s  proBosed  balaiif  e  worild  result  m  an  adjusting  enO-y  of: 


^  .  Dr,  Cost  of  Sales  5,000,000 

^  I  A.'  i  00,000  Cr.  Allowance  for  obsolete  inventory’  5,000,000 

i<  I  »  I  +/-  $500,000 

<«  I  »>  I  +/-  $1,000,000 


Review  Audit  Issue  Memo 


Review  Financial  Statements  j 


Continue 
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ABSTRACT:  Performance-based  compensation  contracts  can  affect  productivity  both 
by  motivating  effort  and  by  attracting  workers  whose  abilities  align  best  with  the  offered 
contracts.  Using  an  experiment  in  which  participants  design  “rebus  puzzles,”  this  study 
tests  whether  the  incremental  benefits  of  contract  self-selection  extend  to  a  multi¬ 
dimensional  performance  environment  in  which  participants  choose  between  a  contract 
that  rewards  both  quantity  and  creativity  versus  a  contract  that  rewards  quantity  only. 
Findings  indicate  that  participants  who  choose  a  creativity-weighted  pay  scheme  per¬ 
ceive  themselves  to  have  greater  creative  potential  than  those  who  choose  a  quantity- 
only  scheme.  This  perceived  creativity  advantage  manifests  itself  in  the  superior  initial 
creativity  ratings  of  such  participants’  productive  output.  However,  in  the  aggregate, 
participants  paid  only  for  quantity  eventually  surpass  the  creativity-weighted  productiv¬ 
ity  scores  of  participants  paid  for  creativity-weighted  productivity,  whether  compensa¬ 
tion  contracts  are  self-selected  or  randomly  assigned.  Thus,  the  implications  of  contract 
selection  on  creativity-weighted  productivity  hinge  on  the  importance  of  the  initial  ad¬ 
vantage  enjoyed  by  participants  who  self-select  a  creativity-weighted  contract. 
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I.  INTRODUCTION 

In  addition  to  motivating  effort  toward  desired  output  characteristics,  performance-based  com¬ 
pensation  schemes  can  potentially  attract  desired  characteristics  by  appealing  most  to  indi¬ 
viduals  whose  abilities  align  best  with  the  terms  of  the  contract  (Demski  and  Feltham  1978; 
Sprinkle  and  Williamson  2007,  §2).  Although  several  studies  have  used  the  tools  of  experimenta¬ 
tion  to  demonstrate  contract-selection  benefits  of  performance-based  pay  by  allowing  participants 
to  choose  among  competing  contracts  in  laboratory  production  exercises  (e.g.,  Chow  1983;  Waller 
and  Chow  1985;  Shields  and  Waller  1988;  Dillard  and  Fisher  1990;  Cadsby  et  al.  2007),  those 
experiments  have  been  restricted  to  a  single  performance  dimension — typically  the  quantity  pro¬ 
duced.  The  objective  of  the  current  study  is  to  investigate  whether  contract-selection  benefits 
extend  to  a  multi-dimensional  performance  environment  in  which  the  firm  desires  both  quantity 
and  creativity. 

Our  interest  in  the  contract-selection  effects  of  creativity-weighted  compensation  is  motivated 
by  the  importance  of  creativity  to  success  in  contemporary  business  settings  (Chang  and  Birkett 
2004;  Fallon  and  Senn  2006).  An  added  benefit  of  our  focus  on  creativity  is  that  it  enables  us  to 
build  on  recent  related  work  by  Kachelmeier,  Reichert,  and  Williamson  (2008);  hereafter,  KRW), 
who  investigate  the  effort-influencing  effects  of  creativity-based  compensation  in  an  environment 
in  which  contracts  are  randomly  assigned.  KRW  reach  the  provocative  conclusion  that  creativity- 
weighted  pay  actually  lowers  creativity-weighted  productivity,  insofar  as  experimental  participants 
who  are  paid  only  for  quantity  produce  as  much  high-creativity  output  as  their  creativity-weighted 
counterparts,  while  producing  significantly  more  output  overall.  KRW’s  conclusion  is  limited, 
however,  by  the  fact  that  their  experimental  participants  are  randomly  assigned  to  contracts, 
thereby  preventing  the  creativity-weighted  contract  from  attracting  those  with  more  creative  abil¬ 
ity.  To  the  extent  that  both  effort  and  ability  drive  creativity,  addressing  this  limitation  could 
potentially  reverse  KRW’s  conclusion,  making  it  important  to  consider  the  effects  of  endogenous 
contract  selection. 

To  pursue  this  objective,  we  adapt  the  “rebus  puzzle”  task  used  in  KRW,  in  which  experi¬ 
mental  participants  design  puzzles  that  use  words,  symbols,  and  pictures  to  represent  a  familiar 
term  or  phrase.  Rebus  puzzles  serve  as  an  experimental  analog  to  real-world  settings  involving  the 
“production  of  ideas,”  for  which  firms  benefit  from  both  the  quantity  of  production  and  its  creative 
value.  Prototypical  examples  of  such  settings  include  professional  service  firms  (Chang  and  Bir¬ 
kett  2004)  and  universities  (Dewett  and  Denisi  2004),  among  others.  Within  such  a  setting,  our 
experiment  manipulates  (1)  whether  participants  are  compensated  based  on  the  raw  quantity  of 
puzzles  produced  or  on  quantity  weighted  by  creativity  scores  from  a  panel  of  creativity  raters,  and 
(2)  whether  these  two  compensation  contracts  are  assigned  randomly  or  are  chosen  by  the  partici¬ 
pants. 

Findings  indicate  that,  relative  to  the  random-assignment  control  condition,  self-selection 
does  indeed  attract  the  most  creative  participants  to  a  creativity-weighted  contract,  both  in  terms  of 
self-perceived  creative  potential  and  in  terms  of  the  actual  creativity  of  participants’  initial  pro¬ 
duction.  However,  notwithstanding  the  benefit  of  self-selected  creativity-weighted  compensation 
that  we  observe  in  initial  creativity,  results  from  aggregate  production  reveal  that  participants  paid 
only  for  quantity  eventually  catch-up  with  and  surpass  the  creativity-weighted  scores  of  those  paid 
for  creativity-weighted  productivity,  whether  contracts  are  self-selected  or  randomly  assigned. 
Thus,  creativity-weighted  compensation  does  not  maximize  (eventual)  creativity-weighted  produc¬ 
tivity.  We  interpret  the  aggregate  results  as  reflecting  the  eventual  net  effect  of  two  opposing 
forces.  Namely,  while  self-selection  allows  those  who  choose  a  creativity-weighted  contract  to 


The  Accounting  Review 
American  Accounting  Association 


September  2010 


Attracting  Creativity:  The  Initial  and  Aggregate  Effects  of  Contract  Selection 


1671 


enjoy  a  significant  initial  advantage,  the  dysfunctional  effort-influencing  effects  of  creativity- 
weighted  compensation  eventually  dominate. 

While  it  might  seem  that  the  eventual  dominance  of  quantity-only  compensation  demonstrates 
the  robustness  of  KRW’s  similar  conclusions  to  contract  selection,  we  submit  that  the  initial 
benefit  of  self-selected  creativity-weighted  compensation  is  important  in  its  own  right.  Our  self- 
selected  quantity-only  participants  had  the  opportunity  to  make  up  their  initial  deficit  by  outper¬ 
forming  their  creativity-weighted  counterparts  in  aggregate  production,  but  real-world  environ¬ 
ments  are  often  not  as  forgiving.  That  is,  in  firms  and  organizations,  resources  often  flow  to  those 
who  achieve  initial  success,  such  that  initial  advantages  can  govern  long-term  opportunities.  For 
this  reason,  we  believe  that  our  results  add  a  significant  caveat  to  KRW’s  conclusions  about  the 
superiority  of  paying  only  for  quantity  even  when  both  quantity  and  creativity  are  desired. 
Namely,  when  employees  can  choose  the  contractual  opportunities  that  suit  them  best,  those  who 
prefer  creativity-based  contracts  could  be  faster  “out  of  the  gate.” 

We  describe  the  background  of  our  study  and  develop  hypotheses  in  Section  II.  Section  III 
explains  the  method  and  design.  Section  IV  presents  results,  and  Section  V  concludes. 

n.  BACKGROUND,  THEORY,  AND  HYPOTHESES 
Background:  Creativity-Weighted  Productivity 

Accountants’  interest  in  performance-based  pay  schemes  stems  from  the  assumption  that  firms 
cannot  directly  observe  employees’  (1)  effort  or  (2)  abilities  (type).  Basing  compensation  on 
output  measures  mitigates  both  sources  of  information  asymmetry,  insofar  as  employees  can  work 
harder  to  attain  rewarded  measures,  thereby  mitigating  the  moral-hazard  problem  of  hidden  effort, 
and  can  also  pursue  employment  contracts  that  align  best  with  innate  skills,  thereby  mitigating  the 
adverse- selection  problem  of  hidden  abilities  (Demski  and  Feltham  1978).  Sprinkle  and  William¬ 
son  (2007,  §2)  review  several  management  accounting  experiments  that  demonstrate  these  ben¬ 
efits.  However,  the  studies  they  review  are  generally  limited  to  one  performance  dimension — 
typically  the  quantity  produced.  Among  others,  Kaplan  and  Norton  (1996)  argue  that  the  full 
potential  of  performance-based  measurement  can  be  captured  only  by  generalizing  to  multiple 
dimensions. 

As  a  step  toward  this  objective,  KRW  extend  the  experimental  study  of  performance-based 
compensation  to  include  creativity  in  addition  to  quantity.  Their  interest  in  creativity  is  motivated 
in  part  by  an  apparent  paradox;  despite  the  clear  importance  of  creativity  to  contemporary  business 
(e.g.,  Fallon  and  Senn  2006),  Ittner  et  al.  (1997)  report  that  very  few  firms  include  explicit 
measures  of  creativity  or  innovation  in  actual  performance-based  compensation  schemes.  One 
plausible  reason  why  firms  are  reluctant  to  reward  creativity  is  that  creativity  is  difficult  to  mea¬ 
sure,  but  KRW  assert  a  more  fundamental  reason:  the  effort-directing  effects  of  creativity-based 
pay  could  do  more  harm  than  good.  In  an  experimental  setting  that  uses  a  panel  of  independent 
raters  to  ensure  reliable  measures  of  creativity,  KRW  find  that  creativity-weighted  productivity 
(i.e.,  quantity  weighted  by  creativity  ratings)  is  significantly  higher  among  participants  compen¬ 
sated  only  for  quantity  than  among  participants  compensated  for  creativity-weighted  productivity. 
By  way  of  explanation,  KRW  draw  on  the  cognitive  difficulty  people  face  when  making  trade-offs 
between  conflicting  incentives  (Payne  et  al.  1993,  Ch.  2).  A  common  decision  heuristic  in  such 
settings  is  to  prioritize  on  one  criterion  as  a  condition  that  must  be  met  first,  thereby  leading  to 
suboptimal  trade-offs.  Extended  to  the  trade-off  between  creativity  and  quantity,  this  reasoning 
suggests  that  creativity-weighted  compensation  leads  participants  to  restrict  production  to  ideas 


'  KRW  also  test  a  fixed-pay  control  condition  and  a  condition  that  rewards  average  creativity  only  (i.e.,  no  reward  for 
quantity),  finding  that  creativity-weighted  productivity  under  these  contracts  is  even  lower. 
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that  they  feel  will  be  highly  creative.  However,  the  literature  on  creativity  in  psychology  and 
management  (e.g.,  Amabile  1996)  documents  that  creativity  does  not  necessarily  emerge  simply 
from  trying  harder.  Accordingly,  while  the  creativity-focused  effort  induced  by  a  creativity- 
weighted  pay  scheme  significantly  lowers  overall  production,  it  does  not  enhance  creative  output. 

KRW’s  finding  is  provocative,  but  is  restricted  to  the  effort-influencing  effects  of  creativity- 
weighted  compensation.  We  draw  on  KRW  as  a  baseline,  replicating  their  use  of  randomly  as¬ 
signed  contracts  as  a  benchmark  for  considering  the  possibly  countervailing  effects  of  contract 
selection.  Proceeding  in  this  manner  enables  us  to  isolate  the  incremental  effects  of  contract 
selection  under  ceteris  paribus  conditions. 

Initial  Creativity:  Does  Creativity-Weighted  Compensation  Attract  the  Most  Creative  Work¬ 
ers? 

Random  assignment  to  treatment  conditions  is  the  cornerstone  of  experimental  design,  allow¬ 
ing  researchers  to  separate  treatment  effects  from  the  potentially  confounding  influence  of  self¬ 
selection  biases  (Shadish  et  al.  2002).  However,  for  the  interest  management  accountants  have  in 
performance-based  compensation,  self-selection  is  more  than  a  “confound.”  Of  specific  relevance 
to  our  setting,  even  if  creativity  is  unresponsive  to  increased  effort,  the  possibility  of  different 
creative  abilities  could  serve  as  a  beneficial  consequence  of  creativity-weighted  pay,  if  more 
creative  individuals  are  more  attracted  than  others  to  a  contract  that  rewards  creativity. 

Waller  and  Chow  (1985)  model  the  self-selection  benefits  of  performance-based  compensa¬ 
tion  schemes  as  a  function  of  prospective  employees’  perceived  abilities  and  preferences.  Consis¬ 
tent  with  this  model,  several  experiments  have  demonstrated  contract-selection  benefits  for  differ¬ 
ent  ways  to  reward  quantity  (e.g.,  Chow  1983;  Waller  and  Chow  1985;  Shields  and  Waller  1988; 
Dillard  and  Fisher  1990;  Cadsby  et  al.  2007).  We  are  unaware,  however,  of  any  extensions  to  a 
multi-dimensional  environment.  For  contract-selection  benefits  to  extend  to  creativity  in  particular, 
two  assumptions  must  be  met.  First,  individuals  must  be  aware  of  meaningful  variation  in  their 
creative  potential.  Second,  more  creative  individuals  must  be  more  willing  than  others  to  choose  a 
contract  that  rewards  creativity  explicitly. 

Redmond  et  al.  (1993)  provide  some  evidence  in  support  of  the  first  assumption,  insofar  as 
they  find  that  individuals  have  reasonable  self-insight  into  their  creativity.  Their  evidence  is  from 
a  single  creative  task  that  may  or  may  not  generalize  to  our  setting.  Still,  the  available  evidence 
(also  see  Amabile  1996)  suggests  that  creative  abilities  and  corresponding  self-insights  differ 
sufficiently  to  afford  a  potential  for  contract  selection  benefits  to  occur. 

We  are  unaware  of  any  direct  evidence  regarding  the  second  assumption  that  more  creative 
individuals  would  be  more  inclined  to  select  a  creativity-weighted  contract.  More  generally,  how¬ 
ever,  Vaughn  (1999)  provides  experimental  evidence  that  individuals  employ  a  variant  of  the 
“availability  heuristic”  in  forming  self-assessments  of  likely  success  (also  see  Tversky  and  Kah- 
neman  1973;  Kahneman  and  Frederick  2002;  Schwarz  and  Vaughn  2002).  Vaughn’s  (1999)  rea¬ 
soning  suggests  that  individuals  contemplating  the  benefits  of  a  creativity-weighted  contract  would 
likely  reflect  on  how  easily  some  creative  ideas  come  to  mind.  If  so,  we  should  see  some  differ¬ 
ence  in  initial  creativity  that  favors  the  creativity-weighed  contract,  reflecting  the  initial  ideas  that 
lead  individuals  to  choose  such  a  contract.  Moreover,  if  such  an  effect  reflects  self-selection  on 
ability  rather  than  contractually  induced  effort,  we  should  not  observe  a  similar  difference  under 
randomly  assigned  contracts.  We  test  this  reasoning  as  our  first  hypothesis: 

HI:  Relative  to  any  difference  observed  under  randomly  assigned  contracts,  when  individu¬ 
als  are  allowed  to  choose  between  contracts,  those  choosing  a  creativity-weighted  pay 
scheme  will  evidence  greater  initial  creativity  than  will  those  choosing  a  scheme  that 
rewards  quantity  only. 
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Aggregate  Creativity-Weighted  Productivity 

Importantly,  HI  predicts  a  contract- selection  effect  in  initial  creativity,  when  selection  effects 
are  most  likely  to  be  observable.  Over  time,  any  selection  benefits  of  a  creativity-weighted  con¬ 
tract  become  intermixed  with  the  effort-directing  effects  of  such  a  contract.  That  is,  even  if 
individuals  of  different  types  are  attracted  to  different  contracts,  those  contracts  subsequently 
motivate  different  kinds  of  effort  that  can  exacerbate,  offset,  or  even  reverse  the  initial  differences. 
As  summarized  earlier,  KRW  find  evidence  from  a  randomized  experiment  that  a  creativity- 
weighted  compensation  contract  generates  lower  creativity-weighted  productivity  than  a  quantity- 
only  scheme,  defining  “creativity-weighted  productivity”  as  the  sum  of  the  creativity  ratings 
assigned  to  all  output  produced.^  They  assert  that  this  effect  arises  because  the  creativity-weighted 
contract  leads  individuals  to  fixate  on  creativity  to  the  detriment  of  overall  production  (and  without 
realizing  gains  in  high-creativity  production).  Hence,  the  harmful  effect  of  a  creativity- weighted 
contract  on  aggregate  production  effort,  as  documented  by  KRW,  works  in  the  opposite  direction 
of  the  initial  selection  benefit  of  a  creativity- weighted  contract  that  we  predict  in  HI.  Ex  ante,  it  is 
unclear  which  effect  would  dominate  over  the  course  of  aggregate  production  in  a  setting  with 
self-selected  contracts,  motivating  a  two-tailed  test  of  the  following  null  hypothesis: 

H2  (null  form):  In  aggregate  production  and  with  self-selected  compensation  contracts, 
creativity-weighted  productivity  will  not  differ  between  individuals  who 
select  a  creativity-weighted  pay  scheme  and  individuals  who  select  a 
quantity-only  scheme. 

Stating  H2  in  terms  of  “creativity-weighted  productivity”  implies  that  all  output  “counts,” 
although  output  of  higher  creativity  counts  more.  However,  some  real-world  settings  may  value 
only  high-creativity  output."^  Thus,  to  corroborate  and  generalize  our  findings,  we  also  examine  the 
effects  of  our  experimental  treatments  on  the  production  of  high-creativity  efforts  only,  defined 
(similarly  to  KRW)  as  output  in  the  overall  top  quartile  of  creativity  ratings. 

III.  METHOD 

Participants  and  Experimental  Task 

In  each  of  10  sessions,  8-10  volunteers  participated  in  a  75-minute  experiment,  yielding  a 
total  of  90  participants.  Similar  to  KRW,  we  recruited  student  participants  from  undergraduate 
core  business  classes,  reflecting  a  variety  of  business  majors.  We  also  adapted  KRW’s  basic  task, 
subject  to  some  modifications  that  we  describe  shortly.  The  task  involved  designing  “rebus 
puzzles,”  defined  in  the  instructions  as  a  “kind  of  riddle  in  which  words  and/or  diagrams  are  used 
to  represent  a  familiar  term  or  phrase,”  with  11  examples  given  as  illustrations.  Also  consistent 
with  KRW,  the  instructions  common  to  all  conditions  observed  that  “while  we  do  not  place  any 
rules  on  the  kinds  of  rebus  puzzles  you  can  submit,  we  value  both  the  number  of  different  puzzles 
you  construct  (i.e.,  quantity)  and  the  creativity  of  those  puzzles  (i.e.,  puzzles  that  are  original 
ideas,  innovative,  and  clever).”  This  instruction  is  important  because  our  interest  is  in  the  incre- 


^  Arithmetically,  “creativity- weighted  productivity”  is  equivalent  to  the  total  quantity  produced  multiplied  by  the  average 
creativity  rating  of  that  quantity.  Experimental  participants  in  our  study  and  in  KRW  were  provided  with  the  simpler 
definition  based  on  the  sum  of  the  creativity  scores  for  all  output  produced. 

^  Our  statement  of  H2  in  the  null  form  does  not  imply  that  we  are  predicting  the  null.  Rather,  we  are  testing  the  null 
against  a  two-tailed  alternative,  given  the  ex  ante  reasoning  that  a  difference  could  arise  in  either  direction. 

For  example,  paralleling  the  two  conditions  examined  in  this  study,  part  of  the  Hollywood  writers’  strike  of  late  2007 
involved  writers’  objections  to  television  studios’  proposals  to  “only  pay  for  jokes  that  are  funny,  [thus  departing  from] 
the  longstanding  tradition  of  paying  comedy  writers  by  the  word”  (Queenan  2007,  Alb).  In  that  setting,  a  joke  that  is  not 
funny  (insufficiently  creative)  might  indeed  add  very  little  (or  even  negative)  value  at  the  margin. 
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mental  effects  of  performance-based  compensation,  not  the  more  basic  goal-setting  effects  from 
knowing  what  matters  to  the  experimenter  (Shalley  1991;  Bonner  and  Sprinkle  2002). 

Participants  worked  independently  at  separate  tables,  designing  puzzles  on  3  X  5  inch  index 
cards  by  drawing  the  puzzle  on  one  side  and  its  solution  on  the  other  side.  Participants  placed  each 
completed  puzzle  in  an  individual  output  box,  subject  to  the  understanding  that  once  in  the  box,  a 
puzzle  could  not  be  removed.^  This  rule  maintained  control  over  the  sequence  of  submissions, 
which  is  important  to  our  goal  of  separately  measuring  initial  and  aggregate  creativity-weighted 
productivity. 

Design 

We  employ  a  2  X  2  between-subjects  experimental  design  that  manipulates  two  factors;  (1) 
contract  type,  basing  performance-contingent  compensation  on  quantity  only  or  on  creativity- 
weighted  productivity,  and  (2)  contract  determination,  with  contracts  either  randomly  assigned  or 
chosen  by  the  participants. 

Our  first  experimental  factor,  contract  type,  holds  constant  the  dependence  of  both  compen¬ 
sation  contracts  on  quantity,  but  varies  whether  the  contract  also  rewards  (i.e.,  weights)  creativity. 
We  favor  this  approach  over  other  possibilities,  such  as  offering  a  contract  that  rewards  average 
creativity  ratings,  for  two  reasons.  First,  the  scope  of  our  study  is  restricted  to  an  environment  in 
which  creativity  is  important,  but  not  at  the  expense  of  overall  productivity  (Chang  and  Birkett 
2004;  Dewett  and  Denisi  2004).  As  discussed  by  KRW,  without  quantity  objectives,  workers  can 
improve  average  creativity  by  pursuing  the  “denominator  strategy”  of  reducing  overall  production 
without  increasing  high-creativity  production.  Such  a  strategy  would  be  dysfunctional  from  the 
firm’s  perspective  unless  less  creative  output  is  actually  harmful  at  the  margin,  so  we  provide 
quantity  incentives  in  all  conditions.  Second,  from  an  experimental  design  perspective,  we  wanted 
to  simplify  the  participants’  choice  by  highlighting  creativity  as  the  only  construct  differing  be¬ 
tween  the  two  contracts.  Notwithstanding  this  rationale,  we  recognize  that  some  firms  may  place 
little  value  on  low-creativity  output,  such  that  we  conduct  additional  analyses  to  examine  the 
sensitivity  of  our  results  to  measures  of  high-creativity  output  only.  Our  second  experimental 
factor,  contract  determination,  addresses  our  primary  objective  of  examining  self-selected  con¬ 
tracts,  while  replicating  KRW’s  use  of  randomly  assigned  contracts  as  a  control  condition.® 

Participants  in  all  10  sessions  first  read  the  same  information  about  both  incentive  contracts. 
The  instructions  specify  that  those  using  the  quantity-only  contract  will  receive  compensation 
based  on  a  rate  per  puzzle  that  results  in  $5.00  for  the  participant  (among  all  participants  using  this 
contract)  submitting  the  fewest  puzzles  and  $45.00  for  the  participant  submitting  the  most  puzzles, 
thereby  yielding  an  average  compensation  around  $25.00.  The  alternative,  creativity-weighted 
contract  specifies  compensation  based  on  the  “creativity-weighted  total  score  of  all  rebus  puzzles” 
constructed,  determined  by  adding  the  1-to-lO  creativity  ratings  assigned  to  those  puzzles  by  “a 
group  of  students  from  a  different  class.”  The  instructions  in  this  version  observe  that  “each  puzzle 
you  submit  helps  your  total  score,  but  higher  rated  puzzles  count  more.”  Similar  to  the  quantity- 
only  payment  scheme,  the  creativity-weighted  scheme  anchors  total  compensation  at  $5.00  and 


^  Participants  were  allowed  to  hold  on  to  a  puzzle  as  long  as  they  wished  before  submitting  it  to  the  output  box,  but 
substantially  all  participants  submitted  puzzles  to  the  box  sequentially  as  they  finished  them,  with  the  few  instances  of 
discarded  output  reflecting  incomplete  efforts.  This  behavior  is  reasonable,  given  that  each  completed  puzzle  strictly 
benefits  compensation  under  either  a  quantity-only  or  a  creativity-weighted  productivity  pay  scheme. 

Our  random  assignment  condition  is  not  an  exact  replication  of  KRW.  Unlike  their  smdy,  we  provide  participants  in  all 
conditions  with  (1)  a  practice  phase,  and  (2)  an  explicit  description  of  both  contracts  prior  to  determining  one  of  these 
contracts  for  payment  (randomly  or  by  participant  choice).  These  modifications  facilitate  participants’  ability  to  select 
among  the  available  contracts,  so  we  keep  these  features  constant  in  our  random  assignment  condition  for  ceteris 
paribus  comparability. 
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$45.00  for  the  lowest  and  highest  creativity- weighted  total  scores,  respectively,  yielding  an  aver¬ 
age  payment  around  $25.00.^ 

To  operationalize  the  between-subjects  manipulation  of  whether  participants  are  randomly 
assigned  to  one  of  the  contracts  or  are  allowed  to  choose,  we  conduct  five  experimental  sessions 
within  each  condition,  in  a  counterbalanced  order.  In  the  random-assignment  condition,  partici¬ 
pants  draw  a  wooden  ball  that  is  marked  “A,”  representing  the  quantity-only  contract,  or  “B,” 
representing  the  creativity-weighted  contract.  As  noted  in  the  instructions,  an  equal  number  of 
balls  of  each  type  ensures  that  the  probability  of  being  assigned  to  either  contract  is  50  percent.  In 
the  self-selection  condition,  each  participant  reads  the  same  information  about  both  contracts  and 
writes  in  a  checkmark  next  to  the  preferred  contract  when  ready.  While  contract  selection  does  not 
provide  any  assurance  of  similar  sample  sizes,  our  participants  were  nearly  evenly  split,  with  24 
(21)  of  the  45  self- selection  participants  choosing  the  quantity-only  (creativity-weighted)  contract. 

Procedure 

After  describing  the  task  and  the  statement  about  valuing  both  quantity  and  creativity,  but 
before  describing  the  compensation  schemes,  the  instructions  announced  a  five-minute  practice 
period  “to  help  you  gauge  your  quantity  and  creativity  in  constructing  rebus  puzzles.”  The  intent 
of  the  practice  period  is  to  familiarize  participants  with  the  task  before  choosing  between  con¬ 
tracts,  so  we  provide  the  same  opportunity  in  the  random-assignment  condition  for  ceteris  paribus 
comparability. 

After  the  five-minute  practice  period,  the  experimenters  collected  all  rebus  puzzles  con¬ 
structed  during  this  period  and  distributed  new  instructions  about  the  compensated  phase  of  the 
experiment.  These  new  instructions  informed  participants  that  they  would  now  “design  rebus 
puzzles  for  an  additional  20  minutes  for  money.”  To  avoid  ambiguity  about  duplicating  ideas  from 
the  practice  phase,  the  instructions  informed  participants  that  “you  may  reuse  any  of  your  ideas 
from  the  previous  five-minute  phase  if  you  choose  to  do  so,  but  you  must  prepare  a  new  card  for 
each  such  idea.”  Participants  then  read  descriptions  of  both  compensation  schemes  and  were  either 
randomly  assigned  to  one  of  these  contracts  or  indicated  the  preferred  contract,  as  described 
above.  Next,  participants  completed  a  pre-experimental  questionnaire  to  confirm  their  understand¬ 
ing  of  the  pay  schemes  and  to  elicit  various  self-perceptions  (described  later). 

The  20-minute  compensated  production  phase  of  the  experiment  followed.  Participants  then 
completed  a  post-experimental  questionnaire,  followed  by  a  risk-preference  exercise  adapted  from 
Boylan  and  Sprinkle  (2001).  The  risk-preference  ipstrument  required  participants  to  make  15 
choices  between  a  certain  payment  of  $5.00  or  a  lottery  that  paid  $10  with  probability  p  and  $0 
with  probability  1  -  p,  with  p  decreasing  in  5-percent  increments  from  85  percent  to  15  percent.  As 
a  measure  of  risk  preferences,  we  recorded  the  point  at  which  each  participant’s  choice  shifted 
from  preferring  the  lottery  to  preferring  the  certain  $5  (participants  were  allowed  to  shift  only 
once).  Earlier  shifts  are  recorded  as  lower  scores,  reflecting  lower  tolerance  for  risk.  After  partici¬ 
pants  completed  the  questionnaire,  we  played  out  the  lottery  for  one  of  the  15  choices,  selected  at 
random.  We  then  added  either  the  lottery  outcome  or  the  certain  $5.00  to  each  participant’s 
compensation,  depending  on  that  participant’s  choice. 


’  Anchoring  the  low  and  high  payments  for  both  contracts  at  $5  and  $45,  respectively,  allows  us  to  vary  the  basis  of 
compensation  while  holding  constant  the  expected  average  magnitude  of  compensation.  Similar  to  KRW,  we  achieve 
this  equivalence  by  basing  the  low  and  high  payments  on  the  lowest  and  highest  performers  within  each  contract. 
However,  these  contracts  are  not  “tournament”  schemes  as  defined  by  Bonner  et  al.  (2000),  because  with  the  sole 
exception  of  the  highest-scoring  participant,  every  other  participant  can  earn  greater  compensation  by  increasing  the 
compensated  performance  measure,  irrespective  of  ordinal  ranking. 
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Approximately  two  weeks  after  the  experimental  sessions,  we  paid  participants  privately  in 
accordance  with  their  task  performance,  corresponding  compensation  scheme,  and  any  additional 
earnings  from  the  risk-preference  exercise.  As  announced  in  the  instructions,  the  delay  in  pay¬ 
ments  was  necessitated  by  the  need  to  determine  quantity  counts  and  creativity  ratings  as  the  basis 
for  compensation.  Payments  were  determined  exactly  as  described,  without  deception  of  any  form. 

Determining  Creativity  Ratings 

Undergraduate  student  volunteers  (n  =  40)  from  a  different  class  rated  the  creativity  of  the 
rebus  puzzles.  In  each  of  four  three-hour  rating  sessions,  10  different  raters  evaluated  a  subset  of 
the  2,178  puzzles  produced  by  the  experimental  participants  (324  puzzles  during  the  practice 
phase  and  1,854  during  the  compensated  phase).  Raters  were  paid  $50  for  their  time. 

To  construct  the  subset  of  rebus  puzzles  evaluated  by  each  cohort  of  raters,  we  first  removed 
all  practice-phase  puzzles  that  were  reused  by  the  participants  during  the  compensated  phase  of  the 
experiment.  This  procedure  ensured  that  participants  would  not  be  penalized  for  reusing  an  idea 
from  the  practice  period,  as  was  explicitly  allowed  in  the  instructions.  For  analysis  purposes,  we 
assign  the  same  rating  to  each  removed  practice  puzzle  as  the  rating  of  the  corresponding  com¬ 
pensated  puzzle  that  reused  the  idea.  This  process  reduced  the  total  number  of  evaluated  rebus 
puzzles  from  2,178  to  1,968. 

For  rating  purposes,  we  randomized  the  order  of  the  1,968  puzzles.  To  reduce  rater  fatigue  and 
make  the  task  manageable,  each  of  the  four  rating  sessions  processed  one-fourth  of  the  randomly 
ordered  puzzles.  We  have  no  reason  to  suspect  any  systematic  differences  between  rating  sessions, 
and  even  if  such  differences  existed,  they  would  be  unlikely  to  interact  with  the  treatment  factors 
distributed  within  each  rating  session.  Nevertheless,  to  assess  reliability  across  rating  sessions  and 
to  give  each  session  the  same  starting  point  for  calibration,  we  began  each  rating  session  with  the 
same  60  puzzles,  in  the  same  order.  Correlations  between  the  average  rating  within  each  rating 
session  and  the  average  across  the  other  three  rating  sessions  are  significantly  positive  for  the  first 
60  puzzles,  ranging  from  0.79  to  0.89.  We  therefore  believe  that  the  creativity  ratings  are  reason¬ 
ably  reliable  and  consistent  across  sessions.^ 

Each  rating  session  took  place  in  a  classroom,  in  which  raters  used  radio-frequency  response 
device  keypads  to  assign  a  1  (lowest)  to  10  (highest)  creativity  rating  to  each  puzzle,  projected  one 
at  a  time  using  a  digital  document  camera.  An  assistant  transcribed  the  solution  to  each  puzzle  on 
the  bottom  of  each  card  beforehand,  such  that  the  puzzle  solutions  were  projected  along  with  the 
puzzles.  The  response-device  software  informed  us  when  all  ratings  had  been  entered  into  a 
computer  database,  upon  which  one  of  the  experimenters  projected  the  next  puzzle.  To  ensure 
independent  ratings,  we  spaced  raters  far  apart  and  asked  them  to  refrain  from  comments  or 
laughter.  All  raters  first  read  the  task  instructions  given  to  the  experimental  participants,  including 
the  same  instructional  examples.  Raters  were  unaware,  however,  of  the  different  compensation 
schemes  or  the  assignment  of  participants  to  those  schemes. 


Our  use  of  undergraduate  raters  differs  from  KRW’s  use  of  doctoral  students.  We  felt  that  undergraduate  peer  ratings 
would  be  just  as  reliable  as  doctoral-student  ratings.  The  use  of  undergraduates  allowed  us  to  recruit  a  larger  number  of 
raters,  given  the  need  to  evaluate  approximately  2,000  puzzles  for  creativity.  We  have  no  reason  to  believe  that  the 
undergraduate  ratings  from  our  study  differ  systematically  from  the  doctoral-student  ratings  in  KRW,  but  the  more 
important  point  is  that  we  hold  the  rater  pool  constant  across  both  the  random-assignment  and  self-selection  conditions, 
such  that  if  we  replicate  KRW’s  findings  in  the  random-assignment  condition,  we  gain  comfort  that  our  contract- 
selection  results  are  not  simply  an  artifact  of  different  creativity  raters. 

^  For  tests  of  hypotheses,  we  obtain  a  creativity  rating  for  the  first  60  puzzles  rated  in  all  four  sessions  by  randomly 
assigning  15  of  these  puzzles  to  each  of  the  four  sessions. 

Unobtrusive  identification  codes  on  the  back  of  each  puzzle  were  not  projected  to  raters. 
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Within  each  rating  session,  correlations  between  each  rater  and  the  average  of  the  other  nine 
raters  are  significantly  positive  (ranging  from  0.20  to  0.73),  with  the  exception  of  one  rater  who 
exhibits  an  insignificant  negative  correlation  (r  =  —0.06).  To  enhance  reliability,  we  drop  the 
lowest  correlated  rater  within  each  rating  session.  Cronbach’s  Alpha  is  0.79  under  this  approach, 
which  exceeds  typical  reliability  thresholds  (Peterson  1994). 

As  illustrations.  Figure  1  provides  examples  of  six  rebus  puzzles  from  our  experiment,  with 
four  examples  of  high-creativity  ratings  (Panels  A  through  D),  one  example  of  a  puzzle  at  the 
median  of  the  creativity  distribution  (Panel  E),  and  one  immediately  above  the  bottom  quartile 
(Panel  F).^^  For  the  entire  distribution  of  puzzles,  average  ratings  (nine  raters  per  puzzle)  range 
from  1.44  to  8.89,  reflecting  a  wide  variation  in  creativity. 

IV.  RESULTS 

Necessary  Conditions  for  Contract  Selection  Benefits 

Before  testing  our  hypotheses,  we  first  explore  several  necessary  conditions  for  any  potential 
benefit  of  contract  self-selection.  First,  participants  must  perceive  meaningful  differences  in  their 
potential  to  produce  highly  creative  puzzles,  the  criterion  that  differs  between  contracts,  and  select 
accordingly.  Second,  we  must  observe  meaningful  variation  in  actual  creative  production,  such 
that  potential  self- selection  benefits  are  detectable.  Third,  participants’  self-assessed  creativity 
potential  should  correlate  positively  with  their  actual  creative  production.  Below,  we  test  each 
condition  within  our  contract  self- selection  subsample.  Table  1  provides  descriptive  statistics, 
including  means,  standard  deviations,  and  ranges  by  condition  for  the  dependent  measures. 

Self-Assessed  Creative  Potential  and  Basis  of  Selection 

We  elicited  participants’  self-assessments  of  potential  relative  to  others  to  produce  highly 
creative  rebus  puzzles  in  a  questionnaire  administered  after  the  practice  session  but  before  the 
compensated  phase  of  the  experiment.  Participants  responded  on  a  nine-point  Likert  scale,  with 
the  low  and  high  endpoints  labeled  “much  worse  than  others’’  and  “much  better  than  others,” 
respectively.  As  Table  1  highlights,  participants’  responses  varied  widely,  ranging  over  the  entire 
nine-point  scale. 

As  noted  earlier,  nearly  half  of  the  participants  in  the  contract-selection  condition  (21  of  45) 
selected  the  creativity-weighted  contract.  Univariate  comparisons  of  means  reveal  that,  relative  to 
those  selecting  the  quantity-only  contract,  participants  choosing  the  creativity-weighted  contract 
perceive  themselves  to  have  significantly  more  potential  to  produce  highly  creative  puzzles  (t  = 
5.26;  two-tailed  p  <  0.01).  Moreover,  participants  rating  their  creativity  potential  above  the 
midpoint  on  this  scale  (the  median  score)  predominately  select  the  creativity-weighted  contract 
over  the  quantity  contract  (16  creativity- weighted  versus  3  quantity-only),  whereas  participants  at 
or  below  the  midpoint  predominately  select  the  quantity-only  contract  (5  creativity-weighted  ver¬ 
sus  21  quantity-only).  These  relative  frequencies  are  significantly  different  (x^  =  18.62;  p  <  0.01). 

To  corroborate  our  univariate  results,  we  also  perform  a  logistic  regression  analysis  (untabu¬ 
lated)  of  the  propensity  to  choose  the  creativity-weighted  contract,  regressed  on  self-reported 
creative  potential,  the  supplemental  measure  from  the  post-experimental  task  we  use  to  elicit  risk 
preferences,  and  on  the  interaction  between  these  two  independent  measures. Controlling  for  risk 


“  The  relatively  low-rated  puzzle  in  Panel  F  of  Figure  1,  “crossing  paths,”  follows  a  pattern  similar  to  one  of  the 
instructional  examples,  “cross  roads.”  KRW  observe  that  creativity  raters  tend  to  assign  lower  ratings  to  puzzles  that  are 
patterned  after  one  of  the  instructional  examples.  Our  experiment  uses  the  same  instructional  examples  as  illustrated  in 
KRW  (2008,  369-370). 

We  control  for  risk  preferences  because  the  subjectivity  of  creativity  ratings  implies  greater  uncertainty  under  the 
creativity-weighted  pay  scheme  than  under  the  quantity-only  scheme.  In  general,  individuals  with  greater  tolerance  for 
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FIGURE  1 

Examples  of  Actual  Rebus  Puzzles 


risk  are  more  likely  to  prefer  noisier  compensation  schemes  (Cadsby  et  al.  2007).  Consistent  with  this  reasoning,  the 
propensity  of  participants  to  select  the  creativity-weighted  contract  is  positively  associated  with  risk  preferences  (p  = 
0.05).  Also,  the  creative  potential  X  risk  preference  interaction  is  significantly  negative  (p  =  0.05),  suggesting  that  the 
combined  effect  of  the  creative  potential  and  risk-preference  measures  is  subadditive:  a  greater  score  on  either  one  of 
these  measures  dampens  (but  does  not  eliminate)  the  effect  of  the  other  measure  on  the  propensity  to  select  the 
creativity-weighted  contract. 
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TABLE  1 


Descriptive  Statistics 


Self-Selected  Contracts 

Randomly  Assigned  Contracts 

Mean  (Standard  Deviation)  {Range} 

Mean  (Standard  Deviation)  {Range} 

Quantity-Only 

Creativity- Weighted 

Quantity-Only 

Creativity- Weighted 

n  =  24 

n  =  21 

n  =  19 

n  =  26 

Creativity  Potential 

3.92 

6.14 

5.26 

5.15 

(1.25) 

(1.59) 

(1.85) 

(1.89) 

{2-7} 

{3-9} 

{3-9} 

{1-8} 

Creativity  Rating 

4.61 

5.09 

4.80 

4.45 

First  Puzzle 

(1.16) 

(1.34) 

(1.22) 

(1.17) 

{2.11-7.22} 

{3.33-7.44} 

{2.78-6.78} 

{3.00-7.44} 

Average  Creativity 

4.72 

5.09 

4.98 

4.87 

Rating  First  Eight 

(0.59) 

(0.44) 

(0.62) 

(0.51) 

Puzzles 

{3.67-5.82} 

{4.42-5.89} 

{4.06-6.01} 

{3.54-5.83} 

Number  of  High 

1.75 

2.52 

2.21 

2.08 

Creativity  Puzzles 

(1.36) 

(1.21) 

(1.62) 

(1.29) 

through  the 

Eighth  Unit 
of  Production 

{0-5} 

{0-5} 

{0-5} 

{0-5} 

Aggregate  Creativity- 

113.32 

77.31 

122.05 

83.88 

Weighted 

(57.12) 

(29.92) 

(40.98) 

(27.07) 

Productivity 

{34.56-254.22} 

{31.00-141.00} 

{57.56-186.78} 

{35.00-140.11} 

Aggregate  Number  of 

4.33 

4.48 

5.95 

4.69 

High  Creativity 

(2.84) 

(2.06) 

(4.01) 

(3.06) 

Puzzles 

{0-11} 

{1-10} 

{0-15} 

{1-12} 

Quantity 

25.13 

16.85 

25.74 

16.85 

(15.41) 

(6.06) 

(10.22) 

(5.42) 

{7-76} 

{6-29} 

{12-54} 

{7-29} 

Self-Perceived  Task 

4.92 

4.86 

5.26 

5.19 

Eamiliarity 

(2.60) 

(2.87) 

(2.81) 

(3.21) 

{1-9} 

{1-9} 

{1-9} 

{1-9} 

preferences,  logit  results  confirm  that  the  propensity  of  participants  to  select  the  creativity- 
weighted  contract  is  positively  associated  with  self-perceived  creative  potential  (p  <  0.01). 

Meaningful  Variation  in  High-Creativity  Production 

To  assess  whether  our  participants  exhibit  meaningful  variation  in  creative  production,  we 
examine  the  production  of  high-creativity  puzzles,  defined  similarly  to  KRW  as  the  number  of 
puzzles  each  participant  produces  that  lie  in  the  overall  top  quartile  of  creativity  ratings,  which  in 
our  distribution  requires  a  rating  of  5.7  or  higher. Figure  2  graphs  the  overall  distribution  of 


We  observe  no  significant  influence  of  the  five-minute  practice  phase  on  contract  preferences,  and  do  not  detect  any 
significant  differences  in  quantity,  average  creativity,  or  creativity-weighted  productivity  during  practice.  Participants 
likely  needed  the  practice  time  to  learn  the  task  and  experiment  with  different  types  of  puzzles.  Results  are  also 
insensitive  to  participant  gender,  with  55  percent  (52  percent)  of  females  (males)  choosing  the  quantity-only  contract. 
One  might  conjecture  that  average  creativity  ratings  could  serve  as  an  alternate  measure  of  creative  production,  but  that 
measure  is  confounded  by  what  KRW  termed  a  “denominator  effect.”  Namely,  if  two  participants  (hypothetically) 
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FIGURE  2 

Distribution  of  High-Creativity  Output  by  Participant  in  the  Self-Selection  Condition 


high-creativity  output  for  participants  in  the  self-selection  condition,  collapsing  across  both  pos¬ 
sible  contracts.  As  Figure  2  indicates,  the  variation  of  high-creativity  production  is  quite  dispersed, 
with  several  participants  producing  two  or  less  such  puzzles,  while  others  produce  ten  or  more.*^ 

Correlation  between  Creative  Potential  and  High-Creativity  Output 

Although  participants  choose  contracts  based  on  self-perceived  creativity,  and  although  we 
observe  significant  variation  in  high-creativity  production,  we  do  not  observe  a  significant  corre¬ 
lation  between  self-assessed  creativity  potential  and  the  aggregate  number  of  high-creativity 
puzzles  produced  (r  =  0.10;  p  =  0.51).  It  appears  that  participants  were  unable  to  map  their 
selection  preferences  to  their  aggregate  capacity  to  produce  high-creativity  output. 

Notwithstanding  the  absence  of  correlation  between  self-perceived  creative  potential  and 
aggregate  high-creativity  production,  we  obtain  a  quite  different  result  when  we  restrict  the  analy¬ 
sis  to  initial  production,  defined  as  the  number  of  high-creativity  puzzles  generated  within  the  first 
eight  puzzles  produced  by  each  participant  (eight  is  half  the  median  quantity  produced  by  partici¬ 
pants  in  the  self-selection  condition).  Here  the  correlation  between  self-perceived  creative  poten¬ 
tial  and  initial  high-creativity  production  is  much  larger,  positive,  and  statistically  significant  (r  = 
0.30;  p  =  0.05). 

By  way  of  explanation,  recall  that  the  theory  underlying  HI  suggests  that  self-assessed  cre¬ 
ativity  potential  draws  on  initial  ideas  (Vaughn  1999).  Hence,  we  observe  a  significantly  positive 
correlation  between  self-perceived  creativity  and  a  measure  of  initial  creative  production,  although 


produce  the  same  number  of  high-creativity  puzzles,  the  participant  who  produces  more  puzzles  overall  will  have  a 
lower  average  creativity  score,  even  though  that  participant  is  no  less  “creative”  in  an  absolute  sense,  and  in  fact  is  more 
productive  overall.  We  therefore  prefer  the  raw  count  of  high-creativity  puzzles  as  a  cleaner  measure  of  creative 
production. 

Using  the  distribution  in  Figure  2,  we  also  construct  two  “pseudo  samples”  by  dividing  the  participants  in  our  self¬ 
selection  conditions  into  those  who  produced  high-creativity  output  above  or  below  the  median  (deleting  participants  at 
the  median).  The  above-median  group  significantly  exceeds  the  below-median  group  both  in  terms  of  (1)  creativity- 
weighted  productivity  and  (2)  production  of  high-creativity  puzzles  (p  <  0.05;  two-tailed).  This  analysis  confirms  that 
we  have  enough  statistical  power  to  discern  creativity-based  advantages  in  aggregate  high-creativity  production,  if 
participants  in  the  self-selection  condition  have  the  foresight  to  select  contracts  that  match  their  actual  differences  in 
creative  production. 
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that  correlation  dissipates  in  aggregate  creative  production,  likely  reflecting  the  offsetting  effort- 
inducing  effects  of  the  two  contracts  over  time.  This  conclusion  is  consistent  with  the  results  of 
our  hypothesis  testing,  as  presented  next. 

Tests  of  Hypotheses 

In  the  results  presented  below,  we  use  analysis  of  covariance  (ANCOVA)  to  investigate  the 
main  and  interactive  effects  of  contract  type  (quantity-only  versus  creativity-weighted  productivity 
incentives)  and  contract  determination  (self-selection  versus  random  assignment)  on  both  initial 
creativity  (HI)  and  aggregate  creativity- weighted  productivity  (H2).  The  sole  covariate  in  all 
analyses  presented  is  a  measure  of  participants’  self-perceived  familiarity  with  rebus  puzzles,  as 
captured  from  nine-point  Likert  scale  responses  to  the  post-experimental  question,  “I  was  familiar 
with  rebus  puzzles  before  the  experiment,”  with  the  low  and  high  endpoints  labeled  “strongly 
disagree”  and  “strongly  agree,”  respectively.  This  covariate  is  statistically  significant  in  all  analy¬ 
ses  reported  below.  Task  familiarity  is  possible  in  this  experiment  from  exposure  to  rebus  puzzles 
in  the  popular  media  (e.g.,  Morris  1983).  Extracting  the  variation  from  this  covariate  enhances  our 
statistical  ability  to  isolate  creativity  effects  apart  from  any  pre-experimental  exposure  to  the  task 
domain.  However,  task  familiarity  does  not  contaminate  our  ability  to  isolate  contract  effects, 
insofar  as  a  separate  ANOVA  with  task  familiarity  as  the  dependent  variable  (untabulated)  reveals 
no  significant  main  or  interaction  effects  of  either  contract  type  or  contract  determination  (highest 
F  =  0.31;  all  p  >  0.50).  Hence,  task  familiarity  appears  evenly  distributed  across  cells,  including 
within  the  self-selection  conditions. 

HI:  Initial  Creativity 

As  explained  in  the  motivation  of  HI  and  consistent  with  the  correlation  analysis  reported 
above  on  participants’  self-perceived  creativity,  the  selection  benefits  of  a  creativity-weighted 
contract  are  most  likely  to  be  manifest  in  initial  creativity,  before  the  possibly  countervailing 
effects  of  differential  effort  can  emerge  across  aggregate  production  (as  examined  later  in  H2).  To 
illustrate  the  differences  we  observe  across  time.  Figure  3  shows  separate  plots  for  the  self¬ 
selection  and  random-assignment  conditions  of  the  cumulative  difference  in  creativity-weighted 
productivity  scores  between  those  operating  under  the  creativity-weighted  contract  and  those  paid 
for  quantity  only,  at  each  unit  of  production.  Positive  (negative)  values  on  the  vertical  axis  indicate 
that  the  creativity-weighted  contract  generates  higher  (lower)  cumulative  creativity-weighted  pro¬ 
ductivity  than  the  quantity-only  contract  through  the  unit  of  production  reflected  on  the  horizontal 
axis.  Thus,  Figure  3  is  essentially  a  running  “horse  race”  of  which  contract  is  generating  higher 
cumulative  creativity-weighted  productivity  scores  at  each  unit  of  production. 

Consistent  with  HI,  Figure  3  indicates  that  individuals  selecting  the  creativity-weighted  pro¬ 
ductivity  contract  enjoy  an  initial  advantage,  outperforming  their  quantity-only  counterparts  by  a 
margin  that  increases  in  early  production,  before  starting  to  reverse  after  the  eighth  unit  of  pro¬ 
duction.  Statistically,  the  difference  in  favor  of  the  creativity-weighted  contract  within  the  self¬ 
selection  condition  is  significant  for  the  first  eight  units  (t  =  2.19;  one-tailed  p  =  0.02).  The 
interaction  between  contract  type  and  contract  determination  is  also  significant  for  cumulative 
creativity-weighted  productivity  scores  through  the  eighth  unit  of  production  (F  =  4.82;  one-tailed 
p  =  0.02),  indicating  that  the  difference  favoring  creativity-weighted  pay  within  the  self-selection 
condition  for  these  early  puzzles  statistically  exceeds  the  difference  observed  in  the  random- 
assignment  condition. 
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FIGURE  3 

Cumulative  Creativity  Weighted-Productivity  by  Order  Submitted 


—  ■  Random 
Selection 


For  both  the  random  assignment  and  self-selection  conditions,  this  graph  presents  the  difference  in  cumulative 
creativity-weighted  productivity  between  individuals  with  creativity-weighted  productivity  incentives  and  those 
with  quantity-only  incentives  by  order  of  puzzle  submitted. 


Across  the  first  eight  puzzles,  “creativity-weighted  productivity”  is  essentially  a  scalar  mul¬ 
tiple  of  average  creativity  ratings,  as  nearly  all  of  our  participants  produce  at  least  eight  puzzles.'® 
For  simplicity,  we  therefore  shed  further  insight  on  HI  by  examining  average  creativity  ratings 
within  initial  production,  noting  that  tests  of  initial  creativity-weighted  productivity  for  the  same 
quantities  would  yield  the  same  inferences.  Starting  with  the  creativity  rating  of  the  very  first  unit 
of  production,  Table  2  presents  ANCOVA  results  (Panel  B),  simple  effects  (Panel  C),  and  a 
graphical  depiction  of  the  means  and  the  significant  interaction  detected  in  the  ANCOVA  (Panel 
A).  Table  3  provides  similar  information  for  the  average  creativity  ratings  of  participants’  output 
across  the  first  eight  puzzles  produced. 

The  first-puzzle  results  are  subject  to  the  noise  inherent  in  creativity  ratings  for  just  one  puzzle 
per  participant,  but  even  for  the  first  puzzle  (Table  2),  we  see  a  statistically  significant  interaction 


Only  three  of  the  90  participants,  each  in  a  different  experimental  condition,  produced  less  than  eight  puzzles,  and  no 
participant  produced  fewer  than  six  puzzles. 
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TABLE  2 


ANCOVA  for  the  Effect  of  Contract  Type  and  Contract  Determination  on  the  Creativity 

Rating  of  the  Eirst  Pnzzle  Produced 


Panel  A:  Mean  Creativity  Rating  of  First  Puzzle 


Panel  B:  Analysis  of  Covariance  on  Creativity  Rating  of  First  Puzzle 

Sum  of 

Factor 

Df 

Squares 

F 

p-value‘ 

Contract  Type 

1 

0.13 

0.10 

>0.50 

Contract  Determination 

1 

1.77 

1.38 

0.24 

Contract  Type  X  Determination 

1 

3.76 

2.92 

0.05 

Task  Familiarity  (covariate) 

1 

18.81 

14.62 

<0.01 

Error 

85 

Panel  C:  Simple  Effects 

Sum  of 

Factor 

Df 

Squares 

F 

p-value' 

Effect  of  Contract  Type  within  Self-Selection 

1 

1.24 

2.09 

0.08 

Effect  of  Contract  Type  within  Random  Assignment 

1 

2.69 

0.97 

0.33 

Reported  p-values  are  two-tailed  unless  testing  a  one-tailed  prediction,  as  signified  by  bold  face. 


between  contract  type  and  contract  determination  (F  =  2.92;  p  =  0.045)  and  at  least  marginally 
significant  evidence  that  participants  under  the  creativity-weighted  contract  exhibit  higher  creativ¬ 
ity  than  those  under  the  quantity-only  contract  when  participants  self-select  the  contract  they 
prefer  (F  =  2.09;  p  =  0.076,  one-tailed),  but  not  when  contracts  are  randomly  assigned  (F  =  0.97; 
p  =  0.33,  two-tailed).  We  obtain  stronger  evidence  for  average  creativity  ratings  across  the  first 
eight  units  of  production,  in  which  the  interaction  is  highly  significant  (F  =  4.90;  p  =  0.02)  and 
the  simple  effects  are  clearly  consistent  with  HI:  average  creativity  ratings  under  the  creativity- 
weighted  contract  are  higher  than  those  under  the  quantity-only  contract  only  when  participants 
self-select  the  contract  they  prefer  (F  =  6.00;  p  <  0.01,  one-tailed),  not  when  contracts  are 
randomly  assigned  (F  =  0.49;  p  =  0.49,  two-tailed). 

To  be  sure,  defining  “initial  creativity”  as  average  creativity  ratings  across  the  first  eight  units 
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TABLE  3 


ANCOVA  for  the  Effect  of  Contract  Type  and  Contract  Determination  on  Average 
Creativity  Rating  of  the  First  Eight  Puzzles 

Panel  A:  Mean  Average  Creativity  Rating  of  the  First  Eight  Puzzles 


Panel  B:  Analysis  of  Covariance  on  Average  Creativity  Rating  of  the  First  Eight  Puzzles 

Sum  of 


Factor 

df 

Squares 

F 

p-value“ 

Contract  Type 

1 

0.39 

1.50 

0.24 

Contract  Determination 

1 

<0.01 

<0.01 

>0.50 

Contract  Type  X  Determination 

1 

1.26 

4.90 

0.02 

Task  Familiarity  (covariate) 

1 

3.22 

12.50 

<0.01 

Error 

85 

Panel  C:  Simple  Effects 

Sum  of 

Factor 

df 

Squares 

F 

p-value‘ 

Effect  of  Contract  Type  within  Self-Selection 

1 

1.55 

6.00 

<0.01 

Effect  of  Contract  Type  within  Random  Assignment 

1 

0.13 

0.49 

0.49 

“  Reported  p-values  are  two-tailed  unless  testing  a  one-tailed  prediction,  as  signified  by  bold  face. 


of  production  is  somewhat  ad  hoc,  but  we  obtain  similar  results  defining  initial  creativity  as  the 
average  creativity  ratings  of  the  first  “n”  puzzles,  for  all  n  between  1  and  8.  That  is,  we  find  a 
statistically  significant  advantage  in  favor  of  the  creativity-weighted  contract  when  participants 
self-select  (all  F-statistics  >  2.76;  all  p  <  0.05),  but  not  when  contracts  are  randomly  assigned  (all 
F  <  0.42;  all  p  >  0.50).  Overall,  these  results  support  HI;  relative  to  the  random  assignment 
control  condition,  self-selection  enabled  those  choosing  the  creativity-weighted  contract  to  exhibit 
significantly  higher  initial  creativity. 

The  above  analysis  reflects  the  creativity  of  all  puzzles  generated  in  early  production.  How¬ 
ever,  to  the  extent  that  many  business  settings  place  primary  (or  exclusive)  value  on  high- 
creativity  output,  it  is  important  to  corroborate  our  findings  with  an  analysis  of  high-creativity 
output  only.  Table  4  graphs  means  (Panel  A),  presents  ANCOVA  results  (Panel  B),  and  reports 
simple  effects  within  the  random-assignment  and  self-selection  conditions  (Panel  C)  for  the  num- 
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TABLE  4 

ANCOVA  for  the  Effect  of  Contract  Type  and  Contract  Determination  on  the  Number  of 


High-Creativity  Puzzles  in  the  First  Eight  Puzzles  Produced 
Panel  A:  Mean  Number  of  High-Creativity  Puzzles  in  the  First  Eight  Puzzles  Produced 


Panel  B:  ANCOVA  on  Number  of  High-Creativity  Puzzles  in  the  First  Eight  Puzzles  Produced 


Factor 

df 

Sum  of 
Squares 

F 

p-value' 

Contract  Type 

1 

2.44 

1.51 

0.22 

Contract  Determination 

1 

0.07 

0.04 

>0.50 

Contract  Type  X  Determination 

1 

4.54 

2.81 

0.05 

Task  Familiarity  (covariate) 

1 

23.34 

14.44 

<0.01 

Error 

85 

Panel  C:  Simple  Effects 
Factor 

Effect  of  Contract  Type  within  Self-Selection 
Effect  of  Contract  Type  within  Random  Assignment 


df 

Sum  of 
Squares 

F 

p-value' 

1 

6.92 

4.28 

0.02 

1 

0.17 

0.10 

>0.50 

Reported  p-values  are  two-tailed  unless  testing  a  one-tailed  prediction,  as  signified  by  bold  face. 


her  of  high-creativity  (i.e.,  overall  top  quartile)  puzzles  across  the  first  eight  units  of  production. 
As  in  Tables  2  and  3,  the  contract  type  X  contract  determination  interaction  in  Table  4,  Panel  B  is 
statistically  significant  (F  =  2.81;  p  =  0.05).  As  Panel  C  indicates,  participants  using  the 
creativity-weighted  contract  submitted  more  high-creativity  puzzles  through  the  eighth  unit  of 
production  than  quantity-only  participants  when  participants  self-select  the  contract  they  prefer  (F 
=  4.28;  p  =  0.02,  one-tailed),  but  not  when  contracts  are  randomly  assigned  (F  =  0.10;  p  >  0.50, 
two-tailed). 

To  facilitate  interpretation.  Figure  4  depicts  the  pattern  of  high-creativity  production  across 
time.  For  self-selected  contracts,  those  preferring  the  creativity-weighted  scheme  are  “faster  out  of 
the  gate,”  producing  high-creativity  puzzles  at  an  initially  higher  rate  than  their  quantity-only 
counterparts  (Figure  4,  Panel  A).  In  the  random-assignment  condition,  by  contrast,  the  rate  of 
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FIGURE  4 

Cumulative  Production  of  High-Creativity  (Overall  Top  Quartile)  Puzzles 
Panel  A:  Self-Selection  Condition 


N. 

in 

a> 

> 

o 

n 


0) 

0) 

N 

N 

3 


'  Quantity-Only 
■Creativity-Weighted 


Puzzle  by  Order  Submitted 

Panel  B:  Random-Assignment  Condition 


h- 

in 

o 

> 

o 

n 

< 

(/) 

0) 

N 

N 

3 

Q. 

**- 

O 


■  Quantity-Only 
•Creativity-Weighted 


Puzzle  by  Order  Submitted 


The  Accounting  Review 
American  Accounting  Association 


September  2010 


Attracting  Creativity:  The  Initial  and  Aggregate  Effects  of  Contract  Selection 


1687 


initial  high-creativity  production  is  nearly  identical  between  the  creativity-weighted  and  quantity- 
only  contracts  (Figure  4,  Panel  B),  suggesting  that  the  temporal  pattern  in  Panel  A  of  Figure  4  is 
a  contract-selection  effect,  consistent  with  HI. 

H2:  Aggregate  Creativity-Weighted  Productivity 

Figures  3  and  4  illustrate  that,  despite  their  initial  success,  self-selected  creativity-weighted 
participants  eventually  lose  steam.  That  is,  quantity-only  participants  eventually  dominate  in  over¬ 
all  creativity-weighted  productivity  (Figure  3),  and  catch-up  even  in  the  number  of  high-creativity 
puzzles  (Figure  4).  To  test  H2,  this  section  reports  analyses  for  creativity-weighted  productivity 
and  high-creativity  output  aggregated  over  the  entire  production  period. 

Table  5  reports  an  ANCOVA  with  aggregate  creativity-weighted  productivity  as  the  dependent 
variable.  The  only  significant  effect  from  this  ANCOVA  is  the  main  effect  of  contract  type,  with 
quantity-only  participants  significantly  outperforming  their  creativity-weighted  counterparts  (F  = 
19.06;  p  <  0.01,  two-tailed).  The  interaction  between  contract  type  and  contract  determination 
(self-selected  versus  random)  is  negligible  (F  =  0.02;  p  >  0.50).  Thus,  in  aggregate  creativity- 
weighted  production,  null  hypothesis  H2  must  be  rejected  in  favor  of  the  quantity-only  contract, 
irrespective  of  whether  contracts  are  self-selected  or  randomly  assigned. 

To  be  sure,  unlike  the  earlier  tests  of  HI  in  which  production  quantity  is  held  constant, 
“creativity-weighted  productivity”  increases  in  both  creativity  and  quantity  across  aggregate  pro¬ 
duction.  Accordingly,  the  superior  creativity-weighted  productivity  scores  of  the  quantity-only 
participants  even  in  the  self-selection  condition  could  plausibly  be  consistent  with  such  partici¬ 
pants  “flooding  the  market”  with  sufficient  quantities  of  low-creativity  output  to  offset  their  initial 
disadvantage  in  creativity  ratings  as  detected  earlier  in  HI.  We  address  this  possibility  with  two 
tests.  First,  we  address  the  influence  of  low-creativity  output  on  our  results  by  repeating  the 
aggregate  creativity-weighted  productivity  ANCOVA  after  deleting  all  output  in  the  overall  bottom 
quartile  of  creativity  ratings.  Results  (untabulated)  remain  similar  to  those  reported  above,  with  the 
main  effect  of  contract  type  continuing  to  evidence  greater  creativity- weighted  productivity  among 
the  quantity-only  participants  (F  =  12.85;  p  <  0.01,  two-tailed),  with  no  main  effect  of  contract 


TABLE  5 


ANCOVA  for  the  Effect  of  Contract  Type  and  Contract  Determination  on 
Creativity- Weighted  Productivity 


Factor  df 


Contract  Type  1 

Contract  Determination  1 

Contract  Type  X  Determination  1 

Task  Familiarity  (covariate)  1 

Error  85 


Sum  of 
Squares 

F 

p-value 

(two-tailed) 

30,174.76 

19.06 

<0.01 

961.48 

0.61 

0.44 

24.81 

0.02 

>0.50 

6,920.83 

4.37 

0.04 

Because  both  creativity  and  quantity  vary  in  aggregate  production,  we  can  no  longer  simplify  the  analysis  by  focusing 
on  average  creativity  ratings  as  in  testing  HI.  Average  creativity  ratings  do  not  even  serve  as  a  reliable  measure  of 
“creativity”  when  quantity  varies,  due  to  the  “denominator  effect”  that  results  in  lower  average  creativity  ratings  with 
more  overall  production  even  if  the  production  of  high-creativity  puzzles  remains  constant.  Our  subsequent  tests  in  this 
section  attempt  to  distinguish  the  effects  of  creativity  and  quantity  on  creativity-weighted  productivity. 
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determination  (F  =  1.09;  p  =  0.30)  and  no  interaction  between  contract  type  and  determination  (F 
=  0.15;  p  >  0.50). 

A  second  approach  is  to  analyze  high-creativity  production  only — that  is,  the  production  of 
puzzles  that  fall  in  the  overall  top  quartile  of  creativity  ratings.  ANCOVA  results  (untabulated) 
show  that  neither  contract  type,  contract  determination,  nor  the  interaction  between  the  two  sig¬ 
nificantly  influences  the  production  of  high-creativity  puzzles  in  the  aggregate  (all  p  >  0.20).^^ 
Figure  5  illustrates  this  near  equivalence,  splitting  the  total  quantity  produced  within  each  experi¬ 
mental  condition  into  high-creativity  puzzles  and  all  other  puzzles.  This  figure  shows  that 
quantity-only  participants  (both  self-selected  and  randomly  assigned)  did  indeed  produce  a  much 
greater  quantity  overall,  but  did  not  do  so  at  the  expense  of  less  high-creativity  production.  Thus, 
it  would  be  an  oversimplification  to  attribute  the  H2  results  to  quantity-only  participants  “flooding 
the  market”  with  low-creativity  efforts.  As  long  as  less  creative  output  is  not  actually  harmful  at 
the  margin,  the  quantity-only  participants  enjoy  substantial  gains  in  overall  production  while  also 
producing  just  as  many  high-creativity  efforts  as  those  paid, for  creativity- weighted  productivity. 


FIGURE  5 

Number  of  Puzzles  with  a  Composite  Creativity  Rating  Above  and  Below  5.7  by  Experimental 

Condition 


Random  Assignment  Random  Assignment  Self-Selection 

Quantity-Only  Creativity-Weighted  Quantity-Only 


Self-Selection 

Creativity-Weighted 


□  Average  Rating  Belo\w  5.7 
■  Average  Rating  Atxjve  5.7 


18 

Table  1  suggests  that  the  high-creativity  production  in  the  quantity-only,  random-assignment  cell  (5.95)  could  be  higher 
than  that  of  the  other  three  cells  (4,69,  4.33,  and  4.48).  However,  none  of  the  pairwise  comparisons  between  individual 
cells  (untabulated)  are  statistically  significant  at  conventional  levels  (highest  t  =  1.13;  two-tailed  p  =  0.27).  Hence,  the 
nominally  higher  production  in  the  quantity-only,  random-assignment  cell  could  simply  reflect  chance  variation. 
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As  indicated  earlier  (see  Figure  2),  it  is  also  unlikely  that  these  results  reflect  a  “ceiling  effect”  in 
high-creativity  production,  as  that  production  exhibits  considerable  overall  variance  around  the 
means  depicted  in  Figure  5. 

Returning  to  Figure  4,  it  becomes  apparent  how  our  H2  results  reeoncile  to  the  earlier  test 
showing  that  self-seleeted  ereativity-weighted  participants  produce  significantly  more  high- 
creativity  puzzles  in  initial  produetion  (HI).  While  these  participants  are  indeed  “faster  out  of  the 
gate,”  Figure  4  shows  that  self-seleeted  quantity-only  participants  continue  to  produce  high- 
creativity  puzzles  over  time,  while  their  ereativity-weighted  counterparts  appear  to  “burn  out.”  We 
interpret  this  pattern  as  consistent  with  the  opposing  effects  of  two  phenomena.  Namely,  the 
contract-selection  treatment  does  appear  to  attract  the  most  (initially)  creative  participants  to  the 
creativity-weighted  contract,  but  over  time,  the  effort-inducing  effects  of  the  two  contracts  lead 
even  the  participants  who  self-select  the  creativity-weighted  contract  to  try  too  hard  to  be  creative 
and  too  little  to  be  productive. 


V.  CONCLUSIONS 

In  what  we  believe  is  the  first  experimental  study  to  investigate  contract  selection  in  a  multi¬ 
dimensional  setting,  we  examine  the  effects  on  creativity-weighted  productivity  of  allowing  par¬ 
ticipants  to  choose  between  a  contract  that  rewards  creativity-weighted  productivity  and  one  that 
rewards  quantity  only.  We  find  that  participants  who  choose  the  creativity-weighted  eontract 
indicate  higher  self-perceptions  of  creative  potential  than  do  those  who  choose  the  quantity-only 
scheme.  These  self-perceptions  are  manifest  in  the  initial  creativity  of  participants’  productive 
output,  consistent  with  the  premise  that  a  ereativity-weighted  eontract  can  attract  relatively  cre¬ 
ative  workers.  To  the  extent  that  initial  creative  success  can  influence  subsequent  career  paths  and 
resources,  this  result  qualifies  KRW’s  conclusion  from  an  experiment  with  randomly  assigned 
contracts  that  creativity-weighted  compensation  impairs  creativity-weighted  productivity. 

Despite  the  benefit  of  a  self-selected  creativity-weighted  pay  scheme  that  we  observe  in  initial 
creativity,  we  find  that  quantity-only  participants  eventually  surpass  the  creativity-weighted  pro¬ 
ductivity  scores  of  their  creativity-weighted  counterparts.  Thus,  in  settings  in  which  all  workers 
have  enough  opportunity  to  “catch  up,”  our  results  suggest  that  even  those  who  self-select  into  a 
quantity-only  contract  can  eventually  produce  just  as  many  high-creativity  efforts  as  their 
creativity-weighted  counterparts,  along  with  several  other  efforts  of  more  modest  creativity.  Over¬ 
all,  the  potential  benefits  of  contract  selection  in  a  multi-dimensional  setting  that  values  both 
quantity  and  creativity  appear  to  depend  on  which  is  most  important:  a  fast  start  out  of  the  gate 
(initial  creativity)  or  sustainable  production  (aggregate  ereativity-weighted  productivity).  As  we 
can  envision  environments  of  each  variety,  our  results  suggest  a  more  complex  answer  to  the  costs 
and  benefits  of  a  creativity-weighted  pay  scheme  than  is  provided  in  earlier  research  by  KRW. 

These  results  contribute  to  the  contract-selection  literature  (e.g.,  Chow  1983;  Waller  and 
Chow  1985;  Shields  and  Waller  1988;  Dillard  and  Fisher  1990;  Bouwens  and  van  Lent  2006; 
Cadsby  et  al.  2007)  in  two  important  ways.  First,  our  findings  suggest  a  more  subtle  interpretation 
of  the  assumption  implicit  in  many  agency-based  eontract  selection  models  that  individuals  have 
perfect  insight  into  their  abilities.  For  softer  performance  dimensions  such  as  creativity,  we  find 
that  individuals  have  reasonable  self-insight  into  their  initial  creative  abilities,  but  cannot  neces¬ 
sarily  anticipate  the  effort-inducing  effects  of  the  selected  contracts  in  mapping  this  self-insight  to 
sustainable  aggregate  performance.  Thus,  the  descriptive  validity  of  agency-based  eontract  selec¬ 
tion  models  is  likely  to  depend  on  the  nature  of  the  ability  organizations  wish  to  attract.  Second, 
our  study  highlights  the  challenges  of  achieving  sustainable  selection  benefits  in  a  multi¬ 
dimensional  performance  setting.  Organizations  must  anticipate  how  contract  design  affects  not 
only  the  attraction  of  high-skilled  individuals,  but  also  the  (potentially  suboptimal)  trade-offs  that 
the  selected  contraets  motivate  among  multiple  performance  dimensions. 
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Our  results  also  build  on  KRW’s  efforts  to  gain  a  better  understanding  of  why  firms  are 
reluctant  to  reward  creativity  explicitly,  despite  calls  to  do  so  (Ittner  et  al.  1997).  Certainly  other 
elements  besides  contract  selection  could  shed  additional  insights  toward  this  end.  For  example, 
although  we  detect  differences  in  initial  versus  aggregate  production  even  in  a  20-minute  produc¬ 
tion  exercise,  truly  creative  efforts  can  often  take  days,  weeks,  or  even  years.  A  long-term  experi¬ 
ment  of  this  genre  would  be  more  difficult  to  control.  Still,  our  results  suggest  that  an  internally 
valid  longitudinal  analysis  of  creativity-weighted  productivity  could  shed  important  insights.  Simi¬ 
lar  comments  extend  to  the  potential  moderating  effects  of  feedback  (e.g.,  see  Sprinkle  2008), 
albeit  limited  by  the  inherent  delays  that  characterize  reliable  creativity  feedback  both  in  the 
laboratory  and  in  practice.  Finally,  the  results  of  a  laboratory  experiment  with  students  producing 
“rebus  puzzles”  do  not  necessarily  generalize  to  real-world  environments  with  experienced  pro¬ 
fessionals  (such  as  marketing  personnel)  undertaking  more  elaborate  creative  tasks  (such  as  de¬ 
signing  advertising  campaigns).  Advertising,  for  example,  might  require  a  more  unique  approach 
to  different  client  settings  than  is  the  case  for  designing  different  rebus  puzzles.  While  it  is  not 
clear  how  any  such  differences  would  interact  with  our  treatment  conditions,  theory-driven  inves¬ 
tigations  of  the  effects  of  expertise  or  task  characteristics  on  the  efficacy  of  creativity  incentives  in 
a  multi-dimensional  performance  setting  could  reveal  important  insights.  We  leave  further  explo¬ 
ration  of  these  issues  to  future  studies. 

In  closing,  we  echo  KRW’s  call  for  expanding  the  experimental  accounting  literature  on 
performance-based  incentives  beyond  the  traditional  interest  in  quantity.  Like  KRW,  our  focus  is 
on  blending  productivity  with  creativity,  which  we  believe  to  be  of  particular  importance  in 
contemporary  business  practice.  As  a  different  example,  Farrell  et  al.  (2008)  design  an  experiment 
to  examine  the  quantity-quality  trade-off,  in  which  “quality”  captures  elements  of  accurate  align¬ 
ment  with  customer  specifications.  Whether  creativity,  quality,  or  other  dimensions,  we  believe 
that  accounting  researchers  can  and  should  incorporate  “softer”  elements  of  performance  into  the 
study  of  performance-based  compensation.  Our  study  evidences  that  laboratory  methods  are  well- 
suited  to  such  inquiries  for  both  the  moral-hazard  effects  of  hidden  actions  and  the  adverse- 
selection  effects  of  hidden  information. 
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ABSTRACT:  Using  confidential  tax  shelter  and  tax  return  data  obtained  from  the  Inter¬ 
nal  Revenue  Service,  this  study  develops  and  validates  an  expanded  model  for  infer¬ 
ring  the  likelihood  that  a  firm  engages  in  a  tax  shelter.  Results  show  that  tax  shelter 
likelihood  is  positively  related  to  subsidiaries  located  in  tax  havens,  foreign-source  in¬ 
come,  inconsistent  book-tax  treatment,  litigation  losses,  use  of  promoters,  profitability, 
and  size,  and  negatively  related  to  leverage.  Supplemental  tests  show  that  total  book- 
tax  differences  (BTDs)  and  the  contingent  tax  liability  reserve  are  significantly  related  to 
tax  shelter  usage,  while  discretionary  permanent  BTDs  and  long-run  cash  effective  tax 
rates  are  not.  Finally,  the  model  is  weaker,  yet  still  significant,  in  the  FIN  48  disclosure 
environment.  This  research  provides  investors  and  policymakers  with  an  extended, 
validated  measure  to  calculate  the  presence  of  extreme  cases  of  corporate  tax  aggres¬ 
siveness.  Such  information  could  also  aid  analysts  and  other  tax  and  non-tax  research¬ 
ers  in  assessing  the  benefits  and  risks  of  firm  behavior. 

Keywords:  tax  shelter;  FIN  48;  book-tax  differences;  tax  cushion;  ROC  Curve. 

Data  Availability:  Financial  data  used  in  this  study  are  available  from  public  sources. 
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load  Identification  Office  (RWI),  and  the  Office  of  Tax  Shelter  Analy¬ 
sis  (OTSA),  and  are  not  publicly  available. 
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I.  INTRODUCTION 

Tax  shelters  are  transactions  or  arrangements  that  generate  tax  losses  without  incurring 
economic  losses  or  risk  (U.S.  Department  of  the  Treasury  1999,  6;  hereafter  Treasury). 
Their  main  goal  is  to  lower  the  corporate  tax  liability  by  exploiting  discontinuities  in  the 
tax  law.^  Tax  shelters  are  illegal  when  they  do  not  exhibit  “economic  substance”  or  a  “business 
purpose” — that  is,  when  they  are  created  for  the  sole  aim  of  evading  tax  rather  than  filling  a  non- 
(or  pre-)  tax  economic  need.  From  a  research  and  tax  administration  standpoint,  tax  shelters  are 
transactions  that  are  classified  as  “Listed”  or  Non-Listed  “Reportable”  transactions  by  the  U.S. 
Treasury  Department  (see  Graham  and  Tucker  2006;  Hennig  et  al.  2008).  These  transactions  are 
described  in  specific  Internal  Revenue  Service  (IRS)  notices,  rulings,  and  guidance.  Inferring  tax 
shelter  use  from  financial  statements  is  the  main  focus  of  this  study. 

The  impact  of  tax  shelters  is  substantial.  The  Treasury  (1999)  concluded  that  corporate  tax 
shelters  cost  the  federal  government  approximately  $10  billion  annually.  In  response  to  such  tax 
base  erosion,  the  IRS  in  2000  established  the  Office  of  Tax  Shelter  Analysis  (OTSA),  whose  sole 
task  is  to  detect  and  track  tax  sheltering  activity.  Recent  tax  shelter  litigation  against  KPMG 
challenged  $2.5  billion  in  tax  benefits  (or  $11  billion  in  sham  tax  losses)  marketed  and  sold  to 
clients  from  1997  to  2001  (IRS  2005). 

In  light  of  these  economic  considerations,  this  study’s  main  focus  is  to  extend  prior  empirical 
work  on  inferring  tax  shelter  participation  from  financial  statement  information  (e.g.,  Wilson 
2009).  I  use  a  conceptual  framework  from  the  Treasury  (1999)  white  paper  on  tax  shelters  to 
extend  the  number  and  types  of  variables  that  can  be  used  to  infer  corporate  tax  shelter  use.  Unlike 
previous  studies,  I  am  able  to  employ  a  relatively  large,  confidential  tax  shelter  data  set  obtained 
from  the  IRS,  allowing  for  the  examination  of  64  categories  of  tax  shelters,  with  267  tax  shelter 
observations  reported  in  the  relatively  short  period  from  2000  to  2004.^  This  analysis  allows 
researchers,  investors,  and  administrators  to  utilize  clues  in  the  financial  statements  to  identify 
corporate  tax  shelter  transactions. 

In  supplemental  analyses,  I  examine  the  extent  to  which  tax  sheltering  behavior  is  related  to 
four  alternative  proxies  of  tax  avoidance  and  tax  aggressiveness  that  have  recently  emerged  in  the 
archival  tax  literature.  In  particular,  I  test  whether  tax  shelters  are  related  to  discretionary  perma¬ 
nent  book-tax  differences  (Frank  et  al.  2009),  long-run  cash  effective  tax  rates  (Dyreng  et  al. 
2008),  total  book-tax  differences  (Mills  1998),  and  the  contingent  tax  liability  reserve,  or  tax 
cushion  (Gleason  and  Mills  2002).  I  also  test  the  reported  model’s  usefulness  in  gleaning  tax 
sheltering  behavior  during  the  new  regime  of  Financial  Interpretation  No.  48  (FIN  48,  FASB 
2006),  in  which  firms  must  disclose  unrecognized  tax  benefits.  I  conduct  this  analysis  because  it  is 
unclear  whether  the  new  FIN  48  disclosures  deter  the  use  of  corporate  tax  shelters,  thus  potentially 
reducing  the  power  of  the  model  developed  here. 

The  ability  to  infer  the  presence  of  tax  shelters  is  important  for  several  reasons.  Beyond  lost 
tax  revenues  to  the  Treasury  and  related  issues  of  equity  in  the  tax  system,  identifying  tax  shelter 
firms  can  provide  useful  information  about  managerial  risk-taking  behavior  (see  Desai  and  Dhar- 
mapala  2006).  The  results  of  this  study  can  also  aid  investors,  analysts,  and  other  researchers 


'  Graetz  (1999,  Al)  more  colorfully  defines  a  tax  shelter  as  a  “deal  done  by  very  smart  people  that,  absent  tax  consid¬ 
erations,  would  be  very  stupid.” 

^  While  the  term  “tax  shelters”  is  used  in  the  media  to  capture  numerous  types  of  general  tax-saving  strategies  more 
commonly  referred  to  in  academic  literature  as  “tax  shields,”  such  as  depreciation  and  deferred  compensation,  the  IRS’s 
use  of  the  term  is  more  specific.  See  Sections  II  and  III  for  details. 

^  This  study  does  not  claim  to  have  access  to  the  entire  population  of  public  U.S.  firms  engaging  in  tax  shelters — only  the 
firms  whose  tax  shelters  have  been  identified  by  IRS  through  audits  or  “voluntary”  disclosures.  There  are  obviously 
some  tax  shelters  whose  existence  is  yet  to  be  discovered  by  the  IRS. 
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interested  in  inferring  tax  shelter  activities  with  respect  to  valuation  (e.g.,  Hanlon  and  Slemrod 
2009).  Given  greater  regulatory  scrutiny  and  significant  penalties  that  could  be  imposed  on  firms, 
as  well  as  new  required  financial  disclosures  for  uncertain  tax  positions,  knowledge  of  tax  shelter 
use  is  increasingly  relevant  for  investors  and  analysts  as  they  assess  the  risk-reward  trade-offs  of 
investments;  for  auditors  as  they  grapple  with  client  reporting  risks;  for  policymakers  as  they 
assess  the  current  disclosure  and  compliance  environment;  and  for  tax  administrators  as  they  seek 
to  supplement  current  compliance  risk-assessment  models.  Furthermore,  analyzing  the  extent  to 
which  empirical  proxies  for  tax  avoidance  and  aggressiveness  represent  underlying  tax  shelter 
transactions  answers  the  call  in  Hanlon  and  Heitzman  (2009)  for  researchers  to  develop  a  better 
understanding  on  the  usefulness  of  such  measures  in  various  contexts. 

The  main  results  show  that  the  likelihood  of  corporate  tax  shelter  utilization  is  positively 
related  to  the  presence  of  corporate  subsidiaries  located  in  tax  havens,  foreign-source  income, 
inconsistent  book-tax  treatment,  litigation  losses,  use  of  promoters,'^  profitability,  and  size.  Tax 
shelter  utilization  is  also  negatively  related  to  leverage.  The  model  developed  in  this  study  in¬ 
creases  the  explanatory  power  of  the  Wilson  (2009)  model  and  corroborates  the  result  in  Graham 
and  Tucker  (2006)  of  the  negative  relation  between  debt  and  tax  shelters.  I  also  develop  a  metric 
called  TaxShelterScore,  which  is  derived  from  this  study’s  main  model  parameter  estimates.  Tax- 
ShelterScore  represents  the  estimated  probability  that  a  firm  engages  in  a  tax  shelter,  and  can  be 
used  to  infer  corporate  tax  shelter  usage.  However,  it  cannot  identify  the  dollar  size  of  tax  shelter 
benefits.  Nevertheless,  tests  using  TaxShelterScore  show  that  the  parameters  have  strong  discrimi¬ 
natory  power  in  inferring  corporate  tax  shelter  use.  Applying  TaxShelterScore  to  a  holdout  sample 
of  firms,  the  area  under  the  Receiver  Operator  Characteristic  (ROC)  Curve  is  0.88.  In  other  words, 
the  model  yields  an  88  percent  chance  that  a  firm  with  a  tax  shelter  has  a  higher  predicted 
probability  (or  TaxShelterScore)  than  a  firm  without  a  tax  shelter,  for  random  pairs  of  firms  with 
and  without  shelters.  This  result  supports  the  model’s  use  as  a  reliable  measure  for  identifying  the 
most  severe  form  of  tax  aggressiveness — tax  shelters. 

To  probe  the  relative  benefits  of  the  model’s  refinements  in  measuring  tax  shelter  behavior  in 
particular,  supplemental  analyses  test  the  univariate  link  between  tax  shelters  and  four  other 
proxies  for  tax  avoidance  and  aggressiveness.  Results  show  a  strong  positive  relation  between  tax 
shelter  usage,  and  both  total  book-tax  differences  and  the  contingent  tax  liability  reserve,  or  tax 
cushion,  while  there  is  no  significant  association  between  tax  shelter  usage  and  either  discretionary 
permanent  book-tax  differences  or  long-run  cash  effective  tax  rates.  Meanwhile,  the  TaxShelter¬ 
Score  developed  herein  provides  the  strongest  link  to  actual  tax  shelter  incidence.^  Finally,  supple¬ 
mental  tests  show  that  TaxShelterScore  is  weaker  in  identifying  tax  sheltering  in  the  FIN  48 
disclosure  environment  than  in  a  pre-FIN  48  period. 

The  study  is  organized  as  follows.  Section  II  presents  a  brief  background  on  tax  shelters,  tax 
aggressiveness  proxies,  and  FIN  48.  Section  III  explains  the  OTSA  tax  shelter  data  and  develops 
proxies  for  tax  shelter  characteristics.  Section  IV  explains  the  sample  and  specifies  the  empirical 
model.  Section  V  reports  main  results  for  the  tax  shelter  likelihood  model.  Section  VI  reports 
supplemental  analyses  on  the  tax  aggressiveness  proxies  and  usefulness  of  the  model  during  the 
FIN  48  disclosure  period.  Section  VII  concludes. 


The  term  “promoter”  is  used  to  represent  the  suppliers,  marketers,  and  financiers  of  tax  shelters. 

^  These  results  speak  only  to  the  various  proxies’  link  to  actual  tax  shelter  incidence,  but  does  not  purport  that  the  other 
measures  do  not  capture  more  general  (non-tax  shelter)  tax  avoidance  or  tax  aggressiveness. 
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II.  BACKGROUND 

Inferring  Tax  Shelters 

The  primary  problem  with  tax  shelters  from  research,  investment,  tax  administration,  and 
enforcement  perspectives  is  detecting  them.  Because  publicly  traded  corporate  taxpayers  disclose 
only  their  financial  statements  and  not  tax  returns,  it  is  difficult  to  calculate  a  firm’s  actual  taxable 
income  or  tax  burden  with  any  degree  of  accuracy  (Hanlon  2003),  let  alone  identify  the  possible 
use  of  tax  shelters.®  McGill  and  Outslay  (2004)  detail  the  limitations  of  using  only  publicly 
available  financial  statements  in  detecting  tax  shelters.  They  analyze  Enron’s  financial  statements 
in  an  attempt  to  glean  tax  shelter  activity,  but  acknowledge  that  it  is  extremely  difficult  to  discern 
tax  shelters  by  examining  financial  statements  alone.^ 

Earlier  research  has  examined  patterns  in  financial  statement  information  to  infer  tax  shelter 
usage.  For  example,  Plesko  (2000)  asserts  that  tax  shelters  are  likely  one  reason  why,  in  aggregate, 
pretax  book  income  has  grown  at  a  faster  rate  than  current  or  deferred  tax  expense  from  1994  to 
1998.  Desai  (2003)  finds  that,  in  1998,  less  than  half  of  book-tax  differences  are  attributable  to 
their  main  determinants  of  international  operations,  stock  options,  and  depreciation  accounting. 
Using  a  deductive  approach,  he  suggests  that  the  unexplained  portion  of  book-tax  differences  is  an 
indicator  of  general  tax  sheltering  activity.  Similarly,  Desai  and  Dharmapala  (2006)  use  a  deduc¬ 
tive  approach  to  model  management  incentives  to  engage  in  tax  shelters.  This  study  proposes  an 
inductive  approach  in  identifying  tax  sheltering  behavior. 

More  recent  research  identifies  specific  tax  shelters  using  public  sources,  such  as  Tax  Court 
cases  and  media  news  releases.  Graham  and  Tucker  (2006)  use  a  sample  of  43  firms  identified 
from  public  records  as  having  engaged  in  tax  shelters  over  a  26-year  period  (1975-2000).  They 
find  that  firms  engaging  in  tax  shelters  use  less  debt,  suggesting  a  substitution  effect  in  which  both 
vehicles  can  be  used  to  lower  taxable  income.  They  acknowledge  that  their  sample,  which  is  based 
on  public  records,  likely  suffers  from  self-selection  issues,  and  focuses  on  large  cases  that  might 
not  generalize  to  a  broader  population. 

Wilson  (2009)  extends  Graham  and  Tucker’s  (2006)  sample  to  2002  and  develops  a  profile  of 
firms  that  are  likely  to  be  engaged  in  tax  shelters.  He  identifies  59  such  firms  over  a  28-year 
period,  with  a  total  of  215  tax  shelter  firm-year  observations.  Wilson  (2009)  finds  evidence  that  tax 
shelter  participation  is  associated  with  book-tax  differences,  performance-matched  discretionary 
accruals,  size,  return  on  assets  (profitability),  and  foreign  income. 

My  research  directly  extends  Wilson  (2009)  by  developing  an  expanded  profile  of  tax  shelter 
participants.  I  supplement  his  model  with  variables  developed  from  a  conceptual  framework  in  the 
Treasury  (1999)  white  paper  on  tax  shelters.  In  designing  the  main  tests,  I  use  267  tax  shelter-year 
observations,  as  identified  by  the  IRS’s  Office  of  Tax  Shelter  Analysis,  over  a  relatively  short 
five-year  period  of  2000-2004.^ 


^  Prior  literature  also  explains  the  limitations  of  using  financial  statements  to  estimate  a  corporation’s  general  tax  status 
(e.g.,  Hanlon  and  Shevlin  2002;  McGill  and  Outslay  2002;  Mills  and  Plesko  2003).  Using  IRS  tax  return  data,  Lisowsky 
(2009)  develops  an  empirical  model  for  inferring  a  corporation’s  tax  burden  from  financial  data. 

^  Tax  return  information  that  is  not  publicly  disclosed,  but  that  can  help  in  identifying  tax  shelter  usage  includes  infor¬ 
mation  on  Form  8886  and  the  Schedule  M-3,  which  are  used  later  in  supplemental  tests  that  encompass  a  more  recent 
time  period.  However,  identifying  tax  shelter  transactions  from  tax  returns  alone  can  still  remain  elusive  if  the  taxpayer 
is  generally  noncompliant  in  its  tax  filings  to  begin  with. 

®  One  advantage  of  using  the  OTSA  data  relative  to  the  Graham  and  Tucker  (2006)  sample  is  that  entry  into  the  OTSA 
sample  is  not  dependent  on  observations  being  discovered  through  tax  litigation  or  public  documents.  One  disadvantage 
of  using  tax  return  data  is  that  differences  in  financial  and  tax  consolidation  rules  can  include  or  exclude  certain  entities 
or  transactions  for  one  set  of  books  but  not  for  another  (Lisowsky  and  Trautman  2007;  and  Boynton  et  al.  2008). 
Therefore,  as  in  Wilson  (2009),  I  also  include  a  broad  measure  for  book-tax  differences. 
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Alternative  Measures  of  Tax  Aggressiveness 

A  demand  currently  exists  for  a  broadly  accepted  definition  and  measure  of  corporate  tax 
aggressiveness  (see  Hanlon  and  Heitzman  2009).  The  difficulty  is  that  tax  aggressiveness  is  a 
matter  of  judgment,  degree,  and  scope.  In  terms  of  judgment,  firms,  tax  administrators,  and  even 
courts,  must  develop  parameters  around  which  transactions  are  deemed  to  exhibit  economic  sub¬ 
stance  or  serve  a  business  purpose.  In  terms  of  degree  and  scope,  tax  minimization  can  be  con¬ 
ceptualized  as  a  continuum,  from  avoidance  to  aggressiveness  to  sheltering. 

I  propose  a  broad  definition  of  tax  aggressiveness  as  behavior  in  which  tax  burden  reduction 
is  of  first-order  importance,  with  pre-  or  non-tax  effects  viewed  as  second-order,  or  marginal.  Tax 
shelters,  then,  would  be  the  most  severe  form  of  tax  aggressiveness,  as  those  transactions  are 
clearly  prohibited  by  the  IRS.  To  illustrate,  tax  benefits  from  items  such  as  depreciation  or  stock 
options  are,  in  expectation,  of  second-order  (or  marginal)  importance  to  the  first-order  capital 
allocation  and  managerial  incentive  alignment  effects,  respectively.  These  items  can  be  viewed  as 
benign  tax  avoidance.  Tax  aggressiveness  can  differ  from  tax  avoidance  in  terms  of  scope  as  well; 
tax  aggressive  transactions  (e.g.,  aggressive  transfer-pricing  policies)  create  tax  avoidance,  but  tax 
avoidance  effects  are  not  necessarily  due  to  tax  aggressiveness  (e.g.,  depreciation).  Even  within 
the  realm  of  tax  aggressiveness,  there  are  degrees  to  which  aggressiveness  is  relatively  low  (e.g., 
paying  employee  bonuses  right  before  rather  than  right  after  fiscal  year-end)  or  high  (e.g.,  tax 
shelters).  In  this  study,  I  focus  on  modeling  the  most  extreme  part  of  the  tax  minimization 
continuum — tax  sheltering. 

The  recent  archival  literature  has  developed  and  used  several  measures  for  corporate  tax 
avoidance  and  aggressiveness.  First,  Frank  et  al.  (2009)  measure  tax  aggressiveness  as  the  residual 
from  a  regression  of  permanent  book-tax  differences  (BTDs)  on  its  known  components,  such  as 
state  tax  expense,  intangibles,  and  other  items.  Their  rationale  is  that  tax  shelters,  which  are 
discretionary  transactions,  derive  value  as  permanent  BTDs  that,  by  reducing  current  taxable 
income  or  generating  tax  credits,  can  understate  the  effective  tax  rate,  boosting  financial  earnings. 

Second,  Dyreng  et  al.  (2008)  develop  a  measure  of  long-run  tax  avoidance  as  the  ten-year  sum 
of  cash  taxes  paid  divided  by  the  ten-year  sum  of  pretax  income  less  special  items.  They  appro¬ 
priately  note  that  tax  avoidance  might  not  pinpoint  tax  aggressiveness;  avoidance  can  include 
legitimate  deductions,  while  aggressiveness  is  a  subset  of  avoidance  because  aggressiveness  can 
also  include  prohibited  activities  such  as  tax  sheltering.  Nevertheless,  several  academic  studies 
have  used  this  measure  as  a  proxy  for  tax  aggressiveness. 

Third,  Mills  (1998)  finds  that  firms  with  greater  BTDs  are  positively  related  to  audit  adjust¬ 
ments,  implying  that  BTDs,  at  least  in  part,  represent  general  tax  risk.  Prior  literature  also  shows 
that  BTDs  capture  financial  earnings  management  (Hanlon  2005).  To  extend  the  analysis  and 
examine  whether  information  in  BTDs  are  also  informative  of  tax  sheltering,  I  test  for  the  relation 
between  BTDs  and  tax  shelters. 

Finally,  Gleason  and  Mills  (2002)  find  that  the  contingent  tax  liability  reserve,  or  tax  cushion, 
is  increasing  in  expected  tax  losses.  One  explanation  is  that  a  nontrivial  portion  of  the  tax  cushion 
likely  reflects  tax  sheltering  transactions  (also  see  Lisowsky  et  al.  2010). 

To  understand  the  extent  to  which  various  proxies  for  tax  avoidance  and  tax  aggressiveness 
measure  tax  sheltering  activities  in  particular,  the  first  supplemental  analysis  tests  each  of  the 
above  proxies’  relation  to  actual  tax  shelter  use. 


®  Note  that  this  analysis  is  not  intended  to  find  the  most  appropriate  measure  for  general  tax  aggressiveness.  Because  tax 
aggressiveness  is  a  matter  of  judgment,  degree,  and  scope,  some  measures  may  not  be  related  to  tax  sheltering  in 
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Tax  Sheltering  in  a  FIN  48  Reporting  Environment 

The  second  supplemental  analysis  examines  the  usefulness  of  the  model  in  inferring  tax 
sheltering  in  a  FIN  48  disclosure  environment.  For  publicly  traded  firms  with  fiscal  years  begin¬ 
ning  after  December  15,  2006,  FIN  48  requires  disclosure  of  unrecognized  (i.e.,  uncertain)  tax 
benefits  in  the  financial  statement  footnotes.  A  common  conjecture  is  that  FIN  48  provides  the  IRS 
with  a  “roadmap”  with  which  to  discover  potentially  aggressive  tax  positions  (Mills  et  al.  2010). 
As  such,  the  additional  disclosures  of  tax  uncertainty  could  curtail  certain  tax  aggressive  behavior, 
potentially  weakening  the  applicability  of  the  model  in  the  FIN  48  environment.  Therefore,  I 
examine  the  model’s  usefulness  in  this  new  regime. 

III.  TAX  SHELTERS  AND  THEIR  CHARACTERISTICS 

Tax  Shelters 

This  study  employs  a  new,  expanded  data  set  on  corporate  tax  shelters  obtained  from  the  IRS 
Office  of  Tax  Shelter  Analysis  (OTSA).  These  data  capture  19  categories  of  specifically  identified 
illegal  corporate  tax  shelters  known  as  “Listed  Transactions,”  with  interesting  names  such  as  Son 
of  Boss  (Notice  2000-44),  Lease  Strips  (Rev.  Rul.  2003-55),  and  Partnership  Straddles  (Notices 
2002-50,  2002-65,  and  2003-54).'°  These  corporate  tax  shelters  have  been  so  widely  used  that  the 
Treasury  Department  and  IRS  have  issued  Revenue  Rulings,  Treasury  Regulations,  and  Notices 
explicitly  prohibiting  them. 

Tax  shelters  are  also  grouped  into  a  second  category  of  (Non-Listed)  “Reportable  Transac¬ 
tions.”  These  transactions  are  somewhat  more  general  in  nature."  The  OTSA  data  used  in  this 
study  capture  45  categories  of  corporate  Reportable  Transactions,  such  as  corporate-owned  life 
insurance  (COLI),  bank-owned  life  insurance  (BOLI),  and  foreign  sales  corporation  advance 
payments.  Listed  or  Reportable  Transactions  are  reported  for  the  fiscal  year,  similar  to  tax  return 
filings  (and  financial  statements).  I  define  the  indicator  variable  TaxShelter  as  equal  to  1  if  it  is 
classified  as  a  Listed  or  (Non-Listed)  Reportable  Transaction,  and  0  otherwise.  The  data  set  covers 
the  five-year  period  2000  to  2004. 

In  general.  Listed  and  Reportable  Transactions  achieve  tax  sheltering  in  one  of  three  ways. 
First,  they  artificially  inflate  or  shift  asset  bases  to  other  (typically  tax-indifferent)  parties  upon 
sale  to  decrease  gains  subject  to  tax  or  shift  gains  to  a  lightly  taxed  jurisdiction  or  party.  Second, 
they  accelerate  or  inflate  expenses  to  decrease  current  taxable  income.  Third,  they  defer  revenue  to 
understate  taxable  income.  Each  of  these  ways  can  also  influence  the  eligibility  of  certain  tax 
credits  that  can  be  claimed  directly  against  the  firm’s  tax  liability. 

OTSA  gathers  data  on  tax  shelters  through  both  enforcement  actions  and  voluntary  compli¬ 
ance  (i.e.,  self-reporting)  by  corporations.  For  example,  IRS  Form  8886  captures  information  on 
Listed  and  Reportable  Transactions,  such  as  their  type,  nature,  and  in  some  cases,  dollar  amount  of 


particular  because  they  are  not  designed  to  do  so.  This  result  does  not  necessarily  rule  out  their  ability  to  capture  other 
types  of  (non-tax  shelter)  tax  aggressiveness  or  avoidance.  The  aim  is  to  evaluate  the  ability  of  these  various  proxies  to 
capture  tax  sheltering  behavior  in  particular,  that  is,  the  most  extreme  case  of  tax  minimization. 

The  remaining  16  categories  of  coiporate  listed  transactions  used  in  this  study  are  included  in 
http:/www.irs.gov/businesses/corporations/article/0„id=  120633,00. html. 

’*  To  be  more  precise.  Listed  Transactions  are  a  special  class  of  Reportable  Transactions.  In  particular.  Listed  Transactions 
are  specifically  identified  tax  shelters  while  the  remaining  (Non-Listed)  Reportable  Transactions  are  more  general  tax 
shelter  transactions.  For  expositional  purposes  of  this  study,  as  well  as  to  distinguish  between  the  potentially  different 
levels  of  tax  sheltering,  I  separate  the  specific  from  general  tax  shelter  transactions,  calling  the  former  “Listed”  and  the 
latter  simply  “Reportable.”  In  effect,  the  short-hand  term  “Reportable”  used  in  this  study  refers  specifically  to  Report- 
able  Transactions  that  are  not  already  classified  as  Listed  Transactions. 

Previous  studies  also  interpret  Reportable  Transactions,  such  as  COLI,  as  tax  shelters  (Graham  and  Tucker  2006;  Brown 
2009;  Wilson  2009).  Examples  of  other  Reportable  Transactions  included  in  the  OTSA  data  set  are  donations  of 
intellectual  assets,  asset  securitizations,  and  extraterritorial  income  exclusions. 
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tax  benefits.  After  2004,  Schedule  M-3  also  captures  some  information  on  total  Reportable  (in¬ 
cluding  Listed)  Transactions,  such  as  their  book  and  tax  income  effects  and  a  breakdown  between 
temporary  and  permanent  book-tax  differences.  This  information  is  not  available  for  my  sample 
period,  although  I  use  both  the  8886  and  M-3  data  later  in  the  study  to  measure  tax  shelter  activity 
in  a  more  recent  FIN  48  period. 

Unfortunately,  the  OTSA  data  do  not  provide  a  clear  indication  as  to  the  motivation  for 
reporting.  For  example,  corporate  tax  shelters  thought  to  be  “voluntarily”  reported  to  the  IRS 
could  actually  be  due  to  enforcement  actions  or  their  threat.  A  related  limitation  is  that  TaxShelter 
can  simply  be  a  proxy  for  the  probability  of  IRS  audit.  Although  I  cannot  rule  out  this  possibility, 
interdependencies  surely  exist  between  tax  shelter  reporting  and  audit  regime,  as  one  setting 
influences  the  other.  I  do  not  have  data  on  IRS  audit  scoring  models  and  therefore  cannot  control 
for  this  effect.  However,  the  major  benefit  of  using  OTSA  data  is  that  inclusion  in  the  sample  is  not 
conditional  on  whether  the  shelters  are  publicly  disclosed,  litigated,  and/or  settled.  It  provides  the 
most  concentrated  sample  of  observations  yet  employed,  increasing  power  of  tests  and  as  a  result, 
generalizability  for  future  research. 

Tax  Shelter  Characteristics 

Wilson  (2009)  finds  that  tax  shelters  are  positively  related  to  total  book-tax  differences, 
performance  matched  discretionary  accruals  (see  Kothari  et  al.  2005),  size,  profitability  (return  on 
assets),  and  foreign  income.  I  extend  the  Wilson  (2009)  model  by  also  including  constructs  based 
on  the  conceptual  framework  from  the  Treasury  (1999)  detailing  the  characteristics  of  tax  shelters. 
I  focus  on  the  additional  Treasury  (1999)  characteristics  here  and  refer  the  reader  to  Wilson  (2009) 

13 

for  a  discussion  of  the  other  charactenstics. 

First,  the  Treasury  (1999)  explains  that  the  use  of  tax-indifferent  parties,  such  as  foreign  or  tax 
exempt  entities,  is  likely  to  be  a  characteristic  of  tax  sheltering.  Hines  and  Rice  (1994)  find  that 
corporate  subsidiaries  located  in  tax  haven  or  low-tax  foreign  jurisdictions  can  lower  tax  payments 
in  general.  Desai  et  al.  (2006)  explain  that  these  entities  can  participate  in  transactions  to  absorb 
taxable  income  or  deflect  tax  liability.  As  such,  they  find  that  tax  haven  operations  facilitate  a 
reduction  in  the  tax  burden  by  allowing  firms  to  reallocate  and  defer  the  U.S.  tax  related  to  foreign 
income.  Another  method  to  shelter  tax  payments  is  by  transacting  with  municipalities,  as,  for 
example,  in  the  lease-in  lease-out  (LILO)  transactions  between  Wachovia  and  U.S.  and  foreign 
municipalities  (McGill  and  Outslay  2004). I  go  a  step  further  and  test  whether  there  is  a  positive 
relation  between  tax  shelters  in  particular  and  tax  haven  operations,  which  can  include  low-tax  or 
tax-indifferent  subsidiaries. 

To  measure  the  tax  haven  aspect  of  tax-indifferent  or  low-tax  counterparties,  I  develop  an 
indicator  variable  called  TaxHaven.  Data  are  hand-collected  from  SEC  Form  10-K  Exhibit  21, 
which  enumerates  the  state  and  country  jurisdictions  in  which  a  U.S.  company’s  subsidiaries  are 
located.  I  cross-reference  Exhibit  21  to  the  OECD  (2000,  17)  list  of  tax  havens. If  a  firm  reports 
a  subsidiary  in  such  a  tax  haven,  TaxHaven  is  set  equal  to  1,  and  0  otherwise.  Because  locating  a 
subsidiary  in  a  tax  haven  is  likely  to  be  characteristic  of  tax  sheltering  consistent  with  the  Treasury 
(1999),  Desai  et  al.  (2006),  and  Hines  and  Rice  (1994),  I  predict  a  positive  association  between 


As  in  Wilson  (2009),  this  study  focuses  on  characteristics,  not  economic  incentives,  of  tax  shelter  usage. 

Wachovia  is  not  included  in  my  sample.  If  it  were,  IRS  data  disclosure  agreements  would  prevent  me  from  identifying 
it.  In  the  context  of  tax-indifferent  parties,  Wachovia  simply  illustrates  how  LILOs  can  be  used  in  practice.  LILO 
transactions  in  general  are  included  in  the  OTSA  sample  data. 

The  Organisation  for  Economic  Co-operation  and  Development  (OECD)  tracks  tax  haven  jurisdictions  and  urges  their 
compliance  with  international  standards  on  tax  administration. 
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TaxHaven  and  tax  shelters. 

Second,  the  Treasury  (1999)  explains  that  tax  shelters  exploit  inconsistent  financial  and  tax 
treatment  of  certain  transactions.  Therefore,  transactions  such  as  tax  shelters  that  create  permanent 
book-tax  differences  by  lowering  taxable  income  or  generating  tax  credits,  reduce  a  corporation’s 
effective  tax  rate,  increasing  reported  earnings  and  (perceived)  shareholder  value.  The  Treasury 
(1999,  7)  notes  that  the  focus  on  increasing  corporate  earnings  and  managing  the  reported  effec¬ 
tive  tax  rate  has  “caused  many  corporations  to  treat  their  in-house  tax  departments  as  profit 
centers,”  with  corporate  accountants  viewing  that  “a  high  effective  tax  rate  is  a  sign  of  weakness.” 
McGill  and  Outslay  (2002,  1136)  explain  that  “most  tax  shelter  products  today  have  vitality 
because  of  their  positive  impact  on  the  corporation’s  tax  liability  (reduced)  and  its  earnings  per 
share  (increased).”  Therefore,  there  is  reason  to  expect  that  tax  shelters  are  used  to  mitigate  an 
otherwise  high  effective  tax  rate  reported  to  shareholders. 

I  infer  inconsistent  book-tax  treatment  by  examining  two  items.  The  first  is  the  tax  rate 
reported  in  the  financial  statements.  The  tax  rate  that  is  most  directly  related  to  earnings  is  the 
reported  effective  tax  rate  (ETR),  or  the  ratio  of  total  tax  expense  to  total  pretax  income.  The 
Treasury  (1999)  explains  that  the  ETR  is  closely  measured  by  corporate  tax  departments  and  can 
capture  a  key  reporting  incentive  to  engage  in  tax  sheltering  behavior,  namely,  to  manage  tax 
expense.  Therefore,  I  predict  that  the  greater  the  prior-year  reported  ETR  (LagETR),  or  the  ratio  of 
total  tax  expense  (Compustat  #16)  to  total  pretax  income  (#170),  the  more  likely  it  is  that  a  firm 

17 

uses  a  tax  shelter  in  the  current  year. 

The  second  item  used  in  inferring  inconsistent  book-tax  treatment  is  the  indicator  variable 
EqEarn,  which  is  equal  to  1  if  equity  method  earnings  (#55)  are  present,  0  otherwise.  I  measure 
equity  method  earnings  because  it  is  a  peculiar  case  in  consolidation  accounting  where  a  net 
percentage  share  of  income  and  assets  of  a  20-49  percent-owned  corporate  investment  is  reported 
on  a  parent’s  consolidated  financial  statements,  but  remains  on  a  cost  basis  on  the  consolidated  tax 
return,  where  non-dividend  income  is  not  reported.  Therefore,  the  equity  method  allows  the  firm 
to  report  greater  net  assets  and  income  for  financial  purposes  than  tax  purposes.  I  test  whether  such 
an  arrangement  is  indicative  of  other  advantageous  consolidation  arrangements,  where  the  finan¬ 
cial  statements  report  greater  profits  than  tax  records.  I  predict  a  positive  relation  between  tax 
shelters  and  EqEarn. 

Third,  the  Treasury  (1999)  identifies  financial  engineering  products  as  indicative  of  tax  shel¬ 
ters.  Certain  hedges  or  structured  transactions,  for  example,  insulate  a  company  from  any  signifi¬ 
cant  economic  risk  or  adverse  financial  statement  reporting  while  engineering  deductions  or  losses 
on  the  tax  return  (see  Engel  et  al.  1999;  Mills  and  Newberry  2005;  Landsman  et  al.  2008).  As  the 
Treasury  (1999)  explains  further,  taxpayers  can  monetize  the  value  in  their  assets  or  investments, 
or  lock-in  appreciation  with  respect  to  their  property,  without  recognizing  taxable  gains  (e.g., 
through  borrowing). 

To  measure  the  use  of  financial  engineering  products,  I  use  a  proxy  for  mezzanine  financing, 
MezzFin,  capturing  convertible  debt  and  preferred  stock  (Compustat  #39/#6).  I  use  this  variable 
because  convertible  debt  has  equity-like  features,  while  preferred  stock  has  debt-like  features. 
Furthermore,  Mills  and  Newberry  (2005)  explain  (and  empirically  validate)  that  differences  in 


Tax  havens  can  also  enable  a  firm  to  engage  in  other  aggressive  tax  transactions,  such  as  transfer-pricing  or  debt 
placement  (interest  stripping)  schemes  (see  Wilson  1995;  Mills  et  al.  1998). 

I  use  the  prior-year  rather  than  contemporaneous  ETR  because  it  is  unclear  the  degree  to  which  contemporaneous  ETR 
is  simultaneously  related  to  tax  shelters.  Specifically,  the  contemporaneous  ETR  can  either  be  the  cause  or  effect  of  tax 
shelter  use.  I  also  use  the  reported  ETR  rather  than  the  marginal  tax  rate  (see  Plesko  2003)  because  the  focus  of  this  item 
in  the  Treasury  (1999)  is  on  the  financial  reporting  rather  than  pure  economic  issues  of  taxes.  Nevertheless,  sensitivity 
checks  using  contemporaneous  ETR  yield  qualitatively  similar  results. 
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financial  interest  expense  and  tax  interest  deductions  are  associated  with  the  advantageous  classi¬ 
fication  of  debt-equity  hybrid  securities  for  financial  and  tax  reporting.  I  predict  a  positive  relation 
between  tax  shelters  and  MezzFin. 

Fourth,  the  Treasury  (1999)  (and  U.S.  Senate  2003)  reports  that  the  use  of  promoters,  includ¬ 
ing  investment  banks,  law  firms,  and  auditors,  is  indicative  of  tax  shelter  usage.  The  white  paper 
states  that  promoters  attempt  to  design  tax  shelters  so  they  can  be  replicated,  marketed,  and  sold 
to  a  variety  of  clients.  The  Treasury  (1999)  reports  that  one  Big  5  accounting  firm  maintained  two 
databases  of  about  1,000  “mass  market”  tax  savings  ideas  in  the  1990s.  Promoter  usage  is  mea¬ 
sured  using  BigS,  equal  to  1  if  the  firm  is  audited  by  PricewaterhouseCoopers,  Ernst  &  Young, 
Deloitte  &  Touche,  KPMG,  or  Arthur  Andersen,  and  0  otherwise  (#149).^^ 

Fifth,  I  examine  more  generally  whether  the  firm’s  legal  circumstances  are  informative  in 
inferring  tax  shelters.  A  firm  that  is  under  lawsuit  could  be  a  harbinger  of  aggressive  practices  that 
also  include  tax  sheltering,  as  is  evident  in  the  Graham  and  Tucker  (2006)  sample.  I  measure  legal 
circumstances.  Litigation,  as  equal  to  1  if  pretax  (#372)  or  after-tax  (#373)  Litigation/Insurance 
Settlement  is  negative,  i.e.,  a  litigation  or  insurance  payout.  For  both  BigS  and  Litigation,  I  expect 
a  positive  relation  to  tax  shelter  usage. 

Finally,  I  control  for  net  operating  loss  carryforwards  (NOL)  of  the  firm.  If  the  firm  is  already 
in  a  non-taxable  position  due  to  loss  carryforwards,  I  predict  a  negative  relation  between  tax 
shelters  and  NOL,  equal  to  1  if  an  NOT  carryforward  exists  (#52  >  0),  0  otherwise.  Prior  research 
suggests  that  loss  firms  have  different  characteristics  than  positive  income  firms  (e.g.,  Joos  and 
Plesko  2005;  Klein  and  Marquardt  2006),  and  thus  can  affect  the  reliability  of  estimates  reported 
in  this  study. 


IV.  RESEARCH  METHOD 

Sample 

The  research  sample  begins  with  Compustat  and  Internal  Revenue  Service  (IRS)  corporate  tax 
return  data  successfully  matched  on  Employer  Identification  Number  (FIN)  and  fiscal  year-end 
from  years  2000  to  2004.  Only  publicly  traded  Subchapter  C  corporations  are  included  in  this 
sample  to  ensure  the  company  reports  a  tax  position  to  the  IRS.^°  This  initial  merged  data  set 
contains  26,388  firm-year  observations  (see  Table  1). 

I  first  restrict  the  initial  data  set  to  taxpayers  with  no  foreign  entity  owning  25  percent  or  more 
of  the  corporation  (from  Form  1120,  Schedule  K,  Fine  7),  leaving  16,776  firm-year  observations. 
This  step  focuses  the  sample  on  U.S.-only  donuciled  firms  by  elimdnating  any  foreign-controlled 
U.S.  corporations.  This  elimination  nutigates  differences  in  international  tax  and  financial  report¬ 
ing  rules  that  can  confound  the  results.  Next,  I  require  non-negative  Implied  Stock  Option  Expense 
reported  in  Compustat  (#399),  leaving  16,649  firm-year  observations.^'  I  then  match  and  merge 
these  observations  to  OTSA  tax  shelter  data.  OTSA  only  provides  information  on  whether  the  firm 
uses  a  tax  shelter,  but  does  not  provide  confirmatory  information  on  whether  the  firm  does  not  use 
a  tax  shelter.  Therefore,  if  confirmed  identification  of  a  tax  shelter  does  not  exist  for  a  particular 


Arthur  Andersen  existed  only  for  the  earlier  part  of  my  sample  period.  Additional  tests  classifying  the  four  currently 
operating  accounting  firms  (PwC,  E&Y,  Deloitte,  and  KPMG)  as  promoters  yield  materially  same  results. 

In  robustness  checks,  I  also  eliminate  firms  with  negative  pretax  book  income,  yielding  similar  results. 

Subchapter  S  corporations,  RICs,  and  REITs  are  excluded  because  they  normally  do  not  pay  corporate  income  taxes. 
Subchapter  C  corporations  are  subject  to  the  corporate  income  tax  as  calculated  from  the  IRS  Form  1120. 

In  calculating  the  tax  cushion  later  in  the  study,  stock  options  are  used  to  adjust  for  the  difference  between  current  U.S. 
tax  expense  and  actual  tax  on  the  tax  return.  In  such  a  construct  for  tax  cushion,  a  negative  stock  option  expense  (i.e., 
stock  option  “income”)  is  not  usable.  It  implies  a  tax  refund  from  stock  option  exercise,  even  though  an  investor  would 
not  exercise  while  in  a  loss  (or  “underwater”)  position. 
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TABLE  1 


Sample  Selection 


Firm-Year  Observations 

Number 
of  Tax 
Shelters 
(OTSA) 

Criteria 

Compustat 
and  IRS 
Data 

OTSA  Data^ 

Subchapter  C  Corporations  matched  on  EIN  and  Fiscal  Year  in 
Compustat  and  IRS  tax  return  databases,  2000-2004. 

26,388 

Less:  Observations  >25%  foreign-owned,  from  Form  1120 
Schedule  K  Line  7 

(9,612) 

16,776 

Less:  Observations  with  negative  Implied  Option  Expense 
(#399) 

(127) 

16,649 

446 

1,387 

Less:  Observations  in  Financial  Industry  (SIC  6000-6999) 

(4,045) 

(110) 

(415) 

12,604 

336 

972 

Less:  Observations  with  negative  current  U.S.  tax  expense 

(1,752) 

(61) 

(212) 

10,852 

275 

760 

Less:  Observations  missing  lagged  values 

(1,629) 

(8) 

(16) 

Remaining  2000-2004  Sample  Period  Observations: 

9,223 

267 

744 

[3,347  firms] 

[211  firms]*’ 

“  OTSA  (IRS  Office  of  Tax  Shelter  Analysis)  firm-year  observations  are  a  subset  of  the  Compustat  and  IRS  firm-year 
observations. 

**  Graham  and  Tucker  (2006)  sample:  43  unique  firms  with  152  total  firm-year  observations,  1975-2000. 

Wilson  (2009)  sample:  59  unique  firms  with  215  total  firm-year  observations,  1975-2002. 

Sample  comparison:  Wilson  (2009)  identifies  16  tax  shelter  firm-year  observations  in  2000-2002;  the  OTSA  sample 
identifies  102  tax  shelter  firm-year  observations  in  2000-2002. 


observation,  I  assume  a  tax  shelter  in  fact  does  not  exist,  and  therefore  assign  a  zero  value  for  that 
observation. 

I  delete  observations  in  the  financial  industry  (SIC  6000-6999)  due  to  the  general  unavail¬ 
ability  of  data  for  my  tests,  and  the  unique  nature  of  regulatory  requirements,  leaving  12,604 

22  ^ 
firm-year  observations.  I  further  delete  observations  with  negative  current  U.S.  tax  expense 

(Compustat  #63;  or  Total  Tax  Expense  [#16]  less  Deferred  Tax  Expense  [#50]),  leaving  10,852 
observations.  I  delete  these  observations  because,  in  calculating  the  tax  cushion  later  in  the  study, 
the  other  input  variables  (actual  tax  on  the  tax  return  and  the  tax  benefit  from  stock  options)  are 
zero  or  positive  only.  Finally,  because  lagged  and  change  variables  are  required  in  the  tax  shelter 
likelihood  model,  I  delete  observations  with  missing  prior-year  data,  leaving  a  final  sample  of 


In  particular,  values  used  in  Wilson  (2009)  and  inferred  from  the  Treasury  (1999),  such  as  foreign  pretax  income, 
research  and  development,  equity  in  earnings,  and  mezzanine  financing  are  not  available  for  financial  firms.  Also, 
leverage  is  influenced  by  regulatory  requirements  and  can  be  otherwise  incomparable  with  firms  in  other  industries. 
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9,223  firm-year  observations,  of  which  267  are  tax  shelter  observations.  Approximately  one-third 
of  the  tax  shelter  observations  are  classified  as  Listed  Transactions,  while  two-thirds  are  (Non- 
Listed)  Reportable  Transactions. 

I  do  not  employ  a  one-to-one  matched  sample  in  the  main  tests  for  two  key  reasons.  First, 
from  an  out-of-sample  estimation  and  generalizability  perspective,  it  is  unclear  a  priori  which 
firms  are  shelter  firms  and  which  firms  are  not.  Matching  a  priori  on  the  item  I  ultimately  want  to 
infer,  i.e.,  tax  shelter  usage,  would  limit  the  model’s  generalizability  and  potentially  distort  the 
extent  of  tax  shelter  representation  in  a  larger  population.  Second,  an  unmatched  sample  allows  the 
researcher  to  exploit  greater  variation  in  variables,  increasing  the  potential  to  learn  more  specific 
details  about  the  profile  of  tax  shelter  participants.  However,  to  mitigate  any  concerns  over  my 
sample  construction,  I  subsequently  report  model  validity  checks  that  also  utilize  matched  samples 
derived  from  Graham  and  Tucker  (2006). 

Tax  Shelter  Likelihood  Model 

To  generate  a  model  using  publicly  available  financial  information  to  estimate  the  likelihood 
of  tax  shelters,  I  run  the  following  pooled,  cross-sectional  logistic  regression  model  by  firm  /  in 
year  f. 

p 

,  ^  TaxShelter  .  . 

In - =  a  +  s, 

1  —  P  r-  ’ 

^TaxShelter 

where  PraxSheiter  =  —  probability  the  firm  engages  in  a  tax  shelter  transaction;  and 

PX  =  PiBTDfj  +  /32DAPft  +  /S^LeveragCfi  +  /i^Sizept  +  +  /3^ForeignIncomept 

-t  fijR&Df  t  +  IS^TaxHaverif  t  +  jSqLagETRf  t -t  PioEqEarnp^ -l-  p^iMezzEitif  t  +  PnBigSf  t 

18  26 

+  l3i2,Litigationf  j  +  Pi^NOLf  I  +  2  Py^earp^F  2 

y=15  ’  i=19 

See  Table  2  for  full  explanation  of  variables.  The  first  seven  variables,  book-tax  differences 
{BTD),  performance-matched  discretionary  accruals  {DAP),  leverage  {Leverage),  asset  size  {Size), 
profitability  {ROA),  foreign-source  income  {Foreignincome),  and  research  and  development  ex¬ 
pense  {R&D)  are  directly  from  Wilson  (2009).  The  remaining  seven  variables  are  derived  from  the 
Treasury  (1999),  as  described  above.  Industry  variables  control  for  variability  in  industry  report¬ 
ing.  Consistent  with  Petersen  (2009),  I  use  Huber->Vhite  robust  standard  errors  to  adjust  for 
potential  serial  correlation  among  multiple  observations  per  firm,  with  year  indicators  controlling 
for  cross-sectional  correlation.  All  continuous  variables  are  winsorized  at  the  1  and  99  percent 
levels  to  mitigate  the  effect  of  outliers. 


V.  RESULTS 

Descriptive  Statistics  and  Correlation  Analysis 

As  detailed  in  Table  3,  the  sample  is  composed  of  9,223  firm-year  observations,  of  which  267, 
or  2.89  percent  are  tax  shelter  observations  {TaxShelter,  Panel  A).  About  8.65  percent  of  the 
sample  reports  a  subsidiary  in  a  tax  haven  jurisdiction  {TaxHaven)  while  less  than  12.5  percent 
reports  equity  method  earnings  {EqEarn).  Almost  two-thirds  of  the  sample  has  a  Big  5  auditor 
{Big5),  and  4  percent  reports  a  legal  or  insurance  payout  {Litigation).  Over  one-third  of  the  sample 
reports  a  net  operating  loss  carryforward  {NOE).  The  most  highly  represented  industries  are 
Manufacturing  (SIC  3000-3999),  at  26  percent,  and  Chemical  (SIC  2000-2999),  at  over  18  per¬ 
cent  (Panel  B).  The  least  represented  industries  are  Diversified  (SIC  9000-9999),  at  1.58  percent, 
and  Agriculture  (SIC  0000-0999),  at  0.27  percent.  Observations  are  generally  evenly  dispersed 
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TABLE  2 

Variable  Definitions 


TaxShelter 


BTD 


DAP 


Leverage 

Size 

ROA 


=  1  if  observation  reports  a  Tax  Shelter  (Listed  or  Reportable)  Transaction  in  the 
IRS  Office  of  Tax  Shelter  Analysis  (OTSA)  database;  0  otherwise. 

=  (Pretax  Book  Income,  #170,  less  Estimated  Taxable  Income)/Total  Assets  (#6), 
where  Estimated  Taxable  Income  =  (current  federal  tax  expense,  #63,  plus 
current  foreign  tax  expense,  #64)/0.35,  less  change  in  tax  loss  carryforward 
(#52). 

=  Discretionary  accruals  from  the  performance-adjusted  modified  cross-sectional 
Jones  (1991)  model  (see  Kothari  et  al.  2005;  Wilson  2009). 

=  Long-term  debt  liability  (Compustat  #9)/Total  Assets  (#6). 

=  log(Total  Assets,  #6). 

=  Pretax  Book  Income  (#170)/Total  Assets  (#6). 


Foreignincome 

R&D 

TaxHaven 


LagETR 

EqEarn 

MezzEin 

Big5 

Litigation 

NOL 


=  Foreign  Pretax  Income  (#273)/Lagged  Total  Assets  (#6);  0  if  #273  is  missing. 

=  R&D  Expense  (Compustat  #46)/Lagged  Total  Assets  (#6);  0  if  #46  is  missing. 

=  1  if  observation  reports  in  lOK  Schedule  21  a  subsidiary  located  in  a  tax  haven 
jurisdiction  (0  otherwise),  as  defined  by  the  OECD  (2000):  Andorra,  Anguilla, 
Antigua  and  Barbuda,  Aruba,  Bahamas,  Bahrain,  Barbados,  Belize,  Bermuda, 
British  Virgin  Islands,  Cayman  Islands,  Cook  Islands,  Cyprus,  Dominica, 
Gibraltar,  Grenada,  Guernsey,  Isle  of  Man,  Jersey,  Liberia,  Liechtenstein, 
Maldives,  Malta,  Marshall  Islands,  Mauritius,  Monaco,  Montserrat,  Nauru, 
Netherlands  Antilles,  Niue,  Panama,  Samoa,  San  Marino,  Seychelles,  St  Lucia, 
St  Kitts  and  Nevis,  St  Vincent  and  the  Grenadines,  Tonga,  Turks  and  Caicos, 
U.S.  Virgin  Islands,  Vanuatu. 

=  Prior-Period  Reported  Effective  Tax  Rate  XlOO,  where  Reported  ETR  =  Total 
tax  expense  (Compustat  #16)/Total  Pretax  Book  Income  (#170^ 

=  1  if  Equity  in  Earnings  (Loss)  from  income  statement  (#55)  is  present  and  does 
not  equal  0,  and  0  otherwise. 

=  Convertible  Debt  and  Preferred  Stock  (Compustat  #39)/Total  Assets  (#6);  0  if 
#39  is  missing. 

=  1  if  Auditor  is  Deloitte  &  Touche,  Price waterhouseCoopers,  Ernst  &  Young, 
KPMG,  or  Arthur  Andersen  (Compustat  #149),  and  0  otherwise. 

=  1  if  Pretax  (#372)  or  After-tax  (#373)  Litigation/Insurance  Settlement  is 
negative,  indicating  a  payout,  and  0  otherwise. 

=  1  if  Tax  Loss  Carryforward  (#52)  >0,  and  0  otherwise. 


Year  =  1  if  observation  falls  in  any  of  years  2001-2004,  and  0  otherwise  (if  2000). 

Ind  =  1  if  observation  falls  in  any  of  one-digit  SIC  (non-financial)  industries  SICO, 

SICl,  SIC2,  SIC4,  SIC5,  SIC7,  SICS,  and  SIC9,  and  0  otherwise  (if  SIC3).  The 
financial  industry  (SIC6)  is  excluded  from  the  sample. 


through  time,  with  slightly  more  observations  in  the  earlier  than  later  years.  However,  the  uneven 
counts  of  TaxShelter  in  time  and  industry  support  inclusion  of  year  and  industry  controls. 

Examining  the  continuous  variables  for  the  sample,  the  mean  (median)  BTD  is  21.1  percent 
(0.1  percent)  of  total  assets.  The  mean  (median)  DAP  is  -0.010  (0.015)  while  Leverage  has  a 
mean  (median)  of  19.4  percent  (7.3  percent).  Size  is  highly  variable,  with  mean  (median)  values  of 
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$697  ($64)  million.  The  mean  (median)  profitability  {ROA)  is  —42.8  percent  (2.2  percent)  of  total 
assets.  Foreignincome,  R&D,  and  MezzFin  all  report  a  zero  median  value,  with  respective  means 
at  0.2  percent,  10.9  percent,  and  7.6  percent.  Finally,  the  lagged  reported  effective  tax  rate  {Lag- 
ETR)  is  16.2  percent  (8.7  percent),  while  the  75th  percentile  firm  has  a  reported  lagged  ETR  of 
over  37  percent  (Panel  C). 

Tests  for  differences  in  means  for  users  and  non-users  of  tax  shelters  indicate  that  BTD,  DAP, 
Size,  ROA,  Foreignincome,  and  LagETR  are  all  greater  for  tax  shelter  than  for  the  non-tax  shelter 
firms  (all  p  <  0.10),  consistent  with  prior  models  and  expectations  (Table  3,  Panel  D).  The  means 
tests  show  that  Leverage,  R&D,  and  MezzFin  are  significantly  smaller  for  the  tax  shelter  than  for 
the  non-tax  shelter  firms  (all  p  <  0.01).  The  Leverage  result  is  consistent  with  Graham  and  Tucker 
(2006),  who  find  a  negative  relation  between  debt  and  tax  shelter  usage.  A  priori,  it  is  unclear 
whether  intangible  activities  facilitate  tax  sheltering,  or  like  leverage,  act  as  a  substitute;  the 
negative  coefficient  on  R&D  implies  substitution.  Finally,  inconsistent  with  expectations,  results 
show  a  negative  relation  between  TaxShelter  and  mezzanine  financing  {MezzFin).  It  is  likely  that 
MezzFin  (like  R&D)  substitutes  for  plain-vanilla  Leverage,  which  itself  is  a  substitute  for  tax 
shelters. 

Tests  for  differences  in  proportions  in  Table  3,  Panel  E  indicate  that  subsidiary  locations  in  tax 
havens  (TaxHaven),  inconsistent  book-tax  treatment  {EqEarn),  use  of  promoters  {Big5),  and  liti¬ 
gation  payouts  {Litigation)  are  significantly  more  likely  for  the  tax  shelter  than  non-tax  shelter 
firms  (all  p  <  0.01).  Also,  tax  shelter  firms  are  significantly  less  likely  than  non-shelter  firms  to 
report  a  net  operating  loss  carryforward  {NOD,  p  <  0.01). 

Of  the  Wilson  (2009)  variables,  untabulated  Pearson  and  Spearman  correlations  show  that 
TaxShelter  is  positively  related  to  BTD,  Size,  ROA,  and  Foreignincome,  and  negatively  related  to 
Leverage  and  R&D  (all  p  <  0.10).  TaxShelter  is  positively  related  to  DAP  only  in  Pearson 
correlations  (p  <  0.05).  Of  the  Treasury  (1999)  variables,  untabulated  Pearson  and  Spearman 
correlations  show  that  TaxShelter  is  positively  related  to  TaxHaven,  LagETR,  EqEarn,  Big5,  and 
Litigation  (all  p  <  0.01),  and  negatively  related  to  NOL  (p  <  0.01).  MezzFin  is  negatively  related 
to  TaxShelter  in  Pearson  correlations  only  (p  <  0.01). 

Main  Logistic  Regression  Results 

This  section  reports  the  main  efforts  to  use  financial  statement  information  in  multivariate 
models  to  infer  corporate  tax  shelter  likelihood.  To  begin,  the  first  column  of  Table  4  reports  the 
findings  of  Wilson’s  Table  5,  column  3.  I  compare  and  extend  these  results  for  two  reasons.  First, 
his  sample  composition  is  more  than  one-to-one  matched,  as  is  my  OTSA-Compustat  data.  Sec¬ 
ond,  this  column’s  results  are  used  later  in  his  out-of-sample  estimation  and  market  tests.  His  tests 
show  a  positive  relation  between  tax  shelters  and  book-tax  differences  {BTD),  discretionary  ac¬ 
cruals  {DAP),  size  {Size),  profitability  {ROA),  and  Foreignincome.  Year  and  industry  controls  are 
not  included.  The  Pseudo-R^  is  13  percent. 

The  second  column  of  Table  4  replicates  the  Wilson  (2009)  model  on  the  OTSA  data,  again 
with  no  year  or  industry  controls.  Results  are  somewhat  sharper  using  the  OTSA  data,  corrobo¬ 
rating  the  findings  of  Wilson  (2009).  In  addition  to  his  results.  Leverage  is  now  negatively  related 
to  TaxShelter,  supporting  Graham  and  Tucker  (2006).  Note,  however,  that  Leverage  calculated  per 
Wilson  (2009)  only  includes  long-term  debt  (#9),  while  Graham  and  Tucker  (2006)  also  include 
the  short-term  portion  of  long-term  debt  (#34).  The  Pseudo-R^  of  the  Wilson  (2009)  model  using 
OTSA  data  is  9.8  percent. 

The  third  column  of  Table  4  replicates  the  Wilson  (2009)  model  on  the  OTSA  data,  but  adds 
year  and  industry  controls.  Results  remain  qualitatively  identical  as  in  the  previous  analysis,  with 
an  increase  in  the  Pseudo-R^  to  15.4  percent. 
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TABLE  4 


Results:  Tax  Shelter  Logistic  Model 


Dependent  Variable:  TaxShelter 


Coefficients 


Variable 

Pred. 

Coeff. 

Sign 

Wilson  as  Wilson  using  OTSA 
Reported**  Data  (1) 

Combined 

Wilson  using  Model 

OTSA  Data  (2)  (1) 

Combined 

Model 

(2) 

Constant 

-4.86  *** 

— 3  47 1  *  *  * 

—4.884***  —7.059*** 

-8.502*** 

(-4.38) 

(-29.20) 

(-16.14)  (-8.13) 

(-11.08) 

BTD 

+ 

5.20  * 

0.756** 

0  142***  0.032 

0.136 

(1.54) 

(2.13) 

(3.14)  (0.60) 

(1.25) 

[1.30] 

[1.08] 

[1.15]  [1.03] 

[1.15] 

DAP 

+ 

4.08  * 

0.748*** 

0.457**  0.924*** 

0.424 

(1.63) 

(5.19) 

(2.28)  (4.36) 

(1.12) 

[1.02] 

[2.11] 

[1.58]  [2.52] 

[1.53] 

Leverage 

- 

-1.41 

-1.036*** 

-0.963***  -0.697*** 

-0.671** 

(-1.18) 

(-5.88) 

(-5.25)  (-2.61) 

(-2.40) 

[-0.35] 

[0.36] 

[0.38]  [0.50] 

[0.51] 

Size 

+ 

0.76  *** 

0.700*** 

0  728***  1  397*** 

1.311*** 

(3.05) 

(7.44) 

(6.35)  (5.69) 

(6.12) 

[0.19] 

[2.01] 

[2.07]  [4.04] 

[3.71] 

ROA 

+ 

3.51  ** 

3.196*** 

3.235***  2.473*** 

2.425*** 

(1.75) 

(7.76) 

(6.94)  (4.08) 

(3.47) 

[0.88] 

[24.44] 

[25.42]  [11.86] 

[11.30] 

F oreignincome 

+ 

^  'J2  *** 

17.305*** 

23.575***  3.569 

8.673* 

(4.35) 

(4.10) 

(5.50)  (0.74) 

(1.67) 

[0.43] 

[>100.00] 

[>100.00]  [35.46] 

[>100.00] 

R&D 

+ 

2.43 

-4.391 

-2.359  -3.023 

-1.026 

or 

(0.58) 

(-1.17) 

(-0.99)  (-0.88) 

(-0.62) 

_ 

[0.61] 

[0.01] 

[0.09]  [0.049] 

[0.358] 

TaxHaven 

+ 

0.790*** 

1.039*** 

(4.46) 

(5.54) 

[2.20] 

[2.83] 

LagETR 

+ 

0.015*** 

0.017*** 

(4.96) 

(5.67) 

[1.02] 

[1.02] 

EqEam 

+ 

1  048*** 

0.969*** 

(6.32) 

(5.69) 

[2.85] 

[2.64] 

MezzFin 

+ 

-0.687 

-0.731 

(-1.07) 

(-1.19) 

[0.50] 

[0.48] 

Big5 

+ 

3.186*** 

3.008*** 

(3.45) 

(3.82) 

[24.20] 

[20.24] 

Litigation 

+ 

1.063*** 

0.952*** 

(4.84) 

(4.15) 

[2.89] 

[2.59] 

NOL 

— 

0.140 

0.100 

(continued  on  next  page) 
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TABLE  4  (continued) 


Dependent  Variable:  TaxShelter 


Coefficients 


Variable 

Pred. 

Coeff.  Wilson  as 
Sign  Reported^ 

Wilson  using  OTSA 
Data  (1) 

Wilson  using 
OTSA  Data  (2) 

Combined 

Model 

(1) 

Combined 

Model 

(2) 

Industry 

Controls 

Not 

Included 

Not  Included 

Included 

(0.81) 

[1.15] 

Not 

Included 

(0.54) 

[1.11] 

Included 

Year  Controls 

Not 

Included 

Not  Included 

Included 

Not 

Included 

Included 

Observations 

319 

9,223 

9,223 

9,223 

9,223 

Pseudo-R^ 

13% 

9.8% 

15.4% 

20.4% 

25.8% 

*,**,***Represent  significance  levels  at  the  p  <0.10,  0.05,  and  0.01  levels,  respectively. 

^  Results  are  as  reported  in  Wilson  (2009,  Table  5,  column  3). 

z-statistic  (odds  ratio)  appears  in  parentheses  (brackets)  below  coefficient  estimate,  except  the  bracketed  terms  in  the 
Wilson  as  Reported  column  indicate  the  marginal  change  in  the  probability  of  using  a  tax  shelter  resulting  from  a  change 
in  the  independent  variable.  Huber- White  standard  errors  are  used  to  adjust  for  potential  serial  correlation  among  multiple 
observations  per  firm.  Continuous  variables  are  winsorized  at  the  1  and  99  percentile  levels. 

See  Table  2  for  full  variable  definitions. 

All  models  reported  herein  are  significant  at  the  p  <1  percent  level  using  a  Chi-square  test. 


The  fourth  column  of  Table  4,  Combined  Model  (1),  combines  the  Wilson  (2009)  variables 
with  the  Treasury  (1999)  variables  developed  in  this  study.  In  this  column,  no  year  or  industry 
controls  are  included  to  provide  comparability  between  models.  Combined  Model  (1)  reports  that 
BTD  and  Foreignincome  are  no  longer  positively  related  to  TaxShelter,  whereas  DAP,  Leverage, 
Size,  and  ROA  remain  significant  with  the  expected  signs.  Several  Treasury  (1999)  variables  are 
significant  as  well.  Results  show  that  TaxShelter  is  positively  related  to  the  tax-indifferent  party 
proxy  TaxHaven,  the  lagged  effective  tax  rate  LagETR,  inconsistent  book-tax  treatment  EqEarn, 
use  of  promoters  BigS,  and  Litigation.  MezzFin  and  NOL  are  not  significantly  related  to  TaxShel¬ 
ter.  The  Pseudo-R^  rises  to  20.4  percent. 

The  final  column  of  Table  4,  Combined  Model  (2),  replicates  the  preceding  analysis,  but 
includes  year  and  industry  controls.  The  Wilson  (2009)  variables  BTD  and  DAP  are  not  significant. 
Size,  ROA,  and  Foreignincome  are  positively  related  to  TaxShelter,  while  Leverage  is  negatively 
related.  The  Treasury  (1999)  variables  TaxHaven,  LagETR,  EqEarn,  BigS,  and  Litigation  remain 
positively  related  to  TaxShelter  (all  p  <  0.01).  The  Pseudo-R^  of  this  model  is  25.8  percent. 

These  results  extend  the  original  Wilson  (2009)  model  by  including  proxies  for  tax  shelter 
characteristics  consistent  with  the  Treasury  (1999).  The  result  on  TaxHaven  adds  to  the  findings  of 
other  studies,  such  as  Desai  et  al.  (2006),  suggesting  that  tax  havens  and  tax-indifferent  parties 
facilitate  the  most  severe  form  of  tax  aggressiveness.  Indeed,  examining  the  OTSA  sample  more 
closely,  no  less  than  107  observations  have  tax  shelter  transactions  involving  some  form  of  tax- 
indifferent  parties.  Reportable  Transactions  included  in  this  number  are  as  a  result  of  foreign  sales 
corporations,  foreign  tax  credits,  extraterritorial  income  exclusions,  and  foreign  worthless  stock  (n 
=  30).  The  largest  proportion  of  Listed  Transactions  is  LILO  (n  =  68).  The  remaining  observa¬ 
tions  are  accounted  for  by  the  Listed  Transactions  Inflated-Basis  CARDS  and  ACM  Partnership  v. 
Commissioner  (n  =  9). 
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Next,  the  positive  relation  between  the  lagged  reported  effective  tax  rate  (LagETR)  and  the 
likelihood  of  engaging  in  a  tax  shelter  supports  the  idea  that  tax  sheltering  is  higher  in  firms  with 
relatively  larger  past  reported  tax  rates.  This  finding  is  consistent  with  anecdotal  evidence  in  the 
Treasury  (1999)  of  pressure  on  tax  departments  to  act  like  a  “profit  center,”  engineering  higher 
earnings.  It  is  also  consistent  with  firms  resorting  to  tax  shelters  if  other,  less  aggressive  methods 
are  unavailable.  The  positive  sign  on  the  inconsistent  book-tax  treatment  variable  EqEarn  indi¬ 
cates  that  the  likelihood  of  tax  shelter  usage  increases  in  book- tax  consolidation  differences.  Such 
treatment  of  entities  or  transactions  could  be  indicative  of  advantageous  reporting  in  the  financial 
statements  relative  to  tax  records. 

The  financial  complexity  variable,  MezzEin,  is  not  incrementally  significant  in  the  main  tests. 
Although  debt-equity  hybrid  instruments  can  be  related  to  more  general  tax  avoidance,  results  here 
indicate  that  they  are  not  likely  related  to  tax  sheltering  behavior  in  particular,  after  controlling  for 
other  measured  characteristics. 

The  positive  relation  between  TaxShelter  and  Big5  lends  some  support  to  the  idea  that  tax 
shelter  use  is  increasing  in  the  presence  of  (and  access  to)  promoters.  The  positive  sign  on  the 
legal  circumstances  variable  Litigation  indicates  that  insurance  or  litigation  payouts  are  an  infor¬ 
mative  sign  of  tax  sheltering.  Specifically,  firms  that  are  found  to  be  aggressive  from  a  legal 
standpoint  seem  to  be  aggressive  from  a  tax  standpoint  as  well. 

Note  that  the  odds  ratios  reported  below  each  coefficient  and  z-statistic  represent  the  odds  of 
the  given  variable  successfully  identifying  tax  shelters  to  the  odds  it  does  not.  An  odds  ratio  of  1 
indicates  no  diseriminatory  power  (equal  chance  of  the  variable  identifying  tax  shelters).  In  the 
last  column  of  Table  4,  for  example  TaxHaven  {Litigation)  reports  an  odds  ratio  of  2.83  (2.59), 
indieating  that  a  firm  with  a  subsidiary  in  a  tax  haven  (litigation  payouts)  is  2.83  (2.59)  times  more 
likely  to  use  a  tax  shelter  than  not. 

In  all,  the  Treasury  (1999)  white  paper  provides  a  useful  conceptual  guide  in  identifying  an 
extended  set  of  predictors  that  augment  Wilson’s  (2009)  analysis.  The  logistic  model  finds  support 
that  tax  shelters  are  positively  associated  with  tax  haven  subsidiaries,  lagged  reported  effective  tax 
rates,  inconsistent  book-tax  treatment,  the  use  of  promoters,  and  litigation  payouts,  as  well  as  size, 
profitability,  and  foreign  ineome.  Tax  shelter  usage  is  also  negatively  related  to  leverage,  indicat¬ 
ing  a  substitution  effect.  After  controlling  for  the  above  relations,  it  does  not  appear  that  there  is 
a  strong  incremental  positive  link  between  tax  shelters  and  book-tax  differences,  discretionary 
accruals,  or  mezzanine  financing. 

Out-of-Sample  Validation  Tests 

To  examine  the  validity  of  Table  4’s  Combined  Model  (2),  I  perform  several  validation  tests. 
The  first  two  tests  run  this  model  on  matched,  out-of-sample  period  tax  shelter  observations.  The 
third  test  ealculates  the  tax  shelter  likelihood  model  on  the  first  four  years  of  my  sample  (2000- 
2003),  then  applies  the  parameter  estimates  to  a  “holdout”  sample  from  year  2004.  I  also  report 
Receiver  Operator  Characteristic  (ROC)  Curve  results  from  each  of  these  tests.  In  brief,  the 
validation  tests  indicate  that  the  main  model  is  significantly  helpful  in  both  inferring  and  predict¬ 
ing  tax  shelter  likelihood. 

First,  I  identify  out-of-sample  tax  shelter  observations  using  Graham  and  Tucker  (2006).  I 
limit  observations  to  the  1994-1998  period  due  to  limitations  in  gathering  TaxHaven?^  Also, 
although  Graham  and  Tucker  (2006)  include  financial  institutions  in  their  study,  I  exclude  them 
due  to  data  availability  issues  on  several  variables  as  described  above.  This  out-of-sample  group 


SEC  Forms  10-K  (and  Schedules  21)  are  available  online  from  1994  onward,  making  hand-collection  of  the  TaxHaven 
variable  more  feasible  in  this  later  period. 
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yields  21  usable  tax  shelter  firm-year  observations,  to  which  I  match  a  control  sample  of  21 
non-tax  shelter  observations.^"^  If  an  observation  is  a  tax  shelter  observation,  the  indicator  variable 
G&TTaxShelter  is  set  equal  to  1,  or  0  otherwise.  I  use  a  one-to-one  match  to  validate  whether  the 
original  model  will  apply  to  such  a  sample. 

For  this  pooled  sample  of  42  observations,  I  calculate  a  tax  shelter  score  {TaxShelterScorel) 
using  firm-specific  financial  information  as  inputs  into  the  reported  coefficients  of  Combined 
Model  (2)  in  Table  4.  I  then  run  a  logistic  regression  of  G&TTaxShelter  on  TaxShelterScorel  to 
examine  if  the  score  is  positively  related  to  out-of-sample  tax  shelter  likelihood.  Results  in  Table 
5,  Panel  A  show  that  TaxShelterScorel  is  positively  related  to  G&TTaxShelter  (p  <  0.05),  indi¬ 
cating  the  model’s  parameters  are  useful  in  identifying  out-of-sample  tax  shelters,  even  in  a 
one-to-one  matched  sample  setting. 

To  supplement  this  test,  I  also  calculate  the  area  under  the  ROC  Curve  (see  Hosmer  and 
Lemeshow  2000).  An  area  under  the  ROC  Curve  of  0.50  represents  a  model  that  does  not  have 
discriminatory  power  beyond  that  of  chance,  while  higher  levels  represent  improved  power.  The 
ROC  Curve  has  been  used  extensively  in  the  bankruptcy  literature  (e.g.,  Chava  and  Jarrow  2004) 
and  accounting  literature  (e.g.,  Demers  and  Joos  2007)  to  estimate  both  the  validity  and  predictive 
ability  of  logistic  models  on  out-of-sample  period  observations.  Results  on  the  matched  sample 
show  that  the  area  under  the  ROC  Curve  is  approximately  0.71,  indicating  strong  discriminatory 
power  of  the  model  to  identify  tax  shelter  firms.  In  short,  for  a  random  pair  of  firms  with  and 
without  tax  shelters,  the  model’s  inferred  predicted  probability  of  tax  shelter  usage  (i.e.,  TaxShel- 
terScore)  correctly  identifies  the  tax  shelter  firm  approximately  7 1  percent  of  the  time. 

I  repeat  the  analysis  using  the  Graham  and  Tucker  (2006)  observations,  but  match  ten  control 
observations,  randomly  picked  from  Compustat,  to  every  one  tax  shelter  observation.  The  only 
constraints  I  apply  are  a  same-year  match  and  that  total  assets  (#6)  are  available.  I  again  calculate 
a  tax  shelter  score  {TaxShelterScorel)  by  applying  the  parameters  calculated  from  Table  4  to  each 
observation.  Results  in  Table  5,  Panel  B  indicate  a  significantly  positive  relation  between  TaxShel¬ 
terScorel  and  G&TTaxShelter  (p  <  0.01).  ROC  analysis  reveals  that  the  area  under  the  curve  is 
0.69,  indicating  strong  discriminatory  power. 

As  the  Wilson  (2009)  replication  tests  suggest,  there  likely  are  differences  in  how  the  Graham 
and  Tucker  (2006)  and  OTSA  tax  shelter  observations  are  gathered,  and  the  time  period  1994- 
1998  might  be  significantly  different  from  2000-2004.  Because  of  these  potential  differences,  I 
re-estimate  the  logistic  model  on  observations  in  years  2000  through  2003,  leaving  2004  as  the 
“holdout”  sample  period.  For  each  observation  in  2004,  I  calculate  its  tax  shelter  score  {TaxShel- 
terScoreS)  using  the  estimated  parameters  from  the  model  estimated  on  the  2000-2003  observa¬ 
tions  (untabulated).  The  logistic  regression  reveals  a  strong  positive  relation  between  TaxShelter, 
as  originally  defined,  and  TaxShelterScoreS  for  year  2004  observations  (p  <  0.01;  Table  5,  Panel 
C).  ROC  results  show  that  the  area  under  the  curve  for  this  holdout  sample  is  0.88,  indicating  an 
88  percent  chance  that  a  firm  with  tax  shelters  has  a  higher  predicted  probability  score  (i.e., 
TaxShelterScorel)  than  a  firm  without  tax  shelters,  for  random  pairs  of  firms  with  and  without 
shelter  s.^^ 

In  addition  to  the  ROC  Curves,  Table  5,  Panel  D  reports  the  ability  of  the  various  “tax  shelter 
scores”  to  identify  tax  shelters  within  10-percentile  increments  in  the  distribution  of 


Observations  are  matched  on  same-year,  same  one-digit  SIC  code,  availability  of  total  assets  (#6),  and  closest  market- 
to-book  (MTB)  ratio  (number  of  common  shares  outstanding  [#25]  X  price  per  common  share  at  fiscal  year-end  [#199], 
divided  by  book  value  of  common  equity  [#60]).  In  untabulated  tests,  MTB  is  not  related  to  TaxShelter. 

As  a  comparison,  bankruptcy  models’  ROC  Curve  areas  range  from  0.74  to  0.91  (Chava  and  Jarrow  2004). 
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TABLE  5 

Validation  Results 

Panel  A:  Out-of-Sample  Tax  Shelter  Model  Results 

(1:1  Match  on  Industry,  Year, 

and  MTB)“ 

Dependent  Variable:  G&TTaxShelter 

Variable 

Pred.  CoelT. 

Sign 

Coefficient  Using  Full  Model 

Constant 

-0.911* 

TaxShelterScorel 

+ 

(-1.84) 

2.016** 

Model  Chi-square 

(2.48) 

6.87*** 

Observations 

42 

Pseudo-R^ 

11.8% 

Area  under  ROC  Curve 

0.71 

Panel  B:  Out-of-Sample  Tax  Shelter  Model  Results 
(10:1  Random  Match  on  Year)'’ 


Dependent  Variable:  G&TTaxShelter 

Variable 

Pred. 

Coeff.  Sign 

Coefficient  Using  Full  Model 

Constant 

-3.099*** 

(-7.67) 

TaxShelterScorel 

1.675*** 

(3.08) 

Model  Chi-square 

10.18*** 

Observations 

231 

Pseudo-R^ 

7.2% 

Area  under  ROC  Curve 

0.69 

Panel  C:  Hold-Out  Sample  Tax  Shelter  Model  Results 
(Year  2004  OTSA  Data)*' 


Dependent  Variable:  TaxShelter 

Variable 

Pred. 

Coeff.  Sign 

Coefficient  Using  Full  Model 

Constant 

-6.238*** 

TaxShelterScoreS 

-1- 

(-8.32) 

3.832*** 

Model  Chi-square 

(4.98) 

79  32*** 

Observations 

1,769 

Pseudo-R^ 

12.3% 

Area  Under  ROC  Curve 

0.88 

(continued  on  next  page) 
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Panel  D:  Distribution  Analysis  of  TaxShelterScores^ 

Cumulative  Percent  of  Tax  Shelters  Correctly  Identified 


Percentile  of  Sample 

91-100 

81-90 

71-80 

61-70 

51-60 

41-50 

31-40 

21-30 

11-20 

0-10 

Total  Observations 


1:1  Match 
21  Tax  Shelter  Ohs. 

10% 

24% 

38% 

57% 

67% 

76% 

86% 

90% 

95% 

100% 

42 


10:1  Match 
21  Tax  Shelter  Ohs. 

5% 

29% 

57% 

76% 

86% 

86% 

90% 

90% 

100% 

100% 

231 


2004  Holdout  Sample 
79  Tax  Shelter  Ohs. 

52% 

80% 

89% 

95% 

95% 

97% 

99% 

99% 

99% 

100% 

1,769 


*,  **,  ***  Represent  significance  levels  at  the  p  <  0.10,  0.05,  and  0.01  levels,  respectively,  z-statistic  appears  in 
parentheses  below  coefficient  estimate. 

“  G&TTaxShelter  equals  1  if  Graham  and  Tucker  (2006)  identify  observation  as  reporting  a  tax  shelter,  and  0  otherwise. 
TaxShelterScore  1  is  the  score  for  this  sample  using  the  Combined  Model  (2)  estimation  equation  and  coefficients 
reported  in  Table  4.  Half  of  observations  are  Tax  Shelter  observations  identified  from  Graham  and  Tucker  (2006, 
1994-1998,  i.e.,  G&TTaxShelter  =  1.  The  other  half  is  a  sample  of  control  observations  matched  to  tax  shelter 
observations  based  on  year,  industry,  market-to  book  ratio  (Compustat  #25  number  of  common  shares  outstanding 
XCompustat  #199  price  per  common  share  at  end  of  fiscal  year,  divided  by  Compustat  #60,  book  value  of  common 
equity),  and  availability  of  total  assets  (#6),  i.e.,  G&TTaxShelter  =  0. 

^  G&TTaxShelter  equals  1  if  Graham  and  Tucker  (2006)  identify  observation  as  reporting  a  tax  shelter,  and  0  otherwise. 
TaxShelterScore!  is  the  score  for  this  sample  using  the  Combined  Model  (2)  estimation  equation  and  coefficients 
reported  in  Table  4.  Tax  shelter  observations  are  identified  from  Graham  and  Tucker  (2006),  1994-1998.  To  each  of  the 
tax  shelter  observations  identified  in  Graham  and  Tucker  (2006),  i.e.,  G&TTaxShelter  =  1,  10  observations  are  matched 
randomly  according  to  tax  shelter  yeai'  and  availability  of  total  assets  (#6),  i.e.,  G&TTaxShelter  =  0. 

TaxShelter  equals  1  if  observation  reports  a  Tax  Shelter  (Listed  or  Reportable)  Transaction  in  the  IRS  Office  of  Tax 
Shelter  Analysis  (OTSA)  database,  and  0  otherwise.  The  tax  shelter  likelihood  model  is  first  estimated  on  observations 
in  years  2000  through  2003.  Then  the  coefficients  are  applied  to  year  2004  observations  (holdout  sample)  to  calculate 
TaxShelterScoreS. 

Tax  shelter  scores  are  inferred  from  Table  4  and  Table  5  parameters  (i.e.,  TaxShelterScore  1  for  1:1  Matched  sample; 
TaxShelterScore!  for  10:1  Matched  Sample;  and  TaxShelterScoreS  for  the  2004  Holdout  Sample).  The  ranking  is 
ordered  from  highest  tax  shelter  scores  in  the  91-lOOth  percentile  to  the  lowest  in  the  O-lOth. 


26 

observations.  Here,  tax  shelter  scores  are  sorted  and  ranked  from  highest  (9 1-1 00th  percentile) 
to  lowest  (0-1 0th  percentile)  for  each  sample. In  the  one-to-one  matched  Graham  and  Tucker 
(2006)  sample,  the  resulting  firm-specific  tax  shelter  score  {TaxShelterScore  1)  clusters  two-thirds 
of  the  tax  shelter  users  in  the  top  half  of  sorted  observations.  In  the  ten-to-one  matched  sample,  the 
resulting  tax  shelter  score  {TaxShelterScore!)  identifies  the  majority  of  tax  shelters  in  the  top  30 
percent  of  sorted  observations.  When  the  (2000-2003)  model  is  applied  to  the  2004  holdout 
sample,  the  tax  shelter  score  {TaxShelterScoreS)  identifies  the  majority  of  tax  shelters  in  the 
highest  ten  percent  of  sorted  observations. 


This  panel  is  essentially  a  tabular  summary  of  the  ROC  Curve,  but  it  is  included  to  provide  slightly  greater  detail. 
This  method  mimics  the  IRS  audit  selection  process,  in  which  tax  compliance  risk  is  quantified  and  ranked  from  highest 
to  lowest  for  taxpayers.  Such  a  method  increases  the  probability  of  selecting  highest-risk  taxpayers  to  audit. 
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In  sum,  the  tests  of  the  likelihood  model  on  out-of-sample  and  holdout  observations  lend 
support  that  the  models  developed  it  this  study  are  largely  valid  and  can  therefore  be  used  by 
investors,  analysts,  tax  administrators,  and  tax  and  non-tax  researchers  to  generate  point  estimates 
of  corporate  tax  shelter  likelihood. 

VI.  SUPPLEMENTAL  ANALYSES 
Linking  Tax  Shelters  to  Alternative  Proxies  of  Tax  Aggressiveness 

The  main  focus  of  this  study  has  been  to  extend  and  validate  the  tax  shelter  likelihood  model 
using  variables  in  Wilson  (2009)  and  the  Treasury  (1999).  Supplemental  analyses  examine  the  link 
between  tax  shelter  use  and  various  proxies  of  tax  avoidance  and  tax  aggressiveness,  and  compare 
these  to  the  model  developed  in  this  study.  These  tests  aim  to  shed  light  on  the  idea  of  a  tax 
minimization  continuum.  To  examine  the  relation  of  each  proxy  to  tax  shelters,  I  replicate  the 
approach  of  the  holdout  sample  analysis  in  Table  5,  Panel  C.  In  particular,  I  estimate  a  tax  shelter 
likelihood  model  (i.e.,  a  univariate  logistic  regression  of  TaxShelter  on  the  tax  aggressiveness 
proxy  of  choice)  on  2000-2003  firm-year  observations,  then  apply  the  parameters  to  actual  data  in 
2004. 

I  examine  four  widely  used  proxies  for  tax  avoidance  and  tax  aggressiveness.  The  first  is  from 
Frank  et  al.  (2009),  who  calculate  tax  aggressiveness  as  the  residual  from  a  regression  of  perma¬ 
nent  book-tax  differences  on  their  known  determinants  of  intangibles,  unconsolidated  subsidiaries, 
minority  interest,  current  state  tax  expense,  change  in  NOL,  and  lagged  permanent  book-tax 
differences.  Summary  results  from  the  holdout  logistic  regression  show  that  the  discretionary 
permanent  book-tax  differences  are  not  significantly  discriminatory  in  inferring  actual  tax  shelter 
usage  (;^^  =  0.15,  p  >  0.10),  with  the  area  under  the  ROC  Curve  being  0.58  (see  Table  6). 

One  potential  explanation  for  the  lack  of  results  is  that  the  Frank  et  al.  (2009)  measure  is 
based  on  a  broader  effective  tax  rate  differential,  which  includes  more  than  permanent  book-tax 
income  differences,  such  as  tax  credits  (Hanlon  and  Heitzman  2009),  while  tax  shelters  have  also 
been  found  to  be  related  to  temporary  book-tax  differences  (Wilson  2009;  Lisowsky  et  al.  2010). 
Therefore,  perhaps  the  Frank  et  al.  (2009)  measure  captures  tax  aggressiveness  along  other  por¬ 
tions  of  the  tax  minimization  continuum  not  tested  here.  However,  because  they  also  validate  their 

measure  on  tax  shelter  observations  from  Graham  and  Tucker  (2006),  the  results  here  should  be 

28 

interpreted  with  caution. 

Next,  I  examine  the  long-run  cash  effective  tax  rate  (ETR;  Dyreng  et  al.  2008).  I  calculate  this 
value  as  the  ten-year  sum  of  cash  taxes  paid,  divided  by  the  ten-year  sum  of  pretax  income  less 
special  items.^^  As  expected,  this  sample  size  is  the 'smallest  due  to  a  requirement  for  adequate 
time-series  data.  Results  show  that  the  long-run  cash  ETR  is  not  significantly  discrinunatory  in 
inferring  tax  shelter  use  p  >  0.10).  The  area  under  the  ROC  Curve  is  0.58.  This  result 

supports  the  observation  in  Dyreng  et  al.  (2008)  that  their  measure  is  intended  to  capture  tax 
avoidance  generally  and  might  not  necessarily  capture  specific  forms  of  tax  sheltering.  The  results 


In  further  examining  the  link  between  financial  aggressiveness  and  tax  shelters,  I  test  two  separate  models  with  Listed 
{Reportable)  as  the  dependent  variable,  equal  to  1  if  the  tax  shelter  is  a  Listed  (Reportable)  Transaction,  0  otherwise.  In 
models  without  controls,  discretionary  accruals  {DAP)  are  positively  related  to  Listed  and  Reportable  (both  p  <  0.10). 
In  models  with  controls,  DAP  is  not  related  to  Listed  or  Reportable.  Although  the  link  between  financial  aggressiveness 
and  tax  shelters  appears  tenuous  here,  these  tests  also  assume  that  DAP  is  a  powerful  proxy  for  financial  aggressiveness. 
See  Wysocki  (2004)  for  a  discussion  on  the  validity  of  cuirent  discretionary  accruals  models  capturing  financial 
aggressiveness,  and  on  the  potentially  endogenous  nature  of  earnings  management  and  tax  compliance.  Robustness 
checks  using  other  measures  for  financial  aggressiveness  do  not  find  a  relation  between  TaxShelter  and  firms  that  (1) 
re-state  their  income  tax  expense;  (2)  re-state  their  net  income;  or  (3)  are  issued  an  Accounting  and  Auditing  Enforce¬ 
ment  Release  (AAER)  from  the  SEC. 

I  also  use  the  five-year  long-run  cash  ETR,  with  qualitatively  identical  results. 
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TABLE  6 


Summary  Results  of  Extended  ROC  Curve  Analysis 
Linking  Tax  Shelters  to  Various  Proxies  for  Tax  Aggressiveness 

Discretionary  Permanent  Book-Tax  Differences  Long-Run  Cash  ETR 


n  =  801 

Wald  Chi-square:  0.15  (p  >0.10) 
Area  under  ROC  Curve;  0.58 


n  =  626 

Wald  Chi-square:  1.68  (p  >0.10) 
Area  under  ROC  Curve:  0.58 


Total  Book-Tax  Differences 


Tax  Cushion  (Using  IRS  Data) 


n  =  1,769 

Chi-square:  34.18  (p  <0.01) 
Area  under  ROC  Curve:  0.61 


n  =  1,769 

Chi-square:  40.98  (p  <0.01) 
Area  under  ROC  Curve:  0.69 


Similar  to  Table  5,  Panel  C,  individual  regression  equations  of  TaxShelter  are  first  run  for  years  2000-2003  on  each  of  the 
above  proxies  for  tax  aggressiveness.  After  obtaining  coefficient  estimates,  the  respective  actual  tax  aggressiveness  proxy 
calculated  as  of  2004  is  input  into  its  related  estimation  equation.  The  area  under  the  ROC  Curve  summarizes  the  ability 
of  each  tax  aggressiveness  variables  to  infer  actual  TaxShelter  in  2004,  given  its  relation  to  TaxShelter  in  2000-2003. 
Discretionary  Permanent  Book-Tax  Differences  are  calculated  as  tbe  residual  from  a  regression  of  Permanent  BTDs  {BTD 
as  calculated  in  this  study  less  Temporary  BTDs,  or  Deferred  Tax  Expense  (#50)/0.35)  on  Intangibles  (#33/#6),  Uncon¬ 
solidated  Subsidiaries  (#55/#6),  Minority  Interest  Income  (Loss)  (#49/#6),  Current  Income  State  Tax  Expense  (#173/#6), 
Change  in  NOT  (#52/#6),  and  Lagged  Permanent  BTDs  (from  Frank  et  al.  2009).  The  Long-Run  Cash  Effective  Tax  Rate 
{ETR)  is  calculated  as  the  ten-year  sum  of  cash  taxes  paid  (#317)  divided  by  the  ten-year  sum  of  pretax  income  (#170)  less 
special  items  (#17)  (from  Dyreng  et  al.  2008).  Results  are  qualitatively  identical  using  the  five-year  sum.  Total  Book-Tax 
Differences  are  calculated  as  in  this  study,  BTD.  Tax  Cushion  is  calculated  as  follows:  {Current  U.S.  Tax  Expense  (#63)  [or 
Total  Tax  Expense  (#16)  minus  Deferred  Tax  Expense  (#50)  if  #63  is  0]  minus  Tax  Benefit  from  Stock  Options  [(#399/ 
0.65)  *  0.35]  minus  Total  Tax  After  Credits  (IRS  Form  1120  page  1,  line  31)}  divided  by  Total  Assets  (#6). 


using  the  Frank  et  al.  (2009)  and  Dyreng  et  al.  (2008)  measures  support  the  idea  that  tax  avoid¬ 
ance,  aggressiveness,  and  sheltering  proxies  likely  capture  unique  portions  along  the  tax  minimi¬ 
zation  continuum. 

Third,  I  examine  total  book-tax  differences,  calculated  as  BTD  in  this  study,  or  the  difference 
between  pretax  book  income  and  estimated  taxable  income,  scaled  by  total  assets.  Results  show 
that  total  book-tax  differences  alone  are  helpful  in  gleaning  actual  tax  shelter  behavior  {)^ 
=  34.18,  p  <  0.01).  However,  the  area  under  the  ROC  Curve  is  0.61,  indicating  that  book-tax 
differences  can  correctly  discriminate  between  a  tax  shelter  and  non-tax  shelter  firm  only  61 
percent  of  the  time.  One  explanation  for  the  weaker  result  is  that  a  firm  can  in  fact  have  a 
contemporaneously  low  (or  negative)  level  of  book-tax  differences,  then  engage  in  a  tax  shelter 
without  increasing  its  BTD  to  a  high  level,  in  absolute  terms.  Nevertheless,  BTD  alone  appears  to 
be  a  crude,  yet  informative  (and  easily  implementable)  measure  for  corporate  tax  sheltering. 
However,  recall  that  the  main  multivariate  tests  show  that  BTD  is  not  related  to  TaxShelter  when 
controlling  for  other  firm  characteristics. 

Finally,  I  test  the  link  between  tax  shelter  usage  and  the  current-year  accrual  of  the  contingent 
tax  liability  reserve,  or  tax  cushion.  For  this  test  I  employ  tax  return  data,  calculating  tax  cushion 
as  the  difference  between  current  U.S.  tax  expense  and  Total  Tax  after  Credits  (IRS  Form  1120 
Page  1  Line  31),  adjusted  for  the  tax  benefit  of  stock  options  ([#399/0.65]  X  0.35).  I  test  the 
discriminatory  power  of  tax  cushion  in  order  to  evaluate  the  extent  to  which  the  IRS  has  superior 
information  than  the  public  in  inferring  tax  risk.  Results  show  that  for  the  holdout  sample,  tax 
cushion  is  significantly  related  to  actual  tax  shelter  use  {x^  =  40.98,  p  <  0.01).  The  area  under  the 
ROC  Curve  is  0.69.  This  result  suggests  that  a  nontrivial  portion  of  the  tax  cushion  is  linked  to  tax 
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shelters.  This  result  also  suggests  that  tax  shelter  firms  correctly  anticipated  the  possibility  of  tax 
shelters  being  discovered  and  eventually  disallowed  by  tax  administrators,  even  before  FIN  48. 

Overall,  the  univariate  holdout  tests  between  tax  shelters  and  alternative  proxies  of  tax  avoid¬ 
ance  and  aggressiveness  show  that  total  book-tax  differences  (BTDs)  and  the  current-year  tax 
cushion  accrual  are  useful  in  inferring  tax  shelters,  but  discretionary  permanent  BTDs  and  long- 
run  effective  cash  ETRs  are  not.  However,  TaxShelterScore  retains  the  most  significant  discrimi¬ 
natory  power  relative  to  the  other  measures,  with  the  main  model’s  ROC  Curve  area  of  0.88.  This 
result  supports  the  refinements  of  the  Treasury  (1999)  to  the  Wilson  (2009)  model  and  provides 
researchers,  investors,  and  policymakers  with  an  additional  tool  with  which  to  infer  corporate  tax 
sheltering  behavior. 

Inferring  Tax  Sheltering  in  a  FIN  48  Environment 

The  second  supplemental  test  examines  the  main  model’s  usefulness  to  infer  tax  shelters 
during  the  FIN  48  regime.  The  assumption  is  that,  when  firms  are  compelled  to  make  greater 
disclosure  of  their  uncertain  tax  benefits,  they  are  less  likely  to  engage  in  tax  aggressive — 
including  tax  sheltering — behavior,  thus  reducing  the  power  of  the  model.  To  evaluate  this  possi¬ 
bility,  I  use  a  separate  sample  of  687  publicly  traded  firms  with  December  31,  2007,  fiscal 
year-end  dates  that  are  part  of  the  S&P  1500,  with  available  financial  data  to  calculate  the  respec¬ 
tive  TaxShelterScore  (see  Lisowsky  et  al.  [2010]  for  a  similar  sampling  methodology).  I  calculate 
TaxShelterScored  using  the  reported  coefficients  in  Table  4’s  Combined  Model  (2)  by  inputting  the 
2007  firm  data  into  this  model’s  parameters. 

To  measure  tax  shelter  usage  in  2007, 1  obtain  additional  IRS  data  on  Listed  and  (Non-Listed) 
Reportable  Transactions.  I  set  TaxShelter_FIN48Period  equal  to  1  if  in  2007  a  firm  reports  Re¬ 
portable  Transactions  on  Schedule  M-3  Line  12  (“Items  related  to  reportable  transactions”),  or 
reports  a  Listed  or  (Non-Listed)  Reportable  Transaction  on  Form  8886,  Line  2  (“Type  of  report- 
able  transaction”).  I  then  run  a  logistic  regression  of  TaxShelter_FIN48Period  on  TaxShelter- 
Score4.  Results  show  a  significant,  but  weaker  ability  to  correctly  classify  tax  shelter  firms  than  in 
Table  4,  Combined  Model  (2).  First,  TaxShelterScore4  is  marginally  significant  at  the  10  percent 
level.  Also,  the  area  under  the  ROC  Curve  is  only  0.61  (see  Table  7).  Although  still  significant,  it 
appears  that  the  ability  of  Combined  Model  (2)  to  infer  tax  sheltering  is  more  muted  in  a  FIN  48 
than  in  a  pre-FIN  48  reporting  environment.  These  results  suggest  that  tax  shelter  behavior  dissi¬ 
pates  because  of  greater  financial  disclosure  requirements  (see  Mills  et  al.  2010).  However,  the 
weaker  results  can  also  be  due  to  lower  power,  e.g.,  different  and  fewer  observations  in  2007 
compared  to  2000-04;  and/or  the  2007  sample  does  not  capture  potential  front-loading  of  tax 
benefits  if  the  initial  tax  shelter  year  is  outside  of  the  sample  period  (e.g.,  LILO).  Also,  other  clues 
in  the  FIN  48  disclosures  could  be  more  informative  to  investors  than  the  model  (Frischmann  et  al. 
2008;  Lisowsky  et  al.  2010).  Further  research  is  needed  to  analyze  these  explanations.  As  a  result, 
the  main  model  should  be  used  with  caution  in  the  recent  FIN  48  environment. 

VII.  CONCLUSION 

This  study  extends  prior  empirical  work  on  inferring  tax  shelter  participation  (Wilson  2009) 
by  applying  a  conceptual  framework  from  the  Treasury  (1999)  white  paper  on  tax  shelters.  In 
doing  so,  I  employ  confidential  tax  shelter  data  obtained  from  the  IRS,  increasing  sample  sizes, 
power,  and  generalizability.  Main  results  show  that  the  likelihood  that  a  firm  uses  tax  shelters  is 
positively  related  to  the  presence  of  its  subsidiaries  located  in  tax  havens,  foreign-source  income, 
inconsistent  book-tax  treatment,  litigation  losses,  use  of  promoters,  profitability,  and  size;  and  is 
negatively  related  to  leverage.  The  model  developed  in  this  study  increases  the  explanatory  power 
of  the  Wilson  (2009)  model  and  corroborates  the  negative  relation  between  debt  and  tax  shelters 
found  in  Graham  and  Tucker  (2006).  I  also  develop  a  metric  called  TaxShelterScore,  which 
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TABLE  7 


Inferring  Tax  Sheltering  in  a  FIN  48  Environment 
Out-of-SampIe  Tax  Shelter  Model  Results 


Dependent  Variable:  TaxShelter_FIN48Period 

Variable 

Pred. 

Coeff.  Sign 

Coefficient  Using 
Full  Model 

Constant 

-1.230* 

(-5.32) 

TaxShelterScore4 

-b 

0.380* 

(1.78) 

Model  Chi-square 

24  95  ^  ^  ^ 

Observations 

687 

Pseudo-R^ 

3.4% 

Area  under  ROC  Curve 

0.61 

*,  **,  ***  Represent  significance  levels  at  the  p  <  0.10,  0.05,  and  0.01  levels,  respectively,  z-statistic  appears  in 
parentheses  below  coefficient  estimate. 

TaxShelter_FIN48Period  equals  1  if  a  firm  reports  a  listed  or  reportable  transaction  on  its  IRS  Schedule  M-3  or  Form  8886 
in  2007,  and  0  otherwise.  TaxShelterScore4  is  the  score  for  this  sample  using  the  Combined  Model  (2)  estimation  equation 
and  coefficients  reported  in  Table  4.  The  observations  (n  =  687)  were  chosen  from  calendar  year-end  publicly  traded 
taxpayers  part  of  the  S&P1500  for  which  information  to  calculate  necessary  variables  were  available.  See  Lisowsky  et  al. 
(2010)  for  a  similar  sample  selection  composition. 


estimates  the  probability  that  a  firm  engages  in  a  tax  shelter.  Tests  using  TaxShelterScore  show  that 
the  models  have  strong  discriminatory  power  in  inferring  corporate  tax  shelter  use.  In  applying 
TaxShelterScore  to  a  holdout  sample  of  firms,  the  area  under  the  ROC  Curve  is  0.88,  indicating  an 
88  percent  chance  that  a  firm  with  tax  shelters  has  a  higher  predicted  probability  (or  TaxShelter¬ 
Score)  than  a  firm  without  a  tax  shelter. 

In  additional  analyses,  I  examine  the  relative  benefits  of  the  main  model’s  refinements  in 
measuring  tax  shelters  by  testing  the  link  between  tax  shelters  and  four  other  proxies  for  tax 
avoidance  and  tax  aggressiveness.  Results  show  a  strong  positive  relation  between  tax  shelter 
usage,  and  both  total  book-tax  differences  (Mills  1998)  and  the  contingent  tax  liability  reserve,  or 
tax  cushion  (Gleason  and  Mills  2002).  However,  there  is  no  significant  association  between  tax 
shelter  usage  and  either  discretionary  permanent  book-tax  differences  (Frank  et  al.  2009)  or 
long-run  cash  effective  tax  rates  (Dyreng  et  al.  2008).  The  main  model  remains  the  most  highly 
discriminatory.  The  results  shed  light  on  the  extent  to  which  various  measures  do  or  do  not  capture 
tax  sheltering.  Such  insight  is  helpful  to  researchers  in  designing  appropriate  measurements  and 
tests  when  the  degree  of  tax  avoidance,  aggressiveness,  or  sheltering,  is  of  paramount  interest. 

Finally,  I  test  the  usefulness  of  the  model  in  the  new  disclosure  regime  of  FIN  48,  in  which 
firms  must  disclose  unrecognized  tax  benefits.  Results  show  that  the  TaxShelterScore  metric  is 
weaker,  but  remains  significant,  in  identifying  tax  sheltering  during  this  time.  Therefore,  infer¬ 
ences  made  using  this  model  in  a  FIN  48  period  should  be  made  with  care. 

As  a  final  note  of  caution,  because  the  model  developed  in  this  study  requires  a  variety  of 
input  variables,  and  is  designed  to  measure  the  most  extreme  type  of  tax  aggressiveness — tax 
shelters — it  can  either  limit  or  sharpen  the  types  of  research  questions  for  which  the  model  is 
well-suited.  Therefore,  the  model  can  potentially  aid  researchers  in  further  exploring  interesting 
questions  on  tax  sheltering  in  particular,  such  as  its  relation  to  managerial  compensation,  value 
creation,  and  financing  risks  via  the  cost  of  capital.  Further,  U.S.  and  international  financial  and 
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tax  policymakers  would  be  interested  in  these  results  to  the  extent  changes  in  reporting  and 
disclosure  practices  are  desirable.  For  example,  future  research  could  examine  whether  book-tax 
conformity  reduces  (or  exacerbates)  tax  sheltering,  and  whether  tax  sheltering  behavior  changes 
with  additional  oversight  (e.g.,  corporate  governance)  or  audit  demands  (e.g.,  Sarbanes-Oxley). 
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ABSTRACT:  We  develop  a  model  to  examine  the  effects  of  Financial  Accounting  Stan¬ 
dards  Board  (FASB)  Interpretation  No.  48,  Accounting  for  Uncertainty  in  Income  Taxes 
(FIN  48),  on  the  strategic  interaction  between  publicly  traded  corporate  taxpayers  and 
the  government.  Several  of  our  findings  contradict  conjectures  voiced  by  members  of 
the  business  community  regarding  the  economic  effects  of  implementing  FIN  48.  Spe¬ 
cifically,  taxpayers  with  strong  facts  obtain  higher  expected  payoffs  from  uncertain  tax 
benefits  and  some  disclosed  liabilities  understate  the  expected  tax  liability.  Consistent 
with  the  common  conjectures,  however,  some  taxpayers  are  more  likely  to  be  audited 
or  are  deterred  from  entering  into  transactions  that  generate  uncertain  tax  benefits 
because  of  FIN  48. 
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I.  INTRODUCTION 

Financial  Accounting  Standards  Board  (FASB)  Interpretation  No.  48,  Accounting  for  Uncer¬ 
tainty  in  Income  Taxes  (FIN  48,  FASB  2006),  is  the  most  significant  change  in  financial 
accounting  for  income  taxes  over  the  past  deeade.  This  interpretation  prescribes  the  ac¬ 
counting  for  uncertain  income  tax  benefits  recognized  in  a  firm’s  financial  statements  in  accor¬ 
dance  with  FASB  Statement  No.  109,  Accounting  for  Income  Taxes.  Taxpayers  are  often  uncertain 
whether  the  government  will  assess  an  additional  income  tax  payment  upon  audit,  which  would 
reduce  tax  benefits  claimed  in  the  originally  filed  tax  return.  FIN  48  dictates  how  uncertain  tax 
benefits  should  be  measured  and  requires  a  taxpayer  to  disclose  any  liability  for  tax  benefits 
claimed  on  its  tax  return  that  are  not  permitted  to  be  recognized  in  its  publicly  available  financial 
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statements.  We  investigate  how  the  strategic  interaction  between  publicly  traded  corporate  taxpay¬ 
ers  and  the  government  changes  as  the  result  of  this  new  accounting  interpretation. 

Tax  uncertainty  arises  because  of  the  difficulty  in  applying  ambiguous  tax  laws  to  a  set  of 
circumstances.  Shareholders  and  creditors  previously  had  little  information  about  the  future  cash 
payments  corporations  expected  to  make  to  the  government  arising  from  disputes  over  uncertain 
tax  positions.  Although  the  EASE  intended  for  FIN  48  to  provide  financial  statement  users  with 
information  about  tax  uncertainty,  the  government  also  observes  this  new  disclosure.  Accordingly, 
the  government  receives  a  signal  about  the  taxpayer’s  level  of  uncertainty  about  its  tax  filing 
positions.  We  model  tax  uncertainty  in  a  strategic  game  between  the  taxpayer  and  government,  and 
use  a  nondisclosure  benchmark  case  to  examine  the  effect  of  the  FIN  48  disclosure  on  the  strategic 
choices  made  by  the  taxpayer  and  the  government. 

Early  models  that  examined  the  strategic  interaction  between  the  taxpayer  and  the  government 
assumed  that  the  taxpayer  knew  its  true  taxable  income  and  that  the  government  could  discover 
such  income  with  an  audit.  Thus,  taxpayers  eould  perfectly  anticipate  the  consequences  of  an  audit 
(Graetz  et  al.  1986).  Beck  and  Jung  (1989a)  and  Beck  and  Jung  (1989b)  examine  tax  compliance 
in  an  environment  in  which  taxpayers  are  uncertain  regarding  their  tax  liability;  the  latter  did  so 
using  a  game  theory  framework  in  which  audit  probabilities  are  endogenous.  Other  studies  allow 
the  government  to  observe  a  signal  regarding  the  taxpayer,  which  affects  their  strategic  interaction 
(Sansing  1993;  Mills  and  Sansing  2000;  Beck  et  al.  2000,  hereafter  BDJ).^  Our  model  features  a 
mandatory  truthful  disclosure  of  taxpayers’  private  information  via  the  new  financial  reporting 
disclosure  regulation. 

Specifically,  we  model  the  FIN  48  financial  statement  disclosure  as  a  public  signal  from  the 
taxpayer  to  the  shareholders  that  also  provides  information  to  the  government.  As  in  Beck  and 
Jung  (1989b),  we  introduce  a  continuum  of  taxpayers  that  differ  in  the  strength  of  their  filing 
positions,  but  each  taxpayer  makes  a  discrete  tax  reporting  decision.  The  government  draws 
inferences  regarding  the  taxpayer’s  type  from  both  its  tax  return  and  its  financial  statements  before 
deciding  whether  to  audit.  Our  model  has  several  features  in  common  with  BDJ.  BDJ  examine  the 
effect  of  allowing  a  taxpayer  to  make  a  voluntary  disclosure  regarding  its  private  information  in 
exchange  for  which  the  government  agrees  to  waive  a  penalty  in  case  a  claimed  tax  benefit  is  lost 
upon  audit.  Although  the  institutional  setting  examined  in  BDJ  differs  from  the  one  we  examine 
here  in  several  important  respects,  our  analysis  of  the  effects  of  disclosure  in  Section  IV  identifies 
both  similarities  and  differences  in  how  voluntary  and  mandatory  disclosure  regimes  affect  tax¬ 
payer  behavior  and  payoffs. 

Our  model  imposes  five  important  assumptions  about  the  information  environment.  As  in 
BDJ,  we  make  assumptions  that  increase  the  government’s  ability  to  draw  inferences  from  the 
disclosure  “in  order  to  create  an  environment  conducive  to  information  transfers  among  taxpayers 
and  the  tax  agency”  (BDJ,  248).  Thus,  our  assumptions  reflect  our  primary  objective,  which  is  to 
explore  the  maximum  extent  to  which  FIN  48  could  affect  strategic  choices  of  both  the  taxpayer 
and  the  government.  This  approach  ensures  that  we  determine  an  upper  bound  on  the  effects  of 
FIN  48;  different  assumptions  would  yield  weaker  effects  than  those  we  find. 

First,  we  assume  that  the  taxpayer  files  a  tax  return  with  a  single  government.  This  assumption 
eliminates  having  a  single  aggregate  disclosure  related  to  tax  uncertainties  in  multiple  political 
jurisdictions.  Second,  we  assume  that  the  tax  return  claims  at  most  one  uncertain  tax  benefit. 
Third,  we  assume  that  the  government  can  observe  whether  the  tax  return  claims  an  uncertain  tax 


'  In  BDJ,  the  signal  is  in  the  form  of  a  disclosure  by  the  taxpayer.  In  Mills  and  Sansing  (2000),  the  signal  is  whether  book 
income  exceeds  or  equals  taxable  income.  In  Sansing  (1993),  the  IRS  privately  observes  a  signal  about  the  taxpayer’s 
income,  e.g.,  the  IRS’  “DIF”  score. 
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benefit.  This  assumption  allows  the  government  to  distinguish  between  taxpayers  that  disclose  no 
FIN  48  liability  on  their  financial  statements  because  they  are  confident  of  sustaining  the  position 
and  those  taxpayers  that  did  not  enter  into  a  transaction  that  generates  an  uncertain  tax  benefit. 
Fourth,  we  assume  that  the  government  knows  the  distribution  from  which  the  taxpayer’s  private 
information  is  drawn.  This  allows  the  government  to  infer  the  strength  of  the  taxpayer’s  filing 
position  in  our  model  whenever  the  taxpayer  discloses  a  liability  associated  with  an  uncertain  tax 
benefit.  Finally,  we  assume  that  the  taxpayer  reports  truthfully  in  the  sense  that  the  firm  complies 
fully  with  the  financial  reporting  requirements  of  FIN  48.^ 

Our  model  provides  important  insights  that  are  contrary  to  two  common  conjectures  made  by 
the  business  community  regarding  the  effects  of  FIN  48.  First,  we  show  that  mandatory  disclosure 
of  uncertainty  about  tax  filing  positions  need  not  put  all  taxpayers  at  a  disadvantage  in  the  tax 
audit  process.  In  fact,  we  show  that  taxpayers  with  strong  tax  positions  have  higher  expected 
payoffs  from  claiming  uncertain  tax  positions  post-FIN  48.  To  our  knowledge,  the  possibility  that 
some  taxpayers  may  benefit  from  the  mandatory  disclosure  requirements  of  FIN  48  has  not  been 
discussed  by  the  business  community  or  standard- setters.  Second,  we  show  that  the  liability 
disclosed  in  the  financial  statements  is  not  necessarily  an  overstatement  of  the  expected  cash 
payment.  The  conjecture  that  FIN  48  liabilities  are  likely  to  be  overstated  is  due  to  the  interpre¬ 
tation’s  departure  from  the  traditional  expected  value  approach.^  Under  FIN  48,  the  taxpayer  must 
assume  that  each  uncertain  tax  benefit  is  audited  when  calculating  its  tax  liability.  Our  model 
illustrates  that  the  direction  of  any  misstatement  depends  on  the  distribution  of  tax  benefits  the 
taxpayer  expects  to  retain  upon  audit  as  well  as  the  taxpayer’s  expectation  of  the  probability  of 
being  audited.  For  some  taxpayers,  however,  we  obtain  results  consistent  with  the  common  con¬ 
jectures  about  the  effect  of  FIN  48.  Specifically,  we  find  that  some  taxpayers  have  lower  expected 
payoffs  under  FIN  48.  This  lower  payoff  arises  because  such  taxpayers  either  face  a  higher 
probability  of  audit  or  refrain  from  entering  into  transactions  that  generate  uncertain  tax  benefits 
due  to  the  higher  audit  probability. 

We  compare  the  effects  of  mandatory  disclosure  pursuant  to  FIN  48  on  taxpayer  behavior  and 
payoffs  to  the  effects  of  voluntary  disclosure  examined  in  BDJ.  We  find  that  taxpayers  with 
sufficiently  strong  support  for  the  uncertain  tax  position  do  not  change  their  behavior  as  a  result  of 
the  mandatory  disclosure  and  are  better  off  because  the  government  is  less  likely  to  audit  them. 
Taxpayers  with  weaker  facts  are  weakly  better  off  in  the  voluntary  disclosure  setting  of  BDJ,  but 
are  weakly  worse  off  in  the  mandatory  disclosure  setting  that  we  examine.  The  difference  between 
the  two  settings  is  that  the  government  must  give  up  something  (a  penalty  waiver)  in  the  voluntary 
setting  in  order  to  induce  the  taxpayer  to  disclose.  The  government  does  not  give  up  anything  to 
get  the  disclosure  in  our  mandatory  disclosure  setting.  Thus,  our  study  enriches  our  understanding 
of  the  effects  of  disclosure  on  tax  compliance  by  comparing  the  different  effects  of  voluntary 
versus  mandatory  disclosure. 

Not  surprisingly,  FIN  48  has  generated  broad  interest  and  concern  across  the  business  and 
academic  community,  as  well  as  among  investors  and  regulators.  Using  our  framework,  concurrent 


^  We  emphasize  that  our  assumptions  allow  us  to  examine  an  important  benchmark  case.  Thus,  for  example,  while  truthful 
reporting  may  not  be  descriptive  per  se,  truthful  reporting  is  conducive  to  information  transfers.  Not  only  is  this 
important  in  a  strategic  tax  compliance  framework,  it  is  also  consistent  with  the  disclosure  literature.  See  Admati  and 
Pfleiderer  (2000,  481),  who  “assume,  as  is  often  done  in  the  literature  on  disclosure,  that  all  disclosed  information  is 
truthful.  This  is  a  reasonable  assumption  when  antifraud  laws  are  rigorously  enforced.  Also,  disclosures  are  often  made 
by  third  parties,  such  as  accounting  firms,  whom  are  not  directly  affected  by  the  content  of  the  disclosure  and  for  who 
the  reputation  for  truthfulness  is  valuable.” 

^  The  expected  value  approach  with  respect  to  uncertain  tax  positions  involves  recording  a  liability  equal  to  the  prob¬ 
ability  of  the  uncertain  tax  position  being  audited  multiplied  by  the  expected  tax  deficiency,  conditional  on  an  audit 
occurring. 
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studies  by  Gupta  et  al.  (2009),  Dunbar  et  al.  (2009),  and  Lisowsky  et  al.  (2010)  examine  the 
effects  of  FIN  48  on  tax  reporting.  We  believe  our  model  can  guide  current  and  future  research 
during  the  post-FIN  48  adoption  period  by  offering  a  framework  for  thinking  about  the  potential 
economic  effects  of  FIN  48. 

In  Section  II,  we  characterize  the  tax  and  financial  reporting  environment  in  which  the  tax¬ 
payer  operates.  Section  III  models  the  strategic  interaction  between  the  taxpayer  and  government 
pre-FIN  48,  which  we  use  as  our  nondisclosure  benchmark  case.  Section  IV  models  the  recogni¬ 
tion  and  measurement  process  of  FIN  48  and  the  strategic  interaction  between  the  taxpayer  and  the 
government  post-FIN  48.  In  Section  V,  we  address  two  common  conjectures  about  the  effects  of 
FIN  48  in  the  context  of  our  model  and  compare  the  effects  of  mandatory  disclosure  to  the  effects 
of  voluntary  disclosure.  Section  VI  concludes.  All  proofs  are  in  Appendix  A. 

II.  TAX  AND  FINANCIAL  REPORTING  UNDER  TAX  UNCERTAINTY 
Tax  Reporting  under  Tax  Uncertainty 

We  first  characterize  the  corporate  income  tax  research,  filing,  and  settlement  process  to 
clarify  the  notion  of  tax  uncertainty  that  we  model  and  that  FIN  48  was  designed  to  address.  The 
taxpayer  has  an  opportunity  to  enter  into  a  business  transaction  that  produces  an  uncertain  tax 
benefit.^  Before  deciding  whether  to  enter  into  the  transaction,  the  taxpayer  privately  observes  all 
relevant  facts  and  supporting  documentation  with  regard  to  the  transaction.  An  important  aspect  of 
the  tax  reporting  process  is  that  the  taxpayer  conducts  research  for  purposes  of  applying  technical 
tax  law  to  the  facts  and  circumstances  of  the  transaction,  evaluating  various  tax  filing  positions 
(Magro  and  Nutter  2009).  If  the  taxpayer  enters  into  the  transaction,  then  the  tax  benefit  is  claimed 
on  a  tax  return  filed  with  the  government  and  becomes  the  taxpayer’s  “filing  position.” 

The  government  observes  the  tax  return  and  thus  whether  the  taxpayer  has  entered  into  a 
transaction  and  claimed  an  uncertain  tax  benefit,  but  does  not  observe  the  strength  of  the  taxpay¬ 
er’s  filing  position.  Because  it  is  uncertain  whether  the  taxpayer  will  prevail  if  the  government 
challenges  the  claimed  tax  benefit,  the  government  cannot  merely  assess  a  deficiency  upon  ob¬ 
serving  a  claimed  uncertain  tax  benefit;  instead,  it  must  conduct  a  costly  audit  in  order  to  challenge 
the  taxpayer’s  position.  If  the  tax  return  is  audited,  then  the  outcome  of  a  dispute  could  result  in 
the  taxpayer  retaining  all,  some,  or  none  of  the  tax  benefit  originally  claimed  on  the  tax  return. 
Figure  1  summarizes  the  sequence  of  play. 

Financial  Reporting  under  Tax  Uncertainty 

The  financial  reporting  problem  that  FIN  48  seeks  to  address  is  how  to  reflect  uncertain  tax 
benefits  in  its  financial  statements  during  the  time  that  potential  tax  disputes  remain  unresolved. 
The  relevant  accounting  question  is  how  much  of  the  current  tax  benefit  that  the  taxpayer  receives 
from  the  uncertain  tax  position  should  be  recognized  as  a  reduction  to  current  period  income  tax 
expense.  Because  we  compare  a  FIN  48  mandatory  disclosure  regime  to  a  benchmark  pre-FIN  48 
case  of  no  disclosure,  we  first  highlight  important  financial  reporting  features  of  the  pre-FIN  48 
regime. 

Practice  varied  pre-FIN  48  with  respect  to  recognizing  liabilities  for  uncertain  tax  positions. 
Anecdotally,  many  corporations  recognized  a  liability  for  uncertain  tax  benefits  using  a  traditional 
expected  value  approach,  although  some  recognized  the  entire  tax  benefit  as  a  liability  unless  it 
was  probable  of  being  sustained.  Even  when  the  latter  approach  was  used,  the  definition  of 


Alternatively,  the  taxpayer  may  engage  in  a  transaction  that  has  an  uncertain  tax  benefit  for  non-tax  business  reasons.  In 
this  case,  the  taxpayer’s  decision  is  not  whether  to  engage  in  the  transaction  but  is,  instead,  whether  to  claim  an 
uncertain  tax  benefit  associated  with  the  transaction  on  the  tax  return.  From  the  point  of  view  of  our  model,  this 
alternative  framing  has  no  effect. 
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FIGURE  1 

Timeline  Summarizing  the  Sequence  of  Play  between  the  Taxpayer  (T)  and  the  Government  (G) 


Note  that  Period  1  and  Period  2  do  not  imply  two  equal  time  periods,  but  rather  the  actions  of  T  (Period  1)  and  G 
(Period  2).  Period  1  is  relatively  short,  while  Period  2  may  take  many  years  to  complete. 


“probable”  varied  across  companies.  Furthermore,  the  extent  to  which  corporations  incorporated 
the  probability  of  audit  into  the  calculation  of  expected  tax  liabilities  was  unclear.  Evidence  in 
Blouin  et  al.  (2007)  suggests  that  firms  generally  recognized  smaller  liabilities  pre-FIN  48  when 
they  faced  a  lower  probability  of  audit  or  detection. 

Regardless  of  how  much  benefit  from  uncertain  tax  positions  was  recognized  in  financial 
statements  pre-FIN  48,  there  was  no  explicit  disclosure  requirement.  Thus,  even  if  a  taxpayer 
recognized  a  liability  pre-FIN  48,  the  lack  of  a  mandatory  disclosure  requirement  prevented 
outsiders  from  observing  the  liability  directly.^  Gleason  and  Mills  (2002)  document  that  even  large 
companies  that  have  material  IRS  deficiencies  do  not  voluntarily  disclose  the  amount  of  their  tax 
reserves. 

FIN  48  imposes  new  recognition,  measurement,  and  disclosure  requirements  regarding  tax 
uncertainty.  Of  particular  importance  is  that  compliance  with  the  disclosure  requirement  (that  is, 
truthful  reporting)  makes  the  taxpayer’s  level  of  uncertainty  transparent  to  the  government  in  the 
post-FIN  48  regime.  Furthermore,  the  recognition  and  measurement  requirements  serve  to  increase 
the  information  content  of  the  disclosure  by  providing  consistency  in  the  language  with  which 


^  Because  financial  statement  balance  sheets  only  present  limited  detail,  recorded  liabilities  for  unrecognized  tax  benefits 
tax  would  not  typically  be  visible  to  the  public.  For  example,  these  liabilities  may  have  been  aggregated  with  environ¬ 
mental  or  legal  liabilities.  In  addition,  liabilities  for  uncertain  tax  positions  were  sometimes  intermingled  with  deferred 
tax  liabilities  as  evidenced  by  the  prohibition  in  FIN  48  against  this  classification  (FIN  48,  117)  and  the  significant 
reclassification  adjustments  reported  at  adoption  (e.g.,  in  the  first  quarter  of  2007,  Verizon  Communications  Inc.  reports 
a  $3  billion  reduction  in  defeixed  income  taxes  and  a  corresponding  increase  in  other  liabilities  as  a  reclassification 
adjustment  upon  adoption  of  FIN  48). 
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taxpayers  express  their  level  of  uncertainty.  An  important  feature  of  the  pre-FIN  48  regime  was 
that  the  lack  of  guidance  gave  taxpayers  no  way  to  credibly  disclose  a  low  level  of  uncertainty 
with  respect  to  an  uncertain  tax  position. 

III.  TAX  COMPLIANCE  PRE-FIN  48 

Pre-FIN  48  Model 

Our  pre-FIN  48  model  examines  the  interaction  between  the  taxpayer  and  the  government 
prior  to  the  implementation  of  FIN  48.  Thus,  our  benchmark  case  is  characterized  by  tax  uncer¬ 
tainty  without  public  disclosure  of  the  taxpayer’s  level  of  uncertainty.  There  are  two  players  in  our 
model:  the  taxpayer  (T)  and  the  government  (G).  The  taxpayer  can  take  two  possible  actions.  It 
can  enter  into  a  business  transaction  (a  =  F)  or  not  (a  =  N).  The  value  of  the  transaction  has  two 
components,  x  and  V,  where  x  is  the  expected  tax  benefit  associated  with  entering  into  the  trans¬ 
action,  conditional  on  being  audited,  and  V  is  the  value  of  the  transaction  in  absence  of  the  tax 
benefit. 

For  tractability,  we  simplify  the  model  so  that  there  are  only  two  possible  tax  treatments  for 
a  transaction.  We  normalize  the  best  possible  tax  result  from  entering  into  the  transaction  to  1  and 
the  worst  possible  tax  result  to  0.  In  order  to  model  business  transactions  in  which  the  tax  benefit 
is  decisive  for  the  taxpayer’s  decision  to  enter  into  the  transaction,  we  consider  cases  in  which  —1 
<  V  <  0.  Therefore,  the  value  of  the  transaction  when  the  tax-favored  treatment  occurs  and  is 
sustained  is  F  -I-  1  >  0.  Furthermore,  the  taxpayer  would  not  enter  into  the  transaction  if  it  knew 
the  tax  outcome  would  be  unfavorable  because  V  <  0.^  Consider  V  the  pre-tax  payoff  relative  to 
a  required  return',  thus,  we  do  not  model  transactions  that  lose  money  in  raw  terms,  but  rather, 
absent  the  tax  benefit,  the  taxpayer  would  not  choose  this  transaction.  Municipal  bonds  and  tax 
credit  investments,  for  example,  do  not  lack  economic  substance,  but  they  would  be  negative  NPV 
investments  in  absence  of  the  tax  benefits  (Robinson  2010). 

Thus,  we  consider  discrete  tax  disputes  in  which  rejection  of  the  taxpayer’s  filing  position 
implies  a  definitive  alternative  tax  treatment.  However,  tax  law  ambiguity  makes  it  hard  for  each 
party  to  determine  whether  the  tax  benefit  claimed  is  valid.  The  taxpayer  and  the  government  may 
therefore  compromise  on  a  settlement,  even  though  as  a  matter  of  law  a  court  would  ultimately 
have  to  allow  or  disallow  the  entire  claimed  tax  benefit.  An  example  is  a  tax-free  spin-off  that  can 
only  be  challenged  on  the  grounds  that  it  lacks  economic  substance.  A  spin-off  qualifies  as  tax-free 
under  §355  or  it  does  not;  any  decision  about  the  applicability  of  tax  law  to  the  transaction  must 
either  grant  or  deny  the  claimed  tax  benefit  in  its  entirety.  Tax-free  mergers  provide  a  second 
example  in  which  a  recent  tax  case  illustrates  the  discrete  nature  of  many  real  world  disputes.  To 
avoid  taxable  gains  of  nearly  $1.4  billion  on  the  sale  of  a  subsidiary.  Times  Mirror  (the  Company) 
engaged  in  complex  and  unconventional  tax-free  reverse  triangular  mergers  with  Reed  Elsevier 
and  Harcourt  General  in  1998.  Upon  audit  in  2002,  the  Internal  Revenue  Service  (IRS)  recharac¬ 
terized  the  reorganization  as  a  fully  taxable  sale  based  on  an  argument  that  the  transaction  lacked 
economic  substance,  and  assessed  a  tax  deficiency  of  $550  million.  The  Company  litigated  the 
case  and  in  2006,  a  Tax  Court  judge  issued  a  ruling  that  supported  the  IRS  position  disallowing 
tax-free  treatment.  As  a  result,  the  Company  paid  $880  million  (which  included  interest  on  the 
deficiency)  and  then  appealed.  The  IRS  considered  possible  outcomes  in  the  Seventh  Circuit  Court 


®  The  notion  that  a  taxpayer  would  engage  in  transaction  only  if  the  tax  outcome  was  favorable  can  be  illustrated  more 
concretely  by  disclosures  surrounding  reorganizations.  Liberty  Media  Group,  Inc.  reports  the  following  in  its  2008 
annual  report:  “The  Redemption  and  resulting  separation  of  LEI  from  LMC  are  referred  to  as  the  Split  Off.  The  Split  Off 
is  conditioned  on,  among  other  matters,  receipt  of  stockholder  approval  and  receipt  of  a  private  letter  ruling  from  the 
IRS  and  a  tax  opinion  from  tax  counsel  and  is  expected  to  occur  in  the  second  quarter  of  2009.” 
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of  Appeals  and  decided  to  initiate  settlement.  In  late  2007,  nearly  10  years  after  the  transaction 
was  entered  into,  the  parties  agreed  to  settle  the  case  for  $536  million  (Mulligan  2007). 

The  taxpayer  in  our  model  is  fully  informed  regarding  the  facts  of  the  transaction,  in  that  it 
knows  how  its  particular  facts  fit  within  the  applicable  law,  regulations,  court  cases,  other  author¬ 
ity,  and  IRS  practices  regarding  settling  appeals.  Based  on  these  facts  and  circumstances,  the 
taxpayer  knows  the  distribution  of  possible  audit  outcomes  if  it  enters  into  the  transaction  and  the 
IRS  audits  the  tax  return.  We  assume  that  both  the  taxpayer  and  the  government  are  risk-neutral, 
which  implies  that  the  mean  value  (pretax  value  plus  the  expected  tax  benefit  retained  upon  audit) 
is  everything  the  taxpayer  needs  to  know  in  order  to  decide  whether  to  engage  in  the  transaction 
in  the  pre-FIN  48  regime.^ 

Finally,  we  assume  that  no  firm  can  make  a  credible  voluntary  disclosure  that  would  deter 
government  audits,  because  practice  varied  too  much  for  disclosures  to  be  meaningful  until  FIN  48 
established  consistent  rules  for  recognition,  measurement  and  the  content  of  the  disclosure.  For 
example,  even  if  a  firm  stated  that  it  had  recorded  no  liability  for  tax  uncertainty,  the  government 
could  not  know  whether  the  disclosure  reflected  a  low  probability  of  government  detection  or  a 
strong  case  on  its  merits.  Gleason  and  Mills  (2002)  find  that  the  largest  100  nonfinancial  firms 
rarely  disclosed  even  the  amounts  of  IRS  disputes  throughout  the  1990s,  even  when  the  proposed 
adjustments  were  material.  In  the  two  quarters  between  FIN  48  enactment  and  adoption,  Blouin  et 
al.  (2010)  find  that  firms  continued  to  not  disclose  contingent  tax  reserves.  Thus,  our  assumption 
is  consistent  with  prior  empirical  findings  about  the  frequency  of  tax  contingency  disclosures  prior 
to  FIN  48. 

Let  X  represent  the  expected  value  of  the  tax  benefit  that  will  be  retained  upon  audit  of  a 
transaction  that  produces  an  uncertain  tax  benefit.  The  realization  x  represents  the  taxpayer’s 
private  information  because,  although  the  government  can  observe  whether  the  taxpayer  entered 
into  a  transaction  that  produced  an  uncertain  tax  benefit,  it  cannot  observe  the  strength  of  the 
underlying  position.  For  example,  the  government  can  tell  from  the  tax  return  that  the  taxpayer  has 
been  a  party  to  a  merger  and  has  treated  it  as  tax-free,  but  it  cannot  tell  how  strongly  the  facts 
support  this  characterization.  In  addition,  we  assume  that  the  government  can  observe  the  pretax 
value,  V.  Thus,  in  the  pre-FIN  48  regime,  taxpayers  claiming  the  uncertain  tax  benefit  are  indis¬ 
tinguishable  to  the  government  in  terms  of  their  level  of  uncertainty.  We  assume  that  x  is  the 
realization  of  a  random  variable  drawn  from  a  uniform  [1/2,  1]  distribution.  The  taxpayer  observes 

Q 

X  before  deciding  whether  to  enter  into  the  transaction.  We  assume  x  >  1/2  because  we  focus  on 
cases  for  which  the  taxpayer  is  more  likely  than  not  to  prevail  in  a  dispute  with  the  government. 
This  ensures  that  no  paid  preparer  has  an  obligation  under  Internal  Revenue  Code  (IRC)  §6694  to 
disclose  any  information  about  the  transaction  on  the  tax  return. 

After  observing  a  tax  return  that  contains  an  uncertain  tax  benefit,  the  government  audits  the 
tax  return  with  probability  a.  The  government  faces  an  audit  cost  c  where  c  e  (0,1]  and  collects 
a  penalty  tt  on  the  expected  additional  tax  collected  (1  -  x),  where  tt  denotes  the  expected  tax 
penalty  rate  tt  e  (0,  1].^  Figure  2  summarizes  the  expected  payoffs  to  the  taxpayer  and  the 
government  when  the  taxpayer  does  and  does  not  enter  into  the  transaction.  As  illustrated  in 


^  For  purposes  of  our  focus  on  tax  compliance,  we  ignore  any  financial  reporting  preferences  a  taxpayer  might  have  for 
either  tax  benefits  or  contingent  liabilities  beyond  the  pretax  values  represented  in  F. 

*  When  we  add  the  FIN  48  disclosure  to  our  model,  we  impose  additional  structure  on  the  underlying  distribution  of 
outcomes  from  which  the  FIN  48  liability  is  derived.  We  emphasize  that  x  is  the  mean  of  the  possible  tax  benefits 
retained  upon  audit,  not  the  outcome  itself. 

®  We  impose  an  upper  bound  on  the  penalty  rate  of  100  percent  to  reflect  the  U.S.  tax  law,  in  which  the  penalty  for 
negligence  is  20  percent  (IRC  §6662)  and  the  penalty  for  fraud  is  75  percent  (IRC  §6663).  Putting  an  upper  bound  on 
the  penalty  rules  out  implausible  cases  in  which  taxpayers  with  very  strong  positions  are  intimidated  into  filing  high 
income  reports  by  the  presence  of  a  draconian  penalty  regime. 
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FIGURE  2 

Payoff  Matrix  for  the  Taxpayer  (T)  and  the  Government  (G) 


Payoff  Matrix  [T,  G] 

Audit 

No  Audit 

Enter  into  transaction  (a  =  V) 

V  +  x-n:(\-x),-x  +  n:{\~x)-c 

V+  1,-1 

Do  not  enter  into  transaction  (a  =  N) 

0,  -c 

0,0 

a  =  Y  when  the  taxpayer  enters  into  a  transaction  with  a  pretax  value,  V,  and  an  expected  tax  benefit  retained 
upon  audit,  x.  a  =  N  when  a  taxpayer  does  not  enter  into  a  transaction,  tt  is  the  penalty  rate,  and  c  is  the  cost  to 
the  government  of  conducting  an  audit. 


Figure  2,  we  normalize  the  “no  transaction,  no  audit”  payoff  to  0  such  that  the  payoff  to  the 
government  from  the  “transaction,  no  audit”  outcome  is  —1,  and  the  difference  between  the 
government’s  payoff  from  the  “transaction,  no  audit”  outcome  and  the  “transaction,  audit”  out¬ 
come  is  (1  +  7r)(l  —  x)  —  c.  Thus,  the  aggregate  payoffs  make  economic  sense,  equaling  either 
0,  V,  V  —  c,  or  —c  in  the  four  cells  of  Figure  2. 

Not  entering  into  the  transaction  {a  =  N)  gives  the  taxpayer  a  payoff  of  0.  Entering  into  the 
transaction  {a  =  Y)  gives  the  taxpayer  a  payoff  of  V  -)-  1  if  the  tax  return  is  not  audited  and  an 
expected  payoff  of  V  +  x  —  7r(l  —  x)  if  the  tax  return  is  audited.  If  the  government  does  not 
conduct  an  audit,  then  the  government’s  payoff  is  —  1  if  the  taxpayer  chooses  a  =  Y  and  0  if  the 
taxpayer  chooses  a  =  N.  If  it  conducts  an  audit,  then  the  government’s  payoff  is  — x  +  7r(l  —  x) 
—  c  if  a  =  F  and  —c  if  a  =  N.  The  payoff  from  auditing  when  a  =  N  is  —c,  because  audits  are 
costly  and  no  additional  tax  is  collected.  Thus  the  government  never  audits  when  a  —  N.  The 
government’s  payoff  from  auditing  the  transaction,  — x  +  7r(l  —  x)  —  c,  reflects  our  assumption 
that  the  government  may  not  recover  all  the  tax  on  audit.  In  our  model,  audit  outcomes  vary 
because  they  depend  on  the  strength  of  taxpayer’s  facts  and  circumstances,  which  determines  x. 

Pre-FIN  48  Equilibria 

An  equilibrium  is  an  audit  strategy  for  the  government  and  a  reporting  strategy  for  each  type-x 
of  taxpayer,  where  each  taxpayer’s  strategy  is  a  best  response  to  the  government’s  strategy  and  the 
government’s  strategy  is  a  best  response  given  the  set  of  taxpayer  strategies.  When  the  taxpayer’s 
facts  and  the  available  authority  (law,  judicial  precedents,  and  administrative  guidance)  make  the 
probability  of  retaining  more  of  the  tax  benefit  relatively  high,  we  refer  to  that  taxpayer  as  having 
a  strong  position.  When  the  taxpayer’s  facts  and  available  authority  make  the  probability  of 
retaining  more  of  the  benefit  relatively  low,  we  refer  to  that  taxpayer  as  not  having  a  strong 
position. 

Four  types  of  equilibria  arise.  We  first  describe  two  simple  equilibria  in  which  all  taxpayers 
claim  the  uncertain  tax  benefit.  We  then  develop  and  discuss  two  more  complex  equilibria  that  we 
formalize  in  Proposition  1. 

First,  if  auditing  is  sufficiently  costly,  c  >  all  taxpayers  enter  into  the  transaction  and  the 
government  conducts  no  audits.  Audit  costs  are  so  high  that  the  threat  to  audit  is  not  credible; 
hence,  all  taxpayers  enter  into  the  transaction  that  generates  uncertain  tax  benefits. 

When  audit  costs  are  low,  c  <  behavior  depends  on  the  pretax  value  V  of  the  transaction 
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and  in  some  cases  the  taxpayer’s  private  information,  x.  Figure  3  depicts  taxpayer  and  government 
behavior  for  the  equilibria  that  arise  when  audit  costs  are  low. 

If  audit  costs  are  low  and  the  pretax  value  V  of  the  transaction  is  sufficiently  high,  V  > 
then  all  taxpayers  are  willing  to  enter  the  transaction  despite  a  certain  audit.  Their  willingness 
arises  because  the  pretax  value  of  the  transaction  is  sufficiently  high,  and  the  government  is 
willing  to  audit  all  transactions  with  uncertain  tax  benefits  because  the  audit  cost  is  sufficiently 
low. 

Proposition  1  below  formalizes  the  behavior  of  the  government  and  the  taxpayer  for  the  two 
equilibria  for  which  the  behavior  of  the  taxpayer  depends  on  the  realization  of  x  when  V  <  If 
audit  costs  are  low  and  V  takes  on  an  intermediate  value,  2c  -  1  <  V  <  ^  (see  Proposition  1(a)), 
then  the  government  audits  all  tax  returns  that  claim  uncertain  tax  benefits  and  taxpayers  deter¬ 
mine  their  reporting  strategy  according  to  a  cutoff  value  x*.  Taxpayers  with  a  type-x  above  x* 
enter  into  the  transaction  even  when  they  are  certain  they  will  be  audited,  whereas  taxpayers  with 


FIGURE  3 

Pre-FIN  48  Equilibrium  When  c  <  and  Post-FIN  48  Equilibrium  When  ^  1  - 


Figure  3  illustrates  taxpayer  and  government  behavior,  respectively.  With  respect  to  the  taxpayer,  a  =  Y  when  the 
taxpayer  enters  into  a  transaction  with  a  pretax  value,  V,  and  an  expected  uncertain  tax  benefit  retained  upon 
audit,  X.  a  =  when  a  taxpayer  does  not  enter  into  a  transaction.  With  respect  to  the  government,  audit  means  the 
government  audits  all  taxpayers,  whereas  mixed  audit  means  the  government  audits  taxpayers  with  probability, 
a.  77  is  the  penalty  rate,  and  c  is  the  cost  to  the  government  of  conducting  an  audit. 
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a  type-x  below  do  not  enter  into  the  transaction.  We  emphasize  that  taxpayers  with  relatively 
strong  positions  (x  >  jc*)  claim  an  uncertain  tax  benefit  even  in  the  face  of  certain  audit. 

Finally,  when  audit  costs  are  low  and  the  transaction  value  is  low,  V  <  2c  —  1  (see  Proposition 
1(b)),  the  threat  to  audit  deters  taxpayers  from  entering  the  transaction  unless  x>  I  -  Because 
only  taxpayers  with  sufficiently  strong  facts  enter  into  the  transaction,  the  government  does  not 
have  an  incentive  to  audit  all  tax  returns  that  claim  uncertain  tax  benefits.  In  equilibrium,  the 
government  adopts  a  mixed  strategy,  auditing  some  returns  and  not  others.  Given  this  mixed 
strategy,  a  taxpayer  for  whom  x=l  -  is  indifferent  about  entering  into  the  transaction;  tax¬ 
payers  with  stronger  facts  enter  into  the  transaction  and  taxpayers  with  weaker  facts  do  not. 
Likewise,  the  government’s  expected  payoff  from  auditing  a  return  that  claims  uncertain  tax 
benefits  is  equal  to  its  payoff  from  not  auditing  the  return. 

Proposition  1: 

(a)  If  audit  costs  (c)  are  low,  c  <  and  the  pretax  transaction  value  (V)  is  intermediate, 
2c  -  1  <  L  <  then: 

(i)  G  audits  no  a  =  N  tax  returns, 

(ii)  G  audits  all  a  =  F  tax  returns, 

(iii)  each  T  chooses  a  =  Y  x>  and  chooses  a  =  N  otherwise. 

(b)  If  c  <  and  the  pretax  transaction  value  is  low,  V  <  2c  —  1,  then: 

(i)  G  audits  no  a  =  N  tax  returns, 

(ii)  G  audits  a  =  Y  tax  returns  with  probability  a  = 

(iii)  each  T  chooses  a  =  Y  '\f  x^  \  -  and  chooses  a  =  N  otherwise. 

IV.  TAX  COMPLIANCE  POST-FIN  48 

Post-FIN  48  Model 

We  next  examine  the  interaction  between  the  taxpayer  and  the  government  in  a  post-FIN  48 
regime  characterized  by  mandatory  public  disclosure  of  the  taxpayer’s  level  of  uncertainty.  FIN  48 
determines  the  amount  of  the  uncertain  tax  benefit  the  taxpayer  may  recognize  in  the  financial 
statements  under  a  two-step  process:  recognition  and  measurement.  First,  taxpayers  may  only 
recognize  tax  benefits  that  are  more  than  50  percent  likely  to  be  sustained  by  the  court  of  last 
resort  based  solely  on  the  technical  merits  of  the  filing  position.  For  positions  that  pass  the 
recognition  step,  the  measurement  step  then  determines  the  amount  of  the  tax  benefit  that  should 
be  recognized,  if  any.  The  tax  benefit  recognized  in  the  financial  statements  is  the  largest  tax 
benefit  that  cumulatively  is  greater  than  50  percent  likely  to  be  sustained  on  audit,  taking  into 
account  likely  settlements  with  the  government.  This  two-step  process  is  a  departure  from  an 
expected  value  approach.  Rather  than  recording  an  expected  benefit  (a  mean)  in  which  the  expec¬ 
tation  incorporates  audit  probability,  both  the  recognition  and  measurement  steps  under  FIN  48 
consider  a  “more  likely  than  not”  threshold  (a  median)  and  assume  the  government  has  full 
knowledge  of  the  position. 

FIN  48  requires  the  taxpayer  to  recognize  a  liability  for  the  unrecognized  tax  benefit  to  offset 
some  or  all  of  the  total  tax  benefit  claimed  on  the  tax  return.  Because  the  recognized  benefit  is  the 
median  value  of  the  taxpayer’s  retained  tax  benefit  (the  more  likely  than  not  amount),  the  recog¬ 
nized  liability  is  equal  to  1  minus  the  median  value  of  the  taxpayer’s  retained  tax  benefit.  Most 
notable  in  our  model  is  the  requirement  under  FIN  48  that  the  taxpayer  disclose  the  liability  for  the 
unrecognized  tax  benefit  in  its  publicly  available  financial  statements.  Thus,  post-FIN  48  the 
government  can  observe  both  an  uncertain  tax  position  in  the  tax  return  and  the  FIN  48  disclosure 
in  the  financial  statements.  This  allows  the  government  to  form  a  more  precise  belief  regarding  the 
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strength  of  the  taxpayer’s  filing  position  than  it  could  pre-FlN  48.  The  government  can  use  this 
revised  belief  to  determine  its  audit  strategy.  Finally,  the  government  can  observe  the  value  of  the 
pretax  features  of  the  transaction,  V.  The  cost  of  the  audit,  c,  is  unchanged  in  our  post-FIN-48 
model,  because  although  the  government  can  identify  the  existence  of  the  transaction  by  observing 
a,  it  must  still  conduct  a  costly  audit  to  analyze  the  transaction. 

We  assume  that  the  taxpayer’s  facts  are  sufficiently  strong  that  they  pass  the  recognition  step. 
Although  some  transactions  would  not  satisfy  the  criteria  for  recognition,  other  features  of  the 
institutional  environment  make  this  difficult  to  hide  from  the  government.  For  example,  a  paid 
preparer  has  an  obligation  to  disclose  the  existence  of  a  transaction  that  would  fail  the  “more  likely 
than  not”  standard  (IRC  §6694;  BDJ;  Pauly  2008).  Because  our  focus  is  solely  on  the  effect  of  FIN 
48  on  the  strategic  interaction  between  publicly  traded  corporate  taxpayers  and  the  government, 
we  confine  our  attention  to  settings  in  which  the  government  can  only  condition  its  audit  decision 
on  the  existence  of  the  uncertain  tax  position  on  the  tax  return  and  the  firm’s  FIN  48  disclosure. 

To  model  the  measurement  processes  of  FIN  48,  we  introduce  the  function  m{x),  which 
represents  the  median  tax  benefit  retained  upon  audit,  given  x.  The  uncertain  tax  benefit  recog¬ 
nized  under  FIN  48  is  1  —  m(x).  We  make  no  assumption  about  whether  m{x)  is  greater  or  less 
than  x;  medians  can  be  above  or  below  means.  We  assume  that  m(x)  is  continuous,  m'(x)  >  0  for 
all  m(x)  <  1,  and  m(7)  =  1. 

We  allow  the  taxpayer  to  retain  the  full  tax  benefit  with  some  positive  probability  p(x).  This 
implies  that  the  expected  retained  benefit  given  some  of  the  benefit  is  lost,  which  occurs  with 

probability  1  -  p{x),  is  so  that  the  expected  retained  benefit  is  equal  to  x.  We  define  Xj  as 

the  smallest  value  of  x  for  which  m(x)  =  1 .  We  denote  this  cutoff  point  as  X5  to  signify  the  point 
above  which  the  taxpayer’s  type-x  is  “strong.”  We  provide  an  example  of  a  probability  distribution 
of  audit  outcomes  that  is  consistent  with  all  of  our  assumptions  regarding  p{x)  and  m(x)  in 
Appendix  B. 

We  denote  the  disclosed  FIN  48  liability  as  D,  where  D  e  [0,1].  If  x  >  X5  (i.e.,  taxpayer  has 
a  strong  position),  then  m{x)  =  1  and  so  D  =  0  because  D  =  1  —  m(x).  If  x  <  X5  (i.e.,  taxpayer 
does  not  have  a  strong  position),  then  0  <  Z)  <  1.  The  function  m(x)  is  common  knowledge,  so  if 
0  <  D  <  1,  then  the  government  can  infer  x.  In  contrast,  if  D  =  0,  the  government  only  knows 
that  X  >  x^.  In  summary,  the  mean  and  median  settlement  outcomes  are  linked,  so  we  let  x  be  the 
mean  settlement  and  m(x)  be  the  median  settlement.  We  make  x  the  main  random  variable,  and 
m(x)  is  expressed  as  a  function  of  x. 

Two  types  of  FIN  48  disclosures  thus  correspond  with  the  measurement  process  of  FIN  48:  D 
=  0or0<D<l.A  disclosure  of  Z)  =  0  in  the  financial  statements  is  consistent  with  two 
situations.  First,  D  =  0  when  a  =  N,  because  the  taxpayer  did  not  claim  an  uncertain  tax  benefit. 
Second,  D  =  0  when  the  taxpayer  claims  an  uncertain  tax  benefit  and  has  a  strong  position  (i.e., 
the  taxpayer  records  D  =  I  -  m(x)  where  m(x)  =  1).  Because  the  government  can  observe  from 


Recall  that  we  assume  there  is  only  one  transaction  in  our  model,  so  a  FIN  48  disclosure  fully  reveals  which  transaction 
is  uncertain.  For  most  companies  of  any  complexity,  the  FIN  48  disclosure  will  be  an  aggregate  liability  for  multiple  tax 
positions  in  multiple  taxing  jurisdictions.  Although  FASB’s  initial  exposure  draft  proposed  requiring  corporations  to 
disclose  domestic  and  foreign  components  of  unrecognized  tax  benefits  separately,  that  finer  disclosure  did  not  survive 
the  adopted  standard.  Thus,  even  if  the  FIN  48  disclosure  helps  the  IRS  select  the  taxpayer  for  audit,  the  FIN  48 
disclosure  does  not  lower  the  cost  of  conducting  the  audit.  An  important  real  world  example  that  supports  this  feature 
in  our  model  is  the  recent  Textron  case  (Textron  v.  Comm.  2007),  in  which  the  court  disallowed  the  IRS  access  to 
workpapers  prepared  during  an  audit.  The  business  community  has  speculated  that  one  of  the  key  ways  that  the 
government  will  reduce  their  audit  cost  as  a  result  of  FIN  48  is  by  obtaining  documentation  used  to  prepare  the  FIN  48 
financial  statement  disclosure.  The  Textron  case  seems  to  provide  taxpayers  with  support  for  asserting  that  the  work 
product  privilege  (Arthur  Young  v.  Comm.  1984)  applies  to  tax  accrual  workpapers  that  were  provided  to  the  taxpayer’s 
independent  auditors. 
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the  tax  return  whether  the  taxpayer  has  claimed  an  uncertain  tax  benefit,  it  can  distinguish  between 
these  two  situations  for  which  D  =  0. 

We  assume  the  independent  audit  process  ensures  that  the  disclosure  D  is  truthful,  i.e.,  con¬ 
sistent  with  FIN  48.  As  a  result,  taxpayers  without  strong  positions  cannot  simply  decide  to 
disclose  D  =  0  to  pool  with  taxpayers  with  strong  positions.  Introducing  another  actor  to  the  game 
(the  independent  auditor)  would  complicate  our  model  and  obscure  our  goal  for  this  initial  study. 
Our  goal  from  imposing  this  simplifying  assumption  is  to  strengthen  the  government’s  information 
set  so  that  we  can  evaluate  the  maximum  effect  of  FIN  48  on  the  strategic  interaction  between  the 
taxpayer  and  the  government. 

We  depict  the  possible  realizations  of  x  and  associated  value  of  D  in  Figure  4. 

Post-FIN  48  Equilibria 

We  consider  the  equilibria  that  can  occur  in  two  cases:  (1)  x  >  jc^,  and  (2)  x  <  .  When  x 

>  Xs,  the  taxpayer  has  a  strong  position,  and  we  begin  our  analysis  with  this  case. 

Taxpayer  with  a  Strong  Position  (at  >  ^5) 

A  taxpayer  that  claims  an  uncertain  tax  benefit  with  a  strong  position  discloses  D  =  0.  When 
the  government  observes  a  =  Y  and  D  =  0,  it  knows  that  jc  lies  in  the  interval  [x^,  1].  As  before, 
we  first  describe  two  simple  equilibria  in  which  taxpayers  always  claim  the  uncertain  tax  position. 
We  formalize  more  eomplex  cases  for  taxpayers  with  strong  positions  in  Proposition  2. 

If  Xg  is  sufficiently  high,  >  1  -  then  the  disclosure  D  =  0  deters  the  government  from 
auditing.  With  no  government  audits,  all  taxpayers  with  strong  positions  claim  the  tax  benefit. 

If  <  1-7:7:;;;,  then  the  strategies  of  the  taxpayer  and  the  government  depend  on  the  pretax 
value  V  of  the  transaction  and  the  realization  of  x.  When  the  value  of  the  transaction  is  high,  y 

>  a  taxpayer  with  a  strong  position  is  willing  to  claim  the  tax  benefit  even  if  an  audit  is 
certain.  In  this  case,  all  taxpayers  with  a  strong  position  claim  the  tax  benefit,  and  the  government 
audits  all  returns  that  claim  the  tax  benefit. 


FIGURE  4 

Possible  Realizations  of  the  Expected  Tax  Benefit,  x,  and  Associated  Values  of  the  Disclosed 

FIN  48  Liability,  D 
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The  expected  tax  benefit  retained  on  audit  is  x.  The  median  retained  benefit  on  audit  is  m{x).  Z)  =  0  is  disclosed 
when  taxpayers  have  strong  positions,  which  implies  x  >  in  our  model.  0  <  Z)<  1,  or  1  -  w(x),  is  disclosed 
when  taxpayers  do  not  have  strong  positions,  which  implies  that  7  <  x  <  X5  in  our  model. 
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When  the  value  of  the  transaction  is  intermediate,  2c  -  I  <V  <  or  low,  y  <  2c  —  1,  the 
behavior  of  the  taxpayer  depends  on  the  realization  of  x.  We  characterize  the  behavior  of  the 
taxpayer  with  a  strong  position  and  the  government  in  Proposition  2  below. 

When  V  takes  on  an  intermediate  value,  2c  -  1  <  y<  (see  Proposition  2(a)),  the  gov¬ 
ernment  audits  all  tax  returns  that  claim  uncertain  tax  benefits  and  taxpayers  determine  their 
reporting  strategy  according  to  a  cutoff  value  x*.  Taxpayers  with  a  type-x  above  x*  enter  into  the 
transaction  even  when  they  are  certain  they  will  be  audited,  while  taxpayers  with  a  type-x  below 
X*  do  not  enter  into  the  transaction. 

When  the  transaction  value  is  low,  V  <2c  —  1  (see  Proposition  2(b)),  the  threat  to  audit  deters 
taxpayers  from  entering  the  transaction  unless  x  ^  1  -  Because  only  taxpayers  with  suffi¬ 
ciently  strong  facts  enter  into  the  transaction,  the  government  does  not  have  an  incentive  to  audit 
all  tax  returns  that  claim  uncertain  tax  benefits.  In  equilibrium,  the  government  adopts  a  mixed 
strategy,  auditing  some  returns  and  not  others. 

Proposition  2:  When  x>  xg,  D  =  0  for  both  a  =  Y  and  a  =  N. 

(a)  If  the  lowest  value  of  x  for  which  no  liability  is  recognized  under  FIN  48  is  low  x^ 
<  1  -  and  the  value  (V)  of  the  transaction  is  intermediate,  2c  -  1  <  V  <  then: 

(i)  G  audits  no  a  =  N  tax  returns, 

(ii)  G  audits  all  a  =  7  tax  returns,  and 

(iii)  each  T  chooses  a  =  7  if  x  >  and  chooses  a  =  N  otherwise. 

(b)  If  X5  <  1  -  and  the  value  of  the  transaction  is  low,  V  <  2c  —  1,  then: 

(i)  G  audits  no  a  =  N  tax  returns, 

(ii)  G  audits  a  =  7  tax  returns  with  probability  a  =  and 

(iii)  each  T  chooses  a  =  7ifx>l  -  and  chooses  a  =  N  otherwise. 

Taxpayer  without  a  Strong  Position  (x  <  -<^s) 

When  X  <  x^,  the  taxpayer  is  more  likely  than  not  to  lose  part  of  the  claimed  tax  benefit. 
When  a  taxpayer  without  a  strong  position  enters  into  a  transaction  that  generates  an  uncertain  tax 
benefit,  it  must  disclose  0  <  D  <  1,  where  D  =  1  -  m(x).  Unlike  the  case  in  which  Z)  =  0,  when 

the  government  observes  a  =  7  and  0  <  Z)  <  1,  the  taxpayer’s  private  information,  x,  is  fully 

revealed  to  the  government  because  the  government  knows  m(x).  In  this  setting,  both  the  taxpayer 
and  the  government  are  in  a  full  information  environment.  We  characterize  the  behavior  of  the 
taxpayer  without  a  strong  position  and  the  government  in  Proposition  3  below. 

Proposition  3:  When  \  <x<xs,D  =  1  -  ot(x)  when  a  =  7 and  D  =  0  when  a  =  N. 

(a)  If  X  >  1  -  77^  then: 

(i)  G  audits  no  tax  returns,  and 

(ii)  all  T  chooses  a  =  Y. 

(b)  If  ^  <  X  <  1  -  then: 

(i)  G  audits  no  a  =  N  tax  returns, 

(ii)  G  audits  all  a  =  7  tax  returns,  and 

(iii)  each  T  chooses  a  =  7  if  x  >  and  chooses  a  =  N  otherwise. 

We  illustrate  the  behavior  of  taxpayers  and  the  government  when  c  <  ^  in  Figures  3,  5,  and 
6.  Figure  3  represents  the  situation  in  which  x^  <  1  -  Recall  also  that  Figure  3  depicts  the 
pre-FIN  48  equilibria.  Figure  5  represents  X5  >  1  -  77^,  and  Figure  6  represents  1  - 
<  1  - 

We  emphasize  that  the  region  of  each  figure  for  which  x  >  X5  is  the  region  for  which  all 
taxpayers  that  engage  in  the  transaction  disclose  D  =  0  and  the  region  for  which  x  <  x^  is  the 
region  for  which  all  taxpayers  that  engage  in  the  transaction  disclose  D  =  1  -  m(x)  >  0.  Thus, 
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FIGURE  5 

Comparing  Equilibria  Pre-FIN  48  and  Post-FIN  48  When  >  1  -  ^ 


“1  itl  0 

I  /  2 


because  Figure  5  shows  a  high  range  for  Xs,  it  depicts  the  situation  in  which  audit  costs  are 
relatively  high,  whereas  Figure  3  represents  a  situation  in  which  audit  costs  are  relatively  low. 

Figure  3  characterizes  the  behavior  of  taxpayers  and  the  government  both  prior  to  FIN  48  and 
after  FIN  48  when  <  1  -  Most  taxpayers  disclose  no  FIN  48  liability  because  Xs  is  so  low. 
In  the  extreme  case  in  which  X5  =  ^,  the  disclosure  has  no  information  content  because  all  taxpay¬ 
ers  make  the  same  disclosure.  For  taxpayers  that  disclose  a  FIN  48  liability,  the  audit  costs  are  so 
low  that  the  government  does  not  need  the  disclosure  to  convince  it  to  audit.  Therefore,  the  FIN  48 
disclosure  does  not  affect  either  taxpayer  or  government  behavior  when  <  1  - 

Figures  5  and  6  illustrate  the  behavior  of  the  government  and  the  taxpayers  both  pre-FIN  48 
and  post-FIN  48.  The  figures  are  similar,  in  that  each  features  a  threshold  value,  x*,  for  which 
post-FIN  48  all  x  >  x*  enter  into  the  transaction  and  the  government  does  not  audit.  For  all  x  < 
X*,  the  taxpayer  enters  into  the  transaction  (and  the  government  audits)  post-FIN  48  when  V  is 
high  and  does  not  enter  into  the  transaction  when  V  is  low.  The  only  difference  between  Figures 
5  and  6  is  that  the  threshold  value  x*  is  less  than  X5  in  Figure  5  but  is  equal  to  x^  in  Figure  6.  This 
difference  is  driven  by  audit  costs.  In  Figure  5,  some  taxpayers  enter  into  the  transaction  despite 
disclosing  a  FIN  48  liability.  In  Figure  6,  audit  costs  are  low  enough  that  taxpayers  are  deterred 
from  entering  into  the  transaction  unless  they  have  no  FIN  48  liability.  We  interpret  these  figures 
when  discussing  our  insights  in  Section  V. 
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FIGURE  6 

Comparing  Equilibria  Pre-FIN  48  and  Post-FIN  48  When  1  -  ^  <  1  - 


V.  INSIGHTS  AND  CONTRIBUTIONS 

Our  model  contributes  to  our  understanding  of  FIN  48  by  challenging  commonly  expressed 
beliefs  in  the  business  press  regarding  the  effects  of  FIN  48.  It  also  highlights  the  differences 
between  the  effects  of  mandatory  and  voluntary  disclosure  regimes.  First,  we  show  how  some  of 
our  results  contradict  conjectures  by  the  business  press.  Second,  we  compare  our  results  to  those 
from  the  BDJ  model  of  voluntary  disclosure  in  a  strategic  tax  compliance  setting. 

Challenging  Common  FIN  48  Conjectures 

A  common  conjecture  about  the  effects  of  FIN  48  is  that  the  new  accounting  interpretation 
will  typically  harm  taxpayers  and  aid  the  government  by  providing  a  “road  map”  to  a  taxpayer  s 
controversial  tax  return  filing  positions.  In  discussing  FIN  48  during  an  SEC  speech,  Chester 
Spatt,  Chief  Economist  and  Director  of  the  Office  of  Economic  Analysis  of  the  SEC,  stated  that 
“providing  publicly  more  information  about  the  taxpayer’s  position  on  salient  tax  issues  may 
provide  a  ‘roadmap’  for  the  government  that  may  undercut  the  firm’s  bargaining  power  in  the 
associated  tax  disputes”  (Spatt  2007).“  Similarly,  Allergan,  Inc.’s  comment  letter  to  the  FASB 


''  Available  online  at:  http://www.sec.gOv/news/speech/2007/spch030807css.htm  as  of  July  22,  2010,  Speech  by  SEC 
staff:  “The  Economics  of  FIN  48:  Accounting  for  Uncertainty  in  Income  Taxes.” 
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argues  that  disclosures  of  accrued  liability  “will  provide  a  readily  visible  red  flag  to  all  taxing 
authorities  as  to  the  magnitude  of  potential  audit  issues  for  which  the  taxing  authorities  should  be 
looking.”^^ 

By  comparing  the  behavior  of  taxpayers  and  the  government  pre-  and  post-FIN  48,  we  show 
that  some  taxpayers  that  claim  uncertain  tax  benefits  with  relatively  strong  positions  are  better  off 
post-FIN  48.  The  shaded  regions  in  Figures  5  and  6  are  those  in  which  the  equilibrium  behavior 
changes  due  to  FIN  48.  Consider  Figure  5.  When  x  ^  1  -  and  the  FIN  48  disclosure  reveals 
that  X  is  high,  the  “road  map”  makes  taxpayers  better  off  because  the  government  does  not  audit 
the  transaction  and  makes  the  government  better  off  because  it  does  not  audit  taxpayers  with 
strong  positions.  FIN  48  provides  a  credible  signal  that  the  taxpayer  has  relatively  strong  facts — a 
signal  that  makes  both  parties  better  off.  The  effects  of  FIN  48  are  more  subtle  when  1  -  -~ 
<  X  <  1  -  and  V  <  2c  —  1  If  x  ^  then  the  FIN  48  disclosure  deters  the  taxpayer  from 
engaging  in  the  transaction.  This  makes  the  government  better  off  and  the  taxpayer  worse  off.  On 
the  other  hand,  if  x  >  then  the  taxpayer  is  worse  off  because  it  faces  certain  audit,  but  the 

government  is  better  off  because  the  government  conditions  its  audit  on  the  FIN  48  disclosure. 
Figure  6  is  similar  to  Figure  5;  the  only  difference  is  the  size  of  the  regions.  Specifically,  the  size 
of  the  shaded  areas  in  which  both  parties  are  better  off  increases  and  the  size  of  the  shaded  areas 
in  which  the  government  is  better  off  but  the  taxpayer  is  either  indifferent  or  worse  off  decreases. 

In  contrast.  Figure  3  reveals  that  the  behavior  of  neither  the  taxpayer  nor  the  government 
changes  when  audit  costs  are  sufficiently  low.  When  X5  <  1  -  most  taxpayers  disclose  D  =  0, 
either  because  they  do  not  engage  in  the  transaction  or  because  x  >  X5.  Because  most  taxpayers 
make  the  same  FIN  48  disclosure,  it  is  of  little  use  as  a  signal.  Conversely,  taxpayers  with  D  >  0 
were  being  audited  in  the  pre-FIN  48  regime,  so  the  disclosure  of  their  relatively  weak  facts  has 
no  effect  on  their  probability  of  being  audited. 

Finally,  we  consider  the  effects  of  FIN  48  when  c  >  There  are  two  possibilities,  which 
we  explain  but  do  not  illustrate  in  a  figure.  If  c  >  then  auditing  is  too  costly  in  either  the 
pre-FIN  48  or  post-FIN  48  environments.  In  each  case,  every  taxpayer  engages  in  the  transaction 
and  the  government  does  not  audit.  If  ^  <  c  <  ^,  then  auditing  is  too  costly  pre-FIN  48  but 
not  necessarily  post-FIN  48  because  the  government  can  condition  its  audit  decision  on  the 
taxpayer’s  FIN  48  disclosure.  In  absence  of  a  FIN  48  disclosure,  the  government  lacks  a  credible 
threat  to  audit  any  taxpayer  if  it  would  be  unwilling  to  audit  the  average  taxpayer  (x=  |).  With  a 
FIN  48  disclosure,  the  government  is  weakly  better  off  because  it  audits  some  taxpayers  that  do 
not  have  strong  facts  and  deters  other  taxpayers  from  claiming  the  uncertain  tax  position.  Taxpay¬ 
ers  are  weakly  worse  off. 

Overall,  we  see  that  FIN  48  can  make  the  government  better  off  but  never  makes  it  worse  off. 
The  FIN  48  “road  map”  is  weakly  valuable  to  the  government.  But  the  road  map  can  also  be 
valuable  to  the  taxpayer,  deterring  audits  with  low  expected  payoffs  to  the  government.  Of  course, 
the  road  map  can  also  make  the  taxpayer  worse  off,  either  by  deterring  the  taxpayer  from  entering 
into  a  transaction  with  uncertain  tax  benefits  or  by  subjecting  the  taxpayer  to  a  higher  audit 
probability. 

A  second  common  conjecture  is  that  FIN  48  may  cause  many  issuers  of  GAAP  financial 
statements  to  significantly  overstate  their  ultimate  tax  liabilities  (BNA  2006).  Critics  of  FIN  48 
note  that  because  the  recognition  and  measurement  process  of  FIN  48  assumes  that  the  govern¬ 
ment  will  audit  a  particular  tax  return  filing  position,  the  departure  from  expected  value  will  cause 
the  recorded  liability  to  be  greater  than  the  expected  tax  payment.  J.  Howard  Stecker,  the  V.P.  and 


Available  online  at:  http://www.fasb.org/jsp/FASB/CommentLetter_C/CommentLetterPage?project_id=  1215-001  as  of 
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Chief  Tax  Officer  of  Prudential  Financial,  wrote  a  comment  letter  to  the  FASB  stating  that  the 
proposed  interpretation  “will  have  the  effect  of  consistently  overstating  the  tax  accruals  for  un¬ 
certain  tax  positions  [relative  to  the  responsible  professional  judgment  under  FAS  5]  in  direct 
conflict  with  the  conceptual  framework  of  the  Board.” 

Recall  that  x  is  the  expected  value  of  the  tax  benefit  associated  with  an  uncertain  tax  benefit 
that  will  be  retained  upon  audit,  a  is  the  probability  of  being  audited,  and  D  is  the  difference 
between  the  “as  filed”  tax  benefit  of  1  and  the  median  tax  benefit  retained  on  audit.  Therefore,  the 
expected  tax  liability  is  a;(l  —  x);  however,  the  disclosed  FIN  48  liability,  D,  is  either  1,  0,  or  1  — 
m(x).  We  denote  a  FIN  48  liability  as  either  overstated  when  the  disclosed  liability  exceeds  the 
expected  liability,  understated  when  the  expected  liability  exceeds  the  disclosed  liability,  or  as 
correctly  stated  when  the  two  are  equal. 

When  D  =  0,  the  disclosed  liability  is  correctly  stated  when  or  =  0  and  is  understated 
otherwise.  When  D  =  1,  the  disclosed  liability  is  overstated.  There  are  two  factors  that  determine 
whether  the  disclosed  liability  is  overstated  or  understated  when  D  =  \  —  m(x);  the  distribution  of 
expected  tax  benefit  retained  upon  audit  and  the  probability  of  audit.  We  explore  both  factors  in 
turn.  First,  the  implications  of  the  distribution  of  the  expected  tax  benefit  retained  on  audit  can  be 
examined  by  comparing  the  expected  liability,  1  —  x,  to  the  disclosed  liability,  1  —  m(x),  condi¬ 
tional  upon  audit  (i.e.,  when  a  =  1).  These  two  liabilities  can  differ  because  the  median  retained 
tax  benefit,  m(x),  can  be  higher  or  lower  than  the  mean  retained  tax  benefit,  x.  Second,  the 
implications  of  the  probability  of  an  audit  can  be  addressed  by  comparing  the  expected  liability, 
a(l  -  x)  to  the  disclosed  liability,  1  -  m(x).  The  expected  liability  conditional  upon  audit,  1  -  x, 
decreases  when  a  <  1.  However,  a  <  I  does  not  change  the  amount  of  the  disclosed  liability 
because  FIN  48  does  not  incorporate  audit  probabilities  into  the  measurement  process.  Thus,  when 
the  disclosed  liability,  1  -  m(x),  exceeds  the  expected  liability  conditional  upon  audit,  I  -  x,  a  < 
1  exacerbates  the  amount  by  which  the  disclosed  liability  is  overstated.  However,  when  the 
expected  liability  conditional  upon  audit,  1  -  x,  exceeds  the  disclosed  liability,  1  -m(x),  a  <  1 
results  in  a  disclosed  liability  that  could  be  higher,  lower,  or  equal  to  the  expected  liability, 
depending  on  the  equilibrium  value  of  a. 

Comparison  to  Voluntary  Disclosure  Research 

BDJ  analyze  the  effect  of  voluntary  taxpayer  disclosure  on  the  strategic  interaction  between 
the  taxpayer  and  the  government.  Specifically,  BDJ  model  the  effect  of  allowing  the  government 
to  waive  a  penalty  if  the  taxpayer  discloses  the  existence  of  a  transaction  that  generates  uncertain 
tax  benefits.  An  important  objective  of  BDJ  is  to  determine  the  conditions  under  which  taxpayer 
disclosure  would  occur.  The  disclosure  in  BDJ,  if  made,  truthfully  reveals  the  existence  of  the 
transaction  rather  than  the  strength  of  the  taxpayer’s  position;  thus,  in  equilibrium,  the  decision  to 
disclose  provides  only  a  coarse  signal  regarding  the  taxpayer’s  private  information.  BDJ  show  that 
the  disclosure  can  benefit  both  sides  of  the  transaction  because  the  taxpayer  can  avoid  penalties, 
whereas  the  government  enjoys  lower  audit  costs. 

In  contrast,  we  model  mandatory  disclosure  such  that  examining  the  conditions  under  which 
disclosure  occurs  is  not  relevant.  An  important  difference  of  mandatory  disclosure  is  that  the 
disclosure  itself  does  not  directly  affect  the  payoffs  associated  with  the  players’  possible  actions; 
recall  voluntary  disclosures  are  made  in  BDJ  to  obtain  penalty  reductions.  Furthermore,  the  man¬ 
datory  disclosure  pursuant  to  FIN  48  allows  the  government  to  learn  the  strength  of  the  taxpayer’s 
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uncertain  tax  position  directly,  instead  of  inferring  it  from  the  equilibrium  disclosure  decisions  of 
all  taxpayers. 

Although  the  voluntary  and  mandatory  disclosure  settings  feature  important  institutional  dif¬ 
ferences,  the  effects  of  disclosure  in  each  setting  have  both  similarities  and  differences.  Taxpayers 
with  sufficiently  strong  facts  (low  p  in  BDJ,  high  jc  in  our  study)  do  not  change  their  behavior 
when  disclosure  is  introduced  in  either  the  voluntary  or  mandatory  disclosure  settings,  but  actually 
benefit  from  the  possibility  of  disclosure  in  each  setting  because  they  are  audited  less  than  if  no 
disclosure  was  possible.  In  BDJ,  taxpayers  with  strong  facts  claim  the  uncertain  tax  benefit  and  do 
not  disclose  the  transaction  (i.e.,  the  expected  value  of  the  penalty  reduction  is  so  low  given  the 
strength  of  their  position  that  it  does  not  outweigh  the  possibility  of  providing  a  roadmap  to  the 
government).  However,  the  fact  that  other  taxpayers  do  disclose  induces  the  government  to  audit 
taxpayers  with  strong  facts  less  frequently.  In  the  mandatory  disclosure  setting,  it  is  the  direct 
disclosure  of  the  strong  taxpayers’  facts  that  deters  the  government  from  auditing  such  taxpayers. 

Some  taxpayers  with  weaker  facts  do  change  their  behavior  when  disclosure  is  introduced,  but 
in  quite  different  ways  in  the  two  settings.  When  disclosure  is  voluntary,  the  possibility  of  disclo¬ 
sure  induces  some  of  these  taxpayers  to  claim  the  tax  benefit  and  disclose  the  transaction;  the 
possibility  of  disclosure  makes  these  taxpayers  better  off.  In  contrast,  taxpayers  with  weaker  facts 
are  weakly  worse  off  in  the  mandatory  disclosure  setting,  as  their  disclosure  induces  weakly  more 
aggressive  auditing  from  the  government.  The  difference  in  the  two  settings  is  that  the  government 
must  give  up  something  (a  penalty  waiver)  in  the  voluntary  setting  in  order  to  induce  the  taxpayer 
to  disclose.  The  government  does  not  give  up  anything  to  get  the  disclosure  in  the  mandatory 
disclosure  setting.  Thus,  taken  together,  BDJ  and  our  study  create  a  cohesive  understanding  of  the 
effects  of  voluntary  and  mandatory  disclosure  on  strategic  tax  compliance. 

VI.  CONCLUSION 

We  investigate  how  the  strategic  interaction  between  publicly  traded  corporate  taxpayers  and 
the  government  changes  as  a  result  of  a  new  accounting  interpretation,  FIN  48,  which  requires 
these  taxpayers  to  disclose  liabilities  in  public  financial  statements  for  uncertain  tax  benefits 
claimed  on  a  tax  return.  Our  model  provides  two  interesting  insights  that  could  not  be  gleaned 
from  a  cursory  examination  of  FIN  48. 

First,  we  show  that  taxpayers  are  not  necessarily  harmed  by  FIN  48.  In  fact,  we  find  that 
taxpayers  with  strong  positions  have  higher  expected  payoffs  post-FIN  48.  Although  the  disclosed 
liability  does  provide  a  roadmap  in  our  simple  one-position,  one-jurisdiction  model,  mandatory 
disclosure  permits  taxpayers  with  strong  facts  to  benefit  through  truthful  reporting  of  little  or  no 
liability.  This  positive  effect  of  FIN  48  for  some  taxpayers  has  not  been  documented  to  date.  In 
addition,  taxpayers  with  weaker  facts  are  not  necessarily  harmed  by  FIN  48,  even  when  the 
disclosure  provides  a  “perfect  roadmap”  for  the  government.  This  observation  results  from  the 
nuanced  relation  between  x  and  c,  whereby  uncertain  tax  benefits  claimed  cannot  be  extracted 
from  taxpayers  except  through  a  costly  audit,  settlement,  and  collection  process. 

Second,  we  find  that  the  liability  disclosed  under  FIN  48  can  be  overstated  or  understated 
relative  to  the  expected  cash  payment.  In  contrast.  International  Financial  Reporting  Standards 
(IFRS)  differ  from  U.S.  Generally  Accepted  Accounting  Principles  (GAAP)  by  using  an  expected 
value  approach,  which  differs  from  the  two-step  recognition  and  measurement  process  under 
GAAP.  If  expected  values  are  more  relevant  to  shareholders,  then  convergence  toward  IFRS 
standards,  accompanied  by  disclosure,  could  potentially  improve  financial  reporting. 

Our  model  imposes  certain  simplifying  assumptions.  Critical  among  these  is  truthful  reporting 
of  the  FIN  48  liability,  because  we  assume  that  independent  auditors  and  regulators  assure  such 
reporting  in  the  post-Sarbanes-Oxley  period.  If  independent  auditors  assure  such  truthful  reporting 
and  if  the  IRS  relies  upon  it,  then  one  effect  of  FIN  48  appears  to  be  the  substitution  of  indepen- 
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dent  auditors  for  the  government.  That  is,  if  the  independent  auditors  decide  that  the  tax  position 
is  so  strong  that  that  all  the  benefit  can  be  recognized  and  no  liability  recorded,  then  the  IRS  can 
forgo  examination.  Because  all  public  corporations  receive  independent  audits  of  their  GAAP 
financial  statements  but  the  IRS  does  not  have  the  budget  to  examine  all  taxpayers,  shifting  this 
role  could  increase  overall  tax  compliance  by  directing  IRS  attention  to  taxpayers  with  the  weakest 
cases  and  deterring  some  aggressive  tax  claims.  To  the  extent  that  mandatory  disclosure  and 
post-SOX  auditing  rigor  requires  independent  auditors  to  evaluate  the  merits  of  uncertain  tax 
benefits,  government  audit  costs  could  decrease  or  government  audits  will  be  t^u•geted  more  ef¬ 
fectively.  However,  there  may  be  a  higher  deadweight  social  cost  due  to  a  higher  independent 
audit  costs.  Future  analytical  work  could  consider  incentives  and  constraints  for  taxpayers  to 
falsify  their  disclosures. 

We  believe  our  model  also  has  various  testable  empirical  implications.  Over  time,  researchers 
could  study  whether  the  IRS  more  frequently  audits  returns  with  high  disclosed  liabilities  relative 
to  claimed  tax  benefits  or  whether  FIN  48  reduces  the  amount  of  tax  planning  done  in  large  public 
companies.  In  the  short  run,  experiments  and  field  studies  could  contribute  evidence  to  test  some 
of  the  implications  of  our  model. 


APPENDIX  A 

Proof  of  Proposition  1  (Pre-FIN  48) 

(a)(i)  Not  auditing  when  a  =  A  is  a  dominant  strategy  for  the  government  because  c  >  0. 
(a)(ii)  The  government  will  audit  when  a  =  Y  ii  £[— v  +  7r(l  —  x)  —  c]  >  —  1.  Given  that 
only  taxpayers  with  x  >  choose  a  =  Y,  E[x  \a=Y]-  ^2+277'^-  Therefore,  the  gov¬ 
ernment’s  expected  payoff  from  auditing  when  a  =  Fis  ^-c>-l  because  V  >  2c 
—  1 .  Therefore,  the  government  audits  all  tax  returns  of  a  =  F. 

(a)  (hi)  A  taxpayer  of  type  x  will  choose  a  =  F  if  and  only  if  the  expected  payoff  is  greater 

than  0,  the  payoff  from  choosing  a  =  N.  Given  that  the  government  audits  all  tax 
returns  of  a  =  F,  the  taxpayer’s  expected  payoff  from  reporting  a  =  Y  is  E\y  +  x  - 
tt{\  -  x)]  >  0  for  all  x  ^  Therefore,  taxpayers  for  whom  x  >  ^  choose  a  =  Y, 
and  other  taxpayers  choose  a  =  N. 

(b) (i)  Same  argument  as  in  (a)(i). 

(b)(ii)  Given  that  only  taxpayers  with  x  >  1  -  choose  a  =  Y,  E[x  |  a  =  F]  =  .  The 

government  is  indifferent  between  auditing  and  not  auditing  tax  returns  of  a  =  F 
because  E[-x  -f  7r(l  -  x)  -  c]  =  -1.  Therefore,  the  government’s  strategy  of 
auditing  tax  returns  of  a  =  F  with  probability  a  =  ^  is  a  best  response.  Because  V  < 
2c  —  1,  Of  is  bounded  above  by  1  and  because  V  >  —1,  a  is  bounded  below  by  0. 
(b)(iii)  A  taxpayer  of  type  x  will  choose  a  =  F  if  and  only  if  the  expected  payoff  is  greater 
than  0,  the  payoff  from  choosing  a  =  N.  Given  that  the  government  audits  when  a  = 
Y  with  probability  the  expected  payoff  from  choosing  a  =  F  is  a[V  +  x 

(1+V)[2c-(1+17)(1-x)]  2r  ^ 

-7r(l-x)]-l-(l-Q;)[F+l]  = - - ^0  for  all  x>l-^.  Because  c 

<  ^  the  cutoff- value  for  x  is  greater  than  QED 

Proof  of  Proposition  2  (Post-FIN  48:  x  >  X5) 

(a)(i)  Not  auditing  when  a  =  A  is  a  dominant  strategy  for  the  government  because  c  >  0. 
(a)(ii)  The  government  will  audit  when  a  =  F  if  ^[-x  +  7r(l  -  x)  -  c]  >  -1.  Given  all 
taxpayers  for  whom  x>f^  choose  a  =  Y,  the  government’s  expected  payoff  from 

auditing  when  a  =  F  is  ^  \  >  - 1  because  A  >  2c  —  1,  and  £[x  |  x  s  =  2(1+7^)  • 

Therefore,  the  government  audits  whenever  a  =  Y. 
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(a) (iii)  A  taxpayer  of  type  x  will  choose  a  =  7  if  and  only  if  the  expected  payoff  is  greater 

than  0,  the  payoff  from  choosing  a  =  N.  Given  that  the  government  audits  all  tax 
returns  of  a  =  Y,  the  expected  payoff  to  the  taxpayer  when  a  =  F  is  E[V  +  x  —  7r(l 
—  x)]  >  0  for  all  X  > 

(b) (i)  Same  argument  as  in  (a)(i). 

(b)(ii)  Given  that  only  taxpayers  for  whom  x  >  1  -  ~  choose  a  =  Y,  E[x  |  a  =  F]  =  ^7^- 
The  government  is  indifferent  between  auditing  and  not  auditing  tax  returns  of  a  =  F 
because  E[—x  +7r(l  —  x)  —  c]=— 1.  Therefore,  the  government’s  strategy  of 
auditing  tax  returns  of  a  =  F  with  probability  a  =  ^  is  a  best  response.  Because  V  < 
2c  —  1,  Q!  is  bounded  above  by  1  and  because  V  >  —1,  is  bounded  below  by  0. 
(b)(iii)  A  taxpayer  of  type  x  will  choose  a  =  F  if  and  only  if  the  expected  payoff  is  greater 
than  0,  the  payoff  from  choosing  a  =  N.  Given  that  the  government  audits  tax  returns 
of  a  =  F  with  probability  a  =  the  expected  payoff  from  reporting  a  =  F  is 


a;[V  +  x-7r(l  -x)]  +  (l  -  a)[V  +  1]  = 


(l+V)[2c-(l+7r)(l-;c)] 

2c 


^  0  for  all  X  i 


1-^ 

^  1  +  T7- 


The 


cutoff-value  for  x  is  greater  than  X5  because  X5  <  1  -  QED 


Proof  of  Proposition  3  (Post-FIN  48:  ^  <  x  <  X5) 

(a)(i)  Not  auditing  when  a  =  A  is  a  dominant  strategy  for  the  government  because  c  >  0. 
(a)(ii)  The  government  will  audit  when  a  =  F  if  £■[— x  +  7r(l  —  x)  -  c]  >  —1.  Auditing  all 
tax  returns  of  a  =  F  is  a  best  response  by  the  government  because  x  < 

(a) (iii)  A  taxpayer  of  type-x  will  choose  a  =  F  if  and  only  if  the  expected  payoff  is  greater 

than  0,  the  payoff  from  choosing  a  =  N.  Given  that  the  government  audits  all  tax 
returns  of  a  =  F,  the  expected  payoff  from  choosing  a  =  FisV  +  x—  tt{1  —  x)>0, 
which  is  positive  if  and  only  if  x  > 

(b) (i)  Same  argument  as  in  (a)(i).  The  government  will  audit  a  tax  return  of  a  =  F  if  77(1  — 

x)—  X  —  c>— 1.  Not  auditing  when  a  =  F  is  a  best  response  by  the  government 
because  x  ^ 

(b)(ii)  Given  that  the  government  does  not  audit  any  tax  returns,  all  taxpayers  prefer  a  =  F  to 
a  =  N.  QED 


APPENDIX  B 

To  model  the  measurement  processes  of  FIN  48,  we  introduce  the  function  m(x),  which 
represents  the  median  tax  benefit  retained  upon  audit,  given  x.  The  uncertain  tax  benefit  recog¬ 
nized  under  FIN  48  is  equal  to  1  —  m{x).  We  make  no  assumption  about  whether  m{x)  is  greater 
or  less  than  x;  medians  can  be  above  or  below  means.  We  assume  that  m{x)  is  continuous,  m{0)  = 
0,  and  m(i)  =  1.  We  also  assume  that  m'(x)  >  0  for  all  m(x)  <  1.  Below,  we  provide  an  example 
of  a  probability  distribution  of  audit  outcomes  that  is  consistent  with  all  of  these  assumptions. 

Suppose  a  firm  with  private  information  x  retains  a  tax  benefit  y  that  is  equal  to  1  with 
probability  p{x)=x^,  and  drawn  from  a  distribution  /(y)  =x[y^"’] ,  0<y<l  otherwise.  This  dis¬ 
tribution  has  the  following  properties.  First: 


x[y^  =  1 . 


which  is  required  for  a  valid  probability  density  function.  Second: 
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p{x)  +  [1  -/?(jc)]  xy[y^  ^]dy=x, 

Jo 

so  the  expected  retained  tax  benefit  upon  audit  is  equal  to  the  value  of  the  taxpayer’s  private 
information.  Third,  the  median  of  the  distribution,  m(x),  is  the  value  of  y  that  solves: 

rm(x)  , 

Jo  J 

which  has  a  solution  of  m{x)  =  [2(1-  x^y\~^'^,  1  /  2  <  x  <  ^  /  2,  and  m(x)  =  1  if  V2  /  2  <  x  <  1 , 
which  has  all  the  assumed  properties  of  the  m(x)  function. 
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ABSTRACT:  We  conduct  an  experiment  to  investigate  whether  the  effects  of  an  SAS 
No.  90-like  quality  assessment  standard  (QAS)  on  auditors’  decision  to  accept  a  client- 
preferred  accounting  method  is  jointly  moderated  by  the  nature  of  advice  from  an  audit 
firm’s  technical  department  and  the  strength  of  the  client’s  justification.  Results  indicate 
that,  in  the  absence  of  advice,  the  presence  of  QAS  does  not  affect  auditors’  decision, 
regardless  of  client  justification  strength.  However,  in  the  presence  of  advice  from  the 
technical  department,  the  presence  of  QAS  significantly  reduces  auditors’  propensity  to 
accept  the  client-preferred  method,  but  only  when  the  advice  explicitly  recommends  the 
use  of  the  most  appropriate  method  and  the  client’s  justification  is  strong.  These  find¬ 
ings  demonstrate  the  complementary  roles  that  professional  standards  and  audit  firms’ 
technical  departments  play  in  enhancing  the  quality  of  auditors’  decisions,  and  indicate 
that  the  nature  of  advice  matters. 

Keywords:  advice;  quality  assessment;  justification;  auditor  judgment. 

Data  Availability:  Contact  the  authors. 

I.  INTRODUCTION 

Improving  the  quality  of  accounting  information  provided  to  market  participants  has  been  a  key 
focus  of  the  accounting  profession  and  regulators  in  recent  years,  more  so  following  several 
widely  publicized  accounting  scandals  and  audit  failures.  Among  other  measures,  standards 
have  been  introduced,  both  in  the  U.S.  (e.g.,  SAS  No.  90,  AICPA  1999)  and  internationally  (e.g., 
ISA  260,  lAASB  2009a),  to  require  auditors  to  discuss  with  listed  companies’  audit  committees 
their  judgments  about  the  appropriateness  or  quality,  not  just  acceptability,  of  the  accounting 
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principles  and  estimates  in  audited  financial  statements.  Implicit  in  this  requirement  is  the  expec¬ 
tation  that,  while  there  may  be  different  methods  to  account  for  a  transaction  under  generally 
accepted  accounting  principles  (GAAP),  to  enhance  the  quality  of  financial  reporting,  the  most 
appropriate  accounting  method  that  best  reflects  the  economic  substance  of  the  transaction  should 
be  used.  However,  prior  research  suggests  that  despite  such  a  requirement,  auditors  continue  to 
accept  client-preferred  accounting  methods  even  when  they  may  not  be  the  most  appropriate  (e.g., 
Kadous  et  al.  2003;  Shafer  et  al.  2004).  Thus,  understanding  factors  that  affect  auditors’  decisions 
on  clients’  accounting  methods  remains  an  important  research  question. 

The  theory  of  motivated  reasoning  suggests  that  individuals  are  prone  to  biased  reasoning  in 
favor  of  a  directional  goal,  subject  to  “reasonableness  constraints”  (Kunda  1990).  In  this  study,  we 
investigate  whether  the  effect  of  an  SAS  No.  90-like  standard  requiring  auditor  assessment  of  the 
quality,  not  just  acceptability,  of  a  client’s  accounting  methods  (hereafter  referred  to  as  “quality 
assessment  standard”  or  “QAS”)  on  auditors’  decisions  is  contingent  on  the  presence  and/or  nature 
of  advice  and  the  strength  of  the  client’s  justification  for  its  preferred  accounting  method.  We 
examine  these  two  factors,  as  they  could  impose  varying  degrees  of  “reasonableness  constraints” 
to  counter  auditors’  directional  goal  of  supporting  client-preferred  accounting  methods.  Providing 
evidence  on  this  issue  sheds  light  on  the  boundary  conditions  under  which  prior  findings  on  the 
efficacy  of  quality  assessment  standards  hold  (e.g.,  Kadous  et  al.  2003),  as  well  as  how  these 
factors  interact  to  jointly  influence  auditors’  proclivity  to  support  client-preferred  accounting 
methods. 

Consultation  with  other  experts  within  the  audit  firm,  including  personnel  from  the  firm’s 
technical  department,  constitutes  an  important  part  of  an  audit  firm’s  quality  control  mechanisms 
(e.g.,  AICPA  2006,  2009;  lAASB  2009b)  and  is  widely  used  for  resolving  ambiguous  or  conten¬ 
tious  accounting  issues  encountered  by  auditors  (e.g.,  Gibbins  and  Emby  1985).  However,  prior 
studies  have  not  systematically  investigated  how  the  presence  or  nature  of  advice  received  by 
auditors  affects  their  decisions,  either  independently  or  jointly  with  other  features  in  the  audit 
environment.^  Research  in  psychology  suggests  that,  although  incorporating  advice  from  others 
generally  increases  decision  accuracy,  individuals  often  underutilize  or  discount  the  advice  re¬ 
ceived  (see  Bonaccio  and  Dalai  [2006]  for  a  review  of  this  literature).  Thus,  providing  evidence  on 
the  extent  to  which  the  presence  or  nature  of  advice  moderates  the  effect  of  QAS,  by  either 
tightening  or  loosening  the  reasonableness  constraints,  contributes  to  a  better  understanding  of  the 
role  and  limitations  of  consultations  in  enhancing  auditors’  decisions. 

In  addition,  we  investigate  whether  the  strength  of  the  client’s  justification  for  its  preferred 
accounting  method  further  moderates  the  joint  effects  of  QAS  and  advice,  given  that  firms  with 
incentives  to  adopt  aggressive  accounting  methods  would  likely  attempt  (or  be  required  by  their 
auditors)  to  justify  their  accounting  method  choices.  To  the  extent  that  the  reasonableness  con¬ 
straints  on  auditors’  proclivity  to  support  client-preferred  methods  are  loosened  by  strong  justifi¬ 
cations  provided  by  the  client,  the  incremental  role  and  impact  of  QAS  and  advice  may  increase. 

To  provide  evidence  on  the  above  issues,  we  conduct  a  two-stage  experiment  with  auditors 
from  two  Big  4  firms  in  Singapore,  where  SAS  No.  90-like  standards  have  not  been  implemented 
at  the  time  of  the  experiment.^  In  the  first  stage,  participants  decide  whether  to  accept  a  client- 


'  Salterio  (1996)  and  Salterio  and  Koonce  (1997)  examine  a  context  where  auditors  need  to  make  their  own  judgment 
about  the  appropriateness  of  client’s  accounting  treatments  based  on  precedents  received  from  the  firm’s  central  research 
unit  (i.e.,  no  specific  advice  on  the  appropriateness  of  the  accounting  issue  is  provided).  Perkins  (2003)  examines  the 
effects  of  accountability  type  and  justifiee  preference  on  tax  accountants’  informal  consultation  and  documentation 
stylization  behaviors,  but  not  how  advice-recipients’  decisions  are  influenced  by  advice  received. 

^  We  employ  a  two-stage  experiment  to  capture  participants’  pre-  and  post-advice  decisions  so  as  to  assess  how  auditors’ 
decisions  are  influenced  by  the  advice  provided.  Bonaccio  and  Dalai  (2006)  argue  that  such  a  design  should  be  used. 
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preferred  accounting  method  relating  to  a  new  sales  contract  in  the  absence  of  any  advice.  We 
manipulate,  between-subjects,  the  presence  of  QAS  and  the  strength  of  the  client’s  justification  for 
its  preferred  accounting  method.  This  stage  allows  us  to  determine  the  effects  of  QAS  and  client 
justification  strength  in  the  absence  of  advice,  and  provides  a  baseline  for  assessing  the  incremen¬ 
tal  effects  of  the  advice  subsequently  provided. 

In  the  second  stage,  participants  make  the  same  decision  on  whether  to  accept  the  client- 
preferred  method  after  receiving  advice  from  the  audit  firm’s  technical  department.^  The  advice 
reduces  the  ambiguity  surrounding  the  accounting  issue  by  clarifying  that,  while  the  client- 
preferred  method  is  an  acceptable  method  under  GAAP,  an  alternative,  less  aggressive  method  is 
the  most  appropriate  method.  This  stage  allows  us  to  examine  the  extent  to  which  auditors  would: 
(1)  continue  to  support  the  client-preferred  method,  given  that  it  has  been  identified  by  the  tech¬ 
nical  department  to  be  an  acceptable,  albeit  not  the  most  appropriate,  method  under  GAAP;  or  (2) 
reject  the  client-preferred  method  in  favor  of  the  most  appropriate  method  identified  by  the  tech¬ 
nical  department."^’^  In  addition,  we  manipulate  the  nature  of  the  advice,  by  either  including  or  not 
including  an  explicit  recommendation  to  use  the  most  appropriate  method,  to  investigate  whether 
providing  an  explicit  recommendation  has  an  incremental  effect  in  reducing  auditors’  propensity  to 
support  the  client-preferred  method  (as  it  further  tightens  the  reasonableness  constraints)  beyond 
information  provided  in  the  advice  regarding  the  relative  quality  of  the  client’s  method.^ 

Results  indicate  that,  in  the  absence  of  any  advice  from  the  technical  department,  auditors’ 
propensity  to  accept  the  client-preferred  method  is  significantly  influenced  by  the  strength  of  the 
client’s  justification  but  not  by  the  presence  of  QAS.  However,  in  the  presence  of  the  technical 
department’s  advice  indicating  that  an  alternative  method  is  the  most  appropriate  method,  we  find 
a  significant  three-way  interaction  between  presence  of  QAS,  nature  of  the  advice,  and  client 
justification  strength.  Specifically,  when  the  client’s  justification  is  weak,  auditors’  propensity  to 
accept  the  client- preferred  method  is  not  affected  by  the  presence  of  QAS,  regardless  of  the  nature 
of  the  technical  department’s  advice.  However,  when  the  client’s  justification  is  strong,  auditors’ 
propensity  to  accept  the  client-preferred  method  decreases  in  the  presence  of  QAS,  but  only  when 


where  possible,  to  facilitate  the  evaluation  of  advice  utilization.  Importantly,  this  design  replicates  what  often  happens 
in  practice.  Discussions  with  the  participating  firms’  technical  department  staff  indicate  that  auditors  who  are  seeking 
consultation  on  an  issue  are  required,  wherever  possible,  to  state  their  preliminary  position  on  the  issue  being  consulted. 
Hence,  auditors  generally  make  their  judgments  twice,  both  before  and  after  receiving  advice  from  the  technical 
department. 

^  Although  auditors  could  potentially  seek  and  receive  advice  from  various  sources,  we  examine  the  context  where  advice 
is  sought  and  provided  by  the  firm’s  technical  department  given  that  prior  research  indicates  that  audit  firms’  technical 
departments  serve  as  an  important  source  of  information  and  consultation  for  auditors  (Gibbins  and  Emby  1985;  Danos 
et  al.  1989;  Salterio  and  Denham  1997).  The  audit  firm’s  technical  department  may  also  be  referred  to  as  the  accounting 
consultation  unit,  professional  practice  department,  professional  standards  group,  etc.  Salterio  and  Denham  (1997) 
describe  the  common  structure  and  features  of  these  technical  departments. 

We  do  not  examine  the  context  where  the  advice  indicates  that  the  client-preferred  method  is  not  an  acceptable  method 
under  GAAP,  since  we  expect  few,  if  any,  auditors  would  accept  an  accounting  method  that  clearly  violates  GAAP. 
Moreover,  this  context  is  less  germane  for  testing  the  efficacy  of  QAS,  since  the  intent  of  such  a  standard  is  to  encourage 
auditors  to  consider  the  appropriateness,  not  just  acceptability,  of  clients’  accounting  methods,  particularly  in  situations 
where  there  are  alternative  methods  that  are  acceptable  under  GAAP. 

^  Our  study  examines  a  context  where  advice  issued  by  the  audit  firm’s  technical  department  is  non-binding  (e.g., 
Kennedy  et  al.  1997).  Based  on  our  discussions  with  the  technical  department  staff  of  the  participating  firms,  in  general, 
the  technical  department  personnel  provide  their  opinion  on  the  accounting  issue  under  consultation,  and  leave  the  final 
decision  to  the  discretion  of  the  audit  engagement  partner. 

^  The  technical  department  staff  of  both  the  participating  firms  indicate  that  in  situations  where  there  are  more  than  one 
accounting  methods  that  are  acceptable  under  GAAP  for  a  transaction  undertaken  by  the  client,  the  technical  department 
would,  when  consulted  on  the  issue,  communicate  the  acceptability  of  the  client-preferred  method  under  GAAP,  identify 
the  method  deemed  most  appropriate,  and  frequently  but  do  not  always  explicitly  recommend/suggest  that  the  client 
should  adopt  the  most  appropriate  accounting  method. 
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the  advice  includes  an  explicit  recommendation  that  the  client  should  adopt  the  most  appropriate 
method.  These  results  suggest  that  the  effect  of  QAS  is  jointly  moderated  by  the  nature  of  advice 
received  and  the  strength  of  the  client’s  justification. 

Our  study  contributes  to  the  literature  in  several  ways.  First,  our  study  adds  to  an  important 
stream  of  research  that  investigates  factors  affecting  auditors’  propensity  to  support  client- 
preferred  accounting  methods  (e.g.,  Trompeter  1994;  Flackenbrack  and  Nelson  1996;  Salterio 
1996;  Salterio  and  Koonce  1997;  Jenkins  and  Haynes  2003;  Kadous  et  al.  2003)  by  demonstrating 
how  such  decisions  can  be  jointly  influenced  by  both  intra-  and  extra-firm  quality  control  mecha¬ 
nisms  as  well  as  the  strength  of  clients’  justifications.  More  specifically,  our  study  extends  prior 
research  on  the  efficacy  of  SAS  No.  90-like  quality  assessment  standards  (Kadous  et  al.  2003)  by 
providing  evidence  on  factors  that  could  moderate  their  effects.  Our  finding  that  such  a  standard 
reduces  auditors’  propensity  to  accept  a  client-preferred,  strongly  justified  method  only  when  the 
advice  from  the  technical  department  contains  an  explicit  recommendation  demonstrates  the 
complementary  roles  that  professional  standards  and  audit  firms’  technical  departments  play  in 
enhancing  auditors’  decisions.  Second,  our  results  show  that  merely  reducing  the  ambiguity  of  the 
accounting  issue  by  clarifying  the  relative  quality  of  client-preferred  methods  {vis-d-vis  other 
alternative  methods)  may  not  be  sufficient  to  mitigate  auditors’  propensity  to  accept  less  appro¬ 
priate  client-preferred  methods,  even  in  the  presence  of  QAS.  Additional  steps  such  as  the  tech¬ 
nical  department  explicitly  recommending  (or  advocating)  the  use  of  the  most  appropriate  methods 
to  enhance  financial  reporting  quality  may  be  necessary.  This  finding  has  important  implications 
for  audit  firms  in  terms  of  the  nature  of  advice  communicated  by  their  technical  departments,  as 
well  as  for  regulators  in  terms  of  the  potential  need  to  further  clarify  auditors’  and  audit  commit¬ 
tees’  responsibility  for  promoting  the  use  of  the  most  appropriate  accounting  methods.  Last,  we 
add  to  the  broader  literature  on  the  impacts  of  advice  on  individuals’  judgments  and  decisions 
(e.g.,  Sniezek  and  Buckley  1995;  Yaniv  2004;  Bonaccio  and  Dalai  2006)  by  demonstrating  how 
the  influence  of  advice  on  professionals’  decisions  is  contingent  on  its  nature,  strength  of  justifi¬ 
cations  provided  by  others,  as  well  as  the  presence  of  external  constraints  (e.g.,  professional 
standards). 

Section  II  presents  our  hypothesis  development.  Section  III  follows  with  a  description  of  the 
research  methods  used,  and  Section  IV  presents  the  results.  Section  V  concludes  with  a  discussion 
of  the  study’s  main  findings,  limitations,  and  opportunities  for  future  research. 

11.  HYPOTHESIS  DEVELOPMENT 

A  number  of  prior  studies  have  documented  auditors’  proclivity  to  support  client-preferred 
accounting  methods  within  reasonable  limits  (e.g.,  Trompeter  1994;  Hackenbrack  and  Nelson 
1996;  Salterio  1996;  Salterio  and  Koonce  1997;  Jenkins  and  Haynes  2003;  Kadous  et  al.  2003).  In 
the  study  most  closely  related  to  ours,  Kadous  et  al.  (2003)  find  that  the  presence  of  a  quality 
assessment  standard  (QAS)  does  not  mitigate  auditors’  propensity  to  support  an  aggressive  method 
preferred  by  the  client.  Instead,  it  exacerbates  the  influence  of  directional  goal  commitment,  an 
individual  pro-client  motivational  factor,  on  auditors’  decisions.  The  theory  of  motivated  reasoning 
suggests  that  auditors  with  pro-client  directional  goals  are  likely  to  support  their  client-preferred 
method  within  reasonableness  constraints — that  is,  as  far  as  the  decisions  are  justifiable  (Kunda 
1990).  One  would  expect  the  presence  of  QAS  (which  requires  auditors’  assessment  and  audit 
committee  communication  of  the  quality  of  clients’  accounting  methods)  to  impose  a  reasonable¬ 
ness  constraint,  thereby  making  it  less  justifiable  for  auditors  to  support  the  client-preferred 
method.  However,  this  effect  is  not  observed  in  Kadous  et  al.  (2003),  which  we  surmise  could  be 
due  to  one  of  the  following  reasons:  (1)  auditors  may  not  be  aware,  in  the  absence  of  any  advice, 
that  there  is  a  more  appropriate  method,  particularly  given  the  ambiguity  of  the  accounting  issue; 
and  (2)  even  if  auditors  are  aware  that  there  is  a  more  appropriate  method,  they  may  continue  to 
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support  the  client-preferred  method  because  a  QAS-like  standard  does  not  explicitly  mandate 
auditors  to  ensure  that  the  most  appropriate  accounting  method  is  adopted  by  the  client.  Indeed, 
recent  survey  research  in  the  U.S.  suggests  that,  even  in  the  post-SAS  No.  90  environment, 
auditors  indicate  that  they  may  not  insist  on  the  use  of  the  most  appropriate  method  by  the  client, 
as  long  as  the  client-preferred  method  is  acceptable  under  GAAP  (Shafer  et  al.  2004).  Thus,  even 
in  the  presence  of  QAS,  auditors  may  continue  to  support  the  client-preferred  method. 

Kadous  et  al.  (2003)  examine  a  context  in  which  no  advice  on  the  relative  quality  of  the 
client’s  accounting  method  is  available,  and  the  client  provides  a  relatively  weak  justification  for 
their  preferred  method  based  on  simplicity  of  the  method  and  a  desire  to  manage  earnings  expec¬ 
tations  (see  Kadous  et  al.  2003,  768).  In  practice,  auditors  are  likely  to  seek  advice  from  other 
experts  to  help  resolve  ambiguous  and/or  contentious  accounting  issues  (e.g.,  Gibbins  and  Emby 
1985).  Moreover,  a  client  with  incentives  to  adopt  aggressive  accounting  methods  is  likely  to 
provide  strong  (rather  than  weak)  justifications  in  support  of  its  preferred  method.  These  factors 
could  independently  or  jointly  moderate  the  influence  of  QAS  on  auditors’  propensity  to  support 
client-preferred  methods  by  either  tightening  or  loosening  the  reasonableness  constraints  on  audi¬ 
tors’  proclivity  to  support  client-preferred  methods.  Our  study  sheds  light  on  these  issues  by 
examining  whether  the  effect  of  QAS  is  greater  in  the  presence  of  advice  from  the  audit  firm’s 
technical  department  on  the  relative  quality  of  the  client’s  preferred  method,  and  whether  this 
moderating  effect  of  advice  is  contingent  on  the  nature  of  the  advice  and  the  strength  of  the 
client’s  justification  for  its  preferred  method. 

Suppose  that  auditors  seek  and  receive  advice  from  their  firm’s  technical  department  indicat¬ 
ing  that  although  the  client-preferred  method  is  an  acceptable  method  under  GAAP,  an  alternative 
(less  aggressive)  accounting  method  is  the  most  appropriate  method.^  The  indication  of  the  most 
appropriate  method  by  the  technical  department  reduces  the  ambiguity  of  the  accounting  issue  by 
providing  auditors  with  a  benchmark  to  evaluate  against  the  more  aggressive  client-preferred 
method.  This  should  tighten  the  reasonableness  constraints  on  auditors’  proclivity  to  support  the 
client-preferred  method  by  reducing  the  justifiability  of  accepting  a  less  appropriate  method  pre¬ 
ferred  by  the  client.  To  what  extent  will  the  presence  of  the  advice  enhance  the  effect  of  QAS  in 
reducing  auditors’  propensity  to  support  the  client-preferred  method? 

Prior  research  suggests  that  individuals  use  advice  to  enhance  the  justifiability  of  their  deci¬ 
sion  or  share  responsibility  with  others  for  their  decision  rather  than  enhancing  decision  quality 
per  se  (Harvey  and  Fischer  1997;  Kennedy  et  al.  1997),  and  that  even  advice  from  experts  may  not 
be  optimally  used  by  advice-recipients  due  to  egocentric  discounting  (e.g.,  Yaniv  and  Kleinberger 
2000;  Bonaccio  and  Dalai  2006).  Hence,  even  in  the  presence  of  QAS,  auditors  could  selectively 
or  strategically  use  information  in  the  advice  (that  the  client-preferred  method  is  an  acceptable, 
albeit  not  the  most  appropriate,  method  under  GAAP)  to  support  the  client-preferred  method, 
given  a  pro-client  directional  goal.  On  the  other  hand,  if  the  advice  from  the  technical  department 
incorporates  an  explicit  recommendation  that  the  client  should  use  the  most  appropriate  method, 
this  should  further  tighten  the  reasonableness  constraints  on  auditors’  motivated  reasoning  by 
making  it  even  less  justifiable  to  accept  a  less  appropriate  client-preferred  method,  since  doing  so 
would  imply  ignoring  the  explicit  recommendation  provided  by  the  firm’s  technical  department. 
Thus,  the  presence  of  the  technical  department’s  advice  is  more  likely  to  enhance  the  effect  of 
QAS  in  reducing  auditors’  propensity  to  support  the  client-preferred  method  when  that  advice 
contains  an  explicit  recommendation  to  use  the  most  appropriate  method  than  when  it  does  not. 


’  Consistent  with  the  general  advice  literature  (see  Gibbons  2003;  Bonaccio  and  Dalai  2006),  we  use  the  term  “advice” 
broadly  to  include  one  that  provides  information  about  different  alternatives,  suggests  new  alternatives,  and/or  recom¬ 
mends  a  specific  course  of  action  (Gibbons  2003;  Bonaccio  and  Dalai  2006).  Thus,  advice  may  or  may  not  contain  an 
explicit  recommendation. 
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However,  we  argue  that  the  moderating  effect  of  advice  nature  on  the  effect  of  QAS  discussed 
above  is  likely  to  be  greater  (or  more  observable)  when  the  client’s  justification  for  its  preferred 
method  is  strong  than  when  it  is  weak.  Psychology  research  suggests  that  when  people  are  moti¬ 
vated  to  evaluate  an  externally  provided  message,  they  will  carefully  scrutinize  the  arguments  in 
the  message  to  arrive  at  their  own  conclusions  (Petty  and  Cacioppo  1986).  This  reasoning  suggests 
that  the  strength  of  the  client’s  justification  for  its  preferred  accounting  method  is  likely  to  posi¬ 
tively  influence  auditors’  assessments  of  the  acceptability  and  appropriateness  of  the  method, 
thereby  increasing  their  perceived  justifiability  of  supporting  the  method.  Further,  psychological 
research  in  motivated  reasoning  (e.g.,  Kunda  1990)  and  elastic  justification  (e.g.,  Hsee  1995) 
suggests  that,  given  pro-client  incentives  and  ambiguity  of  the  accounting  issue  (e.g.,  Hackenbrack 
and  Nelson  1996;  Salterio  and  Koonce  1997;  Kadous  et  al.  2003),  auditors  are  likely  to  use  strong 
justifications  provided  by  the  client  to  bolster  their  support  for  the  client-preferred  method.  Thus, 
a  strong  (weak)  client  justification  likely  loosens  (tightens)  the  reasonableness  constraints  on 
auditors’  proclivity  to  accept  client-preferred  methods  (Kunda  1990;  Hsee  1995),  by  enhancing 
(reducing)  the  justifiability  of  supporting  the  client-preferred  method.  Consequently,  any  incre¬ 
mental  effect  of  QAS  or  advice  in  reducing  the  propensity  of  auditors  to  support  client-preferred 
methods,  through  the  tightening  of  reasonableness  constraints,  is  likely  greater  (or  more  observ¬ 
able)  in  the  presence  of  a  strong  client  justification  than  a  weak  client  justification.*  Taken  to¬ 
gether,  the  above  discussions  suggest  that  the  effect  of  QAS  in  reducing  auditors’  propensity  to 
accept  the  client-preferred  method  will  be  greater  when  the  advice  from  the  technical  department 
contains  an  explicit  recommendation  and  when  the  client  justification  is  strong.  Our  hypothesis  on 
how  the  effect  of  QAS  is  jointly  moderated  by  advice  nature  and  client  justification  strength  is 
formally  stated  as  follows  (Figure  1,  Panel  A,  illustrates  our  expectations): 

Hypothesis:  Given  the  technical  department’s  advice  that  an  alternative  accounting  method 
(other  than  the  client’s  preferred  method)  is  the  most  appropriate  method,  the 
presence  of  a  quality  assessment  standard  will  reduce  auditors’  propensity  to 
accept  the  client’s  preferred  method,  particularly  when  the  advice  explicitly 
recommends  the  use  of  the  most  appropriate  method  and  the  client  has  provided 
a  strong  justification  for  its  preferred  method. 

III.  METHOD 

Design 

We  conduct  the  experiment  in  two  stages.  In  stage  1  (in  which  no  advice  is  provided),  we 
employ  a  2  X  2  between-subjects  design,  with  QAS  (absent  or  present)  and  client  justification 
strength  (weak  or  strong)  as  manipulated  factors.  In  stage  2  (in  which  participants  are  provided 
with  hypothetical  advice  from  the  firm’s  technical  department),  we  employ  a  2  X  2  X  2  between- 
subjects  design,  with  QAS,  client  justification  strength,  and  advice  nature  (with  or  without  an 
explicit  recommendation)  as  manipulated  factors.  Our  experimental  case  is  adapted  from  that  of 
Kadous  et  al.  (2003).^  The  case  involves  an  ambiguous  revenue  recognition  issue  pertaining  to  a 
new  sales  contract.  The  client  prefers  to  use  the  straight-line  method  to  recognize  the  revenue 
evenly  over  the  contract  period,  which  will  enable  it  to  meet  its  earnings  projection  for  the  year 


Kadous  et  al.  (2003)  show  that  even  in  the  presence  of  a  weak  client  justification,  auditors  are  still  susceptible  to  the 
influence  of  their  directional  goal  commitment.  Thus,  although  we  argue  that  the  expected  joint  effects  of  QAS  and 
advice  nature  is  likely  smaller  when  the  client’s  justification  is  weak  (versus  strong),  our  argument  does  not  preclude  the 
possibility  that  auditors  may  still  be  influenced  by  their  directional  goals  to  support  client-preferred  methods  in  the 
presence  of  a  weak  client  justification,  as  further  demonstrated  by  the  results  of  our  further  analyses,  reported  later. 

^  We  thank  Kathryn  Kadous  for  making  the  case  available  to  us. 
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FIGURE  1 

Prediction  and  Findings  on  the  Effects  of  QAS,  Advice  Nature,  and  Client  Justification 
Strength  on  Auditors’  Propensity  to  Accept  Client-Preferred  Method  (in  Stage  2) 

Panel  A:  Predicted  Effects  of  QAS,  Advice  Nature,  and  Client  Justification 
Strength  on  Auditor’s  Stage  2  Decisions 
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Panel  B :  Observed  Effects  of  QAS,  Advice  Nature,  and  Client  Justification 
Strength  on  Auditor’s  Stage  2  Decisions 
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and  maintain  an  increasing  earnings  trend.  However,  this  method  is  potentially  aggressive  relative 
to  an  alternative  method  based  on  actual  usage.  An  average  of  32  percent  of  the  participating 
auditors  in  Kadous  et  al.’s  (2003)  study  accepted  the  client-preferred  straight-line  method  across 
various  experimental  conditions,  suggesting  that  there  is  ambiguity  (and  doubt)  surrounding  the 
appropriateness  of  this  method. 
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We  manipulate  the  presence  of  QAS  by  providing  half  the  participants  with  instructions 
similar  to  the  requirements  underlying  SAS  No.  90.  Specifically,  these  participants  were  asked  to 
assume  that  the  auditing  standards  require  them  to  discuss  with  the  audit  committee  the  quality 
(that  is,  the  appropriateness),  not  just  the  acceptability,  of  the  client’s  accounting  principles  and 
underlying  estimates.  Further,  they  were  told  that  “the  quality  (that  is,  the  appropriateness)  of  an 
accounting  method  is  a  function  of  the  extent  to  which  it  is  consistent  with  the  conceptual  frame¬ 
work  and  accounting  standards,  and  the  degree  to  which  it  captures  the  economic  substance  of  the 
transaction  under  consideration.”  In  addition,  they  were  asked  to  rate  the  appropriateness  of  the 
client’s  method  of  recognizing  revenue  for  the  contract  (i.e.,  the  straight-line  method)  prior  to 
deciding  whether  to  accept  the  client’s  method.  These  instructions  were  repeated  in  the  second 
stage  of  the  experiment,  in  which  participants  were  provided  with  advice  from  the  technical 
department  (described  below).  For  the  other  half  of  the  participants,  no  such  instructions  were 
provided  in  either  stage  of  the  experiment.  Instead,  these  participants  were  asked  to  decide 
whether  to  accept  the  client’s  use  of  the  straight-line  method  without  first  assessing  the  appropri¬ 
ateness  of  the  method. 

We  manipulate  the  nature  of  the  advice  provided  by  the  technical  department  in  terms  of 
whether  it  contains  an  explicit  recommendation  that  the  most  appropriate  method  should  be  used. 
All  participants  were  told  to  assume  that  their  firm’s  technical  department  has  been  consulted  on 
the  issue.  They  were  provided  a  summary  of  the  key  points  communicated  by  the  technical 
department,  which  describes  four  alternative  methods  for  recognizing  the  contract  revenue  and 
indicates  that,  while  the  client’s  preferred  method  is  an  acceptable  method  under  GAAP,  the  actual 
usage  rate  method  is  the  most  appropriate  method.  For  participants  in  the  “advice  with  explicit 
recommendation”  condition,  the  technical  department’s  communication  included  a  recommenda¬ 
tion  that,  although  the  straight-line  method  is  acceptable  under  GAAP,  to  enhance  financial  re¬ 
porting  quality,  the  client  should  adopt  the  most  appropriate  method  (i.e.,  the  actual  usage  rate 
method).  This  recommendation  was  not  provided  to  participants  in  the  “advice  without  explicit 
recommendation”  condition. 

Last,  we  manipulate  the  strength  of  the  client’s  justification  for  the  use  of  the  straight-line 
method  by  varying  the  content  of  the  justification  provided  by  the  client’s  Chief  Financial  Officer 
(CFO).  Specifically,  for  the  weak  client  justification  condition,  the  CFO  argues  that  the  straight- 
line  method  is  appropriate  for  recognizing  revenue  from  the  contract,  as  it  is  easy  to  apply  and  it 
generates  a  smooth  income  stream.  This  justification  is  similar  to  that  used  in  Kadous  et  al.  (2003). 
In  contrast,  for  the  strong  client  justification  condition,  the  CFO  argues  that  the  method  is  appro¬ 
priate  based  on  past  consumption  patterns  that  suggest  that  the  overall  usage  over  the  contract 
period  may  not  exceed  the  minimum  usage  that  the  customer  has  agreed  to  pay.  The  latter  justi¬ 
fication  is  stronger  than  the  former,  since  it  is  based  on  an  analysis  of  the  transaction  rather  than 
on  simplicity  or  income-smoothing  objectives.’*^ 

Our  main  dependent  variable  of  interest  is  auditors’  final  (stage  2)  decision  on  whether  to 
accept  the  client’s  use  of  the  straight-line  method  for  recognizing  the  contract  revenue,  given  the 
technical  department’s  advice.” 

Administration  and  Procedures 

The  experiment  was  administered  in  a  controlled  setting  during  the  participating  firms’  train¬ 
ing  sessions.  In  stage  1,  participants  were  randomly  assigned  to  each  of  the  four  client  QAS/client 


Research  by  Bame-Aldred  and  Kida  (2007,  506)  indicates  that  auditors  do  not  consider  justifications  such  as  avoiding 
variability  in  the  earnings  stream  to  be  as  important  as  the  client  does  for  supporting  the  use  of  a  revenue  recognition 
method.  We  developed  our  strong  justification  based  on  the  response  of  one  of  the  pilot-test  participants  who  teaches 
advanced  financial  accounting  and  is  a  former  senior  audit  manager. 

"  Although  our  focus  is  on  auditors’  final  decision  in  the  presence  of  the  technical  department’s  advice,  we  also  analyze 
the  change  in  auditors’  decisions  between  stages.  We  report  the  related  results  as  part  of  further  analyses. 
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justification  strength  treatments.  In  stage  2,  we  hold  constant  the  client  justification  strength/QAS 
treatment,  and  participants  within  each  justification  strength/QAS  treatment  were  randomly  as¬ 
signed  to  one  of  the  two  nature  of  advice  treatments.  The  experimental  materials  for  the  two 
experimental  stages  were  enclosed  in  separate  envelopes.  At  the  start  of  the  experiment,  the 
participants  were  provided  with  a  brief  introduction  to  the  project,  including  an  assurance  of 
confidentiality,  and  were  asked  to  provide  their  name  to  acknowledge  participation.  They  then 
proceeded  to  read  the  experimental  case  relating  to  a  hypothetical  client  that  contains  our  manipu¬ 
lation  of  QAS  and  client  justification  strength,  as  described  above.  Following  this  instruction, 
participants  were  asked  to  make  several  judgments  on  the  case,  including  whether  to  accept  the 
client-preferred  method. 

Upon  completing  stage  1,  participants  were  asked  to  place  the  completed  case  materials  back 
into  the  envelope  provided  and  proceed  to  the  next  envelope.  They  were  first  asked  to  complete  a 
short  questionnaire  eliciting  their  assessments  of  the  complexity  of  the  accounting  issue,  engage¬ 
ment  risk  of  the  client,  and  their  knowledge  on  the  accounting  issue,  as  well  as  some  demographic 
information.  They  then  proceeded  to  perform  the  stage  2  experimental  task  by  reading  a  hypo¬ 
thetical  advice  memorandum  received  from  their  firm’s  technical  department  and  making  similar 
judgments  on  the  case  as  in  stage  1.  After  completing  the  stage  2  experimental  task,  they  com¬ 
pleted  a  debriefing  questionnaire  (enclosed  in  a  separate  envelope),  which  includes  manipulation 
check  and  other  questions. 

Participants 

Participants  were  129  experienced  audit  seniors,  supervisors,  and  managers  from  two  Big  4 
firms  in  Singapore.  Three  participants  were  eliminated  due  to  incomplete  responses,  yielding  a 
final  sample  of  126  auditors,  comprising  66  audit  seniors  (with  an  average  audit  experience  of  3.6 
years,  ranging  from  3  to  8  years)  and  59  audit  supervisors  and  managers  (with  an  average  audit 
experience  of  5.5  years,  ranging  from  4  to  16  years). The  majority  (80  percent)  of  the  partici¬ 
pants  indicated  experience  auditing  listed  companies,  and  about  half  of  them  (5 1  percent)  partici¬ 
pated  in,  or  prepared  for,  discussions  with  clients’  audit  committees.  On  average,  over  the  past  two 
years,  participants  have  consulted  the  technical  department  personnel  on  clients’  accounting  issues 
on  a  formal  basis  (i.e.,  by  submitting  a  formal  written  request)  in  29  percent  of  their  audit 
engagements,  and  on  an  informal  basis  (such  as  over  the  phone)  in  49  percent  of  their  audit 
engagements.  They  have  also  consulted  other  audit  colleagues  in  the  firm  (outside  the  audit 
engagement  team  and  the  technical  department)  on  clients’  accounting  issues  in  64  percent  of  their 
audit  engagements,  on  average,  over  the  past  two  years.  None  of  the  above  experience  measures 
vary  significantly  between  the  experimental  conditions  at  the  5  percent  level.  Our  results  do  not 
change  when  we  exclude  participants  who  do  not  have  experience  in  listed  company  audits,  nor 
when  we  include  any  of  the  other  experience  measures  as  a  covariate  in  our  analyses.  None  of 
these  experience  measures  are  significantly  associated  with  the  dependent  variable  at  the  5  percent 
level. 


We  handed  out  the  three  envelopes  all  at  once,  with  the  first  envelope  placed  on  top.  At  the  start  of  the  experiment,  we 
verbally  instructed  participants  to  open  the  three  envelopes  (labeled  A,  B,  and  C)  in  sequence  (i.e.,  to  proceed  to 
envelope  B  only  after  they  have  completed  the  materials  in  envelope  A  and  placed  the  completed  responses  back  into 
envelope  A,  and  so  on).  The  researchers  also  moved  around  the  training  room  to  observe  whether  participants  followed 
the  instruction,  and  no  violations  were  observed. 

One  participant  did  not  indicate  his  or  her  rank.  Similar  results  for  hypothesis  tests  are  obtained  when  this  participant  is 
eliminated  from  the  analyses.  The  hypothesis  test  results  do  not  vary  significantly  by  firm  or  rank  at  the  5  percent  level. 
Unless  otherwise  stated,  two-tailed  p-values  are  reported.  One-tailed  p-values  are  reported  for  directional  tests. 
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IV.  RESULTS 

Manipulation  and  Other  Checks 

We  did  not  conduct  a  manipulation  check  on  the  quality  assessment  standard  (QAS)  manipu¬ 
lation,  given  that  participants  were  either  told  or  not  told  about  the  standard’s  requirements  (i.e.,  to 
assess  and  communicate  with  the  audit  committee  the  quality  of  the  client’s  accounting  methods), 
and  only  participants  in  the  condition  where  QAS  is  present  were  asked  to  assess  the  appropri¬ 
ateness  of  the  client’s  preferred  method.  To  the  extent  that  participants  assess  the  quality  of  the 
client’s  accounting  method  even  without  being  asked  to  do  so,  our  tests  should  be  biased  against 
finding  an  effect  of  QAS  (see  also  Kadous  et  al.  2003,  769,  footnote  8).  As  a  check  on  our 
manipulation  of  the  nature  of  the  firm’s  advice  in  stage  2,  we  ask  participants  to  rate  the  extent  to 
which  the  technical  department  explicitly  recommends  that  the  client  should  adopt  the  most 
appropriate  method  to  enhance  financial  reporting  quality  (1  =  very  low  extent,  7  =  very  high 
extent).  Participants’  ratings  are  higher  in  the  “advice  with  explicit  recommendation”  condition 
than  in  the  “advice  without  explicit  recommendation”  condition  (means  =  5.1  and  4.6,  respec¬ 
tively;  p  =  0.038),  suggesting  that  they  attend  to  our  manipulation  of  advice  nature.'"^  Participants 
rate  the  client’s  justification  to  be  stronger  (means  =  4.4  and  3.3,  respectively,  p  <  0.001;  1  = 
very  weak;  7  =  very  strong)  and  more  conceptually  sound  (means  =  4.3  and  3.4  respectively;  p 
<  0.001;  1  =  very  low  extent;  7  =  very  high  extent)  in  the  strong  client  justification  condition 
than  in  the  weak  client  justification  condition.  Thus,  our  manipulation  of  client  justification 
strength  appears  successful.'^ 

We  also  ask  participants  to  indicate  the  extent  to  which  they  agree  with  the  following  state¬ 
ment:  “When  there  is  more  than  one  method  of  accounting  for  a  transaction  that  could  be  deemed 
acceptable  under  GAAP,  auditors  should  require  the  client  to  use  the  most  appropriate  accounting 
method”  (1  =  disagree  completely,  7  =  agree  completely).  Participants’  mean  agreement  rating  is 
significantly  higher  than  neutral  (mean  =  5.4;  p  <  0  .001),  and  does  not  vary  by  nature  of  advice 
or  other  experimental  conditions  (all  p  >  0.245).  Thus,  whether  the  advice  contains  an  explicit 
recommendation  does  not  appear  to  influence  auditors’  belief  that  they  should  ensure  the  use  of  the 
most  appropriate  method  by  the  client,  although  we  predict  and  find  (as  reported  later)  that 
auditors’  decisions  are  influenced  by  advice  nature.  This  disconnect  between  what  professionals 
believe  they  should  do  and  what  they  actually  do  has  been  demonstrated  in  prior  research  (e.g., 
Libby  and  Tan  1999). 

Recall  that  participants  in  the  condition  in  which  QAS  is  present  are  asked  to  assess  the 
appropriateness  of  the  client-preferred  method  prior  to  deciding  whether  to  accept  it.  Consistent 
with  our  expectation  that  client  justification  strength  will  positively  influence  auditors’  assessment 
of  the  appropriateness  of  the  client-preferred  method,  participants’  appropriateness  ratings  for  the 
client-preferred  method  are  significantly  higher  when  client  justification  is  strong  than  when  it  is 
weak,  in  both  stage  1  (means  =  0.46  and  0.33,  respectively,  with  0  percent  =  not  at  all  appro- 


We  ask  a  subset  of  participants  (n  =  105)  to  indicate  the  most  appropriate  method  identified  by  the  technical  department 
and  whether  the  technical  department  assessed  the  client-preferred  straight-line  method  to  be  an  acceptable  method 
under  GAAP.  The  majority  (n  =  86  or  82  percent)  answered  both  questions  correctly,  and  we  obtain  similar  results  for 
the  tests  of  hypotheses  based  on  these  86  participants.  These  participants  generally  agree  with  the  technical  department’s 
choice  of  the  most  appropriate  method  (mean  =  5.6;  1  =  disagree  completely;  7  =  agree  completely),  although  their 
agreement  ratings  are  higher  when  client  justification  is  weaker  than  when  it  is  stronger  (means  =  5.9  and  5.3, 
respectively;  p  =  0.019).  Neither  the  nature  of  the  advice  nor  the  presence  of  QAS  has  any  significant  impact  on  the 
agreement  ratings  at  the  5  percent  level. 

Participants  do  not  have  differential  beliefs  about  the  extent  to  which  the  client’s  justification  is  based  on  the  desire  to 
manage  earnings  across  the  client  justification  strength  conditions  (means  =  4.7  for  both  conditions;  p  >  0.500;  1  = 
very  low  extent;  7  =  very  high  extent).  Thus,  our  manipulation  of  client  justification  strength  does  not  result  in 
differential  assessments  of  the  client’s  incentive  to  manage  earnings,  which  could  otherwise  affect  auditors’  reliance  on 
management  representations  (see  Anderson  et  al.  2004). 


The  Accounting  Review 
American  Accounting  Association 


September  2010 


Effects  of  Technical  Department’s  Advice 


1753 


priate,  100  percent  =  completely  appropriate;  p  =  0.025)  and  stage  2  (means  =  0.53  and  0.40, 
respectively;  p  =  0.015).^^  Advice  nature  does  not  significantly  affect  auditors’  stage  2  appropri¬ 
ateness  assessments,  whether  independently  or  jointly  with  client  justification  strength  (all  p  > 
0.154).  We  also  ask  a  subset  of  participants  in  all  experimental  conditions  in  stage  1  whether  they 
believe  that  the  client-preferred  method  is  acceptable  under  GAAP.  A  higher  proportion  of 
participants  answer  “yes”  when  the  client’s  justification  is  strong  than  when  it  is  weak  (proportions 
=  0.50  and  0.27,  respectively;  p  =  0.012).  These  proportions  do  not  vary  significantly  by  QAS 
(all  p  >  0.342).  Overall,  the  results  indicate  that  client  justification  strength  influences  auditors’ 
assessments  of  both  the  acceptability  and  appropriateness  of  the  client-preferred  method  as  ex¬ 
pected. 

Tests  of  Hypothesis 

Our  research  hypothesis  predicts  that,  given  the  technical  department’s  advice  that  an  alter¬ 
native  accounting  method  (other  than  the  client’s  preferred  method)  is  the  most  appropriate 
method,  the  presence  of  a  quality  assessment  standard  will  reduce  auditors’  propensity  to  accept 
the  client’s  preferred  method,  particularly  when  the  advice  explicitly  recommends  the  use  of  the 
most  appropriate  method  and  the  client  has  provided  a  strong  justification  for  its  preferred  method. 
We  first  analyze  the  effects  of  QAS  and  client  justification  strength  on  auditors’  decisions  in  the 
absence  of  advice.  Table  1,  Panel  A  presents  the  proportion  of  auditors  who  accept  the  client- 
preferred  straight-line  method  (hereafter,  PAccept)  by  experimental  treatment  in  stage  1,  in  which 
advice  is  absent.  Panel  B  of  Table  1  presents  the  categorical  ANOVA  results,  with  QAS  and  client 
justification  strength  as  independent  variables,  and  the  auditors’  stage  1  decision  as  the  dependent 
variable.  Results  indicate  a  significant  main  effect  of  client  justification  strength,  with  a  higher 
PAccept  when  the  client’s  justification  is  strong  than  when  it  is  weak  (PAccept  =  0-43  and  0.15, 
respectively;  p  <  0.001).  Neither  the  main  effect  of  QAS  nor  the  interaction  effect  between  client 
justification  strength  and  QAS  is  statistically  significant  (all  p  >  0.389).^^  These  results  indicate 
that  in  the  absence  of  any  advice,  auditors’  propensity  to  accept  the  client-preferred  method  is 
significantly  influenced  by  the  strength  of  the  client’s  justification,  but  not  by  the  presence  of  QAS. 
The  latter  result  is  consistent  with  Kadous  et  al.  (2003).^® 

Table  2,  Panel  A  presents  the  PAccept  by  experimental  treatment  in  stage  2,  in  which  advice  is 
provided.  Panel  B  of  Table  2  presents  the  categorical  ANOVA  results,  with  QAS,  client  justifica¬ 
tion  strength,  and  advice  nature  as  independent  variables,  and  the  auditors’  stage  2  decision  as  the 
dependent  variable.  Results  indicate  a  statistically  significant  three-way  interaction  (p  =  0.047).^° 
Tests  of  the  simple  effect  of  QAS  within  each  of  tfie  advice  nature/client  justification  strength 
conditions  indicate  that  PAccept  is  significantly  lower  in  the  presence  of  QAS  than  in  its  absence 


Eight  participants  in  the  “QAS  present”  condition  did  not  provide  their  appropriateness  assessments  in  stage  1,  presum¬ 
ably  because  of  the  greater  ambiguity  of  the  accounting  issue  in  the  absence  of  any  advice.  Inferences  relating  to  our 
tests  of  the  hypothesis  remain  the  same  when  these  participants  are  excluded. 

This  question  was  inserted  only  after  the  first  training  session.  Hence,  the  analysis  is  based  on  102  participants  who 
responded  to  the  question.  We  did  not  ask  this  question  in  stage  2  of  the  experiment  given  that  the  advice  provided 
already  indicates  that  the  client-preferred  method  is  an  acceptable  method  under  GAAP. 

**  Including  advice  nature  as  a  third  independent  variable  in  the  analysis  does  not  change  these  results.  Advice  nature  does 
not  significantly  affect,  nor  interact  with  the  other  independent  variables  to  jointly  affect,  auditors’  decisions  (all  p  > 
0.196). 

We  also  elicited,  at  the  end  of  stage  1,  participants’  assessments  of  their  likelihood  of  consulting  (1)  their  firm’s  technical 
department  personnel  (mean  =  0.67),  and  (2)  other  colleagues  outside  the  audit  team  and  technical  department  (mean  = 
0.69),  on  the  accounting  issue.  These  assessments  do  not  vary  significantly  by  QAS  or  client  justification  strength  (all 

p  >  6.188). 

The  three-way  interaction  remains  marginally  significant  (p  =  0.069)  when  we  replace  our  mampulated  client  justifi¬ 
cation  strength  variable  with  the  participants’  perceived  strength  of  client’s  justification  using  a  dichotomous  variable 
based  on  a  median  split. 
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only  when  the  advice  contains  an  explicit  recommendation  and  client  justification  is  strong 
(PAccept  =  0-24  and  0.61,  respectively;  p  =  0.015),  but  not  in  the  other  three  conditions  (all  p  > 
0.159).^'  These  results  provide  support  for  our  research  hypothesis,  and  suggest  that  both  the 
nature  of  the  advice  and  the  strength  of  the  client’s  justification  jointly  moderate  the  effect  of 
QAS.  Figure  1,  Panel  B,  illustrates  the  results.  Alternative  analyses  of  the  three-way  interaction 
indicate  that  PAccept  is  significantly  lower  when  the  advice  contains  an  explicit  recommendation 
than  when  it  does  not,  only  in  the  condition  where  QAS  is  present  and  client  justification  is  strong 
(PAccept  =  0-24  and  0.75,  respectively;  p  =  0.001),  but  not  in  the  other  three  QAS/client  justifica¬ 
tion  strength  conditions  (all  p  >  0.182).^^ 

Additional  Analyses  on  Change  in  Auditors’  Decisions  between  Stages 

Our  tests  of  hypothesis  above  focus  on  auditors’  final  decision  on  the  client’s  accounting 
method  given  the  advice  from  the  technical  department,  and  demonstrate  how  the  effect  of  QAS 
is  moderated  by  the  nature  of  the  advice  and  the  strength  of  the  client’s  justification.  Here,  we 
analyze  how  the  presence  of  advice  changes  auditors’  decisions  between  stages.  Table  3  presents 
the  PAccept  within  each  of  the  between-subject  treatment  conditions  at  both  stages.  As  shown  in 
Table  3,  PAccept  increases  from  stage  1  to  stage  2  in  all  conditions  except  when  client  justification 
is  strong,  QAS  is  present,  and  the  advice  contains  an  explicit  recommendation.  These  results 
suggest  that,  with  the  exception  of  the  above-mentioned  condition,  auditors  strategically  use  in¬ 
formation  provided  in  the  technical  department’s  advice  (that  the  client-preferred  is  an  acceptable, 
albeit  not  the  most  appropriate,  method  under  GAAP)  to  increase  their  support  for  the  client- 
preferred  method. 

We  perform  a  mixed  categorical  ANOVA  that  includes  all  three  between-subject  variables 
(i.e.,  QAS,  advice  nature,  and  client  justification  strength)  and  participants’  stage  1  and  stage  2 
decisions  as  repeated  measures  (hereafter.  Stage).  Results  (untabulated)  indicate  no  statistically 
significant  effects  at  the  5  percent  level  except  for  the  main  effects  of  Stage  (p  <  0.001),  advice 
nature  (p  =  0.041),  and  client  justification  strength  (p  =  0.001),  as  well  as  the  interaction  effect 
between  advice  nature  and  client  justification  strength  (p  =  0.050)  and  the  interaction  effect 
between  Stage  and  QAS  (p  =  0.009).  The  last  result  suggests  that  the  increase  in  PAccept  from 
stage  1  to  stage  2  is  moderated  by  QAS. 

Further  analyses  indicate  that  when  client  justification  is  weak,  only  the  main  effects  of  stage 
and  client  justification  strength  are  statistically  significant  at  the  5  percent  level,  suggesting  that 
neither  QAS  nor  advice  nature  moderates  the  increase  in  PAccept  from  stage  1  to  stage  2.  However, 
when  client  justification  is  strong,  there  is  a  statistically  significant  interaction  effect  between 
Stage  and  QAS  (p  =  0.005).^^  Further  tests  indicate  that  the  effect  of  Stage  is  statistically  signifi- 


Inferences  relating  to  our  tests  of  hypothesis  remain  the  same  when  we  include  auditors’  pre-advice  (stage  1)  decision 
as  a  control  variable  in  the  analyses.  We  also  re-perform  the  analyses  using  auditors’  perceptions  of  the  accounting 
method  that  would  ultimately  be  used  in  the  financial  statements  after  discussions  with  the  management  and  audit 
committee.  Results  (based  on  119  participants  who  responded)  similarly  indicate  that  presence  of  QAS  does  not 
significantly  reduce  auditors’  perception  that  the  client’s  preferred  method  will  ultimately  be  used  except  in  the  condition 
where  the  advice  contains  an  explicit  recommendation  and  client  justification  is  strong  (p  =  0.029). 

We  also  test  the  joint  impacts  of  QAS  and  advice  nature  on  the  influence  of  client  justification  strength  on  auditors’ 
decisions.  Results  show  that  in  the  absence  of  QAS,  client  justification  strength  significantly  increases  Pacccpo  regardless 
of  advice  nature  (p  =  0.024).  However,  in  the  presence  of  QAS,  client  justification  strength  significantly  increases 
PAccept  only  when  advice  does  not  contain  an  explicit  recommendation  (p  =  0.003),  but  not  when  it  contains  an  explicit 
recommendation  (p  =  0.189). 

Although  these  results  suggest  a  significant  interaction  effect  between  client  justification  strength,  QAS  and  Stage, 
neither  the  three-way  interaction  effect  between  Stage,  QAS,  and  client  justification  (p  =  0.155)  nor  the  four-way 
interaction  effect  between  Stage,  QAS,  advice  nature,  and  client  justification  strength  (p  =  0.105)  is  statistically 
significant  in  the  mixed  categorical  ANOVA  reported  earlier. 
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TABLE  3 

Proportions  of  Auditors  Accepting  the  Client-Preferred  Method  (P Accept)  i**  stage  1  and 
stage  2,  and  the  Difference  in  PAccept  between  Stages,  for  each  Experimental  Condition 


Client 

Justification 

Strength 

Advice  Nature 
(Explicit 

Recommendation 

Provided?) 

Quality 

Assessment 

Standard 

n 

^Accept 

(stage  1) 

p 

^  Accept 

(stage  2) 

Increase 
(decrease) in 

f*Accept  from 
stage  1  to  2 

Weak 

No 

Absent 

15 

0.13 

0.53 

0.40 

No 

Present 

17 

0.18 

0.29 

0.11 

Yes 

Absent 

14 

0.14 

0.36 

0.22 

Yes 

Present 

13 

0.15 

0.46 

0.31 

Strong 

No 

Absent 

16 

0.44 

0.81 

0.37 

No 

Present 

16 

0.63 

0.75 

0.12 

Yes 

Absent 

18 

0.33 

0.61 

0.28 

Yes 

Present 

17 

0.35 

0.24 

(0.11) 

cant  in  the  absence  of  QAS  (p  =  0.003)  but  not  in  its  presence  (p  >  0.500).  As  shown  in  Table  3, 
in  the  presence  of  QAS  and  adviee  with  explicit  recommendation,  PAccept  increases  (by  3 1  percent) 
from  stage  1  to  stage  2  when  client  justification  is  weak,  but  decreases  (by  11  percent)  from  stage 
1  to  stage  2  when  elient  justification  is  strong.  The  latter  result  could  be  due  to  auditors’  height¬ 
ened  suspicion  of  client’s  persuasive  tactics  (i.e.,  the  provision  of  a  strong  justification  to  influence 
their  decisions),  causing  them  to  guard  against  their  own  proclivity  to  increase  support  for  the 
client-preferred  method,  particularly  given  the  presence  of  QAS  and  explicit  recommendation  by 
the  technical  department.  These  results  potentially  explain  why  PAccept  in  stage  2  is  the  lowest  in 
the  QAS  present,  advice  with  explicit  recommendation,  and  strong  client  justifieation  condition, 
and  suggest  that  strong  justifications  by  the  client  may  “baekfire”  on  the  client  in  certain  situations. 

Overall,  the  results  provide  some  evidence  that  auditors  strategically  use  information  provided 
in  the  advice  to  bolster  their  support  for  the  client-preferred  method,  except  for  the  condition  in 
which  the  client  has  provided  a  strong  justification  for  its  preferred  method  in  the  presence  of  QAS 
and  the  technical  department’s  advice  explicitly  recommends  the  use  of  the  most  appropriate 
method.  The  finding  that,  despite  the  provision  of  a  weak  justification  by  the  client  for  their 
preferred  method,  auditors  increase  their  support  for  the  method  after  receiving  the  technical 
department’s  advice,  even  in  the  presence  of  QAS  and  an  explicit  recommendation  by  the  tech¬ 
nical  department  to  use  the  most  appropriate  method,  suggests  that  auditors  are  still  susceptible  to 
the  influence  of  pro-client  directional  goals  under  these  conditions. 

Further  Evidence  on  the  Effects  of  Explicit  Recommendation 

Our  main  hypothesis  and  results  suggest  that  the  provision  of  an  explicit  recommendation  in 
the  technical  department’s  advice  reduces  the  justifiability  of  supporting  the  client-preferred 
method  and,  consequently,  auditors’  propensity  to  accept  the  method  in  the  presence  of  QAS  and 
strong  client  justifications.  To  provide  supporting  evidence  for  this  eonclusion,  we  conducted  an 
additional  experiment  involving  the  two  advice  nature  treatments,  holding  constant  the  client 
justification  strength  (strong)  and  QAS  (present)  conditions.  The  design  and  instruments  used  were 


Kadous  et  al.  (2003)  similarly  find  that  auditors  are  susceptible  to  the  influence  of  pro-client  directional  goal  commit¬ 
ment  even  in  the  presence  of  QAS  and  a  weak  client  justification. 
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identical  to  those  in  the  main  experiment  for  the  corresponding  treatments,  with  the  exception  of 
additional  questions  inserted  in  the  debriefing  questionnaire  to  elicit  additional  assessments  (dis¬ 
cussed  below).  Participants  are  40  audit  seniors  with  an  average  of  3.5  years  of  audit  experience. 
Participants  in  the  “advice  with  explicit  recommendation”  condition  rate  the  extent  to  which  the 
technical  department  explicitly  recommends  that  the  client  should  adopt  the  most  appropriate 
method  to  enhance  financial  reporting  quality  higher  than  participants  in  the  advice  without  ex¬ 
plicit  recommendation  condition  (means  =  5.5  and  4.6,  respectively;  p  =  0.043),  indicating  a 
successful  manipulation  of  advice  nature. 

Consistent  with  the  results  of  our  main  experiment,  participants  are  less  likely  to  accept  the 
client-preferred  method  in  stage  2  when  the  advice  contains  an  explicit  recommendation  than 
when  it  does  not  (PAccept  =  0-35  and  0.70,  respectively;  p  =  0.001).  Their  pre-advice  decisions  in 
stage  1  do  not  vary  by  advice  nature  (p  =  0.337).  Although  only  marginally  significant,  partici¬ 
pants  rate  the  client’s  use  of  the  straight-line  method  (i.e.,  the  client’s  preferred  method)  as  less 
justifiable  when  the  advice  contains  an  explicit  recommendation  than  when  it  does  not  (means  = 
4.3  and  4.9,  respectively;  p  =  0.074,  one-tailed).  We  also  investigate  whether  the  provision  of  an 
explicit  recommendation  in  the  technical  department’s  advice  influences  auditors’  decision  through 
its  effect  on  auditors’  perceptions  of  (1)  the  discretion  or  flexibility  that  they  have  in  accepting  or 
rejecting  the  client-preferred  method;  (2)  the  extent  to  which  the  audit  firm  prefers  the  client  to  use 
the  most  appropriate  method;  or  (3)  the  usefulness  of  the  technical  department’s  communications 
in  terms  of  clarifying  which  accounting  method  is  the  most  appropriate  method  or  which  account¬ 
ing  method  the  client  should  use.  None  of  these  assessments  vary  significantly  by  advice  nature 
(all  p  >  0.286).  Last,  we  measure  auditors’  directional  goal  commitment  (i.e.,  to  build  a  justifi¬ 
able  case  for  characterizing  the  client-preferred  method  to  be  an  acceptable  revenue  recognition 
method  in  the  circumstances),  using  the  five-item  scale  reported  in  Kadous  et  al.  (2003,  767),  to 
examine  whether  it  is  altered  by  the  presence  of  an  explicit  recommendation  in  the  technical 
department’s  advice. Results  indicate  no  significant  difference  in  directional  goal  commitment 
between  the  advice  nature  conditions  (p  =  0.347).  Overall,  these  results  support  our  argument  that 
the  provision  of  an  explicit  recommendation  in  the  technical  department’s  advice  decreases  audi¬ 
tors’  propensity  to  accept  the  client-preferred  method  by  reducing  auditors’  perceived  justifiability 
of  supporting  the  client-preferred  method  (in  other  words,  tightening  the  reasonableness  con¬ 
straints  on  auditors’  motivated  reasoning). 

V.  CONCLUDING  REMARKS 

This  study  demonstrates  how  a  key  quality  control  mechanism  in  audit  firms,  advice  provided 
by  the  firm’s  technical  department,  interacts  with  other  features  in  the  audit  environment  (specifi¬ 
cally,  justifications  provided  by  the  client  and  standards  issued  by  the  profession)  to  jointly  influ¬ 
ence  auditors’  propensity  to  accept  client-preferred  accounting  methods.  We  find  that,  in  the 


The  majority  (83  percent)  has  been  involved  in  auditing  listed  companies,  while  45  percent  of  them  have  participated  in, 
or  prepared  for  discussions  with  the  clients’  audit  committees.  On  average  over  the  past  two  years,  they  have  consulted 
the  technical  department  personnel  formally  (informally)  in  30  percent  (37  percent),  and  other  colleagues  in  68  percent, 
of  their  audit  engagements.  These  experience  measures  do  not  vary  between  experimental  treatments. 

We  also  elicit  participants’  agreement  with  the  statement  “The  technical  department’s  communications  on  a  specific 
accounting  issue  reflect  the  audit  firm’s  preference  on  the  accounting  issue”  on  a  seven-point  scale  (1  =  disagree 
completely;  7  =  agree  completely).  Participants  generally  agree  with  this  statement  (mean  =  4.96)  regardless  of  advice 
nature  (p  >  0.500).  Consistent  with  the  results  of  the  main  experiment,  participants  also  agree  with  the  statement  “When 
there  is  more  than  one  method  of  accounting  for  a  transaction  that  could  be  deemed  acceptable  under  GAAP,  auditors 
should  require  the  client  to  use  the  most  appropriate  accounting  method”  (mean  =  5.8)  regardless  of  advice  nature  (p  > 
0.500). 

Results  of  factor  analysis  on  responses  to  the  five  items  indicate  only  one  factor  with  an  eigenvalue  greater  than  1  (3.08). 
This  factor  explains  62  percent  of  the  variance  in  the  measures  and  Cronbach  alpha  is  0.839. 
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absence  of  any  advice,  auditors’  propensity  to  support  a  client-preferred  method  is  significantly 
influenced  by  the  strength  of  the  client’s  justification,  but  not  by  the  presence  of  an  SAS  No. 
90-like  quality  assessment  standard.  In  contrast,  when  auditors  are  provided  with  advice  from  the 
firm’s  technical  department  indicating  that  an  alternative  method  is  the  most  appropriate  method, 
auditors’  propensity  to  accept  the  client-preferred  method  is  reduced  by  the  presence  of  the  quality 
assessment  standard,  but  only  when  the  advice  explicitly  recommends  the  use  of  the  most  appro¬ 
priate  method  and  the  client  has  provided  a  relatively  strong  justification  for  its  preferred  method. 

Our  findings  highlight  the  role  and  limitations  of  in-house  consultation  and  advice  in  enhanc¬ 
ing  auditors’  decisions,  and  suggest  that  the  nature  of  advice  sought  by,  and  presented  to,  auditors 
matters.  Specifically,  advice  that  merely  reduces  the  ambiguity  surrounding  the  relative  quality  of 
alternative  accounting  methods  may  not  be  sufficient  to  counteract  auditors’  proclivity  to  support 
a  less  appropriate  aceounting  method  preferred  by  the  client,  even  in  the  presence  of  a  quality 
assessment  standard.  Including  an  explicit  recommendation  in  the  advice  to  use  the  most  appro¬ 
priate  method  is  instrumental  in  helping  auditors  overcome  the  influence  of  strong  client  justifi¬ 
cations,  but  only  in  the  presence  of  a  quality  assessment  standard.  These  findings  demonstrate  the 
complementary  roles  that  professional  standards  and  audit  firms’  technical  departments  play  in 
enhancing  auditors’  accounting  policy  decisions. 

Our  within-subject  analyses  of  changes  in  auditors’  decisions  between  stage  1  and  stage  2  of 
the  experiment  indicate  that,  except  for  the  condition  in  which  the  client  has  provided  a  strong 
justification  for  its  preferred  method  in  the  presence  of  QAS  and  the  technical  department’s  advice 
explicitly  recommends  the  use  of  the  most  appropriate  method,  auditors  strategically  use  informa¬ 
tion  in  the  advice  (that  the  client-preferred  method  is  an  acceptable,  albeit  not  the  most  appropri¬ 
ate,  method  under  GAAP)  to  bolster  their  support  for  the  client-preferred  method  even  when  it 
may  not  be  the  most  appropriate  method.  We  find  that,  except  for  the  above-mentioned  condition, 
the  proportion  of  auditors  who  accept  the  client-preferred  method  actually  increases  (rather  than 
decreases)  in  the  presence  of  the  advice.  These  results  suggest  that  the  presence  of  advice  could 
lead  to  an  unintended  consequence  of  bolstering  auditors’  support  for  client-preferred  methods, 
especially  if  the  advice  suggests  that  these  methods,  while  not  the  most  appropriate,  do  not  lead  to 
an  outright  violation  of  GAAP. 

Several  limitations  of  our  study  should  be  noted.  First,  we  examine  the  effects  of  advice 
received  from  a  single  source  (i.e.,  the  audit  firm’s  technical  department).  However,  there  are 
different  sources  of  advice  within  the  firm  with  which  auditors  can  potentially  consult,  and  our 
results  may  not  extend  to  other  sources  of  advice.  Future  research  could  examine  whether  differ¬ 
ential  attributes  among  alternative  sources  of  advice  (such  as  expertise)  influence  auditors’  pro¬ 
pensity  to  seek  and  use  advice  from  specific  sources  (e.g.,  Arnold  and  O’Connor  1999). 

Second,  prior  literature  suggests  that  individuals  are  more  likely  to  follow  solicited  advice 
relative  to  unsolicited  advice  (Gibbons  2003).  Although  our  participants  indicate  that  they  are 
likely  to  consult  the  firm’s  technical  department  on  the  accounting  issue  (see  footnote  19),  it  is 
unclear  whether  our  results  would  hold  had  we  allowed  them  to  solicit  advice  themselves  rather 
than  providing  them  with  unsolicited  advice.  Future  research  could  examine  whether  our  results 
are  sensitive  to  whether  advice  is  solicited. 

Third,  we  examine  a  context  in  which  the  advice  provides  some  support  for  the  client- 
preferred  method  by  indicating  that  it  is  acceptable  under  GAAP,  albeit  not  the  most  appropriate 
method.  While  this  context  is  directly  relevant  for  testing  the  efficacy  of  a  quality  assessment 
standard,  which  requires  auditors  to  assess  the  appropriateness,  not  just  acceptability,  of  client’s 
accounting  methods,  it  also  provides  an  opportunity  for  auditors  to  use  the  advice  to  bolster  their 
support  for  the  client-preferred  method.  Thus,  our  results  may  not  hold  had  the  advice  indicated 
that  the  client-preferred  method  is  of  questionable  merit,  as  auditors  would  find  it  less  justifiable 
to  support  such  a  method  even  in  the  absence  of  a  quality  assessment  standard  or  an  explicit 
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recommendation.  Nonetheless,  future  research  could  investigate  whether  auditors  continue  to  sup¬ 
port  the  client’s  preferred  method  even  in  the  presence  of  advice  that  indicates  that  the  client’s 
method  is  of  questionable  merit.  Another  avenue  for  future  research  is  to  examine  how  the 
availability  of  potentially  conflicting  advice  from  multiple  sources  influences  auditors’  decisions 
(e.g.,  Sniezek  and  Buckley  1995;  Yaniv  and  Milyavsky  2007). 

Fourth,  we  examine  factors  influencing  the  decisions  of  auditors  as  advice  recipients,  not 
advice-givers.  Thus,  our  study  does  not  address  the  question  of  whether  and  how  various  contex¬ 
tual  features  (such  as  client  preferences)  influence  the  quality  of  advice  provided  to  auditors  by 
advice-givers  such  as  the  technical  department  personnel.  To  the  extent  that  client  preferences 
influence  the  quality  of  advice  provided  by  advice-givers,  audit  effectiveness  could  be  compro¬ 
mised. 

Fifth,  while  our  participants  are  experienced  audit  seniors  and  managers,  our  results  might  not 
apply  to  audit  partners  who  make  the  ultimate  decisions  on  clients’  accounting  methods.  None¬ 
theless,  understanding  the  effects  of  advice  on  the  judgments  and  decisions  of  audit  seniors  and 
managers  is  important,  given  that  audit  seniors  and  managers  are  involved  in  consultations  with 
the  technical  department  and  other  colleagues  and  regularly  provide  inputs  to  audit  partners  in  the 
resolution  of  clients’  accounting  issues.  Future  research  could  examine  whether  our  results  vary 
depending  on  the  auditors’  rank  (i.e.,  partners  versus  managers/seniors)  or  other  individual  auditor 
attributes  that  our  study  does  not  capture,  such  as  tacit  managerial  knowledge  (e.g.,  Shankar  and 
Tan  2006).  Future  research  could  also  investigate  whether  other  client  features  such  as  audit 
committee  effectiveness  (e.g.,  Ng  and  Tan  2003)  or  additional  regulatory  measures  introduced 
under  the  Sarbanes-Oxley  Act  of  2002  (e.g..  Nelson  and  Tan  2005)  moderate  the  findings  reported 
here. 
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ABSTRACT:  Prior  research  shows  that  an  audit  supervisor’s  active  intervention  in  a 
subordinate’s  judgment  distorts  that  judgment.  However,  subordinates’  judgments  are 
only  one  input  into  audit  team  judgments.  How  do  supervisors  finalize  audit  team  judg¬ 
ments  after  actively  intervening  in  their  subordinates’  judgments?  In  an  experiment 
using  audit  teams,  supervisors  with  weaker  or  stronger  goals  to  reach  a  client-preferred 
conclusion  either  were  or  were  not  asked  to  first  actively  coach  a  subordinate’s  judg¬ 
ment  (i.e.,  active  intervention)  before  reviewing  it  and  finalizing  the  audit  team’s  judg¬ 
ment.  Supervisors’  intervention  influenced  subordinates’  inputs,  which,  in  turn,  supervi¬ 
sors  incorporated  into  their  final  judgments.  More  interestingly,  intervention  biased 
supervisors’  final  judgments,  controlling  for  supervisor  directional  goal  strength  and  for 
concurrent  effects  on  subordinates’  inputs.  However,  supervisors  distorted  their  judg- 


We  thank  two  anonymous  reviewers,  S.  Kachelmeier  (editor),  R.  Bloomfield,  J.  Brazel,  T.  Kida,  R.  Libby,  M.  Nelson,  E. 
O’Donnell,  B.  Pomeroy,  J.  Smith,  H.-T.  Tan,  workshop  participants  at  Cornell  University,  Loyola  University  Chicago,  The 
Ohio  State  University,  as  well  as  conference  participants  at  the  2008  ISAR  Conference  for  their  helpful  comments.  An 
earlier  paper  based  only  on  Experiment  2  also  benefited  from  the  helpful  comments  of  R.  Balakrishnan,  M.  Bamber,  J. 
Herbold  K.  Kadous,  D.  Kleinmuntz,  S.  Krische,  K.  Krumwiede,  R.  Ramsay,  H.-T.  Tan,  J.  Wilks,  M.  Zimbelman,  confer¬ 
ence  participants  at  the  AAA  Audit  Midyear  Meeting,  and  workshop  participants  at  Brigham  Young  University,  Florida 
State  University,  The  Ohio  State  University,  and  The  University  of  Georgia.  We  thank  K.  Fanning  for  research  assistance. 
Finally,  we  thank  the  audit  firm  and  professionals  who  participated. 

Editor’s  note:  Accepted  by  Steven  Kachelmeier. 

Submitted:  July  2008 
Accepted:  February  2010 
Published  Online:  August  2010 


1763 


1764 


Peecher,  Piercey,  Rich,  and  Tubbs 


merits  less  as  they  perceived  a  larger  technical  knowledge  advantage  over  subordi¬ 
nates.  In  a  second  experiment,  auditors  appear  aware  of  the  bias-reducing  knowledge 
advantage  effects  but  unaware  of  the  bias-increasing  active  intervention  effects.  We 
discuss  implications  for  audit  team  judgments  and  audit  quality  control. 

Keywords:  audit  team  judgments;  client  pressure;  directional  goals;  active 
intervention;  technicai  knowledge. 

Data  Availability:  Data  are  available  from  the  authors. 

1.  INTRODUCTION 

Audit  supervisors  ideally  guide  and  review  subordinate  auditors’  work  in  ways  that  improve 
the  judgments  of  subordinates  and,  ultimately,  of  audit  teams  (e.g.,  Bamber  et  al.  1988; 
Gibbins  and  Trotman  2002;  Phillips  1999;  Rich  2004;  Rich  et  al.  1997a;  Shankar  and  Tan 
2006;  Tan  and  Tan  2008).  Prior  research  shows  that  supervisors  who  allude  to  preferred  audit 
conclusions  when  “coaching”  or  actively  intervening  in  subordinates’  judgments  can  bias  the 
subordinates’  judgments  (e.g.,  Peecher  1996;  Brown  et  al.  1999;  Wilks  2002;  Piercey  2009).  The 
conditions  under  which  subordinate  judgment  bias  distorts  audit  team  judgments,  however,  are 
unclear,  as  subordinates’  judgments  are  only  an  input.  Supervisors  sometimes  form  their  own 
initial  judgments,  and  they  inevitably  finalize  judgments  for  the  audit  team  (Rich  et  al.  1997a). 

This  observation  raises  the  research  question  of  how  supervisors  who  have  directional  goals 
and  guide  their  subordinates  via  active  intervention  finalize  audit  team  judgments  (as  compared  to 
supervisors  who  have  the  same  directional  goals,  but  do  not  intervene).  We  examine  this  question 
by  investigating  the  effects  of  a  supervisor’s  active  intervention,  directional  goals,  and  perceived 
technical  knowledge  advantage  over  subordinates  on  the  supervisor’s  finalized  judgments  for  the 
audit  team.  We  also  examine  whether  auditors  understand  how  these  factors  moderate  bias  in  audit 
team  judgments. 

That  supervisors  with  directional  goals  commonly  use  active  intervention  to  coach  subordi¬ 
nates’  judgments  raises  two  worrisome  possibilities  for  audit  team  judgments.  One  is  that,  after 
coaching  the  subordinate,  supervisors  would  anchor  on  the  subordinate’s  judgment,  preserving  any 
intervention-induced  distortions  in  the  audit  team’s  judgment.  This  possibility  would  validate 
concerns  raised  elsewhere  (e.g.,  Peecher  1996;  Wilks  2002).  An  even  more  worrisome  possibility 
is  that  active  intervention  incrementally  biases  the  supervisors’ judgments,  controlling  for  concur¬ 
rent  effects  of  intervention  on  subordinates’ judgments.  If  so,  active  intervention  would  likely  pose 
greater  audit  effectiveness  concerns  than  previously  understood  from  prior  theory.  In  the  auditing 
environment,  client  pressures  can  create  directional  goals  for  auditors  to  reach  biased  judgments  of 
financial  statements.  Research  suggests  that  such  directional  goals  to  reach  client-preferred  audit¬ 
ing  judgments  contributed  to  many  prominent  accounting  scandals,  prompted  reforms  in  the 
Sarbanes-Oxley  Act  (SOX),  and  continue  to  put  audit  effectiveness  at  risk,  even  post-SOX  (Moore 
et  al.  2006;  Nelson  2006;  Bazerman  et  al.  2006). 

To  investigate  our  research  questions,  we  conduct  two  experiments.  In  the  first  experiment, 
participants  were  industry-specialist  auditors  in  supervisor-subordinate  teams.  The  audit  teams’ 
task  was  to  assess  the  discount  rate  used  to  value  a  financial  asset  securitization,  which  provides 
a  relevant  and  timely  setting  for  testing  our  hypotheses.'  Supervisors  had  either  stronger  or  weaker 
directional  goals  to  reach  client-preferred  accounting  estimates  and  either  were  or  were  not  asked 


Empirical  evidence  suggests  managers  use  the  discount  rate  for  securitizations  to  manage  earnings  (Dechow  and 
Shakespeare  2009).  For  example,  regarding  the  audited  financial  statements  of  New  Century  Financial,  Harris  (2008,  1) 
quotes  bankruptcy  manager  Michael  Missal’s  allegation  that  the  auditors  ‘“acquiesced  to  a  lot  of  what  New  Century 
wanted  to  do’  ...  One  problem  was  ‘the  discount  rate  being  used  to  compute  the  residual  interest  valuation  ...  The 
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to  actively  coach  a  subordinate’s  preliminary  audit  judgment  before  reviewing  it  and  finalizing  the 
audit  team’s  judgment.  We  find  that  subordinates’  judgments  (influenced  by  the  active  interven¬ 
tion)  are  reflected  in  supervisors’  finalized  judgment.  Further,  intervention  leads  to  incremental 
bias  in  supervisors’  finalized  judgment  beyond  that  attributable  to  any  effects  on  subordinates’ 
judgments. 

We  also  examine  how  supervisors’  perceptions  of  their  own  technical  knowledge  (compared 
to  that  of  their  subordinates)  influences  audit-team  judgment  bias.  We  hypothesize  that  this  ad¬ 
vantage  would  decrease  such  bias,  based  on  the  idea  that  supervisors  sense  less  of  a  need  to 
persuade  themselves  about  the  reasonableness  of  their  preferred  audit  conclusions  when  they 
believe  themselves  to  have  greater  subject-matter  expertise  within  the  audit  team.  As  predicted,  a 
perceived  technical  knowledge  advantage  reduces  the  biasing  effects  of  supervisors’  directional 
goals  on  finalized  audit  team  judgments.  Thus,  while  coaching  via  active  intervention  exacerbates 
supervisors’  goal-consistent  bias,  there  are  conditions  under  which  supervisors’  preferences  are 
less  apt  to  cause  audit  team  judgment  bias. 

In  a  second  experiment,  we  examine  auditors’  perceptions  of  how  these  factors  influence 
audit-team  judgment  bias.  This  experiment  is  motivated  by  the  idea  that  auditors  must  first  under¬ 
stand  determinants  of  audit-team  judgment  bias  to  be  able  to  design  quality  control  policies  that 
effectively  mitigate  it  (Lerner  and  Tetlock  1999;  Moore  and  Loewenstein  2004).  We  find  that, 
while  auditors  understand  the  bias-rcJMcmg  effects  of  a  supervisor’s  knowledge  advantage  over  a 
subordinate,  they  do  not  understand  the  hi&s-inflating  effects  of  supervisors’  coaching  via  active 
intervention.  Quality  control  activities  would  likely  improve  if  auditors’  understanding  were  im¬ 
proved. 

Our  theory  and  experimental  findings  have  potentially  important  audit  effectiveness  implica¬ 
tions.  As  Kadous  et  al.  (2008)  note,  motivated  reasoning  occurs  when  accountants  change  their 
judgments  to  become  more  consistent  with  external  client  preferences.  We  posit  active  interven¬ 
tion  increases  the  tendency  of  audit  supervisors  to  do  exactly  that.  As  Wilks  (2002)  points  out, 
intervening  earlier  in  a  subordinate’s  judgment  process  could  undermine  audit  quality  because  of 
distortionary  effects  on  subordinates’  judgments.  Our  findings  suggest  that,  by  intervening,  super¬ 
visors  not  only  influence  their  subordinates,  but  also  distort  their  own  judgments.  When  supervi¬ 
sors  have  directional  goals  to  reach  client-preferred  conclusions,  our  study  suggests  that  their 
judgments  would  be  less  biased  by  client  preferences  if  supervisors  did  not  actively  coach  their 
subordinates  and  instead  received  more  independent  judgments  from  their  subordinates. 

Audit  firms,  standard-setters,  and  regulators  could  consider  training  and  quality  control  poli¬ 
cies  that  take  better  account  of  these  sources  of  dudit-team  judgment  bias.  As  one  example, 
individual  auditors  would  likely  benefit  from  greater  awareness  of  how  supervisors’  active  inter¬ 
vention  can  elevate  audit-team  judgment  bias,  especially  when  conditions  suggest  the  supervisor 
also  has  a  client-preferred  directional  goal.  As  another  example,  the  PCAOB  (2008,  1)  has  iden¬ 
tified  cold  or  concurring-partner  reviews  (i.e.,  back-end  reviews  by  audit  partners  without  inter¬ 
vening  in  the  audit  as  it  unfolds)  as  “probably  the  most”  frequent  audit  deficiency  in  current 
practice,  and  the  SEC  (2010)  recently  approved  the  PCAOB  s  AS  No.  7,  a  new  auditing  standard 
that  increases  their  importance.  Our  study  suggests  that  such  reviews  would  be  less  vulnerable  to 
this  new  source  of  bias,  as  they  usually  are  done  without  intervention.  Our  findings  potentially  also 
inform  other  organizational  contexts  in  which  supervisors  can  coach  a  subordinate  s  input  and 
have  directional  pressures  to  reach  preferred  conclusions. 


company  wanted  it  as  low  as  possible,’  so  that  the  valuations  could  be  as  high  as  possible.”  Such  subjective  estimates 
are  common  in  financial  statement  audits  (Abdolmohammadi  1999)  and  provide  opportunity  for  directional  goals  to 
influence  supervisors’  judgments  (Kadous  et  al.  2003). 
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We  develop  a  theoretical  framework  and  hypotheses  in  Section  II,  describe  the  method  and 
results  of  Experiments  1  and  2  in  Sections  III-VI,  and  conclude  in  Section  VII. 

IL  THEORY  AND  HYPOTHESES 

Supervisors’  Directional  Goals  and  Active  Intervention  in  Subordinates’  Judgments 

As  Kunda  (1990,  480)  notes,  “motivated  reasoning  phenomena  ...  fall  into  two  major  catego¬ 
ries:  those  in  which  the  motive  is  to  arrive  at  an  accurate  conclusion  ...  and  those  in  which  the 
motive  is  to  arrive  at  a  particular,  directional  conclusion.”  She  uses  the  terms  accuracy  goals  and 
directional  goals  to  distinguish  these  broad  categories.  Directional  goals  can  self-originate  in  a 
person,  such  as  when  investors’  preferred  investment  outcomes  depend  on  whether  they  hold  a 
short  or  long  position  (Hales  2007),  or  a  person  can  inherit  directional  goals  from  socially  signifi¬ 
cant  others,  such  as  when  auditors  inherit  directional  goals  from  their  supervisors  (Piercey  2009) 
or  from  their  clients  (Kadous  et  al.  2003).^ 

Regardless  of  their  origination,  directional  goals’  activation  and  influence  need  not  be,  and 
usually  are  not,  fully  consciously  processed  as  if  they  were  intentional  goals  (Chartrand  and  Bargh 
1996;  Moore  et  al.  2006,  18;  Wilks  2002,  54).  Indeed,  at  the  conscious  level,  decision  makers  who 
have  directional  goals  try  to  convince  themselves  and  others  that  they  are  reasonable  and  impar¬ 
tial,  but  this  is  only  an  “illusion  of  objectivity,”  as  these  goals  significantly  bias  their  judgments 
(Pyszczynski  and  Greenberg  1987,  302). 

The  need  to  maintain  reasonableness  limits  how  much  bias  directional  goals  can  infuse  into  a 
person’s  judgments  (Kunda  1990).  However,  factors  designed  to  encourage  auditors  to  think  more 
deeply  and  reasonably  do  not  necessarily  reduce  and,  in  fact,  can  increase  the  amount  of  bias 
caused  by  underlying  directional  goals  (cf.  Kunda  1990,  491).  In  Kadous  et  al.  (2003),  for  ex¬ 
ample,  auditors  inherit  a  directional  goal  to  accept  an  aggressive,  client-preferred  accounting 
method.  About  half  the  auditors  are  asked  to  identify  the  most  appropriate  accounting  method  and 
to  provide  a  written  justification  for  their  choice  before  deciding  whether  to  accept  the  client’s 
method.  Compared  to  auditors  not  asked  to  do  these  tasks,  these  auditors  became  significantly 
more  likely  to  accept  the  client-preferred  accounting  method.  This  result  occurred  even  though 
most  auditors  (70  of  98)  characterized  the  client-preferred  method  as  conceptually  suboptimal 
when  identifying  the  most  appropriate  method. 

An  important  contextual  factor  that  likely  moderates  how  deeply  supervisors  with  directional 
goals  process  information  during  the  review  process  is  whether  they  actively  intervene  in  or 
“coach”  their  subordinate’s  judgment.  Related  prior  research  indicates  that  when  subordinates 
inherit  client-preferred  directional  goals  from  supervisors,  subordinates  change  their  judgment 
processes  to  favor  client-preferred  audit  conclusions  (e.g..  Brown  et  al.  1999;  Peecher  1996; 
Piercey  2009;  Shankar  and  Tan  2006;  Turner  2001;  Wilks  2002).  The  supervisors  in  these  studies’ 
experiments  are  hypothetical,  so  these  studies  cannot  empirically  examine  whether  supervisors 
with  directional  goals  maintain  objectivity  in  the  nature  of  messages  they  convey  to  subordinates. 
These  studies  also  cannot  address  the  arguably  more  important  question  of  whether  active  inter¬ 
vention  moderates  the  influence  of  supervisors’  directional  goals  on  their  own  finalized  judgments 
for  audit  teams. 

Such  moderating  effects  are  potentially  critical  because  supervisors  commonly  engage  in 
active  intervention  using  a  variety  of  modes.  One  mode  is  “real-time”  review,  which  began  in  the 
mid-1990s  as  a  means  to  supplement  and,  for  lower-risk  conditions,  even  supplant  traditional 
“back-end”  review  (e.g..  Rich  et  al.  1997b;  Winograd  et  al.  2000;  Gibbins  and  Trotman  2002). 


2 

For  other  examples  of  accountants  inheriting  directional  goals  from  their  clients  and/or  supervisors  see,  e.g.,  Brown  et 
al.  1999,  Kadous  and  Magro  2001,  Kadous  et  al.  2008,  Peecher  1996,  Salterio  and  Koonce  1997,  and  Wilks  2002. 
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While  the  prevalence  of  real-time  review  appears  to  have  waned  recently,  many  modes  of  active 
intervention  continue  to  flourish.  Common  modes  include  informal  or  formal  face-to-face  discus¬ 
sions,  interviews,  and  various  kinds  of  electronic  communications  (e.g.,  Agoglia  et  al.  2008; 
Brazel  et  al.  2004;  Shankar  and  Tan  2006;  Tan  and  Tan  2008).  Recent  field  studies  report  that 
supervisors  use  discussions,  interviews,  and  electronic  communications,  and  modes  of  active 
intervention  likely  will  continue  to  evolve  (Fargher  et  al.  2005;  Payne  et  al.  2010;  Vera-Munoz  et 
al.  2006). 

Regardless  of  communication  mode,  our  active  intervention  construct  refers  to  whether  and 
the  degree  to  which  a  supervisor  “coaches”  or  “guides”  a  subordinate’s  judgment  before  reviewing 
it  as  opposed  to  not  coaching  or  guiding  and  reviewing  a  more  independent  judgment  from  a 
subordinate.  Active  intervention  includes,  to  varying  degrees,  intervention  cognition  and  interven¬ 
tion  content.  Intervention  cognition  refers  to  the  automatic  and  controlled  cognition  that  occurs 
throughout  intervention.^  It  is  goal-directed,  including  supervisors’  private  thinking  and  their 
thinking  about  what  (and  what  not)  to  convey  to  the  subordinate  and  why  as  well  as  the  best  way 
to  say  things  (cf.  stylization  in  Rich  et  al.  [1997a]  and  Rich  [2004]).  Intervention  content  refers  to 
properties  of  messages  conununicated  to  subordinates  (e.g.,  one-  versus  two-sided,  include  versus 
omit  preferred  conclusions,  start  versus  finish  with  strongest  arguments,  etc.;  Petty  and  Wegener 
1998).  Whenever  supervisors  convey  intervention  content  to  subordinates,  some  incremental  de¬ 
gree  of  intervention  cognition  has  occurred. 

Motivated  reasoning  theory  holds  that  even  individuals  with  strong  directional  goals  strive  to 
reach  conclusions  that  seem  defensible  to  themselves  and  impartial  to  others  (Kunda  1990).  Thus, 
they  take  steps  to  maintain  an  illusion  of  objectivity  (Pyszczynski  and  Greenberg  1987).  As  Kunda 
(1990,  483)  notes: 

[T]he  objectivity  of  this  justification  construction  process  is  illusory  because  people  do  not  realize 
that  the  process  is  biased  by  their  goals,  that  they  are  accessing  only  a  subset  of  their  relevant 
knowledge,  that  they  would  probably  access  different  beliefs  and  rules  in  the  presence  of  different 
directional  goals,  and  that  they  might  even  be  capable  of  justifying  different  conclusions  on  differ¬ 
ent  occasions,  (emphasis  added) 

One  way  for  audit  supervisors  who  actively  intervene  in  their  subordinates  judgment  to  maintain 
this  illusion  would  be  to  edit  or  censor  what  they  say  to  subordinates,  trying  to  avoid  bias  in 
intervention  content.  Overt  bias  in  communicated  content  would  impair  justifiability.  Nevertheless, 
their  own,  deeper  cognition  in  thinking  about  what  to  and  not  to  convey  includes  automatic 
goal-driven  judgment  processes  that  bias  supervisors  toward  reaching  and  justifying  preferred 
conclusions  (cf.  Kadous  et  al.  2003;  Kunda  1990;  Piercey  2009).  Therefore,  the  attempt  to  provide 
bias-free  content  to  subordinates,  even  if  successful,  likely  does  not  translate  into  bias-free  cog¬ 
nition  and  so  gives  supervisors  with  directional  goals  a  false  sense  of  security  about  the  justifi¬ 
ability  (i.e.,  neutrality  or  objectivity)  of  their  conclusions. 

Because  active  intervention  triggers  more  cognition  directed  at  self-justifying  supervisors’ 
preferred  judgments,  actively  intervening  supervisors  likely  reach  audit  judgments  that  are  more 
biased  by  their  directional  goals  than  those  who  have  the  same  goals  but  do  not  intervene, 
controlling  for  any  concurrent  effects  of  intervention  content  on  their  subordinate  s  judgments. 
This  reasoning  supports  the  following  hypothesis:^ 


^  Automatic  cognition,  in  contrast  to  controlled  cognition,  occurs  without  awareness  and  is  hard  to  stop  once  initiated 
(Sherman  et  al.  2008).  ,  ,  . 

We  do  not  posit  a  hypothesis  regarding  the  influence  of  supervisors’  directional  goals  on  the  intervention  content  since 
motivated  reasoning  theory  would  allow  for  a  null  (no-effects)  prediction.  We  do,  however,  measure,  report,  and  control 
for  the  influence  of  supervisors’  intervention  content  on  the  subordinate’s  judgment  as  part  of  testing  HI. 
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HI:  A  supervisor’s  intervention  increases  the  effects  of  directional  goals  to  reach  a  preferred 
conclusion  on  his/her  finalized  judgment  for  the  audit  team,  even  after  controlling  for 
any  effects  of  intervention  on  the  subordinate’s  initial  judgment. 

Note  that  one  cannot  attribute  empirical  support  for  HI  solely  to  supervisors  using  their  role 
to  adjust  for  a  subordinate  not  reaching  the  supervisor’s  preferred  conclusion,  because  non¬ 
intervening  supervisors  with  the  same  directional  goals  can  make  the  same  adjustment.  In  that 
case,  the  finalized  judgments  of  supervisors  who  intervene  would  equal  those  of  supervisors  who 
do  not  intervene  but  have  the  same  goals.  However,  if  judgments  of  intervening  supervisors  differ 
from  those  who  do  not  intervene,  while  holding  constant  their  directional  goals  and  controlling  for 
subordinates’ judgments  (HI),  active  intervention  must  create  additional  distortion  in  supervisors’ 
judgments,  via  its  cognition  component. 

Supervisors’  Directional  Goals  and  Perceived  Technical  Knowledge  Advantage  over  Subor¬ 
dinates 

Absent  theory,  one  could  imagine  that  a  perceived  technical  knowledge  advantage  over  sub¬ 
ordinates  could  increase  or  decrease  supervisor  judgment  bias.  As  one  possibility,  supervisors 
could  consciously  or  unconsciously  lever  such  a  perceived  advantage  to  instill  more  directional- 
goal  consistent  bias  into  the  audit  team’s  conclusions.  Alternatively,  such  an  advantage  could 
prompt  supervisors  to  be  objective,  better  fulfilling  their  quality-control  role  (cf.  Libby  and  Trot- 
man  1993).  Motivated  reasoning  theory,  interestingly,  supports  the  alternate  possibility,  for  two 
reasons. 

First,  motivated  reasoning  theory  holds  that  individuals  with  directional  goals  limit  the 
amount  they  bias  their  judgments,  as  noted  above,  to  maintain  the  illusion  of  objectivity  to 
themselves  and  others.  This  need  to  appear  objective  creates  reasonableness  constraints  on  moti¬ 
vated  reasoning  (see  Kunda  1990).  Factors  that  increase  the  salience  of  reasonableness  constraints 
curtail  the  extent  to  which  supervisors  with  directional  goals  instill  bias  into  audit  team  conclu¬ 
sions  (e.g.,  Cuccia  et  al.  1995;  Salterio  and  Koonce  1997;  Piercey  2009;  Ng  and  Shankar  2010). 
Supervisors  who  perceive  a  larger  technical  knowledge  advantage  over  subordinates  likely  are,  on 
average,  more  sensitive  to  reasonableness  constraints  and,  therefore,  less  likely  to  bias  their  judg¬ 
ments.  For  example,  supervisors  who  feel  that  they  provide  the  majority  of  the  technical  knowl¬ 
edge  within  the  audit  team  for  assessing  discount  rates  are  likely  to  become  more  sensitive  to  the 
reasonableness  of  a  lenient  discount  rate  for  the  valuation  of  a  fairly  risky  asset  when  finalizing 
team  judgments. 

Second,  motivated  reasoning  theory  also  holds  that  individuals’  directional  reasoning  is  in¬ 
strumental.  That  is,  they  engage  in  just  enough  one-side  reasoning  to  attain  their  goal,  even  if 
more  bias  would  still  be  viewed  as  reasonable  (Boiney  et  al.  1997).  As  supervisors  feel  that  their 
technical  knowledge  dominates  their  subordinates’,  they  likely  sense  less  of  a  need  to  self-justify 
their  judgments  and  therefore  engage  in  less  directional  reasoning.  Consequently,  their  interven¬ 
tion  cognition  entails  less  rehearsal  of  various  arguments  to  justify  reaching  a  preference- 
consistent  conclusion.  Thus,  motivated  reasoning  theory  suggests  the  following  hypothesis: 

H2:  A  supervisor’s  perceived  technical  knowledge  advantage  (over  subordinates)  reduces  the 
effects  of  a  supervisor’s  directional  goals  on  his  (or  her)  finalized  assessments  for  the 
audit  team.^ 


^  Theory  does  not  warrant  predicting  that  the  effects  of  H2  will  occur  differently  in  the  intervention  and  no-intervention 
conditions  (i.e.,  a  three-way  interaction).  In  our  development  of  HI,  we  posit  that  intervention  affects  auditors’  judg- 
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III.  EXPERIMENT  1:  METHOD 

Participants 

We  recruited  61  financial  services  industry  specialist  auditors  to  participate  in  a  valuation  task 
in  a  financial  services  setting.  The  vast  majority  of  auditors  (95  percent)  indicated  that  they  had 
financial  services  industry  expertise.  The  auditors  also  provided  experience  information  in  the 
post-test  questionnaire,  summarized  in  Table  1 . 

We  assigned  auditors  to  the  role  of  supervisor  or  subordinate  based  on  experience  information 
provided  by  their  firm,  so  we  had  a  potential  of  30  supervisor- subordinate  pairs.®  Nearly  90 
percent  of  the  subordinate  participants  were  seniors  or  staff/assistants.  Of  the  supervisor  partici¬ 
pants,  55  percent  were  in-charges  and  45  percent  were  seniors.  Compared  to  subordinates,  super¬ 
visors  had  more  experience  in  professional  accounting  (t  =  6.9,  p  <  0.001),  and  in  reviewing 


TABLE  1 


Experiment  1:  Audit  Teams 

Subordinates  Supervisors 


Financial  Services  Industry  Expertise 

28 

27 

(97%) 

(93%) 

Rank 

Assistant/First- Year 

2 

0 

Staff 

(7%) 

10 

0 

Senior 

(34%) 

14 

13 

(48%) 

(45%) 

In-Charge 

3 

16 

(10%) 

(55%) 

Experience  in  Professional  Accounting 

Mean 

1.8  years 

4.1  years 

Median 

1.8  years 

4.0  years 

Minimum 

0.0  years 

1.5  years 

Maximum 

4.0  years 

8.3  years 

Standard  Deviation 

0.9  years 

1.5  years 

Experience  as  a  Supervisor 

Mean 

0.4  years 

2.2  years 

Median 

0.3  years 

2.0  years 

Minimum 

0.0  years 

0.2  years 

Maximum 

1.5  years 

5.0  years 

Standard  Deviation 

0.4  years 

1.2  years 

n 

29 

29 

ments  in  unconscious  ways  (results  of  Experiments  1  and  2  support  this).  If  we  expected  that  auditors  are  aware  of 
intervention  effects,  we  might  expect  supervisors  who  perceive  a  larger  knowledge  advantage  to  attempt  to  offset  this 
effect  However  we  do  not  believe  auditors  are  conscious  of  intervention  effects  and  therefore  do  not  predict  this.  Since 
auditors  do  believe  goals  to  bias  their  judgments  (as  in  Wilks’  [2002]  Experiment  2  and  our  own  Experiment  2),  we  do 
expect  supervisors  who  perceive  a  larger  knowledge  advantage  to  try  to  offset  their  own  directional  goals,  as  H2 

predicts.  ,  ,  .  ^ 

^  The  sample  excludes  the  61st  auditor,  with  whom  we  paired  one  of  the  authors  present  at  the  expenment. 
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subordinates’  work  (t  =  6.8,  p  <  0.001).^  Each  auditor  knew  his/her  role,  but  not  the  identity  of 
his/her  counterpart,  who  was  in  a  different  room.  We  used  this  control  to  prevent  unmeasured 
peripheral  social  cues  from  influencing  our  results. 

We  excluded  one  outlying  observation  (the  supervisor’s  finalized  judgment  for  the  audit  team 
was  an  unreasonable  50  percent  discount  rate)  to  avoid  observing  spurious  effects.*  This  provided 
us  with  a  sample  of  29  usable  supervisor-subordinate  teams. 

Design 

Figure  1  summarizes  the  experimental  design,  and  Table  2  summarizes  variables  in  the  order 
that  participants  encountered  them  in  the  task.  All  participants  read  a  case  about  a  hypothetical 
auditing  client  and  information  relating  to  the  client’s  valuation  of  a  financial  asset.  The  dependent 
variable  of  interest  is  each  supervisor’s  finalized  judgment  (for  the  audit  team)  of  a  discount  rate 
for  the  valuation  of  a  client’s  asset  (FinalJgmt).  Lower  values  indicate  more  aggressive  (i.e., 
client-preferred)  valuations.  All  supervisors  received  input  (a  preliminary  discount  rate)  from  a 
subordinate  (SubJgmt)  before  forming  the  finalized  judgment  {FinalJgmt).  However,  half  of  the 
supervisors  were  first  asked  to  provide  guidance  for  the  subordinate’s  preliminary  judgment.  Audit 
teams  were  randomly  assigned  to  this  intervention  (/)  treatment,  the  experiment’s  only  manipu¬ 
lated  factor  (Figure  1). 

Experimental  Task 
Introduction 

Both  supervisors  and  subordinates  read  an  introduction  that  described  a  hypothetical  auditing 
client.  Retail  Credit,  Inc.  (RCI),  the  financing  arm  of  a  clothing  retailer.  The  supervisors’  instru¬ 
ment  stated  that  they  would  receive  input  from  another  professional  from  their  firm,  whom  they 
should  treat  as  a  subordinate  under  their  supervision.  Similarly,  the  subordinates’  instrument  stated 
that  another  professional,  whom  they  should  treat  as  a  supervisor,  would  review  their  work. 

Directional  Goals  to  Reach  an  Engagement  Partner’s  Preferred  Conclusion 

Supervisors  inherited  a  directional  goal  from  their  partner  for  the  audit  team.  In  particular,  the 
supervisors’  instruments  described  the  engagement  partner  as  generally  preferring  to  allow  client- 
preferred  positions  on  subjective  accounting  estimates  and  included  a  question  that  asked  partici¬ 
pants  to  interpret  the  partner’s  preferences  for  the  audit  team  judgments.  Supervisors  used  an 
11 -point  scale  to  indicate  whether  they  thought  the  partner  preferred  more  conservative/income- 
decreasing  estimates  (—5),  neutral  estimates  (0),  or  less  conservative/income-increasing  estimates 


^  As  part  of  the  experimental  control  that  kept  audit  team  members  anonymous  to  one  another  (as  mentioned  above),  no 
participants  knew  how  many  years  of  experience  the  other  auditor  with  whom  they  were  paired  had.  Additionally,  as 
explained  in  the  description  of  the  task  (below)  the  instrument  simply  asked  participants  to  assume  for  the  purposes  of 
the  case  their  role  as  supervisor  or  subordinate,  and  to  assume  the  opposite  role  for  their  anonymous  team  member. 
Thus,  a  priori,  the  anonymous  teams  should  prevent  inadvertent  differences  in  self-reported  actual  experience  (or  other 
unmeasured  social  cues)  between  supervisors  and  subordinates  (Table  1)  from  influencing  our  results.  Consi.stent  with 
this,  we  do  not  detect  any  effects  of  actual  experience  gap  in  our  results  ex  post. 

All  other  observations  ranged  from  9  percent  to  17  percent.  The  outlying  observation  responded  with  a  50  percent 
discount  rate  despite  receiving  case  information  of  8  percent  to  14  percent  as  a  range  for  valuations  of  instruments  with 
“typical”  risk  in  the  industry  and  a  subordinate’s  input  of  11  percent.  Multiple  diagnostic  tests  suggest  that  this 
observation  is  statistically  outlying  as  well  as  contextually  nonsen.sical  (Neter  et  al.  1996). 
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FIGURE  1 

Experiment  1:  Design 


No-Intervention  Condition  t/=  0) 


Supervisor  Subordinate 


Intervention  Condition  (I  =  1) 


Supervisor  Subordinate 
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Variable 

8 


G 

I 


DRG 


WG 


Index 


SubJgmt 


k 

K 

FinalJgmt 


TABLE  2 
Experiment  1 

Variables  in  Chronological  Order 

Description 

The  supervisor’s  assessment  of  partner’s  preferences  for  allowing  more 
client-preferred  (+5),  neutral  (0),  or  conservative  (—5)  valuations  of  the 
illiquid  financial  asset.  Higher  values  indicate  stronger  directional  goals  for 
client-preferred  valuations. 

Supervisors’  mean-centered  goals  (i.e.,  G  =  g  ~g). 

Manipulated  intervention.  Each  supervisor  either  does  (coded  1)  or  does  not 
(coded  0)  have  an  opportunity  to  provide  directional  rate  guidance  (DRG)  or 
written  guidance  (WG)  to  his  or  her  subordinates  before  the  subordinate 
provides  a  preliminary  discount  rate  {SubJgmt). 

The  directional  rate  guidance  that  supervisors  who  intervene  provide  to  their 
subordinates,  indicating  the  extent  to  which  the  discount  rate  should  be 
above  (+5),  at  (0),  or  below  (-5)  industry  averages.  Smaller  values  indicate 
more  aggressive,  client-preferred  valuations. 

The  written  guidance  that  supervisors  who  intervene  provide  to  subordinates, 
indicating  whether  the  discount  rate  should  be  higher  (positive)  or  lower 
(negative).  WG  is  computed  as  the  number  of  written  points  supporting  a 
higher  discount  rate  ( WG^^lg^  minus  the  number  of  written  points  supporting 
a  lower  discount  rate  (ITG^^;  i.e.,  WG=WGt,igh-WGi„^). 

A  composite  index  that  summarizes  the  contents  of  supervisors’  intervention 
{DRG  and  WG),  indicating  whether  each  supervisor’s  guidance  collectively 
suggests  higher  or  lower  discount  rates  to  his  or  her  subordinate.  Index  is 
the  first  principal  component  from  a  factor  analysis  of  DRG,  WG^igh,  and 

W'Gfo,. 

The  subordinate’s  judgment  of  a  discount  rate,  to  be  reviewed  as  a 
preliminary  input  by  his  or  her  supervisor,  who  will  finalize  a  judgment 
{FinalJgmt)  for  the  audit  team. 

The  supervisor’s  perceived  technical  knowledge  advantage  (-1-5)  or 
disadvantage  (-5),  relative  to  the  subordinate. 

Supervisors’  mean-centered  technical  knowledge  advantage  (i.e.,  K  =  k-k). 
The  supervisor’s  finalized  judgment  of  a  discount  rate  for  the  audit  team. 
Smaller  values  indicate  more  aggressive,  client-prefened  valuations. 


(  +  5).' 

We  define  the  variable  g  as  supervisors’  perceptions  of  the  partner’s  preferences.  A  well- 
replicated  finding  is  that  variation  in  an  engagement  partner’s  preferences  affects  auditors’  direc¬ 
tional  goals  {g)  to  reach  more  conservative  or  more  client-preferred  conclusions  (e.g.,  Wilks  2002; 
Brown  et  al.  1999;  Peecher  1996).  For  interpretive  reasons  discussed  in  the  results  (footnote  15), 
we  also  define  G  as  the  mean-centered  transformation  of  g  (i.e.,  G  =  g  -  g).  We  treat  larger  values 
of  g  and  G  as  indicators  of  stronger  supervisor  directional  goals  of  favoring  client-preferred, 
income-increasing  estimates. 


We  fashioned  the  language  communicating  the  partner’s  preference  after  the  optimistic  condition  in  Piercey  (2009). 
Overall,  participants  perceived  the  partner  s  preferences  tor  their  judgments  to  favor  income-increasing  estimates  (t  = 
3.76,  p  <  0.001).  As  a  realism  check,  a  post-test  question  also  asked  auditors  about  the  degree  of  their  awareness  (in 
practice)  of  supervisors  preferences  regarding  the  latitude  that  partners  allow  management  in  making  valuation  esti¬ 
mates.  On  a  0  “never  aware”  to  10  “extremely  aware”  scale,  mean  ratings  are  6.4  for  subordinates  and  6.2  for 
supervisors.  Both  groups’  ratings  are  above  0  (p’s  <  0.001)  and  the  midpoint  of  “somewhat  aware”  (p’s  <  0.003). 
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The  subordinates’  instrument  was  silent  about  partner  and  manager  preferences.  We  omitted 
such  information  to  reinforce  our  intervention  (/)  manipulation:  supervisors’  intervention  content 
would  be  the  only  way  that  subordinates  could  learn  about  such  matters  and  that  depended  on 
whether  they  were  randomly  assigned  to  the  intervention  or  no-intervention  condition. 

Case  Details 

Supervisors  and  subordinates  then  read  identical  information  about  the  valuation  of  a  financial 
asset.  The  client  had  securitized  its  credit  card  accounts  receivables  (selling  them  while  retaining 
a  10  percent  interest  in  their  cash  flows).  The  instrument  described  how  factors  such  as  the 
risk-free  rate,  default  risk,  credit  ratings,  collection  terms,  first-loss  terms,  pre-payment  rates, 
macroeconomic  factors,  liquidity,  and  volatility  could  affect  the  discount  rate  used  to  value  this 
retained  interest.  It  provided  8  percent  to  14  percent  as  a  typical  range  of  discount  rates  used  by 
credit  card  issuers  to  value  retained  interest,  and  9  percent  as  the  discount  rate  used  by  MBNA,  the 
largest  and  one  of  the  most  stable  credit  card  issuers.  The  provision  of  typical  discount  rates  for 
comparison  makes  this  an  analytical  procedures  task  (Ricchiute  2006). 

The  instrument  also  stressed  that  (1)  lower  discount  rates  increase  the  fair  value  estimate, 
which,  (2)  in  turn,  increases  the  client’s  realized  gain  on  securitization.  It  then  asked  two  questions 
to  quiz  participants  on  those  relationships.  The  instrument  then  provided  supervisors  and  subor¬ 
dinates  identical  details  about  RCI’s  portfolio  of  retained  interest,  some  (other)  details  suggesting 
its  risk  was  above  (below)  the  industry  average. 

Supervisor  Intervention  and  the  Subordinate’s  Preliminary  Judgment 

As  our  research  question  focuses  on  how  active  intervention  moderates  the  influence  of 
supervisors’  directional  goals  on  their  finalized  judgments,  we  desired  a  strong  manipulation  of 
active  intervention.^^  So,  in  one  level,  no  intervention  occurred  at  all,  while  we  prompted  both 
intervention  content  and  cognition  in  the  other  level.  In  the  no-intervention  condition,  supervisors 
first  assessed  a  preliminary  discount  rate  and  their  own  task-relevant  technical  knowledge,  and 
then  received  the  subordinates’  independently-formed  discount  rate  judgments  (SubJgmt),  along 
with  the  subordinates’  self-assessed  technical  knowledge. 

In  the  active  intervention  condition,  supervisors  also  first  assessed  a  preliminary  discount  rate 
and  their  own  task-relevant  knowledge.  However,  their  case  materials  then  advised  them  that  their 
subordinate  expected  to  receive  some  guidance  from  them  before  coming  up  with  a  preliminary 
judgment  for  the  discount  rate  (SubJgmt).  These  supervisors  were  asked  to  rate  the  importance  of 
four  case  details  (e.g.,  the  credit  ratings  of  RCI’s  customers)  on  a  1  to  5  scale  (ranging  from 
unimportant  to  important),  to  guide  the  subordinate  whether  the  discount  rate  should  be  below,  at, 
or  above  the  industry  average  (DRG),  and  to  provide  free-form  written  guidance  (ITG).  Thus, 


The  extent  to  which  and  conditions  under  which  partner  preferences  trickle  down  to  subordinates  in  practice  are 
important  empirical  questions.  Our  experiment  provides  one  test  of  this.  The  subordinates  were  unaware  of  supervisor 
preferences  (and  all  supervisors  were  aware  of  this)  so  that  we  cleanly  test  how  and  where  supervisors  inject  directional 
goals  into  audit  team  judgments.  As  our  theoretical  development  of  HI  discusses,  this  is  more  likely  to  occur  via 
intervention  cognition  influencing  the  supervisors’  finalized  judgments  than  via  overtly  biased  intervention  contents, 
which  would  threaten  justifiability. 

"  Providing  supervisors  and  subordinates  with  the  same  case  information  at  this  stage  allowed  them  both  to  reach  the 
intervention  and  subordinate  input  stages  at  approximately  the  same  time.  This  helped  maintain  control  over  the 
administration  of  the  experiment.  It  also  helped  ensure  that  supervisors’  ratings  of  their  technical  knowledge  advantage 
were  not  contaminated  by  instrument-driven  differences. 

12  strong  manipulation  is  consistent  with  the  tenet  to  “design,  plan,  and  conduct  research  so  that  the  experimental 
conditions  are  as  different  as  possible”  (Kerlinger  and  Lee  2000,  459).  In  practice,  many  variants  of  active  intervention 
exist,  with  some  being  more  active  and  others  less  active  than  our  treatment  condition.  Extant  theory  would  predict  such 
variation  to  influence  the  magnitude  but  not  the  direction  of  the  effect  of  active  intervention  on  supervisors’  judgment 
bias. 
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intervening  supervisors  provided  guidance  for  the  subordinate’s  discount  rate  with  some  support¬ 
ing  reasons  for  the  guidance. 

We  decided  to  structure  the  intervention  in  this  manner  to  help  ensure  that  participants  as¬ 
signed  to  the  intervention  condition  would  engage  in  more  active  intervention  cognition  than 
would  supervisors  not  sufficiently  motivated  to  intervene.  However,  structuring  the  intervention  in 
this  way  limits  our  ability  to  separate  the  effects  of  the  particular  way  in  which  the  instrument 
prompted  intervention  cognition  from  the  intervention  itself.  For  example,  we  cannot  isolate  the 
extent  to  which  the  effects  we  observe  are  driven  by  prompting  supervisors  for  importance  ratings 
of  the  details  relevant  to  their  judgment,  for  directional  rate  guidance  (DRG),  or  for  free-form 
written-guidance  iWG).  Instead,  we  can  only  observe  whether  the  active  intervention  as  we  op¬ 
erationalized  it  (including  all  of  these  prompts)  is  sufficient  to  show  an  effect.  We  discuss  addi¬ 
tional  limitations  of  this  design  choice  in  Section  VII. 

Higher  (lower)  values  of  DRG  indicate  higher  (lower)  directional  rate  guidance.  WG^igh 
equals  the  number  of  written  points  favoring  a  higher  rate,  while  WGig„  equals  the  number  of 
written  points  favoring  a  lower  rate.  WG  equals  the  “net”  written  guidance  (i.e.,  WG  =  WG^igh 
-  WGig^^)P  We  also  created  Index,  a  composite  summarizing  the  contents  of  a  supervisor’s  inter¬ 
vention  (including  both  directional  and  written  guidance).  Specifically,  Index  is  the  first  principal 
component  score  from  a  factor  analysis  of  DRG,  WG^ig^,  and  WGio„„. 

Subordinates  in  the  intervention  condition  then  received  their  supervisor’s  guidance,  formed 
their  preliminary  discount  rate  judgment  (SubJgmt),  and  assessed  their  own  task-relevant  technical 
knowledge.  As  in  the  no-intervention  condition,  supervisors  then  received  the  subordinates’  judg¬ 
ments  (SubJgmt)  and  self-assessed  knowledge,  and  then  finalized  a  judgment  for  the  audit  team 
(FinaUgmt). 

Supervisors’  Perceived  Knowledge  Advantage 

We  measured  supervisors’  perceived  technical  knowledge  advantage  over  subordinates  by 
having  each  supervisor  rate  whether  they  believe  their  own  technical  knowledge  dominates,  ap¬ 
proximates,  or  is  dominated  by  that  of  the  subordinate.  We  define  k  as  supervisors’  perception  of 
their  own  technical  knowledge  relative  to  subordinates’.  For  interpretive  reasons  discussed  in  the 
results  (footnote  15),  we  also  define  K  as  the  mean-centered  transformation  of  k  (i.e.,  K=k-k). 
Larger  values  of  k  and  K  indicate  more  perceived  technical  knowledge  relative  to  the 
subordinate.^"^ 


WGhigh,  tV'Gio,,,,  and  WG  are  the  averages  of  scores  assigned  by  two  content-coders:  an  author  who  was  blind  to  the 
remainder  of  the  data  when  content-coding  the  supervisors’  interventions,  and  a  doctoral  student  who  was  not  blind  to 
the  remainder  of  the  supervisors’  data,  but  was  unaware  of  the  hypotheses  and  the  subordinates’  data.  The  two  content- 
coders’  scores  are  highly  positively  correlated  (r  =  0.89,  p  <  0.001  for  r  =  0.88,  p  <  0.001  for  WGio„-,  and  r 

=  0.91,  p  <  0.001  for  WG).  Cronbach’s  a  indicate  very  high  inter-rater  reliability  (a  =  0.93  for  WGf^igf,,  a  =  0.93  for 
WG,„,„  and  a  =  0.94  for  WG). 

We  measured  k  at  this  point  in  the  task  because  our  theory  would  predict  supervisors’  ex  ante  impressions  of  their 
technical  knowledge  advantage  to  influence  their  judgments.  We  made  this  design  choice  because,  had  we  measured  it 
in  a  post-test  questionnaire,  our  tests  of  H2  would  have  been  confounded  by  a  reverse  causality  explanation  that 
supervisor  participants  altered  their  post-judgment  assessments  of  k  ex  post  to  justify  the  extent  to  which  goals  influ¬ 
enced  their  judgments  (MacKinnon  [2008]  and  Kenny  [2008]  point  out  that  experiments  should  measure  such  variables 
first  when  they  would  be  tainted  by  such  reverse-causality).  However,  we  acknowledge  that  measuring  k  as  we  did  can 
modify  behavior.  We  believe  that  supervisors  likely  consult  their  perceptions  of  the  technical  knowledge  of  subordinates 
relative  to  themselves  when  getting  their  input  on  a  technical  matter,  and  we  made  our  measurement  as  innocuous  as 
possible.  Supervisors  in  both  conditions  assessed  k  after  assessing  their  own  technical  knowledge  and  receiving  their 
subordinates’  self-assessed  technical  knowledge.  We  anticipated  that  supervisors’  perceived  knowledge  advantage  over 
subordinates  (k)  would  be,  on  average,  a  self-serving  re-interpretation  of  their  own  and  their  subordinates’  self- 
assessments.  Results  suggest  that  this  is  the  case:  supervisors’  perceived  knowledge  advantage  is  positively  related  to  the 
difference  between  the  supervisors’  and  subordinates’  self-assessments  (p  =  0.048),  but  self-servingly  re-interpreted. 
Specifically,  on  average,  supervisors  perceived  a  significant  technical  knowledge  advantage  over  their  subordinates  (p  = 
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Dependent  Variable  and  Post-Test  Questionnaire 

Supervisors  then  finalized  the  discount  rate  for  the  audit  team  {FinalJgmt),  our  primary 
dependent  variable  of  interest.  Finally,  both  supervisors  and  subordinates  placed  all  of  the  experi¬ 
mental  materials  up  to  that  point  in  an  envelope  and  then  completed  a  post-test  questionnaire. 

IV.  EXPERIMENT  1:  RESULTS 

Manipulation  Checks 

While  supervisors  in  the  intervention  condition  could  provide  guidance,  those  in  the  no¬ 
intervention  condition  had  no  means  of  communicating  with  their  subordinates.  These  features  of 
the  experiment  gave  us  direct  control  over  ensuring  a  successful  intervention  manipulation.  Ad¬ 
ditionally,  the  post-test  questionnaire  asked  all  supervisors  and  all  subordinates  to  rate  how  much 
they  perceived  that  the  supervisor’s  intervention  influenced  the  subordinate’s  judgment.  The  au¬ 
ditors  rated  this  on  a  scale  from  0  (“there  was  no  intervention;  the  subordinate  made  an  indepen¬ 
dent  assessment”)  to  10  (“a  high  amount  of  intervention;  the  guidance  strongly  influenced  the 
subordinate’s  assessment”).  Mean  perception  ratings  are  higher  in  the  intervention  condition  than 
in  the  no-intervention  condition  for  subordinates  (5.3  versus  3.4,  t  =  2.22,  Pone-taiied  =  0-017)  and 
weakly  higher  for  supervisors  (5.2  versus  3.6,  t  =  1.54,  Pone-taiied  =  0.067). 

Comprehension  Checks 

At  the  beginning  of  the  materials,  two  questions  measured  whether  participants  understood 
that  (1)  a  decrease  in  the  discount  rate  increases  the  fair  value  of  the  residual  interest,  and  (2)  an 
increase  in  the  fair  value  of  the  residual  interest  increases  the  client’s  gain.  Of  the  29  supervisors, 
27  (93  percent)  answered  the  first  question  correctly  and  26  (90  percent)  answered  the  second 
question  correctly.  Of  the  29  subordinates,  29  (100  percent)  answered  the  first  question  correctly 
and  24  (83  percent)  answered  the  second  question  correctly.  Overall,  it  appears  that  the  auditors 
generally  understand  the  relationships  between  the  discount  rate  and  the  client’s  valuation  and 
gain. 

Replicating  the  Effects  of  Supervisors’  Intervention  Content  on  Subordinates’  Judgments 

We  examined  whether  the  directional  contents  of  supervisors’  intervention  influenced  their 
subordinates’  judgments,  consistent  with  prior  studies  (e.g.,  Peecher  1996;  Wilks  2002).  Compar¬ 
ing  our  study  to  prior  research,  two  differences  are  noteworthy;  First,  only  half  of  our  subordinates 
received  intervention  content.  This  leaves  us  with  a  modest  sample  of  n  =  14  audit  teams  to 
observe  whether  the  directional  content  of  the  interventions  directionally  influences  subordinates’ 
judgments,  replicating  prior  research.  Second,  prior  studies  actually  manipulated  the  intervention 
content  from  hypothetical  supervisors  (who  favored  either  more  conservative  or  more  aggressive 
accounting  estimates)  to  test  for  its  directional  influence  on  subordinates  judgments,  using 
experimenter-provided  language  that  was  held  constant  within  conditions  (e.g.,  Peecher  1996, 
Wilks  2002).  Our  experiment  let  actual  supervisors  provide  their  own  language  to  subordinates, 
allowing  for  a  replication  and  robustness  test  from  experimenter-provided  (prior  studies)  to 
participant-provided  intervention  content  (this  study). 


0  037)  even  though  the  difference  between  supervisors’  and  subordinates’  self-assessments  did  not  really  indicate  to 
supervisors  such  an  advantage  (p  =  0.21).  We  originally  included  supervisors’  self-serving  perceptions  of  their  own 
knowledge  advantage  {k)  in  the  instrument  (rather  than  rely  on  the  computed  difference  between  the  self-assessments) 
because  our  theory  would  predict  the  former  to  be  more  important  than  the  latter  in  influencing  supervisor  judgment. 
Interestingly,  and  theory-consistent,  we  find  supervisors’  perceived  knowledge  advantage  to  be  significant  in  our  tests  of 
H2  (see  Section  IV),  but  not  the  computed  difference  in  the  self-assessments  of  knowledge,  even  though  the  two  are 
positively  correlated. 
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We  find  evidence  at  p  <  0.06  that  supervisors’  combined  directional  rate  guidance  and  written 
guidance  directionally  influenced  subordinates,  replicating  prior  research.  First,  recall  that  we 
defined  Index  as  the  first  principal  component  of  DRG,  WChigh,  and  WGi^^,  and,  as  such.  Index  is 
a  composite  measure  of  each  supervisor’s  intervention  content.  The  factor  analysis  (Table  3,  Panel 
A)  yields  Index  as  a  weighted  average  of  DRG,  WG^igh,  and  -  WGig^.  These  signs  of  the  factor 
loadings  suggest  that  the  first  principal  component  (which  summarizes  65  percent  of  the  variance 
in  DRG,  WGhigh,  and  WGig^)  captures  the  supervisors’  collective  directional  guidance.  Second,  as 
Table  3,  Panel  B  shows,  SubJgmt  is  weakly  positively  correlated  with  Index,  as  well  as  with  DRG 
and  WG.  Specifically,  subordinates’  discount  rates  are  lower  (higher)  when  supervisors  suggested 
lower  (higher)  rates  {DRG)  and  as  supervisors  provided  fewer  (more)  items  of  written  guidance 
favoring  higher  rates. 

Tests  of  Hypotheses 

Hypothesis  1  predicts  that  a  supervisor’s  intervention  distorts  his  or  her  finalized  judgment  for 
the  audit  team,  beyond  those  of  supervisors  who  have  the  same  goals  but  do  not  intervene,  and 
after  controlling  for  any  effects  on  the  anchor  provided  by  the  subordinate’s  judgment.  In  contrast, 
H2  predicts  that  a  supervisor’s  perceived  knowledge  advantage  tempers  the  directional  effect  of 
goals  on  the  supervisor’s  finalized  judgment. 

We  use  the  following  linear  regression  model  to  test  our  hypotheses: 

FinalJgmt  =  /3o  +  (dj  +  P2G  +  +  P^{I  X  G)  +  P^{I  X  K)  +  ^(,{G  X  K)  +  /3j{I  X  G  X  K) 

+  (B^SubJgmt  +  e .  ( 1 ) 


TABLE  3 


Experiment  1 

Effects  of  Supervisor’s  Intervention  on  Subordinates’  Judgments 


Panel  A:  Factor  Analysis  of  Supervisor’s  Intervention** 


DRG  factor  coefficient 
WG^igh  factor  coefficient 
WGig„  factor  coefficient 
%  variance  explained 
n 


Index 

(First  Principal  Component) 

1.36 

1.12 

-0.21 

65% 

14 


Panel  B:  Correlations  with  Subordinates’  Judgments  {SubJgmt)^ 


Predicted 

Sign 

Pearson’s  r 

P 

Spearman’s  r 

P 

Kendall’s  t 

P 

DRG 

+ 

0.34 

0.118 

0.42 

0.066 

0.34 

0.060 

+ 

0.45 

0.055 

0.54 

0.024 

0.34 

0.062 

WGlg^ 

— 

0.06 

0.840 

0.04 

0.879 

0.01 

0.952 

WG 

-b 

0.39 

0.086 

0.37 

0.094 

0.31 

0.077 

Index 

-b 

0.45 

0.054 

0.44 

0.058 

0.32 

0.059 

“  Variable  definitions  appear  at  Table  2. 

^  Except  for  those  of  WGi„„,  the  correlation  estimates  are  in  the  predicted  direction  and  p-values  are  one-tailed. 
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FinalJgmt  is  the  supervisor’s  finalized  judgment  of  the  most  reasonable  discount  rate  for  the  audit 
team  {lower  values  indicate  more  aggressive,  client-preferred  judgments).  /  =  1  for  audit  teams  in 
the  intervention  condition,  and  0  otherwise.  G  is  the  supervisor’s  mean-centered  perception  of  the 
partner’s  preferences  {higher  values  indicating  more  aggressive,  client-preferred  preferences), 
while  K  is  the  supervisor’s  mean-centered  perceived  task-relevant  technical  knowledge  relative  to 
the  subordinate  {higher  values  indicating  a  larger  perceived  knowledge  advantage  over  the  sub¬ 
ordinate).  SubJgmt  (the  subordinate’s  preliminary  judgment)  is  a  control  variable.  A  classic  an¬ 
choring  effect  (Tversky  and  Kahneman  1974)  would  predict  the  coefficient  for  SubJgmt  to  be 
significantly  positive.  Including  SubJgmt  allows  for  a  cleaner  test  of  HI  by  controlling  for  any 
concurrent  effects  impounded  in  the  subordinate’s  initial  judgment.  We  test  the  above  model  and 
a  parsimonious  model  that  drops  the  two  unhypothesized  and  insignificant  terms  I  X  K  and  I  X  G 
X  K  in  Table  4.^^ 

As  Table  4,  Panel  A  indicates,  the  coefficient  for  SubJgmt  is  significantly  positive  (pone-taiied 
<  0.039),  suggesting  that  subordinates’  judgments  (and  any  effects  impounded  therein)  are  indeed 
reflected  in  the  supervisors’  finalized  judgments  for  the  audit  team.^^ 

Hypothesis  1  predicts  that  the  coefficient  for  7  X  G  will  be  negative.  In  other  words,  G  will 
influence  FinalJgmt  more  for  intervening  supervisors  than  for  non-intervening  supervisors,  after 
controlling  for  any  effects  impounded  in  SubJgmt.  Larger  values  of  G  indicate  that  the  supervisor 
perceives  that  the  engagement  partner  prefers  more  aggressive,  client-preferred  valuations.  Lower 
values  of  FinalJgmt  indicate  lower  discount  rates  and,  therefore,  more  aggressive  valuations. 

As  predicted  by  HI,  the  coefficient  for  7  X  G  is  significantly  negative  (Pone-taiied  —  0-016). 
This  result  supports  HI.  This  negative  interaction  indicates  that  the  same  goals  to  reach  a  preferred 
conclusion  influenced  supervisors’  finalized  discount  rates  more  among  supervisors  who  intervene 
than  among  those  who  do  not,  controlling  for  effects  concurrently  impounded  in  subordinates’ 
judgments.  When  7=1,  the  coefficient  for  G  on  FinalJgmt  is  weakly  negative  in  the  full  model 
(pone-taiied  =  0.056)  and  significantly  negative  in  the  parsimonious  model  (pone-taiied  =  0.033;  Table  4, 
Panel  B).  When  7=0,  the  coefficient  for  G  on  FinalJgmt  is  statistically  insignificant  (p  >  0.10; 
Table  4,  Panel  A). 

Hypothesis  2  predicts  that  supervisors’  perceived  knowledge  advantage  K  will  counteract  the 
effects  of  G  on  FinalJgmt.  As  shown  in  Table  4,  Panel  A,  the  coefficient  iox  G  X  K  is  significantly 
positive  (Pone-taiied<  0.001).  TMs  interaction  indicates  that  supervisors  selected  a  discount  rate 


We  use  the  mean-centered  variables  G  znA  K  rather  than  g.and  k  for  the  following  reason;  While  our  independent 
construct  of  interest  are  g  and  k,  mean-centering  these  two  continuous  variables  is  necessary  to  properly  interpret  the 
t-tests  in  the  above  regression  model  as  tests  of  HI  and  H2  (Echambadi  and  Hess  2007).  Mean-centering  continuous 
variables  is  the  equivalent  of  switching  from  a  test  of  simple  effects  to  a  test  of  main  effects  in  classical  ANOVA 
(Echambadi  and  Hess  2007;  Irwin  and  McClelland  2001).  For  example,  if  we  had  used  g  and  k  in  the  above  model,  then 
our  test  of  HI  in  Table  4  would  show  that  intervention  amplifies  directional  goals  when  k  =  0  (Irwin  and  McClelland 
2001).  Using  the  mean-centered  variables  G  and  K  appropriately  switches  the  test  of  HI  in  Table  4  to  a  test  of  at  /  X 
G  across  the  average  of  all  levels  of  k  rather  than  at  a  single  value  that  is  not  representative  of  the  distribution  of  k  (Irwin 
and  McClelland  2001;  Echambadi  and  Hess  2007).  Correspondingly,  HI  predicts  intervention  and  goals  to  interact  not 
just  at  one  particular  value  of  k,  but  across  all  levels  of  k.  Mean-centering  accomplishes  this  without  increasing  the  risk 
of  Type  I  errors  (Echambadi  and  Hess  2007).  We  continue  to  use  simple-effects  coding  of  the  categorical  variable 
intervention  (/)  for  exposition  (e.g..  Table  4,  Panel  B).  When  we  re-run  Table  4  using  main  effects  coding  for  /  (i.e.,  7 
=  1  or  -1,  as  in  classical  ANOVA  tests),  those  results  are  statistically  similar  to  Table  4  and  similarly  support  our 
hypotheses.’  Following  Echambadi  and  Hess  (2007),  we  test  our  regression  models  for  multicollinearity  using  the 
uncentered  variables  that  represent  the  constructs  of  interest  (g  and  k)  and  multiple  diagnostic  tests.  These  tests  indicate 
that  we  do  not  have  multicollinearity  problems  (e.g.,  all  VIFs  <  10). 

Prior  research  indicates  that  supervisors  are  influenced  by  their  subordinates’  inputs  (e.g.,  Ricchiute  1999).  This  effect 
has  primarily  been  shown  under  conditions  in  which  supervisors  were  relying  on  subordinates  for  information.  We  thank 
an  anonymous  reviewer  for  pointing  out  that  our  study  extends  this  prior  work  by  testing  whether  supervisors  still  rely 
on  their  subordinates’  judgments  as  an  input  even  when  they  are  aware  that  they  have  approximately  the  same  set  of 
information  as  subordinates. 
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TABLE  4 
Experiment  1 

General  Linear  Regressions  of  Supervisors’  Finalized  Judgments 
Finaljgmt  =  Po  +  Pi^  =  P2^  =  Ps^  =  P/I  x  G)  +  ps(I  x  K)  +  p^(G  x  K)  +  Pj(I  x  G  x  K) 
+  PgSubJgmt  +  e  ^ 


Panel  A:  Regression  Analysis 


Variable 

Predicted 

Sign^ 

Coefficient 
(t-  statistic) 

Coefficient 
(t-  statistic) 

Intercept 

8.63 

8.76 

(10.10)*** 

(10.41)*** 

7 

0.16 

0.19 

(0.64) 

(0.74) 

G 

0.1-3 

0.14 

(1.61) 

(1.71) 

K 

0.43 

0.38 

(4.05)*** 

(4.04)*** 

IX  G 

-(HI) 

-0.26 

-0.28 

(-2.32)** 

(-2.66)*** 

IX  K 

-0.26 

— 

(-1.17) 

— 

G  X  K 

+  (H2) 

0.29 

0.29 

(6.18)*** 

(6.58)*** 

I  X  G  X  K 

-0.14 

— 

(-0.89) 

— 

SubJgmt 

-b 

0.16 

0.15 

(1.98)** 

(1.85)** 

R2 

0.884 

0.868 

Adjusted 

0.836 

0.830 

n 

28^^ 

28" 

Panel  B:  Supplementary  Analysis 


Variable 

Predicted 

Sign^ 

Coefficient 
(t-  statistic) 

Coefficient 
(t-  statistic) 

G  when  I  =  1 

— 

-0.13 

-0.15 

(i.e.,  P2  +  Pa) 

(-1.67)* 

(-1.94)** 

*,  **,  ***  Indicate  p  <  0.10,  p  <  0.05,  and  p  <  0.01,  respectively. 
“  Variable  definitions  appear  at  Table  2. 

*’  Directional  predictions  are  tested  with  one-tailed  t-tests. 

One  sample  unit  is  excluded  due  to  missing  data. 


more  inconsistent  (consistent)  with  their  directional  goals  as  their  perceived  knowledge  advantage 
increased  (decreased).  This  result  supports  Finally,  Table  4  shows  that  the  models  explain  a 


’’  In  supplementary  analyses  of  H2,  we  mean-split  K  into  a  dichotomous  variable  to  gain  a  better  understanding  of  the 
nature  of  this  G  X  K  interaction.  Results  of  these  tests  indicate  that  H2  is  driven  primarily  by  higher-knowledge 
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high  percentage  of  the  variance  in  supervisors’  finalized  judgments  (adjusted  approximately  83 
percent)/® 

Supplementary  Analyses  and  Discussion 

As  noted  in  the  theoretical  development  of  HI,  people  engaged  in  motivated  reasoning  gen¬ 
erally  do  not  consciously  bias  their  judgments.  Rather,  they  bias  their  judgments  in  subtle,  uncon¬ 
scious  ways  that  maintain  an  appearance  of  objectivity  (Kunda  1990;  Kadous  et  al.  2003;  Piercey 
2009).  Because  of  this,  HI  does  not  predict  intervening  supervisors  to  bias  their  intervention 
content  in  order  for  active  intervention — via  intervention  cognition — to  influence  the  discount 
rates  they  choose.  Moreover,  it  is  not  necessary  that  intervention  content  be  biased  by  directional 
goals  in  order  for  these  goals  to  distort  intervention  cognition  in  a  goal-consistent  direction. 
Regardless  of  how  intervention  content  concurrently  affects  subordinates,  intervention  cognition  is 
predicted  to  amplify  the  effects  of  an  intervening  supervisor’s  own  directional  goals  on  the  super¬ 
visor’s  final  judgments  (HI). 

We  find  evidence  of  this  causal  explanation  in  supplementary  analyses.  First,  the  post-test 
questionnaire  provides  evidence  that  the  intervention  effects  observed  in  our  tests  of  HI  are 
unconscious.  It  asked  all  supervisors  to  rate  the  extent  to  which  they  believed  that  the  engagement 
partner’s  views  influenced  their  judgments  on  a  scale  from  0  to  10.  Supervisors  indicated  that  the 
partner’s  preference  had  at  least  some  influence  on  their  judgments  (t  =  7.39,  p  <  0.001). 
However,  this  did  not  vary  by  intervention  condition  (3.09  versus  2.93,  t  =  0.18,  p  =  0.86).  This 
suggests  that  supervisors  were  unaware  that  intervention  increased  the  effect  of  directional  goals 
on  their  final  judgments. Second,  while  subordinates  were  indeed  influenced  by  the  direction  of 
the  guidance  they  received  (replicating  prior  studies.  Table  3),  supervisors  did  not  systematically 
tilt  their  intervention  content  in  a  goal-consistent  direction.  That  is,  we  do  not  detect  significant 
correlations  between  G  and  any  of  DRG,  WG,  Index,  or  SubJgmt  among  our  14  intervention  audit 
teams  (all  p  >  0.35).^° 

Thus,  it  is  not  simply  the  case  that  supervisors  bias  their  judgments  by  steering  their  subor¬ 
dinates  in  the  supervisors’  goal-consistent  directions  and  then  simply  anchoring  on  their  subordi¬ 
nates’  judgments,  as  suggested  by  prior  research.  The  additional  bias  in  the  finalized  judgments  of 
intervening  supervisors  cannot  simply  be  attributed  to  effects  their  intervention  content  had  on 


supervisors  tempering  their  directional  goals.  Specifically,  while  all  supervisors  provide  more  goal-consistent  discount 
rates  as  they  coach  subordinates  (t  =  -3.08  p  =  0.006),  high-knowledge  supervisors  provide  less  goal-consistent 
discount  rates  (t  =  4.34,  p  <  0.001).  Low  knowledge  supervisors,  on  the  other  hand,  do  not  (t  =  0.83,  p  =  0.43). 

We  interpret  our  high  adjusted  as  indicative  that  the  model  variables  explain  systematic  versus  residual  variance 
within  the  experimental  task  well,  controlled  extraneous  sources  of  variance  well,  or  both  (Kerlinger  and  Lee  2000).  We 
do  not  interpret  this  as  indicating  that  the  independent  variables  explain  similarly  high  amounts  of  variance  in  the  real 
world,  for  several  reasons.  Other  sources  of  variance  in  the  real  world  (e.g.,  interpersonal  social  cues  that  would 
accompany  intervention)  were  controlled  within  the  experimental  setting  but  are  not  in  the  real  world.  Additionally,  the 
experiment  measures  variables  of  interest  to  the  experimental  task  with  response  scales  designed  to  control  variance. 
Overall,  the  model’s  adjusted  suggests  successful  experimental  design  and  good  predictive  validity  within  the 
experimental  setting  (Neter  et  al.  1996;  Maxwell  and  Delaney  1990;  Kerlinger  and  Lee  2000). 

Consistent  with  this,  supervisors  rated  their  perceptions  of  the  accuracy  of,  their  confidence  in,  and  the  justifiability  of 
their  judgments.  We  find  no  significant  differences  in  these  ratings  between  the  intervention  and  no-intervention  groups, 
further  supporting  the  notion  that  HI  is  an  unconscious  effect.  In  contrast,  H2  predicts  that  a  larger  perceived  knowledge 
advantage  will  lead  supervisors  to  reduce  their  directional  goals,  which  suggests  that  knowledge  effects  in  H2  can  be 
conscious  Consistent  with  this,  supervisors’  accuracy,  confidence,  and  justifiability  ratings  do  increase  with  perceived 
technical  knowledge  advantage  (r  =  0.29,  p  =  0.064;  r  =  0.48,  p  =  0.004;  r  =  0.28,  p  =  0.072,  respectively). 

Prior  research  uses  hypothetical  supervisors  and  implicitly  assumes  that  the  supervisor’s  intervention  content  (i.e.,  DRG, 
WG)  and  the  supervisor’s  directional  goals  (i.e.,  G)  represent  the  same  construct  (e.g.,  Peecher  1996;  Brown  et  al.  1999; 
Wilks  2002;  Piercey  2009).  We  improve  upon  prior  literature  by  using  actual  audit  teams,  an  experimental  design  that 
allows  us  to  disentangle  these  two  constructs,  and  a  causal  model  that  acknowledges  them  as  distinct  constructs  that  can 
even  be  uncorrelated  as  supervisors  with  directional  goals  attempt  to  maintain  an  appearance  of  objectivity. 
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their  subordinates.  It  also  cannot  simply  be  attributed  to  intervening  supervisors  adjusting  for  a 
subordinate’s  failure  to  be  influenced  by  intervention  in  the  supervisor’s  goal-consistent  directions. 
Because  both  groups  of  supervisors  received  subordinate  input  uninfluenced  by  G,  the  effects  of 
HI  are  not  due  to  differences  in  the  subordinate’s  input.  Non-intervening  supervisors’  subordinates 
were  also  not  influenced  by  intervention,  and  non-intervening  supervisors  with  the  same  goals 
could  have  made  the  same  adjustment  to  the  finalized  discount  rate,  but  did  not  (Table  4).  Rather, 
these  supplemental  findings  underscore  the  significance  of  H 1 ;  intervention  cognition  appears  to 
cause  supervisors  to  bias  their  judgments  more  than  non-intervening  supervisors  with  the  same 
goals  to  reach  a  preferred  conclusion,  after  controlling  for  any  effects  of  intervention  on  the  initial 
anchor  provided  by  subordinates.  These  supplemental  findings,  together  with  our  tests  of  HI,  also 
highlight  that  the  subordinate’s  judgment  (the  primary  focus  of  prior  research)  is  only  an  input  into 
the  audit  team  judgment.  We  extend  prior  literature  with  new  evidence  that  intervention  effects  on 
the  supervisor  introduce  a  distinct  bias  into  audit  team  judgments,  separate  from  subordinate 
effects. 


V.  EXPERIMENT  2:  METHOD 

Motivation 

Whether  these  effects  are  conscious  or  unconscious  is  a  crucial  determinant  of  the  likelihood 
that  auditors  can  successfully  attempt  to  improve  their  own  or  other  auditors’  judgments  via 
conscious  review  or  quality-control  procedures  (Lemer  and  Tetlock  1999;  Wegener  et  al.  2001; 
Moore  and  Loewenstein  2004).  If  auditors  are  unaware  of  the  biasing  effects  of  a  supervisor’s 
intervention  on  the  supervisor,  then  training  or  policy  changes  would  likely  help  pre-empt  or 
correct  the  biasing  effects  of  a  supervisor’s  intervention.  Training  interventions  could  include 
making  auditors  aware  of  this  effect  and  encouraging  them  to  consider  their  own  directional  goals 
or  striving  to  obtain  relatively  more  independent  work  from  their  subordinates.  Policy  interven¬ 
tions  could  include  increased  emphasis  on  reviews  by  individuals  who  are  more  removed  from  the 
audit,  such  as  the  concurring  or  cold  partner  reviews  that  the  PCAOB  currently  deems  insufficient 
in  practice  (PCAOB  2008).  Thus,  the  findings  from  Experiment  1  not  only  inform  prior  auditing 
research  and  theory,  but  they  could  also  have  important  policy  implications  for  audit  firms  and 
audit  regulators.  The  less  aware  auditors  are  of  these  effects  shown  in  Experiment  1,  the  more 
informative  and  adverse  these  implications  can  be.  This  warrants  an  experiment  designed  more 
specifically  to  test  auditors’  awareness  of  these  effects  (cf.  Wilks  2002,  Experiment  2). 

Participants 

One  hundred  five  auditors  from  a  major  accounting  firm  volunteered  as  participants.  Unlike 
Experiment  1,  our  sample  unit  for  this  experiment  is  an  individual  auditor.  On  average,  partici¬ 
pants  had  approximately  three  years  of  audit  experience  and  1 .3  years  as  supervisors.  Six  partici¬ 
pants  did  not  complete  the  experimental  instrument,  giving  us  a  sample  of  99  auditors. 

Design 

We  use  a  within-subjects  design:  a  3  X  2  X  2  balanced,  incomplete-block  design  (see  Winer 
1971,  622).  The  manipulated  factors  are  G  (goals  to  reach  a  more  conservative,  neutral,  or  ag¬ 
gressive  conclusion),  I  (intervention  or  no  intervention),  and  K  (knowledge  advantage  or  no 
knowledge  advantage).  These  factors  described  the  supervisor  of  a  hypothetical  auditing  team. 
Each  participant  received  a  block  of  six  of  the  12  possible  cases,  and,  for  each,  judged  the  amount 
of  bias  that  would  be  present  in  the  audit  team  judgments. 

We  chose  a  within-subjects  design  because  the  objective  of  the  experiment  is  to  measure 
auditors’  intentionally  expressed  beliefs  about  how  G,  /,  and  K  influence  the  amount  of  bias  in 
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audit  team  judgments  (Libby  et  al.  2002).^'  The  incomplete  block-design  kept  the  task  short. 

The  variable  G  manipulates  the  supervisor’s  goals  to  reach  a  preferred  audit  conclusion  (in  the 
same  asset  securitization  context).  The  neutral  conditions  described  the  supervisor  as  preferring 
the  best  possible  or  highest  quality  accounting  policies,  without  prejudice  to  earnings  management 
objectives.  The  aggressive  conditions  described  the  supervisor  as  encouraging  clients  to  use  ag¬ 
gressive  accounting  practices.  The  conservative  conditions  describe  the  supervisor  as  encouraging 
clients  to  use  conservative  accounting  practices. 

The  variable  I  manipulates  whether  the  supervisor  actively  intervened  in  the  subordinate’s 
judgment.  In  the  intervention  conditions,  the  supervisor  explained  his  expectations  for  the  esti¬ 
mate,  and  guided  the  subordinate  as  s/he  (the  subordinate)  formed  a  preliminary  judgment.  In  the 
no-intervention  conditions,  the  subordinate  developed  his/her  judgment  without  any  guidance 
from  the  supervisor.  In  all  conditions,  the  subordinate  then  documented  his/her  judgment  and 
forwarded  this  documentation  to  the  supervisor.  The  supervisor  then  reviewed  the  subordinate’s 
memo  and  finalized  the  audit  team’s  estimate  of  fair  value  by  making  any  adjustments  deemed 
necessary  to  the  subordinate’s  judgment.  Note  that  this  narrative  describing  the  audit  teams  par¬ 
allels  what  took  place  among  actual  audit  teams  in  Experiment  1. 

K  manipulated  the  supervisor’s  technical  knowledge  relative  to  the  subordinate.  The 
knowledge-advantage  (no-knowledge  advantage)  conditions  described  the  supervisor  as  possess¬ 
ing  tremendous  (moderate)  technical  accounting  expertise  and  as  quite  (somewhat)  knowledgeable 
about  accounting  for  asset  securitizations.  In  comparison,  all  conditions  described  the  subordinate 
as  possessing  moderate  technical  accounting  expertise  and  as  being  somewhat  knowledgeable 
about  asset  securitizations. 

Task 

Before  commencing,  participants  read  a  description  of  a  fictional  client’s  asset  securitization 
transaction  (the  same  context  as  in  Experiment  1)  and  answered  a  fill-in-the-blank  question  de¬ 
signed  to  test  their  understanding  of  how  estimates  of  the  fair  value  of  residual  interest  influence 
reported  income. 

Dependent  Variable  and  Post-Test  Questionnaire 

Eor  each  case,  participants  rated  the  extent  to  which  bias  causes  the  audit  team’s  estimate  of 
fair  value  to  deviate  from  the  best  possible  estimate.  These  ratings  were  on  a  seven-point  scale 
from  -3  (“Audit  team  estimate  likely  is  extremely  low”)  to  0  (“Audit  team  estimate  undoubtedly 
is  the  best  possible  estimate”)  to  -f  3  (“Audit  team  estimate  likely  is  extremely  high”).  This  rating 
is  our  primary  dependent  variable  of  interest  for  Experiment  2,  Perceived  Bias.  Upon  completing 
the  cases,  participants  placed  their  answers  in  an  envelope  and  completed  a  post-test  questionnaire, 
which  included  manipulation  checks  and  demographic  questions. 

VI.  EXPERIMENT  2:  RESULTS 

Manipulation  Checks 

In  post-test  questions,  we  replicated  language  used  in  different  levels  of  our  manipulations 
and  asked  participants  to  evaluate  the  language  along  five-point  scales.  Eor  G,  the  scale  ranged 
from  1  (“The  manager’s  preferences  are  highly  biased  toward  conservative  accounting  estimates”) 
to  5  (“The  manager’s  preferences  are  highly  biased  toward  aggressive  accounting  estimates”). 


As  Libby  et  al.  (2002)  note,  within-subjects  designs  are  particularly  desirable  for  experiments  that  investigate  individu¬ 
als’  current  beliefs  or  thinking  about  hov.'  manipulated  factors  directionally  affect  phenomena  of  interest.  Within-subjects 
designs  also  create  a  high-power  environment  for  testing  the  presence  or  absence  of  intentionally  expressed,  current 
beliefs  (Neter  et  al.  1996). 


The  Accounting  Review 


September  2010 
American  Accounting  Association 


1782 


Peecher,  Piercey,  Rich,  and  Tubbs 


Mean  ratings  are  1.36  for  the  conservative  level,  2.76  for  the  neutral  level,  and  4.78  for  the 
aggressive  level,  and  each  change  is  significant  (all  p  <  0.001).  For  I,  the  scale  ranged  from  1 
(“There  was  no  manager  intervention”)  to  5  (“There  was  an  extreme  amount  of  intervention”). 
Mean  ratings  are  2.49  for  the  no-intervention  level  and  4.13  for  the  intervention  level,  a  significant 
increase  (p  <  0.001).  For  K,  the  scale  ranged  from  1  (“The  senior’s  technical  accounting  knowl¬ 
edge  is  much  greater  than  the  manager’s”)  to  5  (“The  manager’s  technical  knowledge  is  much 
greater  than  the  senior’s”).  The  mean  ratings  are  3.54  for  the  no-knowledge-advantage  level  and 
4.71  for  the  knowledge  advantage  level,  a  significant  increase  (p  <  0.001). 

Comprehension  Checks 

As  mentioned  previously,  participants  responded  to  pre-test,  fill-in-the-blank  questions  that 
tested  their  understandings  of  bias,  noise,  and  whether  the  effects  of  increases  in  the  estimated  fan- 
value  of  retained  interest  affect  income.  Of  the  99  participants,  95  (96  percent)  answered  the  bias 
question  correctly,  while,  for  the  accounting  question,  88  (89  percent)  answered  correctly,  seven 
answered  incorrectly,  three  answered  incoherently,  and  one  did  not  respond.  Similar  to  Experiment 
1,  participants  generally  appeared  to  understand  the  earnings  implications  of  the  estimated  fair 
value  of  residual  interest. 

Perceived  Bias  in  Supervisors’  Finalized  Judgments 

An  analysis  of  the  variance  of  Perceived  Bias  (untabulated)  yields  only  significant  main 
effects  of  G  (F  =  93.3,  p  <  0.001)  and  K  =  8.26,  p  =  0.004).  With  respect  to  G,  the 
least-squares  mean  Perceived  Bias  ratings  are  1.86  for  the  conservative  conditions,  0.71  for  neutral 
conditions,  and  3.15  for  the  aggressive  conditions.  These  bias  ratings  for  the  two  directional  goal 
conditions  (conservative  and  aggressive),  averaged  together,  are  significantly  higher  than  those  of 
the  neutral  goal  conditions  (t  =  11.61,  Pone-taiied  O-OOl)-  These  findings  suggest  that  auditors 
believe  that  directional  goals  to  reach  a  preferred  conclusion  bias  supervisors’  finalized  judgments. 
With  respect  to  K,  the  least-squares  mean  bias  ratings  are  2.12  for  the  no-knowledge-advantage 
conditions  and  1.70  for  the  knowledge-advantage  conditions,  a  significant  decrease  (t  =  2.87, 
Pone-tailed  =  0.002).  Thus,  auditors  appear  to  believe  that  a  supervisor’s  knowledge  advantage  re¬ 
duces  bias  in  supervisors’  finalized  judgments. With  respect  to  /,  we  find  no  evidence  that 
auditors  believe  that  intervention  increases  bias  in  supervisors’  finalized  judgments.  The  least- 
squares  mean  bias  ratings  are  1.84  in  the  no-intervention  conditions  and  1.98  in  the  intervention 
condition  (F  <  1).  Thus,  auditors  do  not  appear  to  believe  that  intervention  influences  bias. 

VII.  CONCLUSION 

Results  from  the  two  experiments  collectively  support  several  conclusions  with  implications 
for  both  theory  and  practice.  First,  consistent  with  a  concern  raised  in  prior  research  that  super¬ 
visor  interventions  distort  subordinates’  judgments,  we  find  evidence  that  subordinates’  judgments 
are  reflected  in  the  finalized  judgment  for  the  audit  team.  Thus,  supervisors  can  influence  and  then 
anchor  on  subordinates’  judgments.  More  relevant  to  our  research  question,  intervention  leads  to 
distortions  in  the  supervisor’s  finalized  judgment  for  the  audit  team  beyond  any  concurrent  effects 
on  the  subordinate’s  judgment.  Specifically,  after  taking  subordinates’  judgments  into  account, 
supervisors  who  intervene  reach  even  more  goal-consistent  conclusions  than  do  supervisors  who 


Note  that  this  main  effect  in  auditors’  Perceived  Bias  in  supervisors’  finalized  judgment  is  consistent  with  the  effect  (G 
X  K)  on  the  actual  levels  of  reviewer’s  finalized  judgments,  observed  in  Experiment  1.  Because  Experiment  1  uses 
finalized  judgment  levels  as  the  dependent  variable,  we  infer  the  tempering  effect  of  K  on  bias  from  its  tempering  the 
bias-inducing  effects  of  goals  to  reach  a  preferred  directional  conclusion  G  (i.e.,  G  X  A")  on  FinalJgmt  (cf.  Kunda  1990). 
In  Experiment  2,  we  measure  Perceived  Bias  directly,  and  the  main  effect  of  K  analogously  suggests  that  auditors 
believe  K  to  have  a  tempering  effect  on  bias. 
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have  the  same  goals  but  do  not  intervene.  This  suggests  that  intervention  creates  additional  cog¬ 
nitive  effects  in  the  mind  of  supervisors,  consistent  with  theory  underlying  HI. 

Formally  or  informally,  supervisors  have  the  option  to  intervene  relatively  early  in  subordi¬ 
nates’  judgment  processes.  One  view  regards  supervisors  as  reducing  the  bias  in  subordinates’ 
judgments,  and  therefore  more  active  intervention  would  lead  to  more  objective  judgments.  While 
prior  research  has  focused  on  the  effect  of  intervention  on  subordinates’  judgments,  we  add  to  this 
body  of  research  by  considering  a  setting  in  which  coaching  or  actively  intervening  in  a  subordi¬ 
nate’s  judgment  generates  bias  in  supervisors  themselves  as  they  finalize  audit  team  judgments. 

Our  findings  from  Experiment  1  suggest  that  supervisor  intervention  increases  motivated 
reasoning  bias  in  audit  team  judgments  through  its  effects  on  the  intervening  supervisor.  Experi¬ 
ment  2  provides  evidence  that  auditors  likely  do  not  understand  these  effects.  A  lack  of  under¬ 
standing  of  these  effects  precludes  a  realistic  likelihood  that  supervisors  could  consciously  attempt 
to  adjust  for  them  (Wegener  et  al.  2001),  short  of  training  or  policy  changes.  To  counter  this  risk, 
it  would  be  prudent  for  firms  to  train  auditors  about  how  intervention  affects  bias  in  audit  team 
judgments.  Supervisors  could  try  to  become  conscious  of  these  intervention  effects,  or  more 
simply  avoid  unnecessary  early  intervention  in  their  subordinates’  judgments,  allowing  their  input 
to  be  more  independent.  This  advice  is  likely  especially  prudent  when  stronger  sources  of  client 
pressure  that  could  affect  the  supervisors’  independence  are  present.  Additionally,  audit  firms 
could  rely  more  on  reviews  done  by  supervising  auditors  who  are  more  removed  from  the  decision 
making  of  the  subordinates  they  review.  At  the  partner  level,  these  “cold”  concurring  partner 
reviews  have  this  purpose.  The  PCAOB  (2008)  currently  identifies  such  reviews  as  probably  the 
most  frequently  cited  audit  deficiency  in  current  practice,  and  is  considering  a  proposed  standard 
that  would  substantially  increase  importance  on  them  as  an  audit  quality  control. 

Our  experimental  findings  also  suggest  that,  as  supervisors  perceive  that  their  technical 
knowledge  dominates  their  subordinates  more,  they  temper  the  influence  of  their  directional  goals 
to  reach  a  preferred  conclusion  on  audit  team  judgments.  Moreover,  auditors  appear  to  understand 
that  perceived  technical  knowledge  advantages  temper  bias  in  supervisors’  judgments;  this  effect 
and  auditors’  understanding  thereof  enable  supervisors  to  better  use  such  knowledge  advantages  as 
a  cue  to  temper  their  directional  goals,  better  fulfilling  the  quality  control  role  of  audit  review  (cf. 
Rich  2004;  Kunda  1990). 

Kennedy  and  Peecher  (1997)  document  that  supervisors  tend  to  be  overconfident  in  their 
subordinates’  technical  knowledge,  and  that  this  overconfidence  increases  as  the  knowledge  gap 
between  them  increases  (cf.  Messier  et  al.  2008).  Linking  this  observation  to  our  theory  and 
findings,  those  supervisors  who  tend  to  be  more  aware  of  the  actual  knowledge  gap  probably  are 
more  likely  to  act  as  a  reasonableness  check  in  audit  team  judgments  as  the  actual  technical 
knowledge  gap  increases.  However,  this  is  less  likely  to  occur  for  the  modal  supervisor  who 
overestimates  subordinates’  technical  knowledge,  and  does  so  increasingly  as  the  actual  gap  in¬ 
creases.  Thus,  less  overconfidence  in  assessing  the  technical  knowledge  of  one’s  subordinates  is 
likely  to  be  an  important  criterion  for  promotion  of  reviewing  auditors.  Since  Experiment  2 
suggests  that  auditors  are  aware  of  this  effect,  firm  policies  that  identify  (or  train)  supervisors  with 
more  accurate  confidence  in  their  subordinates’  judgments  are  likely  to  make  supervisors  more 
sensitive  to  reasonableness  constraints  in  auditing  team  judgments. 

Like  all  empirical-experimental  investigations,  ours  has  its  limitations.  First,  we  do  not  ex¬ 
amine  all  factors  that  affect  the  amount  of  bias  in  audit  team  judgments.  Rather,  we  limit  our 
discussion  to  speaking  to  the  moderating  consequences  of  supervisors  active  intervention  and 
perceived  knowledge  advantages  on  the  effects  that  directional  goals  have  on  supervisors  judg¬ 
ments.  Additional  research  should  examine  other  factors  that  impose  boundary  conditions  on  these 
effects.  Second,  in  Experiment  1,  we  used  measurement  rather  than  manipulation  to  achieve 
variation  in  supervisors’  directional  goals.  We  did  so  to  conserve  on  the  number  of  two-member. 
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industry-specialist  auditor  teams  needed  to  conduct  our  study.  Additionally,  theory  would  not 
predict  intervention  or  supervisor  knowledge  advantage  to  interact  differently  with  supervisor 
goals  depending  on  whether  they  are  manipulated  versus  measured.  Third,  we  could  have  achieved 
additional  realism  in  the  task  by  adding  more  discount  rate  benchmarks.  However,  our  theory 
would  not  predict  this  to  alter  the  conclusions  we  reach  so  we  forwent  such  realism  (cf.  Cook  and 
Campbell  1979;  Peecher  and  Solomon  2001;  Kachelmeier  and  King  2002).  Fourth,  while  we  do 
not  find  evidence  in  Experiment  2  that  auditors  believe  that  intervention  affects  bias  in  audit  team 
judgments,  we  cannot  rule  out  the  possibility  that  such  beliefs  exist  but  were  unobservable  due  to 
inherent  limitations  in  experimental  design,  statistical  power,  or  measurement  error. 

Finally,  the  way  in  which  we  operationalized  active  intervention  imposes  limitations  on  the 
inferences  we  can  draw.  Specifically,  Experiment  1  used  structured  prompts  to  elicit  both  inter¬ 
vention  content  and  cognition  from  supervisors  in  the  intervention  condition.  We  could  have 
allowed  supervisors  to  guide  subordinates  without  the  structured  cognition  prompts,  leaving  the 
extent  of  intervention  cognition  up  to  them.  Future  research  could  measure  key  attributes  of 
free-form  intervention  cognition  by  supervisors  as  well  as  investigate  the  attributes  that  are  more 
or  less  apt  to  lead  to  intervention-induced  biases  in  supervisors’  judgments.  Deeper  goal-consistent 
intervention  cognition  during  free-form  intervention,  for  example,  would  be  more  likely  than 
shallow  goal-consistent  processing  to  lead  to  such  bias. 

Despite  these  limitations,  we  find  evidence  with  implications  for  audit  supervisors  about  the 
determinants  of  bias  in  audit  team  judgments.  We  show  how  active  intervention  and  supervisors’ 
perceived  knowledge  advantages  over  subordinates  have  countervailing  effects  in  that  the  former 
creates  bias  while  the  latter  mitigates  bias  in  audit  team  judgments.  A  worrisome  rub  is  that 
auditors  appear  to  understand  only  the  latter,  bias-mitigating  effect. 
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ABSTRACT:  This  research  examines  whether  investors  adjust  their  assessments  of 
investment  risk  in  response  to  material  control  weakness  disclosures,  the  pervasive¬ 
ness  of  material  control  weaknesses,  and  the  detail  of  explanation  provided  regarding 
the  pervasiveness  of  material  control  weaknesses.  Findings  from  a  laboratory  experi¬ 
ment  with  97  nonprofessional  investors,  a  second  experiment  with  53  nonprofessional 
investors,  and  surveys  of  47  investors  and  28  Fortune  500  directors  confirm  prior  ar¬ 
chival  findings  that  investors  adjust  their  investment  risk  assessments  in  response  to 
material  weakness  disclosures.  More  importantly,  we  find  evidence  of  an  interactive 
effect  of  material  control  weakness  pervasiveness  and  disclosure  detail  that  is  counter 
to  the  expected  benefits  of  expanded  disclosure  desired  by  corporate  directors.  When 
material  weakness  disclosures  include  specific  and  detailed  discussion  of  the  perva¬ 
siveness  of  control  weaknesses,  investors  increase  assessments  of  investment  risk  for 
less  pen/asive  weaknesses  and  decrease  assessments  of  risk  for  more  pervasive 
weaknesses.  Results  indicate  that  these  findings  are  driven  by  different  levels  of  inves¬ 
tor  trust  in  management. 

Keywords:  internal  control;  control  pervasiveness;  control  weakness;  disclosure 
detail;  investment  risk;  management  credibility;  trust. 
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I.  INTRODUCTION 

Section  404  of  the  Sarbanes-Oxley  Act  of  2002  (SOX,  U.S.  House  of  Representatives  2002) 
and  associated  SEC  regulations  (SEC  2003)  require  management  of  public  companies  to 
establish  and  maintain  adequate  internal  control  over  financial  reporting  and  to  obtain  an 
attestation  report  by  an  independent  auditor.  In  addition  to  promoting  investor  confidence,  the 
requirements  of  Section  404  of  SOX  are  intended  to  provide  investors  with  more  accurate  infor¬ 
mation  regarding  the  integrity  of  financial  reports.  Further,  internal  control  weakness  disclosures 
are  believed  to  be  effective  signals  of  risks  to  investors  created  by  potentially  unreliable  financial 
reports,  allowing  investors  to  make  better  forward-looking  assessments  of  investment  risk 
(PCAOB  2007). 

The  objectives  of  this  research  are  to  examine  whether  nonprofessional  investors  adjust  their 
assessments  of  investment  risk  in  response  to  material  control  weakness  (hereafter,  control  weak¬ 
ness)  disclosures,  the  pervasiveness  of  control  weaknesses,  and  the  detail  of  explanations  provided 
regarding  the  pervasiveness  of  control  weaknesses.  Existing  archival  research  indicates  that  con¬ 
trol  weakness  disclosures  result  in  adverse  stock  price  reactions  (e.g.,  Beneish  et  al.  2008;  Ham- 
mersley  et  al.  2008),  providing  evidence  that  investors  perceive  risks  related  to  material  control 
weaknesses. 

The  current  research  leverages  the  advantages  of  experimentation  to  extend  archival  findings 
and  explore  the  underlying  causes  of  the  effects  of  pervasiveness  and  explanation  detail  on  inves¬ 
tors’  judgments.  We  first  examine  the  effects  of  the  pervasiveness  of  control  weaknesses  on 
perceived  investment  risk.  Pervasive  weaknesses  are  those  at  the  financial  statement  and  entity 
level  that  might  affect  overall  control  objectives,  while  less  pervasive,  or  isolated,  control  weak¬ 
nesses  are  typically  associated  with  specific  accounts  and  disclosures  (Kinney  1989;  Messier  and 
Austen  2000;  PCAOB  2007).  Isolated  weaknesses  affect  one  account  or  transaction  and  pervasive 
weaknesses  have  system-wide  effects  that  are  inherently  less  precise  (i.e.,  more  vague)  (Doss 
2004).  Examples  of  pervasive  weaknesses  include  items  such  as  problems  with  the  control  envi¬ 
ronment  and  ineffective  information  technology  controls  that  affect  many  accounts  and  processes. 
Both  auditors  and  regulators  have  stated  that  investors  may  not  react  to  control  weakness  disclo¬ 
sures  if  they  cannot  comprehend  the  effects  of  weaknesses  on  future  performance  (Orenstein  2004; 
Hammersley  et  al.  2008).  Our  study  examines  whether  individual  investors  recognize  the  differ¬ 
ential  effects  of  internal  control  weaknesses  that  are  isolated  versus  pervasive,  and  whether  they 
adjust  investment  risk  perceptions  accordingly.  We  also  examine  the  effects  of  the  explanation 
detail  provided  in  material  weakness  disclosures  on  risk  assessments.  Psychology-based  theories 
of  risk  assessment  suggest  that  increased  explanation  decreases  perceived  risk.  However,  attribu¬ 
tion  theory  indicates  that  more  detailed  explanations  can  interact  with  weakness  pervasiveness, 
and  increased  risk  perceptions  can  result. 

Consistent  with  archival  research,  our  results  indicate  that  nonprofessional  investors’  assess¬ 
ments  of  investment  risk  increase  for  firms  with  material  control  weakness  disclosures,  relative  to 
firms  without  such  disclosures.  Investors  also  recognize  the  pervasiveness  of  the  control  weak¬ 
nesses,  but  the  effects  of  control  weakness  pervasiveness  are  moderated  by  the  level  of  explana¬ 
tion  provided  regarding  pervasiveness.  We  also  find  significant  evidence  for  an  effect  that  is 
counter  to  the  expected  benefits  of  detailed  explanations  currently  desired  by  corporate  directors. 
When  material  weakness  disclosures  in  the  Management  Report  on  Internal  Controls  include 
specific  and  detailed  discussion  of  the  pervasiveness  of  the  control  weaknesses,  investors  increase 
their  assessments  of  investment  risk  for  less  pervasive  weaknesses  and  decrease  their  assessments 
of  risk  for  more  pervasive  weaknesses.  That  is,  providing  a  detailed  and  explicit  explanation  that 
a  control  weakness  is  isolated  and  not  pervasive  increases  forward-looking  risk  assessments  rela¬ 
tive  to  not  providing  a  detailed  explanation.  Conversely,  providing  a  detailed  and  explicit  expla¬ 
nation  that  a  control  weakness  is  very  pervasive  decreases  forward-looking  risk  assessments  rela- 
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live  to  not  providing  a  detailed  explanation.  Supplemental  analyses  provide  an  explanation  for  this 
interactive  effect. 

Risk  perceptions  are  amplified  when  decision  makers  lose  trust  in  the  source  of  information 
(Kasperson  et  al.  1988;  Pidgeon  et  al.  2002).  Our  nonprofessional  investor  participants  appear  to 
have  believed  that  management  was  less  trustworthy  when  detailed  explanations  were  provided  for 
isolated,  non-pervasive  control  weaknesses.  On  the  other  hand,  participants  perceived  that  man¬ 
agement  was  more  trustworthy  when  detailed  explanations  were  provided  for  pervasive  control 
weaknesses.  Apparently,  nonprofessional  investors  believe  that  management  is  attempting  to 
downplay  the  significance  of  the  problem  when  detailed  explanations  are  provided  for  isolated 
control  weaknesses,  but  detailed  explanations  of  pervasive  control  weaknesses  are  viewed  as 
indicators  that  management  is  trustworthy. 

These  results  indicate  that  providing  explicit  explanations  of  control  weakness  pervasiveness 
may  have  unintended  consequences  to  the  extent  that  investors  use  the  explanations  as  cues  for 
assessing  their  trust  in  management.  Insofar  as  the  intent  of  Section  404  was  to  facilitate  the 
restoration  of  investor  confidence  in  the  financial  markets  and  provide  relevant  information  to 
assess  investment  risk,  these  results  should  be  interesting  to  legislators,  investors,  managers,  and 
directors  contemplating  the  structure  of  control  weakness  disclosures,  as  well  as  to  independent 
auditors.  We  find  that  investors  use  information  about  control  weaknesses  and  their  pervasiveness 
to  adjust  forward-looking  risk  assessments.  Additionally,  the  level  of  explanation  provided  in 
control  weakness  disclosures  significantly  affects  investors’  interpretations  of  investment  risk. 

II.  BACKGROUND  AND  DEVELOPMENT  OF  HYPOTHESES 
Control  Weaknesses  and  Investment  Risk 

The  U.S.  Congress  required  the  Securities  and  Exchange  Commission  (SEC)  to  adopt  rules  to 
implement  Section  404  of  the  Sarbanes-Oxley  Act.  Under  these  rules,  management  must  disclose 
any  material  weakness  and  may  not  conclude  that  the  company’s  internal  control  over  financial 
reporting  is  effective  if  even  one  material  weakness  exists.  Analyses  of  recent  disclosures  under 
SOX  Section  404  suggest  that  a  number  of  adverse  effects  are  associated  with  internal  control 
weaknesses.  Hammersley  et  al.  (2008)  find  that  returns  are  significantly  negative  when  material 
weaknesses  are  disclosed,  and  the  adverse  effects  of  control  weaknesses  on  returns  are  more 
pronounced  when  management  claims  that  the  control  system  is  effective  but  the  independent 
auditor  reports  that  the  system  is  ineffective.  The  authors  suggest  that  these  results  are  consistent 
with  investor  concerns  about  the  expense  required  to  remediate  weaknesses  or  the  possibility  that 
uncorrected  control  weaknesses  remain.  Similarly,  Beneish  et  al.  (2008)  find  a  significant  negative 
price  response  for  disclosing  firms  that  is  exacerbated  by  conditions  associated  with  higher  inher¬ 
ent  reporting  risk,  such  as  auditor  turnover  and  high-risk  industry  membership.  Einally,  Ogneva  et 
al.  (2007)  report  marginally  higher  cost  of  equity  for  firms  that  disclosed  a  material  weakness  in 
internal  controls  and  for  firms  that  delayed  their  Section  404  disclosures. 

Palmrose  et  al.  (2004)  state  that  material  control  weaknesses  should  adversely  affect  stock 
price  because  weaknesses  lead  to  material  misstatements,  and  these  material  misstatements  in 
financial  statements  result  in  future  stock  price  declines  when  the  misstatements  are  corrected.  The 
prior  archival  research  supports  this  perspective,  and  indicates  that  investors  recognize  the  invest¬ 
ment  risks  created  by  internal  control  weaknesses  (Ogneva  et  al.  2007;  Beneish  et  al.  2008; 
Hammersley  et  al.  2008).  The  first  hypothesis  is  designed  to  confirm  this  finding  under  controlled 
laboratory  conditions  with  nonprofessional  investors.  Given  that  we  examine  nonprofessional 
investors,  it  is  important  to  verify  the  market-based  findings  because  market  effects  could  be 
driven  by  institutional  investors  and  other  more  sophisticated  investors.  We  propose  the  following 
hypothesis  to  confirm  the  findings  of  prior  archival  research  in  a  controlled  laboratory  environ¬ 
ment. 
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HI:  The  forward-looking  investment  risk  perceptions  of  nonprofessional  investors  are  higher 
for  firms  with  material  control  weakness  disclosures  relative  to  firms  without  material 
control  weakness  disclosures,  ceteris  paribus. 

Control  Weakness  Pervasiveness 

Pervasive  control  weaknesses  are  those  weaknesses  that  influence  multiple  aspects  of  the 
financial  reporting  system,  while  less  pervasive,  or  isolated,  control  weaknesses  influence  indi¬ 
vidual  accounts  or  classes  of  transactions  (Kinney  1989;  Messier  and  Austen  2000;  PCAOB  2007). 
Extremely  pervasive  control  weaknesses,  such  as  failures  of  the  information  systems  that  produce 
financial  information,  create  ubiquitous  weaknesses  throughout  the  entire  financial  system.  Neither 
existing  archival  nor  experimental  research  has  examined  whether  investors  recognize  the  perva¬ 
siveness  of  control  weaknesses  or  whether  investors  react  differentially  to  varying  levels  of  per¬ 
vasiveness.  Prior  research  finds  that  auditors  react  to  the  pervasiveness  of  risks  (Messier  and 
Austen  2000),  but  researchers  have  not  addressed  the  ability  of  nonprofessional  investors  to  use 
information  about  weakness  pervasiveness  in  their  assessments  of  risk. 

Hammersley  et  al.  (2008)  find  that  the  market  reacts  more  to  material  weaknesses  than  to  less 
severe  control  deficiencies.  The  authors  suggest  that  material  weaknesses  are  more  likely  to  result 
in  future  restatements  of  misstated  financial  information  than  are  less  severe  control  deficiencies, 
and  material  weaknesses  trigger  greater  concerns  about  cost  of  remediation  and  future  control 
weaknesses.  Similarly,  we  expect  investors  to  react  to  the  pervasiveness  of  material  control  weak¬ 
nesses,  because  more  pervasive  control  weaknesses  have  greater  potential  to  cause  misstatements 
than  do  less  pervasive  weaknesses,  and  the  widespread  nature  of  pervasive  weaknesses  increases 
the  potential  costs  of  remediation  and  likelihood  of  future  discoveries  of  additional  control  prob¬ 
lems.  According  to  Weber  (1988)  and  Weber  and  Bottom  (1989,  1990),  decision  makers  assess 
risk  by  analyzing  the  severity  and  likelihood  of  potential  outcomes.  Pervasive  control  weaknesses 
are  more  severe  than  are  isolated  weaknesses,  and  the  likelihood  of  misstatements  increases  for 
pervasive,  relative  to  isolated,  control  weaknesses.  Accordingly,  we  expect  that  investors’  percep¬ 
tions  of  investment  risk  will  increase  for  more  pervasive  control  weaknesses,  relative  to  less 
pervasive  control  weaknesses. 

H2:  The  forward-looking  investment  risk  perceptions  of  nonprofessional  investors  increase 
as  the  pervasiveness  of  material  control  weaknesses  increases. 

Explanation  Detail 

There  are  potentially  competing  effects  of  the  level  of  explanation  provided  in  control  weak¬ 
ness  disclosures  on  investors’  risk  assessments.  Hammersley  et  al.  (2008)  find  that  vague  disclo¬ 
sures  of  material  weaknesses  (i.e.,  those  with  insufficient  explanation  detail  to  understand  the 
underlying  cause,  the  accounts  affected,  and  the  severity  of  the  weakness)  result  in  more  adverse 
market  reactions  than  do  detailed  disclosures.  This  is  consistent  with  psychological  theories  of  risk 
perception,  by  which  decision  makers  perceive  more  risk  when  a  risky  item  is  less  observable  and 
more  is  unknown  about  the  item.  Slovic  (1987)  suggests  that  risk  assessments  are  derived  from 
two  components:  “dread”  and  “unknown.”  Based  upon  Slovic’s  model  of  risk,  when  less  is  known 
about  a  material  control  weakness,  perceptions  of  the  risks  associated  with  the  control  weakness 
will  increase.  Slovic’s  model  of  risk  perception  therefore  suggests  a  main  effect  of  explanation 
detail,  with  more  explanation  detail  decreasing  perceptions  of  investment  risk  by  decreasing  the 
amount  of  unknown  information  related  to  the  weakness. 

Corporate  directors  also  subscribe  to  the  theory  that  less  detailed  disclosures  increase  inves¬ 
tors’  risk  perceptions.  A  2005  survey  of  more  than  1,100  corporate  directors  by  Pricewaterhouse- 
Coopers  (PWC  2005)  indicates  that  corporate  directors  believe  that  pervasive  control  matters 
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should  be  clearly  separated  from  those  that  are  less  pervasive  to  help  investors  gauge  the  serious¬ 
ness  of  any  weaknesses  (Kerstetter  and  Schulken  2005).  Directors  appear  to  assume  that  nonpro¬ 
fessional  investors  are  unable  to  differentiate  between  more  and  less  pervasive  control  weak¬ 
nesses,  leading  to  an  over-estimation  of  risk  related  to  less  pervasive  weaknesses.  We  survey  a 
sample  of  28  corporate  directors  from  Fortune  500  firms  to  confirm  the  assumption  that  one 
source  of  directors’  concerns  regarding  the  pervasiveness  of  control  weaknesses  involves  inves¬ 
tors’  reactions  to  these  disclosures.  Our  survey  results  indicate  that  directors  believe  that  investors 
over-react  (i.e.,  perceive  more  risk  than  is  warranted)  to  disclosures  of  more  isolated  control 
weaknesses  and  that  increasing  the  explanation  detail  provided  in  weakness  disclosures  could 
reduce  unwarranted  investor  reactions.  Directors  believe  that  providing  detailed  explanations  of 
control  weakness  pervasiveness  should  decrease  risk  perceptions  for  less  pervasive  control  weak¬ 
nesses. 

Attribution  theory,  however,  suggests  an  interactive  effect  of  explanation  detail  and  perva¬ 
siveness,  rather  than  a  main  effect  of  explanation  detail.  In  addition,  Hammersley  et  al.’s  (2008) 
definition  of  “vague”  disclosures  contains  features  of  both  pervasiveness  and  explanation  detail. 
Thus,  how  these  two  aspects  of  control  weaknesses  and  related  disclosures  affect  investors  re¬ 
mains  an  open  question.  According  to  attribution  theory,  decision  makers  evaluate  messages  (such 
as  management’s  disclosures  concerning  control  weaknesses)  by  comparing  messages  with  the 
incentives  of  the  messenger  (Eagly  and  Chaiken  1993).  Attribution  theory  indicates  that  recipients 
(e.g.,  investors)  of  disclosures  from  a  messenger  (e.g.,  management)  develop  expectations  of  the 
contents  of  disclosures,  and  recipients  determine  the  validity  of  the  disclosure  and  the  credibility 
of  the  messenger  based  upon  comparisons  of  expectations  and  actual  disclosures  (Eagly  and 
Chaiken  1975).  Mercer  (2005,  725)  defines  management’s  credibility  related  to  financial  reporting 
as  “investor’s  beliefs  about  management’s  trustworthiness  and  competence  in  financial  disclo¬ 
sure.”  Thus,  trust  is  a  key  component  of  credibility.  If  a  message  is  consistent  with  the  messenger’s 
incentives,  then  recipients  view  the  message  as  biased  and  lose  trust  in  the  messenger.  Good  news 
disclosures  are  inherently  viewed  more  skeptically  by  recipients,  while  bad  news  disclosures  are 
more  readily  accepted,  because  bad  news  disclosures  are  inconsistent  with  the  messengers’  incen¬ 
tives  (Eagly  and  Chaiken  1993). 

Managers  have  incentives  to  exaggerate  good  financial  news  and  downplay  bad  news  (Hutton 
et  al.  2003).  Earnings  forecast  research  finds  that  investors  believe  that  bad  news  forecasts  are 
credible  because  management  has  few  incentives  to  report  bad  news  (Skinner  1994;  Softer  et  al. 
2000).  As  a  result,  investors  consistently  view  bad  news  disclosures  as  more  credible  than  good 
news  disclosures.  However,  managers  are  also  more  likely  to  create  explanations  for  bad  news  that 
externalize  the  causes  of  bad  news,  and  investors  expect  management  to  downplay  bad  news 
(Baginski  et  al.  2000). 

Attribution  research  in  psychology  and  accounting  suggests  that  investors  expect  that  bad 
news  will  be  accompanied  by  attempts  to  marginalize  the  bad  news,  and  explanations  that  down¬ 
play  bad  news  are  viewed  as  biased.  Investors,  however,  do  not  expect  management  to  provide 
detailed  explanations  that  explain  the  serious  nature  of  more  pervasive  weaknesses  (i.e.,  they  do 
not  expect  management  to  emphasize  bad  news),  and  investors  are  likely  to  attribute  greater  levels 
of  trustworthiness  to  managers  that  reveal  the  serious  nature  of  pervasive  weaknesses.  Research 
demonstrates  connections  between  trust  in  management  and  investor  risk-taking  (e.g.,  Ryan  and 
Buchholtz  2001;  Das  and  Teng  2004;  Schwarzkopf  2006).  This  research  generally  finds  that 
investors  must  perceive  an  acceptable  level  of  trustworthiness  in  management  before  they  decide 
to  take  the  risk  of  investing  in  that  firm,  and  increased  trust  can  result  in  decreased  perceptions  of 
investment  risk.  Thus,  following  attribution  and  trust  theories,  increased  explanation  detail  will 
increase  perceptions  of  risk  for  less  pervasive  weaknesses  and  will  decrease  perceptions  of  risk  for 
more  pervasive  weaknesses. 
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Given  the  competing  predictions  of  attribution  theory  and  psychological  theories  of  risk 
assessment,  we  test  the  following  null  hypothesis: 


H3:  Increasing  the  level  of  explanation  for  material  control  weaknesses  will  not  affect  per¬ 
ceived  investment  risk. 


III.  METHOD 

Design  and  Participants 

The  experiment  involved  a  2  X  2  between-participants  design  with  a  control  group  included 
as  a  fifth  experimental  condition.  The  experiment  manipulated  the  pervasiveness  of  material  in¬ 
ternal  control  weaknesses  (low  and  high)  and  the  detail  of  explanation  of  the  control  weaknesses 
(low  and  high).  The  control  group  case  materials  did  not  include  any  internal  control  weaknesses. 
This  group  was  used  to  evaluate  the  effects  of  a  material  weakness  disclosure  on  investors’ 
investment  risk  perceptions  relative  to  risk  perceptions  related  to  annual  reports  where  no  material 
weaknesses  are  identified  (HI).  Participation  in  the  study  was  voluntary. 

Participants  were  M.B.A.  students  from  two  large,  public  U.S.  universities.  Elliott  et  al. 
(2007)  find  that  M.B.A.  students  are  reasonable  surrogates  for  nonprofessional  investors,  and 
M.B.A.  students  have  been  employed  as  surrogates  for  nonprofessional  investors  in  multiple  prior 
studies  investigating  investor  behavior  (e.g.,  Hirst  et  al.  1995;  Maines  and  McDaniel  2000;  Hodge 
et  al.  2004).  The  participants  had  an  average  of  6.1  years  of  management  experience;  54  percent 
were  male.  Participants  were  asked  if  they  understood  the  Management  Report  on  Internal  Con¬ 
trols.  On  a  Likert  scale  ranging  from  1  (Not  at  All)  to  9  (Completely),  the  average  response  was 
6.08. 

Task  and  Procedure 

Participants  were  randomly  assigned  to  treatment  conditions.  The  experiment  provided  par¬ 
ticipants  with  general  instructions,  case  materials,  and  a  post-experiment  questionnaire.  The  task 
was  completed  under  controlled  conditions  and  under  the  supervision  of  an  experimenter.  Partici¬ 
pants  read  the  case  materials  and  responded  to  a  series  of  questions  about  investment  risk  and 
trust.  The  case  described  a  manufacturing  company  and  included  several  financial  ratios  from  the 
prior  three  years  and  quarterly  stock  price  data  from  the  prior  eight  quarters.  The  company 
description,  ratio  data,  and  stock  price  data  were  derived  from  an  actual  manufacturing  company 
(the  company  name  was  changed  for  the  experiment).  The  brief  sets  of  ratio  and  stock  price  data 
were  included  to  demonstrate  that  the  company  experienced  a  steady  improvement  in  financial 
ratios  and  steady  growth  in  earnings  and  stock  price  over  the  period  described,  creating  a  favorable 
impression  of  the  firm  and  its  associated  investment  risk  prior  to  the  control  weakness  disclosure. 

After  reading  the  case  materials,  participants  received  the  Management  Report  on  Internal 
Controls  described  in  AS  No.  5  (PCAOB  2007)  and  the  auditor’s  opinion  on  the  Management 
Report  on  Internal  Controls.  The  management  reports  and  the  auditor  reports  followed  the  format 
described  by  AS  No.  5.  The  reports  were  identical  across  treatment  conditions,  with  the  exception 
of  the  specific  control  weaknesses  identified  and  the  level  of  explanation  of  those  weaknesses.  The 
management  report  in  the  control  condition  did  not  contain  any  disclosures  of  material  internal 
control  weaknesses. 

In  the  four  treatment  conditions  with  material  weaknesses,  management  identified  a  control 
weakness  in  its  report  on  internal  controls  and  the  auditor  concurred  with  management’s  conclu¬ 
sions.  As  a  result,  the  auditor’s  report  was  unqualified  with  respect  to  management’s  assertion  that 
internal  controls  were  not  effective,  and  adverse  with  regard  to  the  effectiveness  of  controls. 
Hence,  there  was  no  conflict  between  the  findings  of  management  and  the  auditor,  and  there  was 
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no  indication  of  a  management  failure  to  detect  the  control  weaknesses.  Therefore,  any  differences 
in  participants’  risk  perceptions  are  due  to  the  manipulations  of  the  material  control  weaknesses 
and  the  level  of  explanation  of  those  weaknesses. 

Independent  Variables 

Both  the  low-  and  high-pervasiveness  control  weaknesses  involved  a  lack  of  adequate 
password/access  controls.  The  text  in  the  opinion  paragraph  that  described  the  material  weakness 
is  presented  below  for  the  low-pervasiveness  and  low  explanation  conditions: 

Low  pervasiveness  (low  explanation): 

The  material  weaknesses  resulted  from  a  failure  to  maintain  adequate  password  and  access 
controls  over  the  portion  of  the  Enterprise  Resource  Planning  system  that  ensures  proper 
recording  of  sales  in  the  appropriate  reporting  period. 

In  the  high  explanation  level,  the  following  text  was  added  to  the  end  of  the  opinion  paragraph: 

Low  pervasiveness  (additional  text  for  high  explanation): 

Access  and  password  controls  over  the  system  for  recording  sales  prevent  unauthorized  use 
of  the  system  and  unauthorized  changes  to  the  system  by  employees,  management,  and 
outsiders.  The  system  reviews  sales  and  shipping  transactions  that  occur  near  period-end. 
The  access  controls  ensure  that  sales  revenues  are  always  recorded  in  the  proper  account¬ 
ing  period.  The  insufficient  password  and  access  controls  in  the  system  for  recording  sales 
can  only  affect  the  recording  of  revenue  for  specific  sales  transactions  near  period-end,  and 
this  deficiency  cannot  affect  any  other  aspect  of  the  financial  reporting  system. 

The  control  weakness  and  explanation  level  for  the  high-pervasiveness  condition  are  presented 
below: 

High  pervasiveness  (low  explanation): 

The  material  weaknesses  resulted  from  a  failure  to  maintain  adequate  password  and  access 
controls  over  the  entire  Enterprise  Resource  Planning  system. 

High  pervasiveness  (additional  text  for  high  explanation): 

The  Enterprise  Resource  Planning  system  includes  all  financial  and  nonfinancial  systems 
and  databases  within  the  organization.  Password  and  access  controls  prevent  unauthorized 
use  of  the  system  and  unauthorized  changes  to  the  system  by  employees,  management,  and 
outsiders.  Insufficient  password  and  access  controls  over  the  enterprise  resource  planning 
system  can  influence  the  integrity  of  many  aspects  of  both  financial  reporting  systems  and 
nonfinancial  systems,  and  the  weaknesses  could  have  resulted  in  material  misstatements 
throughout  the  financial  statements. 

Dependent  Variables 

After  reading  and  analyzing  the  Management  Report  on  Internal  Controls  and  the  auditor 
report  expressing  an  opinion  on  internal  controls,  as  well  as  an  opinion  on  the  management  report, 
participants  made  forward-looking  assessments  of  the  risk  of  a  moderate  stock  price  decline  (1—10 
percent)  within  the  next  year  (where  1  =  very  low,  5  =  moderate,  and  9  =  very  high).  This 
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measure  of  investment  risk  captures  an  individual’s  risk  of  loss.  We  use  the  risk  of  a  stock  price 
decline  as  our  measure  of  investment  risk  for  two  reasons.  First,  risk  is  typically  viewed  in  both 
the  accounting  and  psychology  contexts  as  a  risk  of  personal  loss  (e.g.,  Fischhoff  1995;  Olsen 
1997;  Loewenstein  et  al.  2001).  Stock  price  declines  represent  clear  and  measurable  losses  to 
investors.  Second,  we  pre-tested  multiple  measures  of  investment  risk  with  more  than  100  M.B.A. 
students  enrolled  in  an  investments  course,  and  the  risk  of  a  stock  price  decline  was  found  to  be 
the  most  representative  of  personal  investment  risk.  We  include  a  second  dependent  variable: 
“How  will  the  stock  price  of  Davis  Manufacturing  Company  react  to  the  control  weakness  dis¬ 
closure??’’  (where  —  5  =  Significant  Decline,  1  =  No  Effect,  and  5  =  Significant  Increase)  to 
allow  for  the  possibility  that  participants  will  believe  that  control  weaknesses  could  favorably 
affect  future  performance.^  Finally,  we  assess  trust  in  management  with  the  following  question: 
“Do  you  trust  the  management  of  Davis  Manufacturing  Company  to  accurately  represent  the 
financial  position  of  the  firm?”  (where  1  =  Definitely  No,  5  =  Moderately,  and  9  =  Definitely 
Yes). 


IV.  RESULTS 

Manipulation  Checks  and  Preliminary  Tests 

The  participants  were  asked  about  the  presence/absence  of  a  material  control  weakness.  Four 
participants  responded  incorrectly  (i.e.,  four  participants  who  were  in  a  treatment  group  with  a 
control  weakness  responded  that  there  was  no  control  weakness,  indicating  that  they  did  not  attend 
to  the  experimental  materials)  and  one  response  was  an  extreme  outlier.  These  five  participants  are 
not  included  in  any  of  the  analyses.  Regarding  the  manipulation  check  for  the  pervasiveness 
treatment,  the  difference  in  the  mean  response  for  the  low-pervasiveness  (5.18)  conditions  is 
significantly  lower  than  the  mean  response  for  the  high-pervasiveness  (6.86)  conditions  (p  < 
0.001).  Thus,  the  pervasiveness  manipulation  appears  successful. 

Analyses  of  the  dependent  variable  (forward-looking  assessments  of  the  risk  of  moderate 
stock  price  decline)  are  shown  in  Table  1.  Panel  A  of  Table  1  provides  means,  standard  deviations, 
and  sample  sizes  for  all  treatment  conditions  (including  the  control  group)  and  main  effects  for  the 
risk  of  a  moderate  stock  price  decline.  The  pattern  of  means  suggests  a  potential  interactive  effect 
of  control  weakness  pervasiveness  and  explanation  detail  on  investment  risk  perceptions,  but  no 
main  effects  are  readily  apparent.  Next,  we  use  a  regression  model  to  conduct  preliminary  tests, 
where  the  dependent  variable  is  the  assessment  of  risk  of  a  moderate  stock  price  decline.  The 
independent  variables  are  the  control  weakness  pervasiveness,  explanation  detail,  and  an  interac¬ 
tion  term  (pervasiveness  X  explanation);  and  the  covariates  include  demographic  factors  (i.e.,  age, 
gender,  management  experience).  None  of  the  demographic  factors  are  statistically  significant; 
thus,  they  are  not  included  in  our  reported  analyses. 

Tests  of  Hypotheses 

Our  first  hypothesis  (HI)  proposes  that  forward-looking  risk  assessments  of  nonprofessional 
investors  increase  for  firms  with  control  weakness  disclosures  relative  to  firms  without  control 
weakness  disclosures.  We  test  this  hypothesis  with  an  ANOVA  model  that  includes  the  four 
treatment  conditions  and  the  control  group.  The  dependent  variable  reflects  the  participants’  as¬ 
sessment  of  the  risk  of  a  moderate  stock  price  decline,  and  the  independent  variable  is  the  treat¬ 
ment  group  (five  groups).  The  group  variable  is  statistically  significant  in  the  model  (F  =  1072.25, 
p  <  0.001).  Post  hoc  mean  comparisons  using  the  Bonferroni  procedure  indicate  that  assessments 
of  the  risk  of  a  moderate  stock  price  decline  are  lower  in  the  control  group  relative  to  each  of  the 


1 


No  participants  indicated  an  improvement  in  performance  resulting  from  the  control  weakness. 
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TABLE  1 

Descriptive  Statistics  and  ANOVA 
(Dependent  Variable  =  Risk  of  Moderate  Stock  Price  Decline) 

Panel  A:  Descriptive  Statistics 


Mean  (Standard  Deviation)  {Sample  Size}  Responses  across 

Treatment  Conditions 

Explanation  Detail 

Main  Effect: 

Pervasiveness^ 

Low 

High 

Pervasiveness 

Low 

5.65 

6.47 

6.06 

(1.96) 

(1.42) 

(1.74) 

{17} 

{17} 

{34} 

High 

6.59 

5.68 

6.18 

(1.40) 

(1.96) 

(1.71) 

{22} 

{22} 

{44} 

Main  Effect:  Explanation 

6.18 

6.05 

6.10 

(1.72) 

(1.75) 

(1.73) 

{39} 

{39} 

{78} 

Control  Group:  3.74  (1.56)  {19} 

Panel  B:  ANOVA  Model 

Sum  of 

Mean 

Source  Squares 

df 

Square 

F 

p-value 

Pervasiveness  (P)^  0.115 

1 

0.115 

0.039 

0.843 

Explanation  (E)'’  0.035 

1 

0.035 

0.012 

0.913 

P  X  E  14.394 

1 

14.394 

4.927 

0.030 

Error  216.209 

74 

2.922 

Panel  C:  Planned  Contrasts 

Low 

High 

t-value 

Explanation 

Explanation 

(p-value)*^ 

Low  Pervasiveness 

5.65 

6.47 

1.42 

(p  <  0.10) 

High  Pervasiveness 

6.59 

5.68 

1.78 

(p  <  0.05) 

““  Pervasiveness:  Low  pervasiveness  involves  lack  of  access  controls  over  a  single  system,  while  high  pervasiveness 
involves  lack  of  access  controls  over  the  entire  system. 

*’  Explanation  Detail:  In  the  low  explanation  level,  the  pervasiveness  of  the  control  weaknesses  and  the  specific  effects  of 
the  control  weaknesses  are  not  elaborated.  In  the  high  explanation  level,  the  pervasiveness  and  effects  of  the  control 
weaknesses  are  described  in  detail. 

One-sided. 


four  treatment  groups  (all  p  0.01).  These  results  indicate  that  the  mean  risk  assessments  are 
significantly  lower  for  a  firm  without  a  control  weakness  relative  to  a  firm  with  a  control  weak¬ 
ness,  thereby  supporting  HI  and  findings  from  archival  research. 

The  analyses  we  conduct  to  test  H2  and  H3  do  not  include  the  control  group,  as  control 
weakness  pervasiveness  and  explanation  level  cannot  be  manipulated  when  no  control  weaknesses 
are  present.  Hypothesis  2  posits  that  the  forward-looking  investment  risk  assessments  of  nonpro¬ 
fessional  investors  increase  as  the  pervasiveness  of  material  control  weaknesses  increases.  Panel  B 
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of  Table  1  presents  the  results  of  an  ANOVA  model  where  the  dependent  variable  reflects  the 
participants’  assessment  of  the  risk  of  a  moderate  stock  price  decline,  and  the  independent  vari¬ 
ables  are  control  weakness  pervasiveness  (low  versus  high),  explanation  detail  (low  versus  high), 
and  the  interaction  term.  The  coefficient  for  control  weakness  pervasiveness  is  not  statistically 
significant  (F  =  0.039,  p  >  0.50).  However,  this  result  must  be  evaluated  cautiously,  as  there  is 
also  a  significant  interaction  between  pervasiveness  and  explanation  that  we  explore  further  in  the 
analyses  that  follow. 

Hypothesis  3  is  tested  with  the  main  effect  of  explanation  detail  and  the  interaction  of  per¬ 
vasiveness  and  explanation  detail.  Results  in  Table  1,  Panel  B  indicate  that  explanation  detail  is 
not  statistically  significant  (F  =  0.012,  p  >  0.50),  but  the  interaction  of  pervasiveness  and  expla¬ 
nation  level  is  statistically  significant  (F  =  4.927,  p  =  0.03).^  Figure  1  presents  the  interaction.  To 
further  analyze  the  interaction  effect,  we  conduct  two  planned  contrasts  to  test  the  predictions  of 
attribution  theory  (see  Panel  C  of  Table  1).  Attribution  theory  predicts  that  explanation  will 
increase  risk  assessments  for  less  pervasive  weaknesses  and  decrease  risk  assessments  for  more 
pervasive  weaknesses.  A  planned  contrast  comparing  low  explanation  (5.65)  to  high  explanation 
(6.47)  within  the  low-pervasiveness  condition  indicates  that  assessed  risk  is  significantly  higher 
for  higher  levels  of  explanation  detail,  relative  to  lower  levels  of  explanation  detail  (t  =  1.42,  p  < 
0.10,  one-tailed  test).  A  second  planned  contrast  comparing  low  explanation  (6.59)  to  high  expla¬ 
nation  (5.68)  within  the  high-pervasiveness  condition  indicates  that  assessed  risk  is  significantly 
higher  for  low  explanation,  relative  to  high  explanation  (t  =  1.78,  p  <  0.05,  one-tailed  test). 

The  pattern  in  Figure  1,  the  planned  contrast  results,  and  the  mean  responses  all  support  an 
attribution  theory  perspective.  Contrary  to  the  expectations  of  corporate  directors,  detailed  expla¬ 
nations  of  control  weakness  pervasiveness  increase  risk  assessments  for  isolated  control  weak¬ 
nesses,  but  decrease  risk  assessments  for  pervasive  control  weaknesses.  This  finding  is  further 
analyzed  and  discussed  in  the  “Supplemental  Analyses  and  Additional  Data  Collection”  section. 

Supplemental  Analyses  and  Additional  Data  Collection 

Attribution  theory  suggests  that  trust  in  management  is  affected  by  differences  in  disclosure 
expectations  versus  actual  disclosures.  Following  attribution  theory,  we  expect  that  when  manage¬ 
ment  describes  the  isolated  nature  of  less  pervasive  material  weaknesses,  investors’  expectations 
that  management  will  attempt  to  cover  up  bad  news  will  be  confirmed.  Thus,  trust  in  management 
will  decline.  Similarly,  when  investors  are  surprised  by  management’s  detailed  disclosures  that 
emphasize  the  negative  nature  of  a  pervasive  weakness,  attribution  theory  predicts  that  trust  in 
management  will  increase.  Table  2  reports  the  results  of  an  ANOVA  model  where  the  dependent 
variable  is  trust  in  management  and  the  independent  variables  are  control  weakness  pervasiveness, 
explanation  detail,  and  the  interaction  term.  The  analysis  indicates  that  perceptions  of  trust  display 
a  similar  pattern  of  results  to  perceptions  of  changes  in  stock  price.  There  is  a  statistically  signifi¬ 
cant  interaction  of  pervasiveness  and  explanation  detail  (p  =  0.005).  Figure  2  depicts  the  inter¬ 
action.  Increasing  explanation  detail  for  pervasive  weaknesses  increases  trust  in  management, 
while  increasing  explanation  detail  for  isolated  weaknesses  decreases  trust  in  management.  This 
result  is  supported  by  planned  contrasts  that  test  for  attribution  theory’s  predictions  for  differences 
in  trust  perceptions  across  levels  of  explanation  (see  Panel  C,  Table  2).  A  planned  contrast  com¬ 
paring  low  explanation  (5.76)  to  high  explanation  (5.06)  within  the  low-pervasiveness  condition 
indicates  that  trust  in  management  is  significantly  lower  for  higher  levels  of  explanation  detail, 
relative  to  lower  levels  of  explanation  detail  (t  =  1.43,  p  <  0.10,  one-tailed  test).  A  second 


^  We  repeated  the  analyses  described  above  with  our  alternative  measure  of  the  effects  of  the  control  weakness  disclosure 
on  stock  price.  The  hypotheses  tests  produce  the  same  statistical  conclusions. 
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FIGURE  1 

Effects  of  the  Interaction  of  Pervasiveness  and  Explanation  Detail  on  Assessments  of  the 

Likelihood  of  a  Moderate  Stock  Price  Decline 


Pervasiveness 

—  Low 
--  High 


The  figure  displays  the  estimated  marginal  means  from  the  ANOVA  analysis  of  participants’  ratings  of  the 
likelihood  of  a  moderate  stock  price  decline. 

Pervasiveness:  Low  pervasiveness  involves  lack  of  access  controls  over  a  single  system,  while  high  pervasive¬ 
ness  involves  lack  of  access  controls  over  the  entire  system. 

Explanation  Detail:  In  the  low  explanation  level,  the  pervasiveness  of  the  control  weaknesses  and  the  specific 
effects  of  the  control  weaknesses  are  not  elaborated.  In  the  hfgh  explanation  level,  the  pervasiveness  and  effects 
of  the  control  weaknesses  are  described  in  detail. 


planned  contrast  comparing  low  explanation  (5.45)  to  high  explanation  (6.64)  within  the  high¬ 
pervasiveness  condition  indicates  that  trust  in  management  is  significantly  higher  for  higher  levels 
of  explanation  detail,  relative  to  lower  levels  of  explanation  detail  (t  —  2.72,  p  <  0.01,  one-tailed 
test). 

To  further  examine  the  trust  findings,  we  employ  path  analysis  (Figure  3).  The  relationship 
between  the  interaction  of  pervasiveness  and  explanation  level  and  trust  in  management  is  positive 
and  statistically  significant  (/3  =  0.327,  p  <  0.01),  and  the  relationship  between  trust  and  assess¬ 
ments  of  risk  is  negative  and  statistically  significant  (yS  =  -0.223,  p  <  0.05).  In  addition,  the 
interaction  of  pervasiveness  and  explanation  detail  has  a  direct,  negative  effect  on  assessments  of 
investment  risk  (yS  =  -0.567,  p  <  0.05).  From  the  model,  it  is  apparent  that  the  interaction  of 
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TABLE  2 

Descriptive  Statistics  and  ANOVA 

(Dependent  Variable  =  Trust  in  Management) 

Panel  A:  Descriptive  Statistics 

Mean  (Standard  Deviation)  {Sample  Size}  Responses  across 

Treatment  Conditions 

Explanation  Detail” 

Main  Effect 

Pervasiveness^ 

Low 

High 

Pervasiveness 

Low 

5.76 

5.06 

5.41 

(1.60) 

(1.30) 

(1.47) 

{17} 

{17} 

{34} 

High 

5.45 

6.64 

6.05 

(1.10) 

(1.71) 

(1.54) 

{22} 

{22} 

{44} 

Main  Effect:  Explanation 

5.59 

5.94 

5.77 

(1.33) 

(1.72) 

(1.54) 

{39} 

{39} 

{78} 

Control  Group:  6.86  (1.34)  {19} 

Panel  B:  ANOVA 

Sum  of 

Mean 

Source  Squares 

df 

Square  F 

p-value 

Pervasiveness  (P)“  7.702 

1 

7.702  3.688 

0.059 

Explanation  (E)'’  1.086 

1 

1.086  0.520 

0.473 

P  X  E  17.086 

1 

17.086  8.181 

0.005 

Error  154.550 

74 

2.088 

Panel  C:  Planned  Contrasts 

Low 

High 

t-value 

Explanation 

Explanation 

(p-value)*^ 

Low  Pervasiveness 

5.76 

5.06 

1.43 

(p  <  0.10) 

High  Pervasiveness 

5.45 

6.64 

2.72 

(p  <  0.01) 

“  Pervasiveness:  Low  pervasiveness  involves  lack  of  access  controls  over  a  single  system,  while  high  pervasiveness 
involves  lack  of  access  controls  over  the  entire  system. 

’’  Explanation  Detail:  In  the  low  explanation  level,  the  pervasiveness  of  the  control  weaknesses  and  the  specific  effects  of 
the  control  weaknesses  are  not  elaborated.  In  the  high  explanation  level,  the  pervasiveness  and  effects  of  the  control 
weaknesses  are  described  in  detail. 

One-sided. 


pervasiveness  and  explanation  detail  indirectly  affects  risk  assessments  through  effects  on  trust  in 
management,  but  the  interaction  also  directly  influences  risk  assessments.  Thus,  trust  explains  a 
portion  of  the  effects  of  the  interaction  on  risk  assessments,  but  not  all  of  the  effects.  There  may 
be  other  mediating  variables  that  also  explain  the  interactive  effects  of  pervasiveness  and  expla¬ 
nation  detail. 

Follow-Up  Survey 

In  order  to  better  understand  how  trust  influences  perceptions  of  investment  risk,  we  survey 
(under  controlled  laboratory  conditions)  an  additional  47  M.B.A.  students  with  an  average  of  3.2 
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FIGURE  2 

Effects  of  the  Interaction  of  Pervasiveness  and  Explanation  Detail  on  Trust  in  Management 


Pen/as  iveness 

- Low 

--  High 


The  figure  displays  the  estimated  marginal  means  from  the  ANOVA  analysis  of  participants’  ratings  of  their  trust 
in  management. 

Pervasiveness:  Low  pervasiveness  involves  lack  of  access  controls  over  a  single  system,  while  high  pervasive¬ 
ness  involves  lack  of  access  controls  over  the  entire  system. 

Explanation  Detail:  In  the  low  explanation  level,  the  pervasiveness  of  the  control  weaknesses  and  the  specific 
effects  of  the  control  weaknesses  are  not  elaborated.  In  the  high  explanation  level,  the  pervasiveness  and  elfects 
of  the  control  weaknesses  are  described  in  detail. 


years  of  management  experience.  Participants  responded  to  the  following  three  items:  (1)  “In 
general,  does  the  trustworthiness  of  a  firm’s  top  management  affect  the  market  price  of  the  firm?” 
[response  scale;  1  =  Definitely  No,  5  =  Moderately,  9  Definitely  Yes],  (2)  In  general,  does  the 
trustworthiness  of  a  firm’s  top  management  affect  the  risk  that  financial  disclosures  of  the  firm  will 
not  be  accurate?”  [response  scale:  1  =  Definitely  No,  5  =  Moderately,  9  =  Definitely  Yes].  (3)  In 
general,  how  is  the  long-term  financial  performance  of  a  firm  affected  if  management  has  low 
levels  of  trustworthiness?”  [response  scale:  -5  =  Very  Poor  Performance,  1  =  No  Effect,  5  = 
Very  High  Performance].  The  mean  responses  (standard  deviations)  to  survey  items  are  7.36  (1.69) 


The  Accounting  Review 


September  2010 
American  Accounting  Association 


1800 


Rose,  Norman,  and  Rose 


FIGURE  3 
Structural  Model 


*  p  <  0.05,  two-tailed 
**  p  <  0.01,  two-tailed 


*,  **  p  <  0.05  and  p  <  0.01,  respectively,  two-tailed. 

This  figure  displays  a  path  analysis  that  examines  whether  trust  in  management  mediates  the  relationship  be¬ 
tween  the  interaction  of  pervasiveness  and  explanation  detail  and  assessments  of  investment  risk.  Low  perva¬ 
siveness  involves  lack  of  access  controls  over  a  single  system,  while  high  pervasiveness  involves  lack  of  access 
controls  over  the  entire  system.  In  the  low  explanation  level,  the  pervasiveness  of  the  control  weaknesses  and  the 
specific  effects  of  the  control  weaknesses  are  not  elaborated.  In  the  high  explanation  level,  the  pervasiveness  and 
effects  of  the  control  weaknesses  are  described  in  detail.  Assessments  of  investment  risk  are  participants’  ratings 
of  the  likelihood  of  a  moderate  stock  price  decline.  Coefficients  represent  standardized  betas. 


for  item  (1),  —3.37  (1.65)  for  item  (2),  and  7.28  (1.08)  for  item  (3).  For  items  (1)  and  (2),  the 
responses  are  significantly  different  from  a  response  of  “Definitely  No”  (p  <  0.001),  and  are  also 
significantly  greater  than  a  response  of  “Moderately”  (p  <  0.001).  For  item  (3),  the  mean  response 
is  significantly  less  than  “No  Effect”  (p  <  0.001).  Taken  together,  the  survey  responses  indicate 
that  concerns  about  management’s  trustworthiness  affect  investors’  perceptions  of  information  risk 
and  risk  of  actual  declines  in  performance.  Investors  appear  to  be  concerned  that,  consistent  with 
some  recent  archival  research  (Francis  et  al.  2005;  Aboody  et  al.  2005;  Ecker  et  al.  2006;  Kravet 
and  Shevlin  2009),  information  risk  is  priced,  and  they  also  perceive  that  management  credibility 
is  associated  with  a  firm’s  future  performance.  Pilot  testing  also  suggested  that  trust  in  manage¬ 
ment  can  be  associated  with  concerns  about  the  potential  for  restatements  in  the  future.  While  we 
cannot  disentangle  which  of  these  perceptions  drives  our  results,  it  is  apparent  that  the  interaction 
of  pervasiveness  and  explanation  detail  has  the  potential  to  alter  perceptions  of  information  risk, 
risks  related  to  actual  future  performance,  and  risks  that  the  market  will  price  management’s  lack 
of  honest  disclosures. 

Other  Analyses 

Next,  we  perform  analyses  to  rule  out  alternative  interpretations  of  our  findings.  Potentially, 
increased  explanation  detail  could  have  unintended  consequences  on  investors’  perceptions  of  the 
pervasiveness  of  control  weaknesses.  For  example,  if  additional  explanation  makes  the  low¬ 
pervasiveness  condition  appear  more  pervasive  or  the  high-pervasiveness  condition  appear  less 
pervasive,  then  this  would  explain  our  interaction  results.  Table  3  presents  an  analysis  of  the 
effects  of  pervasiveness  and  explanation  detail  on  investors’  assessments  of  the  pervasiveness  of 
the  control  weakness.  The  pattern  of  means  refutes  this  alternative  explanation,  and  there  is  a 
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TABLE  3 


Descriptive  Statistics  and  ANOVA 
(Dependent  Variable  =  Assessments  of  Weakness  Pervasiveness) 


Panel  A:  Descriptive  Statistics 

Mean  (Standard  Deviation)  {Sample  Size}  Responses  across  Treatment  Conditions 

Explanation  Detail'’ 

Pervasiveness^  Low  High 


Main  Effect 
Pervasiveness 


Low 

High 

Main  Effect:  Explanation 

5.24 

(1.82) 

{17} 

6.18 

(1.76) 

{22} 

5.77 

(1.83) 

{39} 

5.12 

(1.90) 

{17} 

7.55 

(1.37) 

{22} 

6.49 

(2.01) 

{39} 

5.18 

(1.83) 

{34} 

6.86 

(1.71) 

{44} 

6.13 

(1.94) 

{78} 

Panel  B:  ANOVA 

Source 

Sum  of 
Squares 

df 

Mean 

Square 

F 

p-value 

Pervasiveness  (P)^ 

54.595 

1 

54.595 

18.743 

0.001 

Explanation  (E)'’ 

7.444 

1 

7.444 

2.556 

0.114 

P  X  E 

10.521 

1 

10.521 

3.612 

0.061 

Error 

215.551 

74 

2.913 

^  Pervasiveness:  Low  pervasiveness  involves  lack  of  access  controls  over  a  single  system,  while  high  pervasiveness 
involves  lack  of  access  controls  over  the  entire  system. 

Explanation  Detail:  In  the  low  explanation  level,  the  pervasiveness  of  the  control  weaknesses  and  the  specific  effects  of 
the  control  weaknesses  are  not  elaborated.  In  the  high  explanation  level,  the  pervasiveness  and  effects  of  the  control 
weaknesses  are  described  in  detail. 


statistically  significant  interaction  of  pervasiveness  and  explanation  detail  (F  -  3.612,  p  -  0.06). 
Assessments  of  pervasiveness  increase  when  more  explanation  is  provided  (7.55),  relative  to  when 
less  explanation  is  provided  (6.18)  for  the  high-pervasiveness  condition.  In  the  low-pervasiveness 
condition,  assessments  of  pervasiveness  do  not  increase  when  more  explanation  is  provided  (5.12), 
relative  to  when  less  explanation  is  provided  (5.24).  Based  only  upon  perceptions  of  pervasive¬ 
ness,  increased  explanation  would  be  expected  to  decrease  assessments  of  investment  risk  for  less 
pervasive  risks  and  increase  assessments  of  investment  risk  for  more  pervasive  risks.  We  find 
precisely  the  opposite  results. 

Follow-Up  Experiment 

We  conducted  a  follow-up  experiment  to  provide  additional  evidence  that  attribution  theory 
explains  our  results,  rather  than  the  results  being  an  artifact  of  the  wording  of  our  disclosures.  The 
follow-up  experiment  employed  the  same  dependent  and  independent  variables,  and  used  the  same 
experimental  materials,  but  with  one  change.  Rather  than  providing  participants  with  the  Manage¬ 
ment  Report  on  Internal  Controls  and  the  auditor’s  opinion  on  the  Management  Report  on  Internal 
Controls,  we  informed  participants  that  management  had  identified  a  material  control  weakness 
and  that  the  auditor  concurred  that  there  was  a  material  control  weakness.  Therefore,  there  were  no 
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lengthy  reports  provided  in  the  follow-up  experiment.  Most  importantly,  the  description  of  the 
control  weakness  was  provided  by  the  external  auditor,  rather  than  management.  The  case  mate¬ 
rials  stated  the  following:^ 

The  external  auditor  has  described  the  material  weakness  as  follows:  “The  material  weaknesses 
resulted  from  a  failure  to  maintain  adequate  password  and  access  controls  over  the  portion  of  the 
Enterprise  Resource  Planning  system  that  ensures  proper  recording  of  sales  in  the  appropriate 
reporting  period.” 

The  purpose  of  this  alternative  approach  is  to  remove  the  effects  predicted  by  attribution 
theory.  While  the  wording  of  the  disclosures  remains  the  same,  the  disclosures  no  longer  come 
from  management,  and  the  descriptions  of  the  weaknesses  are  prepared  by  the  external  auditor. 
Thus,  the  new  experiment  removes  expectations  about  management’s  incentives  to  downplay  bad 
news  and  overstate  good  news  because  the  disclosures  are  not  prepared  by  management.  With 
management  incentives  and  investor  expectations  removed,  we  no  longer  expect  increased  expla¬ 
nation  detail  to  increase  trust  in  management  for  pervasive  weaknesses  or  decrease  trust  in  man¬ 
agement  for  isolated  weaknesses.  In  addition,  we  no  longer  expect  that  increased  explanation 
detail  will  decrease  risk  assessments  for  pervasive  weaknesses  or  increase  risk  assessments  for 
isolated  weaknesses.  The  follow-up  experiment  makes  it  possible  to  verify  that  attribution  theory 
explains  the  interactive  effects  of  pervasiveness  and  explanation  detail  on  risk  assessments. 

Tables  4  and  5  present  results  from  the  follow-up  experiment.  Fifty-three  M.B.A.  students 
(from  the  same  population  of  M.B.A.  students  as  the  main  experiment)  with  an  average  of  2.8 
years  of  management  experience  participated.  From  Table  4,  it  is  apparent  that  increased  perva¬ 
siveness  increases  assessments  of  risk  (main  effect,  F  =  27.009,  p  =  0.004),  but  there  is  no  longer 
a  significant  interaction  of  pervasiveness  and  explanation  detail  (F  =  0.244,  p  >  0.50).  Further, 
the  pattern  of  means  in  Panel  A  of  Table  4  and  the  significant  main  effect  of  explanation  detail 
indicate  that  risk  assessments  decrease  whenever  explanations  are  more  detailed  (F  =  18.416,  p  = 
0.015).  Overall,  the  findings  for  assessments  of  investment  risk  are  consistent  with  directors’ 
expectations  and  the  tenets  of  psychological  theories  of  risk  (Slovic  1987).  That  is,  pervasiveness 
increases  risk  assessments,  and  additional  explanation  has  the  potential  to  decrease  the  “unknown” 
and  decrease  risk  assessments.  When  management  incentives  are  removed,  identical  descriptions 
of  the  control  weaknesses  no  longer  produce  the  pattern  of  effects  found  in  the  main  experiment. 

Similarly,  analyses  of  trust  in  management  no  longer  follow  the  pattern  found  in  the  main 
experiment.  Table  5  presents  the  trust  results.  There  are  no  statistically  significant  effects  of 
pervasiveness  or  explanation  detail  on  trust  in  management  in  the  follow-up  experiment.  Without 
investors’  expectations  of  management’s  incentives,  and  without  consistency  or  inconsistency 
between  these  expectations  and  the  content  of  disclosures  from  management,  the  disclosures  do 
not  alter  trust  in  management.  Taken  together,  the  results  of  this  experiment  help  to  rule  out  the 
possibility  that  the  wording  of  the  explanations  was  the  driver  of  the  results.  Rather,  the  expecta¬ 
tions  that  management  will  downplay  bad  news  and  exaggerate  good  news,  and  the  effects  of  these 
expectations  on  attributions  related  to  management  trustworthiness  apparently  caused  the  interac¬ 
tive  effects  of  pervasiveness  and  explanation  detail  on  perceptions  of  risk. 

V.  DISCUSSION  AND  CONCLUSIONS 

Investors  rely  upon  audited  financial  statements  to  make  investment  decisions,  trusting  man¬ 
agement’s  assertions  and  the  independent,  expert  opinion  of  the  auditors  regarding  the  fairness 


^  This  is  the  statement  from  the  low-pervasiveness/low  explanation  detail  treatment.  Each  of  the  four  treatment  conditions 
employed  descriptions  of  the  control  weakness  that  were  identical  to  the  descriptions  in  the  main  experiment,  and  each 
description  was  prefaced  by  the  statement  that  the  external  auditor  was  describing  the  weakness. 
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TABLE  4 

Follow-Up  Experiment 
Descriptive  Statistics  and  ANOVA 
(Dependent  Variable  =  Risk  of  Moderate  Stock  Price  Decline) 

Panel  A:  Descriptive  Statistics 

Mean  (Standard  Deviation)  {Sample  Size}  Responses  across  Treatment  Conditions 

Explanation  Detail** 

- - -  MainEffect 

Pervasiveness**  Low  High  Pervasiveness 


Low 

High 

Main  Effect:  Explanation 

5.08 

(1.44) 

{12} 

6.29 

(1.59) 

{14} 

5.73 

(1.62) 

{26} 

3.67 

(1.55) 

{15} 

5.33 

(2.18) 

{12} 

4.41 

(2.01) 

{27} 

4.29 

(1.64) 

{27} 

5.85 

(1.91) 

{26} 

5.06 

(1.93) 

{53} 

Panel  B:  ANOVA 

Source 

Sum  of 
Squares 

df 

Mean 

Square 

F 

p-value 

Pervasiveness  (P)*“ 

27.009 

1 

27.009 

27.009 

0.004 

Explanation  (E)** 

18.416 

1 

18.416 

18.416 

0.015 

P  X  E 

0.707 

1 

0.707 

0.244 

0.623 

Error 

141.774 

49 

2.893 

Pervasiveness:  Low  pervasiveness  involves  lack  of  access  controls  over  a  single  system,  while  high  pervasiveness 
involves  lack  of  access  controls  over  the  entire  system. 

Explanation  Detail:  In  the  low  explanation  level,  the  pervasiveness  of  the  control  weaknesses  and  the  specific  effects  of 
the  control  weaknesses  are  not  elaborated.  In  the  high  explanation  level,  the  pervasiveness  and  effects  of  the  control 
weaknesses  are  described  in  detail. 


with  which  the  annual  financial  statements  have  been,presented.  However,  corporate  failures  over 
the  past  decade  suggest  that  neither  the  auditors  nor  management  of  failed  companies  were  fully 
committed  to  protecting  the  public  trust  (Powers  et  al.  2002;  Rosen  2003).  The  Sarbanes-Oxley 
Act  and  associated  SEC  regulations  were  created  to  restore  investor  confidence  in  financial  mar¬ 
kets  and  to  provide  investors  with  high-quality  information  for  assessing  investment  risk.  We 
investigate  the  effects  of  control  weakness  disclosures,  as  required  by  Section  404  of  the  Sarbanes- 
Oxley  Act,  on  investors’  perceptions  of  investment  risk. 

The  results  from  laboratory  experiments  with  nonprofessional  investors  indicate  that  assess¬ 
ments  of  investment  risk  increase  for  firms  with  material  control  weakness  disclosures,  relative  to 
firms  without  such  disclosures.  This  finding  is  consistent  with  archival  research  that  finds  negative 
stock  price  reactions  resulting  from  control  weakness  disclosures,  indicating  that  investors  use 
information  about  internal  control  weaknesses  when  evaluating  investment  risk. 

Directors  believe  that  more  detailed  explanation  of  control  weaknesses  and  their  pervasive¬ 
ness  will  reduce  risk  assessments  when  the  control  weaknesses  are  not  pervasive,  and  they  desire 
to  protect  the  interests  of  the  firms  they  represent  by  clearly  explaining  the  limited  effects  of  less 
pervasive  weaknesses  on  financial  reporting  systems.  This  perspective  is  supported  by  theories  of 
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TABLE  5 


Follow-Up  Experiment 
Descriptive  Statistics  and  ANOVA 
(Dependent  Variable  =  Trust  in  Management) 


Panel  A:  Descriptive  Statistics 

Mean  (Standard  Deviation)  {Sample  Size}  Responses  across  Treatment  Conditions 

Explanation  Detail** 

Pervasiveness^  Low  High 


Main  Effect 
Pervasiveness 


Low 

High 

Main  Effect:  Explanation 

4.75 

(2.22) 

{12} 

4.79 

(1.43) 

{14} 

4.77 

(1.80) 

{26} 

5.13 

(1.77) 

{15} 

4.58 

(2.07) 

{12} 

4.89 

(1.89) 

{27} 

4.96 

(1.95) 

{27} 

4.69 

(1.72) 

{26} 

4.83 

(1.83) 

{53} 

Panel  B:  ANOVA 

Source 

Sum  of 
Squares 

df 

Mean 

Square 

F 

p-value 

Pervasiveness  (P)“ 

0.868 

1 

0.868 

0.248 

0.620 

Explanation  (E)'* 

0.107 

1 

0.107 

0.031 

0.862 

P  X  E 

1.126 

1 

1.126 

0.322 

0.573 

Error 

171.257 

49 

3.495 

“  Pervasiveness:  Low  pervasiveness  involves  lack  of  access  controls  over  a  single  system,  while  high  pervasiveness 
involves  lack  of  access  controls  over  the  entire  system. 

’’  Explanation  Detail:  In  the  low  explanation  level,  the  pervasiveness  of  the  control  weaknesses  and  the  specific  effects  of 
the  control  weaknesses  are  not  elaborated.  In  the  high  explanation  level,  the  pervasiveness  and  effects  of  the  control 
weaknesses  are  described  in  detail. 


risk  perception,  but  attribution  theory  predicts  different  results.  We  find  that  investors  do  recognize 
the  pervasiveness  of  internal  control  weaknesses  and  they  use  this  information  to  generate  invest¬ 
ment  risk  assessments.  However,  the  anticipated  benefits  of  additional  disclosure  sought  by  direc¬ 
tors  are  not  supported.  Rather,  results  suggest  that  providing  a  detailed  explanation  that  a  control 
weakness  is  isolated  and  not  pervasive  increases  forward-looking  risk  assessments  relative  to  not 
providing  such  explanation.  Conversely,  providing  an  explanation  that  a  control  weakness  is  very 
pervasive  decreases  forward-looking  risk  assessments  relative  to  not  providing  a  detailed  expla¬ 
nation.  While  we  find  that  investors  use  information  about  control  weakness  pervasiveness  to 
assess  risk,  pervasiveness  is  a  multi-faceted  construct  (e.g.,  multiple  versus  single  accounts,  larger 
versus  smaller  dollar  amounts,  less  versus  more  vague  effects  on  financial  statement).  Future 
research  will  be  necessary  to  determine  which  components  of  pervasiveness  are  most  important  to 
investors,  if  this  is  desirable. 

We  find  evidence  that  changes  in  trust  in  management  likely  explain  the  pattern  we  observe. 
The  nonprofessional  investors  in  our  experiments  believe  that  management  is  not  trustworthy 
when  detailed  explanations  are  provided  for  non-pervasive  control  weaknesses,  while  investors 
believe  management  is  trustworthy  when  detailed  explanations  are  provided  for  pervasive  control 
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weaknesses.  The  results  indicate  that  investors  perceive  that  management  is  being  dishonest  when 
detailed  explanations  are  provided  for  isolated  control  weaknesses  and  that  management  is  at¬ 
tempting  to  cover  up  the  seriousness  of  the  control  weakness  by  claiming  that  it  is  non-pervasive. 
Detailed  explanations  of  pervasive  control  weaknesses,  however,  are  viewed  as  indicators  that 
management  is  trustworthy — that  management  is  openly  admitting  the  serious  nature  of  the  con¬ 
trol  weakness. 

Overall,  our  findings  respond  to  concerns  from  archival  research  studies  that  investors’  risk 
judgments  are  more  complex  than  suggested  by  prior  accounting  risk  models  and  that  financial 
risk  judgments  are  influenced  by  probabilities,  outcomes,  and  numerous  behavioral  variables 
(Koonce  et  al.  2005).  Further,  Schwarzkopf  (2006)  claims  that  we  do  not  really  understand  what 
makes  a  business  situation  appear  more  or  less  risky  to  an  individual,  and  therefore,  managers 
should  make  a  greater  effort  to  understand  how  investors  view  risk  and  to  consider  more  open 
stakeholder  dialog,  more  involved  message  content,  and  increased  transparency.  The  explanations 
provided  by  management  in  control  weakness  disclosure,  and  the  interaction  of  these  explanations 
with  the  pervasiveness  of  the  weaknesses  have  meaningful  effects  on  investors’  judgments  of  risk, 
and  we  provide  insight  into  the  complex  processes  that  affect  these  risk  judgments. 
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PETER  O.  CHRISTENSEN  and  GERALD  A.  FELTHAM,  Equity  Valuation  (Hanover,  MA; 
Foundations  and  Trends®  in  Accounting,  2009,  ISBN  978-1-60198-272-8,  Vol.  4,  No.  1, 
pp.  X,  115). 


Let  me  begin  by  saying  that  this  is  a  great  booklet.'  The  authors  propose  using  the  riskless  spot  rate  and 
forward  rates,  as  opposed  to  risk-adjusted  discount  rates,  in  discounting  future  earnings  and  cash  flows  to  value 
equity.  It  is  a  key  conceptual  departure  from  practitioners’  orthodoxy.  Like  most  analytical  works,  the  beauty  of  its 
conciseness  and  precision  is  evident  throughout.  The  authors  also  put  together  various  parts  of  the  analysis  through 
a  consistent  and  overarching  theme,  making  sure  that  readers  are  not  lost  in  the  mathematical  details  and  do  not 
lose  sight  of  the  big  picture.  In  my  mind,  they  strike  an  almost  perfect  balance  between  making  the  book  precise 
and  solid  for  academics,  and  at  the  same  time  engaging  and  persuasive  for  practitioners.  Because  of  this,  it  appeals 
to  a  wide  range  of  readers.  1  highly  recommend  this  book  to  anyone  who  is  interested  in  the  area  of  accounting- 
based  valuation.  Overall,  on  the  scale  of  1  to  5,  1  would  rate  this  book  a  solid  4. 

Although  the  authors  did  a  great  job  in  making  this  book  engaging  and  accessible,  it  is  definitely  not  a 
beginners’  book.  Readers  are  assumed  to  possess  a  fair  amount  of  knowledge  about  the  valuation  literature  as  well 
as  the  common  modeling  ingredients  and  techniques.  The  good  news  is  that  the  authors  make  frequent  references 
to  a  couple  of  related  books  with  more  detailed  analysis  (Christensen  and  Feltham  2003;  Penman  2007).  Thus,  for 
those  who  are  interested  in  delving  into  more  details,  these  are  readily  available  in  those  books. 

The  book  is  titled,  broadly,  as  Equity  Valuation.  It  deals  mostly  with  one  particular  issue:  In  estimating  equity 
value  through  discounting  future  payoffs,  should  we  adjust  for  risk  in  the  numerator  (i.e.,  the  forecasted  future 
payoffs)  or  in  the  denominator  (i.e.,  the  discount  rate)?  This  is  an  important  question.  In  performing  accounting- 
based  valuation  analysis,  one  usually  replaces  cash  flow  with  either  residual  income  or  abnormal  earnings  growth. 
Either  way  we  need  to  deal  with  the  issue  of  what  discount  rate  should  be  used  in  calculating  residual  income. 
Following  the  traditional  approach  of  incorporating  risk  into  such  discount  factors  poses  two  serious  problems. 
First,  when  such  discount  factors  are  stochastic,  which  is  often  the  case  with  the  non-deterministic  term  structure 
of  interest  rates,  inflation,  and/or  risk  adjustment,  we  are  faced  with  the  difficult  task  of  determining  the  covariance 
between  the  discount  rate  and  the  book  value  in  calculating  further  residual  earnings.  Second,  and  more  impor¬ 
tantly,  if  the  discount  rate  already  incorporates  risk  adjustment,  then  it  contradicts  one  of  the  important  goals  of  the 
whole  analysis,  namely,  to  use  accounting  data  as  input  for  the  calculation  of  proper  risk  adjustment.  This  is  why 
it  is  crucial  for  us  to  establish  a  theory  of  how  to  adjust  for  risk  in  the  numerator,  leaving  the  discount  rate 
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deterministic.  Of  course,  even  putting  aside  the  need  for  incorporating  accounting  in  equity  valuation,  it  is  utterly 
essential  to  be  able  to  determine  a  proper  discount  rate  in  the  valuation  of  any  claim. 

Within  the  equity  valuation  literature,  this  issue  has  been  largely  ignored.  Since  the  establishment  of  the 
CAPM  model,  the  finance  literature  on  valuation  has  mostly  flourished  in  some  other  areas,  most  noticeably 
market  micro-structure,  and  most  recently  behavioral  finance.  Within  the  neoclassical  framework,  work  has  been 
done  to  address  issues  such  as  adaptive  learning,  stochastic  expected  stock  return,  and  stochastic  risk  premium. 
This  paper  revisits  a  quite  fundamental  issue,  i.e.,  should  we  adjust  risk  in  the  numerator  or  the  denominator?  It 
extends  the  recent  works  by  Feltham  and  Ohlson  (1999)  and  Christensen  and  Feltham  (2003),  which  are  direct 
descendants  of  the  seminal  work  of  Rubinstein  (1976),  who  characterized  the  relationship  between  the  valuation 
of  cash  flow  streams  and  the  prices  of  zero-coupon  bonds  in  a  multi-period  setting  under  uncertainty. 

The  authors  advocate  adjusting  risk  in  the  numerator  rather  than  in  the  denominator.  In  other  words,  they 
propose  using  the  riskless  spot  rate  and  forward  rates  as  the  discount  rates  in  valuation.  The  authors  list  several 
problems  with  the  traditional  approach  in  estimating  the  risk-adjusted  discount  rate.  They  are: 

•  many  of  the  ideas  are  developed  in  a  single-period  setting,  which  cannot  be  easily  carried  over  to 
multi-period  setting.  Most  noticeably,  calculating  cost  of  capital  as  a  weighted  average  of  cost  of  equity 
and  cost  of  debt  is  likely  to  introduce  error  into  a  multi-period  setting,  because  the  internal  rate  of  return 
on  a  portfolio  of  assets  differs  from  the  weighted  average  of  the  internal  rates  of  returns  on  the  individual 
assets; 

•  with  stochastic  expected  stock  returns,  the  estimate  of  average  return  becomes  a  biased  measure  of  the 
average  cost  of  capital  used  in  valuation,  due  to  Jensen’s  inequality;  and 

•  even  with  recent  asset-pricing  models,  which  address  the  stochastic  expected  return  issue  with  date- 
specific,  risk-adjusted  discount  rates,  such  rates  may  not  exist  in  certain  cases. 


The  authors  develop  an  equilibrium  asset-pricing  model  with  risk  adjustments  depending  on  the  time-series 
properties  of  cash  flows  and  the  accounting  policy.  They  show  that  operating  characters  such  as  the  growth  and 
persistence  of  earnings  can  affect  the  risk  adjustment. 

What  are  the  highlights  of  this  book?  The  book  contains  five  chapters  and  three  appendices.  Chapters  2  to  5 
each  contain  separate  yet  closely  related  topics.  Chapter  2  reviews  and  identifies  problems  with  the  implementa¬ 
tion  of  the  classical  model.  In  Chapters  3  to  5,  the  authors  develop  an  accounting-based,  multi-period  asset-pricing 
model  with  KARA  utility.  My  preferenees  are  Chapters  2  and  5.  Chapter  2  contains  a  critical  review  of  the 
elassical  valuation  approach  with  a  constant  risk-adjusted  discount  rate.  As  noted  above,  the  authors  highlight 
several  problems  in  estimating  these  models.  Many  of  these  issues  are  not  properly  acknowledged  and/or  dealt 
with  in  many  of  the  textbooks.  The  authors  provide  a  nice  step-by-step  analysis  of  the  problems  and  possible 
solutions. 

Chapter  5  contains  the  punch  line.  The  authors  push  ahead  with  the  idea  of  adjusting  risk  in  the  numerator, 
and  deal  with  the  thorny  issue  of  identifying  and  simplifying  the  so-called  “pricing  kernel.”  Although  the  final 
model  involves  a  rather  simplifying  assumption  of  a  simple  VAR  model  of  the  stochastic  processes  of  residual 
income  and  for  the  consumption  index,  it  provides  striking  and  promising  ideas  of  how  to  estimate  and  adjust  for 
risk  based  on  fundamentals,  as  opposed  to  stock  return.  It  provides  a  nice  illustration  of  how  to  ineorporate 
time-change  risk  characteristics  of  firms  with  the  change  in  firms’  operations  (captured  by  the  change  in  residual 
income).  This  is  very  encouraging. 

There  are  some  unsettling  issues  in  this  book.  Not  surprisingly,  I  find  the  authors’  review  of  the  classical 
valuation  approach  to  be  somewhat  tilted  toward  the  negative  side.  For  instanee,  many  of  the  problems  cited  arise 
from  the  practice  of  estimating  a  single,  constant  risk-adjusted  discount  rate  for  all  future  periods.  This  seems  to 
be  based  on  the  assumption  that  firms’  risk  characteristics  do  not  change  materially  over  future  periods.  Of  eourse, 
this  is  a  grossly  simplified  approach  in  dealing  with  the  issues  of  time-changing  interest  rates  and  inflation.  To  me, 
errors  introduced  by  sueh  an  approach  reflect  more  the  shortcomings  in  the  empirieal  or  practical  implementation, 
rather  than  the  shortcomings  in  the  valuation  approaeh  per  se.  As  noted  by  the  authors,  using  date-specifie  discount 
rates  can  avoid  many  of  the  problems.  After  all,  under  most  circumstances  in  a  neo-classieal  framework,  putting 
the  risk  adjustment  in  the  numerator  or  in  the  denominator  may  simply  be  an  easy  mathematical  transformation. 
In  some  cases,  of  eourse,  adjusting  risk  in  the  denominator  does  not  lead  to  any  solution  to  the  problem.  In  that 
sense,  adjusting  in  the  numerator  is  more  flexible. 

After  finishing  the  book,  I  asked  myself  the  following  question:  Am  I  convinced  that  the  practice  of  adjusting 
risk  in  the  diseount  rate  should  be  abolished?  The  answer  seems  unclear,  for  a  couple  of  reasons.  First,  despite  the 
authors’  admirable  effort  in  bringing  context  to  it,  the  concept  of  “consumption  index”  still  seems  rather  elusive. 
As  a  result,  it  lacks  the  appeal  of  the  traditional  CAPM,  namely,  a  clear  and  intuitive  idea  of  risk  adjustment. 
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Second,  given  that  the  “consumption  index”  or  pricing  kernels  are  by  nature  event-price  measures,  measuring  the 
“consumption  index”  empirically  is  likely  to  introduce  considerable  error.  Such  error,  in  all  hkelihood,  may  exceed 
what  we  observe  with  the  calculation  of  WACC  using  CAPM.  Of  course,  this  is  ultimately  an  empirical  issue. 
From  a  difference  perspective,  such  concerns  also  make  the  issue  more  interesting  and  engaging  for  future 
research. 

In  sum,  despite  the  somewhat  unsettling  issues  mentioned  above,  this  work  makes  a  great  handbook  for 
anyone  who  is  interested  in  pursuing  research  in  accounting-based  equity  valuation.  Since  each  chapter  essentially 
deals  with  one  independent,  yet  interrelated  issue,  the  work  constitutes  a  nice  reference  book.  It  is  particularly 
useful  for  doctoral  students  who  are  interested  in  pursuing  work  in  this  area.  It  would  be  a  fun  and  engaging 
exercise  for  them  to  fill  in  the  missing  proofs  in  each  step  of  the  derivations.  I  highly  recommend  it. 

Traditionally,  core  courses  in  finance  and  accounting  divide  materials  roughly  along  the  line  of  numerator  and 
denominator:  accounting  professors  teach  students  how  to  analyze  and  forecast  future  earnings  and  cash  flows,  and 
finance  professors  teach  students  how  to  properly  discount  them.  Given  this  division  of  labor,  I  doubt  that  my 
finance  colleagues  would  happily  embrace  the  idea  of  discounting  everything  with  zero-couple  rate  since  that  does 
not  leave  them  much  to  talk  about.  But  as  an  accountant,  I  like  that  idea. 
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PAUL  M.  CLIKEMAN,  Called  to  Account:  Fourteen  Financial  Frauds  that  Shaped  the 
American  Accounting  Profession  (New  York,  NY;  Routledge,  2009,  ISBN  978-0-415- 
99698-3,  pp.  X,  345). 

This  book  is  a  complete  package  for  auditing  professors  and  students  because  it  includes  more  than  just 
descriptions  of  14  significant  accounting  scandals  that  occurred  in  the  United  States.  The  book  is  divided  into 
seven  parts.  Each  part  begins  with  a  chapter  that  provides  a  context  for  the  discussion  of  the  frauds  described  in 
that  part  and  concludes  with  a  chapter  describing  the  regulatory  fallout  from  those  frauds.  For  the  instructor,  an 
appendix  includes  a  set  of  discussion  questions  for  each  chapter  and  a  table  integrating  Called  to  Account  with 
popular  auditing  texts.  Both  instructor  and  student  will  find  the  extensive  lists  of  bibliographic  references  should 
they  wish  to  refer  to  original  sources. 

Called  to  Account  does  an  excellent  job  of  presenting  the  frauds  as  catalysts  for  the  changes  that  have  taken 
place  in  regulation,  in  accounting  and  auditing  standards,  and  in  accounting  practice  over  the  period  from  the  early 
1900s  to  the  first  part  of  the  21st  century.  The  information  about  the  frauds  is  presented  in  enough  detail  so  that 
the  reader  can  see  why  the  fraud  was  successful.  I  recall  Professor  John  Willingham  saying  a  number  of  years  ago 
that  most  partners  and  managers  in  public  practice  had  not  been  involved  in  an  audit  where  there  had  been  a  fraud 
and  thus  were  not  as  well  equipped  as  they  might  be  to  recognize  a  fraud  at  an  audit  client.  Called  to  Account 
provides  enough  detail  that  the  readers  can  easily  imagine  themselves  in  the  picture  and  being  required  to  make  the 
decisions  the  auditor  had  to  make. 

Chapter  1,  “Scandal  and  Reform,”  provides  a  history  of  public  accounting  and  regulation  in  America  begin¬ 
ning  with  the  public’s  interest  in  the  stock  market  in  the  early  years  of  the  20th  century  through  to  the  collapse  of 
WorldCom  in  2002. 
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Part  I  Birth  of  a  Profession 

Chapter  2,  “Out  of  Darkness,”  examines  the  early  history  of  capital  markets  and  the  public  accounting 
profession.  The  author  states,  “U.S.  Steel  defied  convention  when  it  published  its  first  set  of  audited  financial 
statements  in  1903”  (p.  17).  Prior  to  that  date  most  companies  believed  that  opaqueness  was  preferred  (and 
cheaper).  The  chapter  describes  the  early  attempts  by  the  fledgling  public  accounting  profession  to  encourage 
better  financial  reporting. 

In  Chapter  5,  “Into  the  Spotlight,”  Clikeman  ties  the  Ivar  Kreuger  and  McKesson  &  Robbins  frauds,  dis¬ 
cussed  in  Chapters  3  and  4,  to  the  Securities  Act  of  1933  and  the  Securities  Exchange  Act  of  1934  and  the  setting 
up  of  the  Securities  and  Exchange  Commission  (SEC).  He  also  describes  the  development  of  Statement  on 
Auditing  Procedure  No.l,  Extensions  of  Auditing  Practice,  by  a  committee  of  the  American  Institute  of  Accoun¬ 
tants  (AIA).  This  standard  mandated  the  auditor’s  observation  of  client  inventory  counts  and  direct  confirmation  of 
accounts  receivable. 


Part  II  The  Profession’s  Principle  Problem 

Chapter  6,  “Generally  Accepted  Accounting  Principles,”  traces  the  development  of  generally  accepted  ac¬ 
counting  principles  (GAAP)  from  Pacioli  in  1494  through  the  establishment  by  the  AIA’s  Committee  on  Account¬ 
ing  Procedure  (CAP)  in  1938  and  the  Accounting  Principles  Board  (APB)  in  1959.  Clikeman  chronicles  the 
problems  that  the  CAP  and  the  APB  had  in  persuading  the  public  accounting  profession  and  business  community 
of  the  need  for  better  accounting. 

The  frauds  discussed  in  this  part  are  National  Student  Marketing  and  Equity  Funding. 

Chapter  9,  “Deja  vu,”  is  a  continuation  of  Chapter  6.  The  formation  of  the  Financial  Accounting  Standards 
Board  (FASB)  is  described.  The  discussions  in  Chapter  6  about  the  problems  that  the  CAP  and  APB  had  and  in  this 
chapter  the  problems  that  the  FASB  had  in  setting  accounting  principles  that  were  generally  accepted  provide  a 
clear  picture  of  how  difficult  it  has  been  to  build  consensus.  The  chapter  also  includes  a  discussion  of  Congress’ 
Moss  and  Metcalf  investigations  and  the  Cohen  Commission  on  Auditors’  Responsibilities  (1978),  which  stated  in 
its  report  that  an  “expectation  gap”  existed  between  what  auditors  believed  they  should  do  and  what  the  public 
expected  of  auditors. 


Part  III  The  Savings  and  Loan  Crisis 

Chapter  10,  “It’s  a  Wonderful  Life,”  discusses  the  Gam-St.  Germain  Bill  passed  by  Congress  in  1982,  which 
“greatly  expanded  the  types  of  assets  savings  and  loans  were  permitted  to  own”  (p.  86).  Clikeman  describes  how 
savings  and  loan  accounting,  including  Regulatory  Accounting  Principles  (RAP),  allowed  the  savings  and  loan 
associations  to  manipulate  their  earnings  to  appear  to  be  more  profitable  than  they  were.  He  also  describes  how  the 
Federal  Savings  and  Loan  Insurance  Corporation  lowered  the  savings  and  loan  net  worth  requirement  in  the  1980s 
from  5  percent  to  4  percent  to  3  percent.  Moral  hazard  became  a  problem. 

The  frauds  discussed  are  ESM  Government  Securities  and  Lincoln  Savings  &  Loan. 

Chapter  13,  “Bank  Robbers,”  describes  Congressional  hearings  that  were  critical  of  the  management  prac¬ 
tices  of  the  savings  and  loans  and  also  of  the  auditors  of  those  savings  and  loans.  In  1989  and  1991,  the  Senate  and 
House  passed  two  acts  dealing  with  problems  with  savings  and  loans  and  with  banks,  respectively.  Both  acts  added 
significant  regulation  for  both  kinds  of  financial  institutions.  The  chapter  includes  a  discussion  of  the  Foreign 
Corrupt  Practices  Act  (FCPA)  passed  in  1977  and  the  battles  between  the  SEC  and  the  public  accountants  and 
industry  over  reporting  by  management  on  internal  control. 


Part  IV  The  Expectation  Gap 

Chapter  14,  “Auditors  and  Fraud,”  is  a  concise  history  of  auditing  from  3200  B.C.  to  the  beginning  of  the 
20th  century,  followed  by  a  discussion  of  the  evolution  of  the  auditor’s  responsibility  for  the  detection  of  fraud 
from  the  late  1800s  to  the  mid-1980s. 

The  two  frauds  reviewed  are  ZZZZ  Best  and  Crazy  Eddie. 

Chapter  17,  “Closing  the  Gap,”  describes  the  creation  in  1985  of  the  National  Commission  on  Fraudulent 
Financial  Reporting  (Treadway  Commission)  by  the  Committee  of  Sponsoring  Organizations  (COSO).  A  conclu¬ 
sion  of  the  Treadway  Commission’s  report,  which  was  issued  in  1987,  was  that  management  bore  the  primary 
responsibility  for  financial  reporting.  The  chapter  also  discusses  the  nine  “Expectation  Gap”  auditing  standards  and 
the  replacement  of  Statement  on  Auditing  Standards  (SAS)  No.  53  with  SAS  No.  82,  which  in  turn  was  replaced 
by  SAS  No.  99. 
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Part  V  The  Litigation  Crisis 

Chapter  18,  “Auditors’  Legal  Liability,”  begins  with  the  statement,  “Auditors  in  the  early  1900s  were  subject 
to  scant  discipline”  (p.l59),  and  then  provides  a  brief  summary  of  auditors’  liability  to  third  parties.  The  rest  of  the 
chapter  provides  a  historical  perspective  of  auditors’  liability  to  clients,  auditors’  liability  to  third  parties,  and 
auditors’  liability  under  statutory  law.  1  found  this  to  be  the  weakest  chapter  in  the  book,  because  the  coverage  of 
the  topic  is  too  brief  to  do  justice  to  the  topic;  auditing  texts  do  a  better  job.  Chapter  21  does  provide  more 
extensive  coverage  of  the  legal  environment.  1  believe  the  two  chapters  might  have  been  combined. 

The  two  cases  discussed  in  this  part  (Fund  of  Funds  and  MiniScribe)  are  very  different,  although  both  relate 
to  litigation  against  the  auditor. 

The  Fund  of  Funds  case  discusses  the  dilemma  that  Arthur  Andersen  &  Co.  found  itself  in  when  the  same 
office,  and  the  same  partner,  audited  two  clients  (King  Resources  and  Fund  of  Funds)  that  had  transactions 
between  them.  Client  confidentiality  was  not  accepted  as  a  defense.  It  would  have  been  instructive  if  the  author, 
perhaps  in  a  sidebar,  had  referred  to  the  case  of  Waggenheim  (Consolidata  Services,  Inc.)  v.  Alexander  Grant  & 
Co.,  19  Ohio  App  3rd  7,482  N.E.  2d  955  (1982),  where  the  accounting  firm  of  Alexander  Grant  &  Company  lost 
the  case  because  the  court  ruled  the  firm  had  violated  confidentiality. 

Chapter  21,  “Litigation  Reform,”  chronicles  the  significant  concern  that  arose  from  the  settlements  and  from 
the  reported  “6,000  liability  suits  seeking  more  than  $15  billion  in  damages”  (p.  184)  from  U.S.  accounting  firms 
in  1992.  The  author  describes  the  accounting  profession’s  perception  of  the  causes  of  the  problem;  joint  and 
several  liability  and  the  inability  of  a  successful  defendant  to  collect  costs,  and  extreme  punitive  damages  that 
were  being  awarded. 

The  Private  Securities  Litigation  Reform  Act  of  1995  “established  proportionate  liability  for  defendants  who 
did  not  knowingly  engage  in  securities  fraud”  (p.  187).  In  the  mid-1990s,  state  legislation  was  amended  to  permit 
CPA  firms  to  practice  as  limited-liability  partnerships. 


Part  VI  Beginning  of  the  End 

Chapter  22,  “Auditor  Independence,”  provides  a  thorough  discussion  of  the  topic,  beginning  with  a  historical 
perspective.  The  issues  relating  to  independence  are  discussed:  auditors  are  paid  by  their  clients,  accounting  firm 
staff  being  hired  by  clients,  nonaudit  services,  and  joint  ventures  with  clients.  The  author  discusses  the  creation  of 
the  Independence  Standards  Board  created  by  the  American  Institute  of  Certified  Public  Accountants.  One  of  the 
messages  that  the  chapter  conveys  is  that  the  CPA  profession  and  especially  the  largest  firms  fought  any  attempt 
to  regulate  them  and  constrain  their  activities. 

Waste  Management  and  Sunbeam  are  discussed  in  this  part. 

Chapter  25,  “End  of  the  Millennium,”  continues  the  discussion  of  independence  that  began  in  Chapter  22. 
There  is  a  lengthy  discussion  of  the  SEC’s  proposed  and  final  rules  on  independence  issued  in  2000.  The  author 
states  that  the  SEC  mandated  fee  disclosure,  which  resulted  in  a  number  of  the  larger  corporations  restricting  the 
non-audit  services  purchased  from  their  auditors.  The  chapter  continues  with  a  discussion  of  the  SEC’s  Staff 
Accounting  Bulletin  No.  99  and  the  requirement  that  auditors  also  consider  qualitative  materiality. 


Part  VII  From  Profession  to  Regulated  Industry 

Chapter  26,  “Professionalism,”  traces  the  evolution  of  public  accounting  from  early  in  the  20th  century,  when 
it  v/as  truly  a  profession,  through  the  gradual  erosion  of  its  standards  and  business  model  into  a  full-service 
business.  The  author  uses  the  history  of  Arthur  Andersen  to  demonstrate  that  evolution. 

The  two  frauds  that  probably  had  the  most  impact  on  the  accounting  profession,  Enron  and  WorldCom,  are 
the  focus  of  this  part. 

Chapter  29,  “The  Perfect  Storm,”  discusses  the  fall  of  Arthur  Andersen  as  a  precursor  for  the  development 
and  passing  of  the  Sarbanes-Oxley  Act  of  2002  (SOX).  There  is  a  discussion  of  both  the  Act  and  its  implemen¬ 
tation.  There  is  also  an  examination  of  SAS  No.  99  and  its  expanded  requirements  for  auditors  to  consider  with 
respect  to  possible  fraud.  The  author  recounts  the  criticism  of  SOX’s  Section  404  and  its  costs  and  impact  on 
reporting  firms  in  terms  of  the  cost  in  dollars  and  time.  The  author  states,  “Unfortunately,  nobody  has  yet 
documented  a  discernible  improvement  in  earnings  quality  post-Sarbanes-Oxley”  (p.  278).  I  can  only  conclude 
that  the  book  was  completed  before  Ashbaugh-Skaife  et  al.  (2008,  2009)  were  published.  And  I  assume  that 
completion  of  the  book  predates  the  release  of  Auditing  Standard  No.  5  by  the  Public  Company  Accounting 
Oversight  Board. 

Chapter  30,  “Conclusion,”  summarizes  the  preceding  29  chapters  with  respect  to  the  development  of  auditing 
and  audit  standards,  the  development  of  accounting  standards,  the  changing  role  of  auditor  responsibility  for  fraud, 
and  the  evolution  of  public  accounting  firms  and  regulation,  and  is  a  fitting  conclusion  to  the  book. 
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There  are  books  published  that  include  cases  on  fraud  {Fraud  Casebook:  Lessons  from  the  Bad  Side  of 
Business  [Wells  2007]  and  Contemporary  Auditing:  Real  Issues  &  Cases,  Update  [Knapp  2010]),  but  I  believe  that 
their  objectives  are  different,  and  so  they  do  not  include  the  rich  discussion  of  the  context  and  impact  of  the  frauds 
on  accounting  and  auditing  standards,  regulation,  and  the  law  that  are  included  in  Called  to  Account:  Fourteen 
Financial  Frauds  that  Shaped  the  American  Accounting  Profession. 
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E.  RICHARD  CRISCIONE,  Abraham  J.  (Abe)  Brilojf:  A  Biography,  Studies  in  the  Devel¬ 
opment  of  Accounting  Thought,  Volume  11  (Bingley,  U.K.:  JAI/Emerald  Group,  2009, 
ISBN:  978-1-84855-588-4,  pp.  xxi,  258). 

There  are  not  many  individuals  with  deep  roots  in  the  academic  accounting  profession  whose  biography 
would  have  the  potential  to  be  both  lively  and  have  content  enough  for  a  250+  page  book.  Most  accounting 
academics  are  simply  not  at  the  center  of  events  that  have  changed  the  accounting  profession  or  the  broader 
business  community.  The  vast  majority  of  accounting  academics  with  extensive  interaction  with  the  accounting 
profession  (such  as  those  who  serve  on  accounting  rule-making  bodies)  have  worked  within  the  system  in  a 
relatively  non-dramatic  and  non-confrontational  way;  they  have  chosen  not  to  want  to  get  in  front  of  the  parade  or 
not  to  lead  the  band  to  make  major  change.  Sandy  Burton,  who  was  a  proactive  Chief  Accountant  of  the  Securities 
and  Exchange  Commission  in  the  1970s,  is  perhaps  an  important  exception. 

Abraham  Briloff  is  neither  quiet  nor  lacking  in  his  willingness  to  be  confrontational.  Indeed,  his  critics  have 
often  taken  the  route  of  making  personal  attacks  on  his  willingness  to  be  at  the  center  of  controversy.  His  flowery 
language  has  only  added  to  his  colorful  stature.  It  is  difficult  to  think  of  many  accounting  academics  whose 
professional  life  over  multiple  decades  (especially  1965  to  about  2000)  would  make  engaging  reading.  Criscione’s 
book  (drawn  from  his  Ph.D.  dissertation  at  The  University  of  Mississippi)  is  a  commendable  work  of  bringing 
together  a  vast  amount  of  diverse  material  in  a  very  instructive  and  interesting  way.  While  I  spent  much  time  years 
ago  going  through  Briloff’s  writings,  I  still  found  much  that  was  both  new  and  interesting  in  this  lively  book.  It  is 
well  written  with  a  solid  underpinning  in  a  broad  base  of  field  research. 

The  beginning  of  my  own  interaction  with  Briloff  was  while  lecturing  in  the  downtown  night  program  of  The 
University  of  Chicago.  A  senior  executive  of  a  company  whose  accounting  practices  had  been  criticized  by  Briloff 
in  Barron ’s  challenged  me  to  explain  why  the  stock  price  of  his  company  plummeted  at  the  time  the  article  came 
out.  The  executive  asserted  that  the  article  was  both  “profoundly  wrong  and  offensive”  as  well  as  being  “based 
only  on  a  reading  of  publicly  available  information.”  My  response  during  the  class  did  not  satisfy  the  executive, 
and  he  challenged  me  in  front  of  the  class  to  come  back  next  week  with  a  “better  story.”  In  the  seven  days  before 
the  next  class,  I  feverishly  collected  stock  market  and  other  information  on  as  many  Briloff  articles  as  I  could 
access.  I  developed  exhibits  that  had  patterns  very  similar  to  what  was  subsequently  published  in  a  Journal  of 
Accounting  Research  (Foster  1979)  article.  Nick  Dopuch,  JAR's  very  influential  editor  at  the  time,  told  me  that, 
while  my  “Briloff  and  the  Capital  Market”  paper  was  not  a  “normal  research  piece,”  it  was  a  “fun  read”  and 
correctly  predicted  it  would  get  a  lot  of  attention.  I  suspect  that  I  was  chosen  as  the  reviewer  for  this  book  because 
of  that  1979  paper  and  a  related  paper  in  1987  (Foster  1987).  Those  were  published  a  long  time  ago.  However,  it 
was  a  joy  to  take  the  time  many  years  later  to  read  this  book  and  reflect  on  Abe  Briloff  and  his  writings. 

Criscione’s  choice  of  a  biography  subject  was  an  excellent  one.  First,  Briloff  had  a  long  career  in  which  he 
exposited  some  important  themes  regarding  “actual  and  perceived  independence  of  the  accounting  professional” 
and  the  “attempts  by  management  to  use  accounting  choices  to  present  an  overly  rosy  picture  of  their  company’s 
fundamentals.”  These  are  important  and  broadly  understandable  issues  in  both  academe  and  the  general  business 
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community.  His  themes  were  not  topics  of  interest  only  to  a  narrow  inner  circle  of  academics  who  had  little 
interest  in  whether  their  writings  had  any  relevance  in  the  commercial  world.  Second,  Briloff  wrote  and  spoke  with 
a  passion  and  in  colorful  phrases  that  resulted  in  much  polarization  of  opinion  about  his  ideas  and  him  personally. 
This  adds  to  the  narrative  in  Criscione’s  book  being  very  readable.  Briloff  was  hard-hitting  in  his  criticisms,  and 
the  responses  to  him  likewise  were  far  from  timid.  This  makes  for  fun  reading,  although  at  times  the  tone  (on  both 
sides)  makes  the  reader  cringe.  His  SafeCard  Services  article  in  Barron ’s  led  to  a  company  press  release  that  used 
Briloff’s  legal  blindness  as  the  basis  for  a  low,  cheap  shot  at  him;  “Sadly  for  the  ‘Professor,’  he  is  legally  blind, 
and,  therefore,  not  under  any  obligation  it  seems  to  determine  the  facts”  (quoted  on  page  34  of  book).  Third, 
several  of  the  positions  Briloff  took  with  regard  to  the  accounting  profession — such  as  that  management  consulting 
and  auditing  should  not  be  done  by  the  same  firm — were  subsequently  adopted  by  regulators  and  politicians. 

Briloff  was  able  to  use  several  high-profile  public  platforms  to  propound  his  ideas.  It  is  hard  to  think  of  other 
advocates  of  major  change  in  the  accounting  profession  who  had  such  effective  “bully  pulpits”  to  present  their 
case  on  an  ongoing  basis  over  many  years.  Criscione  describes  how  Briloff  started  writing  for  Barron’s.  Briloff 
wrote  a  response  to  an  article  in  The  Accounting  Review  that  criticized  his  position  on  pooling  accounting,  but  the 
Review  rejected  his  paper.  He  showed  his  “rejected  response”  to  Barron’s,  which  decided  to  run  with  it  under  the 
title  of  “Dirty  Pooling”  (pp.  28-29).  This  was  the  first  of  many  Barron’s  articles  by  Briloff  over  an  extended  time 
period.  Over  the  years,  he  had  the  very  strong  support  of  the  then  editor  of  Barron’s,  Robert  M  Bleiberg.  Multiple 
lawsuits  were  brought  against  both  Briloff  and  Barron ’s,  asserting,  among  other  things,  “libel,”  “defamation,”  and 
“negligence.”  The  second  major  platform  that  especially  irked  many  in  the  inner  circle  of  the  accounting  profes¬ 
sion  was  his  frequent  appearances  at  Washington  Congressional  hearings.  Politicians  who  sought  very  strongly 
worded  and  argued  positions  criticizing  key  aspects  of  the  “cozy  world  of  the  auditing/tax/advisory”  triangle  of 
most  accounting  firms  knew  that  Briloff  would  provide  very  targeted  positions  as  well  as  colorful  “sound  bites.” 
The  fact  that  academic  defenders  of  the  accounting  profession  (such  as  Robert  N.  Anthony  of  Harvard  Business 
School)  were  willing  to  attack  Briloff  at  the  personal  level  as  well  as  for  his  arguments  only  added  to  the  debate 
being  viewed  by  many  as  “high  theatre.”  I  found  Criscione’s  Chapter  6  on  “Mr.  Briloff  Goes  to  Washington”  an 
especially  well-presented  one  in  an  area  that  surprised  me  with  regard  to  how  much  both  politicians  and  Briloff 
were  able  to  effectively  “use”  each  other  to  put  the  accounting  profession  very  much  on  the  backfoot  on  some  key 
debates. 

Briloff  himself  was  an  important  input  into  Criscione’s  biography,  and  in  that  sense  it  is  an  authorized  one. 
However,  many  others  were  interviewed,  and  Criscione  showcases  the  arguments  of  Briloff’s  critics  as  well  his 
supporters.  Chapter  7  covers  Briloff’s  appearance  before  the  AICPA  Ethics  Division  Trial  Board.  To  put  it  mildly, 
Briloff’s  personality  and  his  umbrage  at  having  his  own  integrity  questioned  did  not  assist  him  in  these  hearings. 
I  found  one  of  the  most  interesting  aspects  of  the  book  to  be  Briloff’s  own  response  to  a  question  by  Criscione,  re 
“how  he  now  felt  about  the  investigation  and  subsequent  trial.”  Briloff  answered  that  “he  still  felt  the  charges  were 
absurd  but  he  sympathized  with  the  trial  board’s  position.  He  mused,  ‘Would  I  do  it  all  again?  Would  I  go  looking 
for  an  argument?  No’”  (p.  205). 

Criscione’s  book  provides  much  insight  into  an  important  figure  who,  over  the  1965  to  2000  period  (and 
beyond),  has  been  a  very  consistent  proponent  of  his  view  of  dhe  important  dimensions  of  independence  in  the 
accounting  profession.  Changes  made  by  regulators  and  more  begrudgingly  by  the  accounting  profession  itself 
have  moved  the  industry  structure  sizably  toward  Briloff’s  position.  A  read  of  this  book  provides  fascinating 
insights  into  an  individual  who  both  “pushed  hard”  and  was  “pushed  hard  against”  as  the  accounting  profession 
underwent  some  major  (and  to  many  within  the  profession)  wrenching  changes.  It  is  well  worth  reading  and  will 
make  an  important  contribution  to  our  knowledge  of  the  growth  and  development  of  the  accounting  profession  in 
the  last  40  years. 
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WERNER  DE  BONDT  (editor),  Financial  Accounting  and  Investment  Management  (Chel¬ 
tenham,  Glos,  U.K.:  Edward  Elgar,  2009,  ISBN:  978-1-84844-039-5,  pp.  xl,  609  (vol.  I), 
viii,  526  (vol.  II)). 

In  this  two-volume  set  the  reader  is  provided  with  an  interesting  collection  of  articles  that  relate  to  seven 
topies: 

•  Accountants  and  analysts  as  financial  intermediaries  (Volume  I,  Part  I) 

•  Measuring  corporate  earnings  and  profitability  (Volume  I,  Part  11) 

•  Financial  ratios,  the  risk  of  failure,  and  stock  returns  (Volume  I,  Part  III) 

•  Equity  valuation  (Volume  I,  Part  IV) 

•  Price-eamings  ratios,  market-to-book  ratios,  and  stock  returns  (Volume  II,  Part  I) 

•  Earnings  and  stock  returns  (Volume  II,  Part  II) 

•  Fundamental  analysis  and  stock  returns  (Volume  II,  Part  III) 

In  addition,  Professor  Werner  De  Bondt,  the  editor,  writes  a  set  of  insightful  introductory  comments  in  which 
he  briefly  discusses  each  topic,  motivates  and  describes  the  articles  he  chose,  and  explains  how  the  seven  topics 
are  linked  together.  This  is  especially  helpful  as  it  allows  the  reader  to  see  the  articles,  many  of  which  are  well 
known,  from  Professor  De  Bondt’s  perspective.  In  particular,  an  important  theme  running  through  the  introduction 
is  that  extant  research  provides  considerable  evidence  that  accounting  numbers  are  related  to  equity  prices  and 
equity  returns,  and  that  they  play  a  role  in  the  risk-assessment  process.  However,  the  evidence  also  implies  that,  in 
eertain  circumstances,  capital  market  participants  do  not  fully  understand  the  implications  of  past  and  current 
accounting  numbers  for  expected  future  payoffs  and  outcomes.  Hence,  many  unanswered  questions  remain  re¬ 
garding  the  investment-management  role  of  financial  accounting  numbers. 

The  collection  has  many  fine  qualities,  two  of  which  stand  out.  First,  when  read  together,  the  articles  that 
relate  to  a  particular  topic  nicely  illustrate  how  our  understanding  of  that  topic  has  evolved,  and  they  help  the 
reader  identify  key  research  questions  that  remain  unanswered.  For  example,  in  Volume  I,  Part  I,  which  relates  to 
the  financial  intermediation  role  of  accounting  and  analysts’  reports.  Professor  De  Bondt  begins  with  a  1964  article 
by  the  accountant  Howard  C.  Greer,  who  describes  many  of  the  complaints  that  individual  investors  had  (and 
continue  to  have)  with  the  use  and  interpretation  of  accounting  numbers  (Greer  1964).  Next,  Professor  De  Bondt 
includes  the  seminal  1968  article  by  Ball  and  Brown,  which  demonstrates  that,  notwithstanding  the  complaints  of 
individual  investors,  accounting  earnings  do  have  information  eontent  (Ball  and  Brown  1968).  An  immediate 
implication  of  this  result  is  that  there  is  a  demand  for  earnings  forecasts  and,  consequently,  the  analysts  who  make 
them.  This  clearly  motivates  the  third  article  chosen  by  Professor  De  Bondt,  which  is  by  Asquith  et  al.  (2005). 
These  authors  show  that  both  the  textual  content  as  well  as  numerical  earnings  forecasts  and  price  recommenda¬ 
tions  contained  in  a  sample  of  Institutional  Investor  All-American  analyst  reports  have  information  content. 
Finally,  Professor  De  Bondt  finishes  Volume  I,  Part  I  with  an  article  by  de  Franco  et  al.  (2007),  who  show  that 
some  analysts  provide  biased  information  and  that  small  investors  do  not  always  fully  adjust  for  the  bias.  Hence, 
the  reader  finishes  this  part  of  the  collection  realizing  that  analysts’  reports  are  informative  on  average  but  that  we 
do  not  fully  understand  the  factors  that  determine  their  quality  or  the  manner  in  which  capital-market  participants 
use  them  when  making  investment  decisions. 

Second,  even  though  each  of  the  seven  topics  covered  represent  well-developed  sub-fields  of  capital  markets 
research.  Professor  De  Bondt  manages  to  link  them  together  in  a  coherent  fashion.  He  does  this  in  two  ways.  First, 
he  provides  a  concise  yet  illuminating  description  of  the  links  between  the  topics  in  his  introduction.  Second,  he 
chooses  a  set  of  articles  that  are  very  complementary.  For  example,  each  of  the  articles  in  Volume  II,  Part  I,  which 
relates  to  different  valuation  multiples  (e.g.,  the  price-to-earnings  ratio)  are  clearly  related  to  each  of  the  six 
remaining  topics.  This  is  useful  as  it  helps  readers  understand  better  the  relation  between  various  sub-fields  and  to 
draw  connections  between  different  bodies  of  evidence. 

The  collection  also  has  some  shortcomings,  which  mostly  relate  to  the  choice  of  articles  selected  and,  thus, 
are  unavoidable,  given  that  there  is  a  vast  literature  on  the  seven  topics  covered.  Two  of  these  shortcomings  bear 
mentioning.  First,  all  of  the  recent  analytical  work  related  to  residual  income  valuation  (e.g.,  Ohlson  [1995])  and 
abnormal  earnings  growth  valuation  (e.g.,  Ohlson  and  Juettner-Nauroth  [2005])  is  omitted.  Given  the  impact  of 
this  work  on  research  and  practice,  the  failure  to  include  at  least  one  of  the  key  articles  in  this  area  is  a  limitation. 
Second,  none  of  the  recent  articles  regarding  accounting-based  estimates  of  the  expected  rate  of  return  on  equity 
capital  are  included.  These  estimates  are  becoming  increasingly  popular;  hence,  some  discussion  of  why  they  are 
used  and  their  properties  seems  warranted. 
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The  collection  will  have  value  to  a  number  of  audiences.  In  my  opinion,  the  individuals  who  will  benefit  the 
most  are  those  with  exposure  to  the  fundamentals  of  finance  and  accounting  (i.e.,  dividend  policy  and  capital 
structure  irrelevance,  market  efficiency,  asset  pricing  models,  etc.)  but  are  not  “black  belts”  in  accounting  research. 
Hence,  Ph.D.  students  in  accounting  and  finance  who  have  progressed  beyond  their  first  year;  faculty  in  account¬ 
ing  or  finance  who  want  a  new  perspective,  and  practitioners  with  a  strong  accounting  and  finance  background  will 
likely  appreciate  the  two-volume  set  the  most.  However,  because  Professor  De  Bondt  wisely  chooses  not  to  try  to 
cover  all  seven  topics  from  the  ground  up,  individuals  without  some  exposure  to  basic  accounting  and  finance  may 
find  the  two-volume  set  less  useful. 
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KEN  McPHAIL  and  DIANE  WALTERS,  Accounting  and  Business  Ethics:  An  Introduction 
(London,  U.K.:  Routledge,  2009,  ISBN:  13:978-0-415-36235-1,  pp.  xii,  225). 

This  book  is  divided  into  two  main  parts,  and  divided  further  into  nine  chapters.  Part  1  is  “How  to  think 
ethically  about  accounting.”  It  contains  an  introductory  chapter.  Chapter  2  on  the  psychology  of  moral  judgment; 
Chapter  3  presenting  a  variety  of  ethical  theories;  Chapter  4  on  the  ethical  foundations  of  accounting  in  terms  of 
political  philosophy;  and  Chapter  5  on  the  post-  and  new-modem  approaches  to  accounting  ethics.  Part  II  is  “The 
ethics  of  accounting  practice.”  It  consists  of  an  analysis  (Chapter  6)  of  the  function  of  accounting  in  the  context 
of  “free”  markets  and  of  the  morality  of  markets;  Chapter  7  on  accounting  as  a  professional  occupation;  Chapter 
8  on  the  ethics  of  international  accounting;  and  a  concluding  chapter  on  ethics,  intellectual  capital,  and  accounting 
reporting.  In  addition  to  the  text  itself,  each  chapter  contains  a  list  of  references  and  an  extensive  list  of  additional 
resources,  including  books,  articles,  podcasts,  blogs,  websites,  and  movies.  A  rough  estimate  is  that  one-third  of 
the  book  is  devoted  to  these  reference  materials.  Since  the  actual  text  of  the  book  is  quite  brief,  roughly  140  pages 
(including  a  large  number  of  illustrative  boxes),  the  book  is  a  short  introduction  to  the  issues  and  relevant 
literature,  and  is  best  viewed  as  a  reference  work  rather  than  as  a  textbook  per  se. 

Ken  McPhail  is  professor  of  social  and  ethical  accounting  at  the  University  of  Glasgow,  and  Diane  Walters 
is  a  lecturer  at  Heriot-Watt  University,  in  Edinburgh. 

This  book  is  intended  as  a  textbook  for  a  course  on  accounting  ethics.  As  the  chapter  topics  indicate,  the 
primary  purpose  of  the  book  is  to  introduce  accounting  students  to  a  wide  range  of  issues  and  theories  relating  to 
the  ethical  status  of  the  accounting  profession.  The  discussion  is  at  a  “macro”  level,  for  the  most  part;  thus,  it 
addresses  issues  relating  to  codes  of  ethics  (e.g.,  why  we  have  them,  where  they  came  from,  and  the  extent  to 
which  an  effective  international  code  of  ethics  might  be  feasible),  but  does  not  go  into  detailed  discussions  of  the 
contents  of  any  specific  code  or  of  any  specific  ethical  issue  (such  as  auditor  independence  or  conflict  of  interest). 
Possible  uses  of  this  book  would  be  as  a  basis  for  a  course  in  accounting  ethics  or  as  a  supplement  to  such  a 
course.  In  either  case,  the  end-of-chapter  lists  of  resources  should  provide  much  of  this. 
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This  review  will  proceed  from  the  assumption  that  most  accounting  instructors  in  many  countries  will  be 
relatively  unfamiliar  with  most  of  the  issues  and  theories  covered  in  this  book,  but  wish  to  learn  about  them  for 
teaching  or  scholarly  purposes.  This  suggests  that  the  book  may  be  useful  to  academics  who  wish  either  to  teach 
some  of  the  issues  contained  in  this  book,  or  to  leam  about  them  for  their  own  scholarly  development. 

From  the  point  of  view  of  the  mainstream  of  academic  accounting  thought  (which  in  much  of  the  world  is 
dominated  by  economics,  financial  economics,  and  psychology),  the  potential  value  of  this  book  is  to  present  a 
number  of  ethical  issues  and  theories  that  have  been  given  little  attention  in  North  America  and  other  places.  Thus, 
although  a  number  of  more  familiar  theories  are  presented  in  Chapters  2,  3,  and  4,  the  heart  of  Part  I  is  the 
presentation  of  the  views  of  a  variety  of  lesser-known  (from  the  standpoint  of  mainstream  academics)  social 
theorists,  including  Jean-Jacques  Rousseau,  Jurgen  Habermas,  Emmanuel  Levinas,  Jacques  Derrida,  and  Michel 
Foucault.  Following  this.  Part  II  presents  a  number  of  fundamental  issues  that,  according  to  the  authors,  face  the 
accounting  profession,  but  which  have  been  ignored  by  the  mainstream.  Chapter  2  on  moral  psychology  seems  out 
of  place,  in  part  because  it  focuses  on  the  ability  of  accountants  (and  accounting  students)  to  deal  with  day-to-day 
ethical  issues,  and  does  not  play  an  important  role  in  the  rest  of  the  book. 

Unfortunately,  the  brevity  of  the  text  limits  the  book’s  usefulness  as  a  textbook  for  most  instructors,  at  least 
in  the  North  American  context,  because  students  will  have  given  little  or  no  thought  to  either  the  theories  or  the 
issues  discussed  in  the  book.  It  also  limits  its  usefulness  for  accounting  academics,  many  of  whom  will  also  be 
neophytes.  The  problem  is  repeated  many  times  in  the  book.  For  example,  Habermas’  theory  is  described  in  three 
pages  (Chapter  5),  Foucault’s  theory  in  two  pages  (Chapter  5),  a  little  over  three  pages  are  devoted  to  Levinas 
(Chapter  4),  Kant  is  covered  in  one  page  (Chapter  3),  and  only  a  half-page  treats  Rawls  (Chapter  3).  (Less  space 
is  given  to  Rawls  than  to  a  newspaper  article  by  Prem  Sikka  [Chapter  4].)  An  important  pair  of  ethical  concepts 
generally,  and  for  accounting  in  particular,  are  rights  and  duties.  Rights  (but  not  duties)  are  dealt  with  at  somewhat 
greater  length  (Chapter  4),  but  even  here  the  agenda  of  Part  II  focuses  attention  on  human  rights,  rather  than  on, 
or  in  addition  to,  rights  (e.g.,  rights  to  privacy  or  property)  that  are  relevant  to  the  issues  practicing  accountants 
face  in  their  day-to-day  work. 

The  fundamental  problem  here  is  that  neophytes  wanting  an  introduction  to  this  literature  are  likely  to  gain 
little  from  these  discussions  of  complex  and  subtle  theories.  Academics  who  are  somewhat  familiar  with  the  work 
of  Anglo-Saxon  (e.g.,  Locke,  Hobbes,  Kant,  Mill,  and  Rawls)  or  of  Continental  philosophers  (e.g.,  Levinas, 
Derrida,  and  Foucault)  may  be  able  to  fill  in  the  blanks  themselves  by  searching  out  other  sources,  but  will  not 
gain  much  here.  Without  prior  understanding,  the  accounts  given  here  will  be  of  little  help.  For  example,  the 
authors  quote  from  a  secondary  source  about  the  essence  of  Levinas’  view:  “Morality  begins  in  the  face  to  face,” 
and,  “morality  is  the  encounter  with  the  Other  as  face”  (p.  75).  “What  on  earth  does  that  mean?”  the  authors  ask; 
unfortunately,  the  explanation  does  not  clarify  it  in  a  way  that  an  “outsider”  to  this  literature  will  understand. 

More  specifically,  the  text  in  Chapter  3,  devoted  to  the  major  ethical  theories,  is  1 1  pages  long,  including 
about  four  pages  of  boxes  on  such  topics  as  A1  Gore,  Dirty  Harry,  circle  time,  thalidomide,  and  Islamic  accounting. 
Even  having  radically  trimmed  the  types  and  variations  of  theories  discussed,  it  is  impossible  to  say  much  that  is 
intelligible  in  such  a  brief  space.  And  even  then,  the  half-page  devoted  to  Rawls  is  not  only  short  but  fundamen¬ 
tally  mistaken  in  suggesting  that  a  practicing  accountant  should  deal  with  a  material  error  by  employing  the 
original  position  argument.  For  one  thing,  the  authors  do  not  mention  Rawls’s  (1971)  famous  principles  of  justice. 
Even  though  they  refer  to  Rawls’s  famous  1955  paper  on  rules  (which  is  worth  reading)  in  another  connection, 
they  do  not  recognize  the  importance  of  rules,  which  would  be  an  important  feature  of  his  approach  to  the  example 
of  an  erroneous  financial  statement  (Rawls  1955).  (The  authors  do  discuss  “rules-based”  versus  “principle-based” 
standards,  but  not  in  this  context.) 

In  addition,  these  incomplete  discussions  are  often  left  for  the  reader  to  fill  in.  For  example,  the  discussion  of 
“knowledge  capitalism”  in  Chapter  9  suggests  that  it  provides  the  possibility  for  “empowerment  and  a  strength¬ 
ening  of  the  ‘democratic  public  sphere’”  (p.  185).  The  authors  then  suggest  that  the  reader  “[t]ry  to  link  this  kind 
of  perspective  back  to  our  discussion  of  Jurgen  Habermas  in  Chapter  5.  Think  about  how  and  whether  new 
technological  forms  contribute  towards  the  kind  of  communicative  processes  that  Habermas  promotes.”  This 
would  be  very  difficult  to  do,  based  only  on  the  three-page  discussion  of  Habermas  in  Chapter  5.  Rather  than 
leaving  this  to  the  reader  to  figure  out,  the  book  should  have  explained  this  linkage,  with  the  attendant  possibility 
of  increasing  readers’  understanding  of  Habermas’  theory. 

The  theories  presented  in  Part  I  may  be  viewed  as  the  foundations  of  Part  II,  which  directly  addresses  a 
number  of  fundamental  ethical  issues  relating  to  the  practice  of  accounting.  For  this  reason,  readers  may  find  Part 
II  more  interesting,  from  the  point  of  view  of  understanding  the  ethical  foundations  of  accounting. 

Chapter  6  discusses  the  supposed  ethical  neutrality  of  accounting.  The  basic  idea  is  that  accounting  today,  for 
the  most  part,  occurs  in  the  context  of  market-based  capitalist  economic  systems,  and  that  the  ethical  justifiability 
of  accounting  presumes  the  ethical  status  of  such  economic  systems.  For  example,  accounting  is  practiced  in  the 
context  of  a  specific  configuration  of  rights  (property  rights)  that  give  priority  to  capital  over  labor,  and  so  the 
defensibility  of  this  type  of  accounting  depends  on  the  defensibility  of  these  rights.  In  this  sense,  accountants 
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would  need  to  take  responsibility  for  their  role  in  such  an  economic  system.  This  is  unexceptionable.  Because 
accounting  (whatever  else  it  is)  is  supposed  to  be  a  practical  occupation,  it  is  successful  to  the  extent  that  it  serves 
the  purposes  set  out  for  it.  The  important  observation  is  that  accounting  is  not  “value-neutral.”  The  authors’  main 
concern  is  an  important  one:  to  consider  the  ethical  status  of  the  activity  and  institution  to  which  accounting 
contributes. 

The  main  problem  with  this  chapter  is  that  it  finds  the  ethical  foundations  of  capitalism  in  a  form  of 
utilitarianism.  This  is  surprising,  because  it  seems  clear  that  many  ethical  theories  (including  utilitarianism)  can 
provide  the  ethical  underpinnings  of  capitalism.  An  obvious  example  is  libertarianism.  Indeed,  a  major  problem 
with  this  chapter  is  that,  while  it  recognizes  that  the  mainstream  of  accounting  thought  is  capitalist  in  some  sense, 
it  does  not  engage  the  advocates  of  this  view,  who  would  agree  about  the  centrality  of  market  economies,  and 
would  emphasize  (and  defend)  the  use  of  accounting  in  securities  markets. 

Chapter  7  examines  accounting  as  a  profession,  focusing  on  the  place  of  the  profession  in  society,  rather  than 
on  specific  aspects  of  the  practice  of  accounting.  Of  special  interest  is  the  important  question  whether  accounting, 
as  a  professionalized  occupation,  acts  in  the  public  interest  (as  is  often  claimed)  or  in  the  interest  of  private  groups. 
(Whether  accounting  is  a  profession  at  all  is  addressed  only  briefly.)  For  example,  do  codes  of  ethical  conduct 
serve  the  interests  of  the  profession  or  of  society?  As  with  Chapter  6,  it  would  have  been  good  for  the  authors  to 
have  discussed  the  more  mainstream  literature  on  these  topics,  because  many  “mainstream”  academics  are  just  as 
suspicious  as  the  authors  appear  to  be  of  the  public-interest  claims  of  the  profession.  It  would  have  been  inter¬ 
esting  to  see  just  what  the  differences  are  between  the  authors’  and  the  mainstream  points  of  view. 

Chapter  8  addresses  international  issues  in  the  practice  of  accounting.  The  authors  correctly  point  out  that  the 
desirability  and  feasibility  of  trying  to  enforce  uniform  standards  of  practice  (whether  GAAP,  GAAS,  or  ethical 
standards)  present  major  difficulties.  Issues  include  respect  for  cultural  tradition  and  the  possibility  that  a  particular 
set  of  economic  and  ethical  practices  may  be  imposed  on  cultures  that  have  developed  different  traditions.  As  in 
Chapter  6,  it  is  observed  that  international  harmonization  is  not  ethically  neutral,  and  that  accountants  implicitly 
act  in  the  interest  of  whatever  economic  system  or  ideology  is  dominant.  This  is  fairly  obvious,  and  indeed  many 
accountants  and  academics  would  defend  the  current  global  system  and  the  role  of  accountants  in  it.  The  contri¬ 
bution  of  the  authors  is  to  claim  that  this  is  contestable  and  not  as  obvious  as  some  might  think. 

The  authors  appear  to  take  a  strong  view  on  this  issue,  criticizing  both  the  impact  of  globalization  on  less 
developed  economies  (the  activities  of  the  World  Bank  attract  special  attention)  and  the  activities  of  accountants 
in  helping  to  advance  a  globalization  agenda.  In  particular,  they  claim  that  accounting  may  have  a  role  in  the 
increase  in  international  terrorism  because  of  the  subjugation  of  “indigenous”  people.  For  example,  they  repeat  an 
argument  that  accountants  “broadly  defined”  in  Canada  have  been  responsible  for  First  Nations  children  being  sent 
to  boarding  schools,  for  the  murder  of  First  Nations  people  in  order  to  take  their  land  (because  accountants  paid 
the  bounty  to  the  killers),  and  the  giving  of  gifts  to  First  Nations  people.  Although  the  idea  of  role  morality  (i.e., 
the  morality  of  people  acting  in  their  role  as  accountant)  is  discussed  in  passing  elsewhere,  the  analysis  here  does 
not  recognize  that  the  ethical  status  of  taking  children  from  their  parents  or  killing  people  for  their  land  may  be 
wrong,  and  that  an  accountant  who  takes  part  in  this  is  ethically  culpable  (as  a  person);  but  this  does  not  imply  that 
the  wrongness  of  the  act  is  due  to  the  precise  occupational  role  (i.e.,  the  accountant  as  accountant)  that  a  person 
plays. 

Chapter  9,  like  Chapter  2,  is  not  a  good  fit  with  the  rest  of  the  book,  as  I  interpret  the  authors’  mission.  It  is 
not  clear  what  intellectual  capital,  and  accounting  for  it,  has  to  do  with  the  ethics  of  accounting.  However,  the 
authors  consider  whether  “ethical  knowledge”  should  be  considered  an  asset  of  a  company.  They  describe  one 
research  study  that  suggests  that  behaving  ethically  is  more  profitable;  but  they  ignore  a  very  large  literature,  the 
results  of  which  are  mixed.  In  any  case,  many  ethicists  question  the  legitimacy  of  such  a  linkage,  because 
emphasizing  it  suggests  that  the  reason  to  be  ethical  is  to  produce  profits. 

In  conclusion,  this  book  has  some  attractive  features.  First,  it  may  serve  as  an  introduction  to  a  number  of 
fundamental  ethical  issues  of  the  accounting  profession.  This  may  be  helpful  to  academics,  in  relation  either  to 
teaching  students  or  to  advance  their  own  scholarly  understanding  of  the  profession.  However,  the  text  is  brief  and 
sketchy,  sometimes  to  the  point  of  being  uninformative  (and  mistaken).  This  means  that  a  neophyte  will  get,  at 
best,  a  very  rough  outline  of  the  issues  and  the  literature,  and  someone  who  is  familiar  with  the  literature  will  not 
obtain  additional  perspective  or  understanding.  In  addition,  the  early  chapters  on  ethics  and  social  and  political 
philosophy  leave  out  a  number  of  important  theories  that  are  relevant  to  the  ethics  of  the  accounting  profession. 

The  extensive  reference  materials  at  the  ends  of  the  chapters  are  a  strength  of  the  book,  in  large  part  because 
much  of  the  material  presented  in  this  book  will  be  unfamiliar  to  most  academics  in  many  countries.  The 
teaching-oriented  resources  (blogs,  podcasts,  films,  etc.,  as  well  as  the  books  and  articles)  may  be  particularly 
valuable.  Adoption  of  this  book  for  a  course  would  require  heavy  supplementation,  possibly  with  the  assignment 
of  some  of  these  resources.  However,  the  resources  are  incomplete,  because  they  are  a  function  of  the  choices 
made  by  the  authors  about  what  to  include  in  the  book.  In  view  of  the  many  omissions,  this  is  an  important 
limitation  of  its  value  as  a  reference  work. 
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A  few  quibbles:  (1)  the  authors  sometimes  write  in  the  first  person.  This  is  OK,  except  that  sometimes  it  is 
singular  and  sometimes  plural.  And  (2)  the  book  contains  a  large  number  of  text  boxes  (many  of  them  up  to  a  page 
long)  that  discuss  various  issues  or  provide  background  information.  Unfortunately,  many  of  them  add  little  or 
nothing  to  gaining  an  understanding  of  the  issue  or  theory  in  question.  In  fact,  I  found  many  of  them  to  be 
unhelpful  and  even  distracting.  Indeed,  it  is  unclear  what  the  point  is  of  many  of  the  boxes.  In  addition,  the 
material  contained  in  the  boxes  is  almost  always  not  given  a  reference  source.  While  it  may  be  possible  to  figure 
out  from  the  end-of-chapter  references  where  the  information  came  from,  it  should  not  be  necessary.  Less  of  a 
quibble  is  that  their  number  and  length  limit  the  number  of  pages  available  for  discursive  text. 
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OTHMAR  W.  WINKLER,  Interpreting  Economic  and  Social  Data:  A  Foundation  of  De¬ 
scriptive  Statistics  (Berlin,  Germany:  Springer,  2009,  ISBN  978-3-540-68721-4,  pp.  xvi, 
265). 

Opening  this  book  by  Othmar  Winkler  is  like  splashing  oneself  with  cold  water  at  5:30  in  the  morning.  It’s 
a  wakeup  call!  The  author  lays  out  his  “call  to  arms”  in  the  preface.  In  our  quest  to  understand  or  “make  sense  of 
socio-economic  data”  (p.  vi),  we  have  come  to  rely  too  heavily  on  statistical  inference  (F,  t.  Chi-square)  and  on 
assumed  symmetry  and  continuity.  If  we  seek  insights,  we  are  enjoined  to  adopt  instead  the  descriptive  tools  of 
statistics  and  apply  them  to  aggregate  observations  and  their  categorizations.  To  understand  socio-economic 
phenomena,  it  is  essential  to  recognize  that  the  contributing  processes  are  purposeful  and  not  random. 

Winkler  appropriately  observes  that  inferential  statistical  methods  rose  to  prominence  in  the  natural  sciences. 
But  these  methods  are  typically  ill-suited  when  adopted  to  grapple  with  socio-economic  data.  He  takes  pains  to 
point  out  the  limitations  when  directly  transporting  inferential  statistical  methodology  to  economic  and  social  data. 
Among  the  more  readily  identified  differences  is  the  fact  that  social  science  data  occur  over  time  and  location  and 
are  subject-specific.  Insight  into  the  data  generation  process  commands  that  the  observer  consider  the  data’s 
uniqueness  to  each  of  these  factors. 

Games  of  chance  are  offered  as  an  ideal  setting  for  the  probability  calculus.  The  rules  of  the  game  are  known 
and  cannot  change  during  the  game.  The  trials  are  repeated  identically,  and  outcomes  can  be  identified  and 
predicted  over  the  long  run.  Economic  games,  in  business  for  example,  are  very  different  animals.  Essentially, 
none  can  be  accurately  described  as  being  repeated  under  stable  conditions  with  known  outcomes  and  subject  to 
long-term  predictability.  Over  time,  conditions  change,  cooperation  may  develop,  and  new  options  typically 
emerge.  Outcomes  are  multidimensional.  Perturbations  and  changes  over  time  present  serious  difficulties  for 
aggregation  into  a  single  measure  of  success.  Indeed,  success  is  likely  to  be  a  vector,  not  a  datum.  Winkler  writes, 
“It  simply  goes  too  far  to  view  economic  processes  as  random  experiments”  (p.  3).  One  need  only  cast  an  eye  as 
far  as  the  tightly  controlled  economies  of  the  former  European  Soviet-bloc  countries,  says  Winkler  (p.  187),  to 
observe  the  lack  of  randomness.  Could  we  deny  government  influence  even  in  mai'ket-based  economies?  He 
observes,  “Rather  [chance]  resides  in  the  observer’s  difficulty,  inability  or  lack  of  interest  to  understand  the 
deterministic  causal  system  that  actually  is  at  work”  (p.  189). 

Of  equal  if  not  greater  importance  is  the  fact  that  randomness  plays  a  very  different  role  across  disciplines. 
In  games  of  chance  and  in  the  sciences,  randomness  is  captured  by  the  probability  calculus.  It  underlies  experi¬ 
mental  design.  In  the  socio-economic  world,  the  remaining  “random”  observations  after  appropriate  aggregation 
and  classification  often  are  what  are  of  most  interest,  because  the  objective  is  to  understand  the  process  generating 
the  observations.  In  addition,  and  again  as  the  author  points  out,  there  are  “[vjery  few  statistical  observations  ... 
made  directly  by  an  objective  outside  observer”  (p.  5)  essential  to  the  randomness  assumption. 
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Accountants  will  discover  in  Chapter  1 1  that  many  of  the  statistical  tools  favored  by  the  author  have  been  and 
continue  to  be  used  in  accounting.  He  points  out  that  the  collection,  classification,  and  aggregation  (and  de¬ 
aggregation)  of  data  that  are  involved  in  preparing  the  balance  sheet  and  income  statement  are  simply  standard 
statistical  estimation  processes.  The  data  can  then  be  analyzed  horizontally  or  vertically,  as  is  typically  taught  to 
students.  The  consideration  of  the  time-series  nature  of  the  properly  categorized  measurements  leaps  out  at  the 
reader.  But  the  complexities  of  the  traditional  statistical  analysis  of  trend,  seasonal,  and  randomness  are  avoided. 
Inventories,  sales,  and  income  all  yield  to  simple  but  different  descriptive  approaches.  The  observed  extensive 
reliance  on  ratios  in  both  accounting  and  financial  analysis  is  a  natural  application  of  the  descriptive  statistical 
techniques  extolled  by  the  author.  In  addition,  methods  for  improved  ratio  analysis  are  noted.  The  interested  reader 
is  left  with  the  discovery  of  additional  connections  as  well  as  a  brief  digression  into  auditing.  With  the  exception 
of  budgeting  and  record  keeping,  management  accounting  is  ignored,  a  limitation  for  accountants  but  of  minor 
importance  to  the  text. 

The  author  exhorts  his  readers  to  use  descriptive  statistical  tools  to  understand,  evaluate,  and  interpret  social 
and  economic  activity  (e.g.,  business  formation,  entrepreneurship,  pricing,  trade,  production,  poverty,  and  so  on). 
He  would  have  one’s  tool  kit  packed  with  the  proper  use  of  ratios,  appropriate  graphing  and  tabling  presentation 
techniques,  and  frequency  tabulations  (including  their  use  to  disclose  easily  understood  measures  of  dispersion). 
He  urges  us  to  aggregate  and  classify  data  as  a  means  of  understanding  socio-economic  phenomena,  thereby 
allowing  for  the  less  tractable  and  measurable  aspects  of  reality  to  be  captured  and  interpreted.  And,  if  I  may,  we 
should  all  adopt  a  critical  mindset  when  evaluating  the  application  of  inferential  statistics  to  the  socio-economic 
literature  which  we  read  or  disseminate.  I  believe  the  author  would  approve  of  the  insights  that  Professors  Steven 
D.  Levitt  and  Stephen  J.  Dubner  bring  to  investigating  interesting  topics  in  the  social  sciences,  which  have  been 
widely  popularized  in  their  books,  Freakonomics  (2005)  and  SuperFreakonomics  (2009). 

The  book  contains  many  examples  typically  gathered  from  a  career  of  studying,  researching,  and  consulting 
that  describe  the  problems  of  forcing  an  analysis  into  the  statistical  inference  mold.  One  example  that  appears  in 
Chapter  9  and  several  other  chapters,  including  the  author’s  “Afterthoughts”  (pp.  229-232),  describes  a  study 
using  regression  analysis  for  a  trial  centered  on  male/female  pay  differentials.  When  the  results  appeared  to  be 
counter-intuitive,  the  explanation  resided  in  the  socio-economic  nature  of  the  data.  The  interpretation  of  the  slope 
coefficient  had  to  be  interpreted  in  the  light  of  the  changes  in  the  data  over  time  rather  than  as  simply  the  average 
change  in  X  for  a  unit  of  change  in  Y.  The  chapter  examines  many  of  the  limitations  of  regression  analysis  as 
typically  applied  to  socio-economic  data. 

The  author  suggests  that  our  reliance  on  the  inferential  aspects  of  mathematical  statistics  comes  from  its 
elegance  and  our  own  desire  to  be  able  to  run  with  the  elite.  In  our  research,  there  seems  to  be  a  quest  for 
mathematical  sophistication,  as  exemplified  by  advanced  modeling  and  mathematical  testing  to  satisfy  reviewers. 
We  as  instructors  find  statistical  inference  more  exciting  to  teach  and  more  likely  to  reap  the  admiration  of  our 
colleagues.  The  author  also  points  to  the  academic  hiring  function  as  seeking  those  with  expertise  and  interests 
devoted  to  statistical  inference  where  the  random  sample  is  king. 

The  guilt  lies  heavily  with  those  of  us  who  have  authored  or  teach  from  the  standard  statistical  texts.  Too 
often  we  have  not  only  undervalued  the  direct  relevance  of  descriptive  methods,  but  also  have  failed  to  identify, 
let  alone  appreciate,  the  inherent  limitations  underlying  the  applicability  of  assuming  randomness  and  the  associ¬ 
ated  applicability  of  statistical  inference. 

This  book  does  not  fit  easily  into  most  schools’  curriculum.’ However,  this  result  is  to  a  large  extent  a  problem 
with  those  schools’  curriculum  rather  than  with  the  book’s  subject  matter.  It  is  a  short  volume  consisting  of  12 
chapters  and  less  than  150  pages  of  text.  The  text  is  augmented  by  enough  text  notes— often  containing  their  own 
rewarding  gems — to  recall  the  Principles  of  Economics  text  written  by  Alfred  Marshall  (1890)  that  many  of  us 
were  required  to  digest  in  our  economics  classes.  (By  the  way,  Marshall  respected  mathematics  but  did  not  want 
it  to  dominate  economics.)  Many  examples,  culled  from  Winkler’s  extensive  writings  and  experience,  are 
sprinkled  through  the  text  and  the  aforementioned  notes.  Most  of  these  are  from  another  era,  which  makes  them 
no  less  relevant  today.  The  writing  is  clear,  if  wordy.  (The  author  feels  repetition  is  useful  for  those  who  may  wish 
to  use  portions  of  the  volume  without  missing  the  essence  of  the  argument.) 

Indeed,  statistics  classes  might  be  more  exciting  if  students  were  given  questions  or  issues  to  investigate, 
similar  to  the  pay  example  explored  by  the  author  and  described  in  some  detail  in  Chapter  9.  Discovering  methods 
that  yield  insights  and  that  reveal  the  limitations  of  investigative  techniques  might  be  rewarding  and  actually  fun. 
Accounting  professors:  take  note. 

If  your  school  or  department  has  a  seminar  that  dwells  on  the  philosophy  of  science,  treats  research  tech¬ 
niques  in  the  social  sciences,  or  just  reviews  research  studies  using  statistics,  then  a  reading  of  portions  (or  all)  of 
this  book  would  be  both  appropriate  and  desirable.  If  not,  I  would  urge  the  Dean  (or  Department  Chair)  to  provide 
each  professor,  instructor,  and  doctoral  student,  regardless  of  their  field  of  study,  a  copy  at  the  holidays  and  then 
hold  an  exam  the  day  before  classes  begin. 
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EDITORIAL  POLICY  AND  STYLE  INFORMATION 

EDITORIAL  POLICY 

According  to  the  policies  set  by  the  Publications  Committee  (which  were  endorsed  by  the  Executive 
Committee  and  were  published  in  the  Accounting  Education  News,  June  1987),  The  Accounting  Review 
“should  be  viewed  as  the  premier  journal  for  publishing  articles  reporting  the  results  of  accounting  research 
and  explaining  and  illustrating  related  research  methodology.  The  scope  of  acceptable  articles  should  embrace 
any  research  methodology  and  any  accounting-related  subject,  as  long  as  the  articles  meet  the  standards 
established  for  publication  in  the  journal  ...  No  special  sections  should  be  necessary.  The  primary,  but  not 
exclusive,  audience  should  be — as  it  is  now — academicians,  graduate  students,  and  others  interested  in 
accounting  research.” 

The  primary  criterion  for  publication  in  The  Accounting  Review  is  the  significance  of  the  contribution  an 
article  makes  to  the  literature.  Topical  areas  of  interest  to  the  journal  include  accounting  information  systems, 
auditing  and  assurance  services,  financial  accounting,  management  accounting,  taxation,  and  all  other  areas  of 
accounting,  broadly  defined.  The  journal  is  also  open  to  all  rigorous  research  methods. 

The  efficiency  and  effectiveness  of  the  editorial  review  process  is  critically  dependent  upon  the  actions 
of  both  authors  submitting  papers  and  the  reviewers.  Authors  accept  the  responsibility  of  preparing  research 
papers  at  a  level  suitable  for  evaluation  by  independent  reviewers.  Such  preparation,  therefore,  should  include 
subjecting  the  manuscript  to  critique  by  colleagues  and  others  and  revising  it  accordingly  prior  to  submission. 
The  review  process  is  not  to  be  used  as  a  means  of  obtaining  feedback  at  early  stages  of  developing  the 
research. 

Reviewers  and  editors  are  responsible  for  providing  constructive  and  prompt  evaluations  of  submitted 
research  papers  based  on  the  significance  of  their  contribution  and  on  the  rigor  of  analysis  and  presentation. 

MANUSCRIPT  PREPARATION  AND  STYLE 

The  Accounting  Review’s  manuscript  preparation  guidelines  follow  The  Chicago  Manual  of  Style  (15th  ed.; 
University  of  Chicago  Press).  Another  helpful  guide  to  usage  and  style  is  The  Elements  of  Style,  by  William 
Strunk,  Jr.,  and  E.  B.  White  (Macmillan).  Spelling  follows  Webster’s  Collegiate  Dictionary. 

FORMAT 

1.  All  manuscripts  should  be  formatted  in  12-point  font  on  8  1/2  x  11"  paper  and  should  be  double-spaced, 
except  for  indented  quotations. 

2.  Manuscripts  should  be  as  concise  as  the  subject  and  research  method  permit,  generally  not  to  exceed  7,000 
words. 

3.  Margins  should  be  at  least  one  inch  from  top,  bottom,  and  sides. 

4.  To  promote  anonymous  review,  authors  should  not  identify  themselves  directly  or  indirectly  in  their 
papers  or  in  experimental  test  instruments  included  with  the  submission.  Single  authors  should  not  use  the 
editorial  “we”. 

5.  A  cover  page  should  show  the  title  of  the  paper,  all  authors’  names,  titles  and  affiliations,  email  addresses, 
and  any  acknowledgments. 

6.  The  American  Accounting  Association  encourages  use  of  gender-neutral  language  in  its  publications. 

Pagination:  All  pages,  including  tables,  appendices  and  references,  should  be  serially  numbered.  Major 
sections  should  be  numbered  in  Roman  numerals.  Subsections  should  not  be  numbered. 

Numbers:  Spell  out  numbers  from  one  to  ten,  except  when  used  in  tables  and  lists,  and  when  used  with 
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Hyphens:  Use  a  hyphen  to  join  unit  modifiers  or  to  clarify  usage.  Eor  example,  a  cross-sectional  equation, 
re-form.  See  Webster’s  for  correct  usage. 

Keywords:  The  abstract  must  be  followed  by  at  least  three  keywords  to  assist  in  indexing  the  paper  and 
identifying  qualified  reviewers. 
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ABSTRACT/INTRODUCTION 

An  Abstract  of  about  100  words  (150  maximum)  should  be  presented  on  a  separate  page  immediately 
preceeding  the  text.  The  Abstract  should  concisely  inform  the  reader  of  the  manuscript’s  topic,  its  methods, 
and  its  findings.  Keywords  and  the  Data  Availability  statements  should  follow  the  Abstract.  The  text  of  the 
paper  should  start  with  a  section  labeled  “1.  Introduction,”  which  provides  more  details  about  the  paper’s 
purpose,  motivation,  methodology,  and  findings.  Both  the  Abstract  and  the  Introduction  should  be  relatively 
nontechnical,  yet  clear  enough  for  an  informed  reader  to  understand  the  manuscript’s  contribution.  The 
manuscript  s  title,  but  neither  the  author’s  name  nor  other  identification  designations,  should  appear  on  the 
Abstract  page. 

TABLES  AND  FIGURES 

The  author  should  note  the  following  general  requirements: 

1.  Each  table  and  figure  (graphic)  should  appear  on  a  separate  page  and  should  be  placed  at  the  end  of  the 
text.  Each  should  bear  an  Arabic  number  and  a  complete  title  indicating  the  exact  contents  of  the  table  or 
figure.  Tables  and  figures  should  define  each  variable.  The  titles  and  definitions  should  be  sufficiently 
detailed  to  enable  the  reader  to  interpret  the  tables  and  figures  without  reference  to  the  text. 

2.  A  reference  to  each  graphic  should  be  made  in  the  text. 

3.  The  author  should  indicate  where  each  graphic  should  be  inserted  in  the  text. 

4.  Graphics  should  be  reasonably  interpreted  without  reference  to  the  text. 

5.  Source  lines  and  notes  should  be  included  as  necessary. 

Equations:  Equations  should  be  numbered  in  parentheses  flush  with  the  right-hand  margin. 

DOCUMENTATION 

Citations.  Within-text  citations  are  made  using  an  author-year  format.  Cited  works  must  correspond  to  the 
list  of  works  listed  in  the  “References”  section.  Authors  should  make  an  effort  to  include  the  relevant  page 
numbers  in  the  within-text  citations. 

1.  In  the  text,  works  are  cited  as  follows:  author’s  last  name  and  year,  without  comma,  in  parentheses.  Eor 
example:  one  author,  (Berry  2003);  two  authors,  (Fehr  and  Schmidt  2003);  three  or  more  authors, 
(Dechow  et  al.  1998);  more  than  one  work  cited,  (Cole  and  Yakushiji  1984;  Dechow  et  al.  1995;  Levitt 
1998);  with  two  works  by  the  same  author(s),  (Nelson  2003,  2005). 

2.  Unless  confusion  would  result,  do  not  use  “p.”  or  “pp.”  before  page  numbers.  Eor  example,  (Dechow  and 
Dichev  2002,  41-42). 

3.  When  the  reference  list  contains  two  or  more  works  by  the  same  author  (as  the  only  author  or  first  of  two 
or  more  authors)  published  in  the  same  year,  the  suffix  a,  b,  etc.,  is  appended  to  the  date  in  the  within-text 
citations  and  m  the  “References”  section.  For  example,  (Johansson  2004a,  2004b,  2004c;  Baiman  and 
Rajan  2002a,  2002b;  Dhaliwal  et  al.  2005a,  2005b). 

4.  When  the  author’s  name  is  mentioned  in  the  text,  it  need  not  be  repeated  in  the  citation.  For  examnle: 
“Cohen  et  al.  (2005)  provide  ...” 

5.  Citations  to  institutional  works  should  use  acronyms  or  short  titles  where  practicable.  For  example: 
(NCFFR,  The  Treadway  Commission  1987). 

6.  If  the  paper  refers  to  statutes,  legal  treatises,  or  court  cases,  citations  acceptable  in  law  reviews  should  be 
used. 

Reference  List:  Every  manuscript  must  include  a  “References”  section  that  contains  only  those  works  cited 
within  the  text.  Each  entry  should  contain  all  information  necessary  for  unambiguous  identification  of  the 
published  work.  Use  the  following  formats  (which  follow  The  Chicago  Manual  of  Style)-. 

1.  Arrange  citations  in  alphabetical  order  according  to  the  surname  of  the  first  author  or  the  name  of  the 
institution  or  body  responsible  for  the  published  work. 

2.  Use  authors’  initials  instead  of  proper  names. 

3.  For  two  or  more  authors,  separate  authors  with  a  comma,  including  a  comma  before  “and”  (Dechow,  P. 
M.,  R.  Sloan,  and  A.  Sweeney). 

4.  Date  of  publication  follows  the  author’s  (authors’)  name(s). 

5.  Titles  of  journals  or  newspapers  should  not  be  abbreviated. 
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Footnotes:  Footnotes  are  not  used  for  documentation.  Textual  footnotes  should  be  used  only  for  extensions 
and  useful  excursions  of  information  that  if  included  in  the  body  of  the  text  might  disrupt  its  continuity. 
Footnotes  should  be  inserted  using  the  “footnote”  or  “endnote”  feature  of  the  word  processing  software  which 
will  automatically  number  the  footnotes  throughout  the  manuscript  with  superscript  Arabic  numerals. 
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1.  Manuscripts  currently  under  consideration  by  another  journal  or  publisher  should  not  be  submitted.  The 
author  must  state  upon  submission  that  the  work  is  not  submitted  or  published  elsewhere. 

2.  For  manuscripts  reporting  on  field  surveys  or  experiments:  If  the  additional  documentation  (e.g.,  ques¬ 
tionnaire,  case,  interview  schedule)  is  sent  as  a  separate  file,  then  all  information  that  might  identify  the 
author(s)  must  be  deleted  from  the  instrument. 

3.  Manuscripts  should  be  submitted  via  email  as  Microsoft  Word  or  PDF  file  to  the  Senior  Editor,  Steven 
Kachelmeier,  at  email  address  tar@mccombs.utexas.edu.  Please  submit  separate  files  for  (1)  the  manu¬ 
script’s  title  page  with  identifying  information  (not  forwarded  to  reviewers),  (2)  the  manuscript  with  title 
page  and  all  other  identifying  information  removed,  and  (3)  any  necessary  supplemental  files,  such  as 
experimental  instructions  and/or  response  memoranda  on  invited  revisions.  The  body  or  subject  line  of  the 
email  should  also  indicate  the  submission  fee  receipt  code  from  AAA  (see  step  4  below). 

4.  A  nonrefundable  submission  fee  is  required  unless  explicitly  waived  by  the  editor  for  invited  revisions  of 
previous  submissions.  The  submission  fee  in  U.S.  funds  is  $200.00  for  members  and  $400.00  for  non¬ 
members  of  the  AAA  payable  by  credit  card  (VISA  or  MasterCard  only).  The  payment  form  is  available 
online  at:  https://aaahq.org/AAAforms/joumals/tarsubmit.cfm.  If  you  are  unable  to  pay  by  credit  card  or 
have  any  questions,  please  contact  the  AAA  Member  Services  Team  at  (941)  921-7747  or  info@aaahq.org. 

5.  Revisions  must  be  submitted  within  12  months  from  the  decision  letter  inviting  a  revision. 

COMMENTS 

The  Accounting  Review  welcomes  submissions  of  comments  on  previous  TAR  articles.  Comments  on  articles 
previously  published  in  The  Accounting  Review  will  generally  be  reviewed  by  two  reviewers,  usually  includ¬ 
ing  an  author  of  the  original  article  (to  assist  the  editor  in  evaluating  whether  the  submitted  comment 
represents  the  prior  article  accurately)  and  an  additional  reviewer  who  is  independent  of  the  original  article. 
If  a  comment  is  accepted  for  publication,  the  original  author  will  generally  be  invited  to  reply.  All  other 
editorial  requirements,  as  enumerated  above,  apply  to  proposed  comments. 

POLICY  ON  REPRODUCTION 

An  objective  of  The  Accounting  Review  is  to  promote  the  wide  dissemination  of  the  results  of  systematic 
scholarly  inquiries  into  the  broad  field  of  accounting. 

Permission  is  hereby  granted  to  reproduce  any  of  the  contents  of  the  Review  for  use  in  courses  of 
instruction,  as  long  as  the  source  and  American  Accounting  Association  copyright  are  indicated  in  any  such 
reproductions. 

Written  application  must  be  made  to  the  American  Accounting  Association,  5717  Bessie  Drive,  Sarasota, 
FL  34233-2399,  for  permission  to  reproduce  any  of  the  contents  of  the  Review  for  use  in  other  than  courses 
of  instruction — e.g.,  inclusion  in  books  of  readings  or  in  any  other  publications  intended  for  general  distri¬ 
bution.  In  consideration  for  the  grant  of  permission  by  the  Review  in  such  instances,  the  applicant  must  notify 
the  author(s)  in  writing  of  the  intended  use  to  be  made  of  each  reproduction. 

Except  where  otherwise  noted  in  articles,  the  copyright  interest  has  been  transferred  to  the  American 
Accounting  Association.  Where  the  author(s)  has  (have)  not  transferred  the  copyright  to  the  Association, 
applicants  must  seek  permission  to  reproduce  (for  all  purposes)  directly  from  the  author(s). 

POLICY  ON  DATA  AVAILABILITY 

The  AAA’s  Executive  Committee  policy  (originally  adopted  in  1989,  and  amended  in  2009)  is  that  the 
objective  of  the  Association-wide  journals  {The  Accounting  Review,  Accounting  Horizons,  Issues  in  Account¬ 
ing  Education)  is  to  provide  the  widest  possible  dissemination  of  knowledge  based  on  systematic  scholarly 
inquiries  into  accounting  as  a  field  of  professional  research,  and  educational  activity.  To  fulfill  this  objective, 
authors  are  encouraged  to  make  their  data  available  for  use  by  others  in  extending  or  replicating  results 
reported  in  their  articles. 
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Accounting  Errors  and  Errors  of  Accounting 

John  Christensen 

University  of  Southern  Denmark 

ABSTRACT:  Accounting  should  pay  more  attention  to  errors,  as  errors  are  essential  for 
the  updating  of  beliefs.  Accounting  is  an  information  system,  and  errors  are  the  carriers 
of  information  according  to  Bayes’  Theorem.  Accountants  are  primarily  concerned  with 
the  mean  (value),  but  the  variance  of  accounting  numbers  is  equally  important.  Only 
autocorrelation  makes  historical  accounting  relevant  for  decision  purposes.  Endog¬ 
enous  errors  of  accounting  are  more  common  than  acknowledged.  First,  the  accounting 
model  is  linear,  whereas  the  world  is  nonlinear.  Second,  accounting  is  not  the  only 
information  channel,  and  accountants  must  consider  the  role  of  accounting  when  it 
supplements  other  information  sources.  This  commentary  discusses  the  consequences 
of  endogenous  errors.  Errors  are  inherent  to  accounting,  and  accountants  must  ad¬ 
dress  them. 

Keywords:  information  perspective;  accounting  errors;  permanent  earnings; 
nonlinearity;  alternative  information  sources. 


I.  INTRODUCTION 

The  accounting  system  is  a  closely  managed  information  system.  Financial  data  about  the 
firm  are  collected  with  the  purpose  of  guiding  decisions  related  to  the  firm.  The  recognition 
rules  govern  the  inclusion  of  information  in  the  accounting  library.  Recognition  rules  also 
govern  the  exclusion  of  data  from  the  accounting  system.  The  accounting  system  is  based  upon  old 
technology  dating  back  to  Pacioli  in  1494.  This  technology  continues  to  be  used,  and  the  market 
has  proved  it  useful.  Basically,  it  is  a  double  entry  system  that  simultaneously  keeps  track  of  the 
financial  stocks  and  flows  of  the  company,  with  the  comparative  advantage  that  it  is  hard  to 
manipulate.  Accountants  have  a  long  track  record  ia  maintaining  the  accounting  system  based 
upon  financial  data. 

Even  with  this  inherent  security,  the  accounting  system  is  by  no  means  flawless  (cf.  Beaver 
1998).  The  net  result  is  that  not  all  information  of  relevance  for  economic  decisions  is  included  in 
the  accounting  system.  The  accounting  system  is  effectively  limited  to  information  that  can  be 
measured  reliably.  Thus,  all  troublesome  items  are  carefully  excluded,  and,  as  a  consequence,  the 
accounting  system  steps  back  from  the  valuation  of  many  intangible  assets  and  writes  off  such 
investments  at  acquisition. 


Helpful  comments  from  Joel  Demski,  Steven  Kachelmeier,  and  semiinar  participants  at  Bocconi  University  are  gratefully 
acknowledged. 

Editor’s  note:  This  commentary,  based  on  a  lecture  at  the  2009  American  Accounting  Association  Annual  Meeting  in  New 
York,  NY,  was  invited  by  senior  editor  Steven  Kachelmeier,  consistent  with  the  AAA  Executive  Committee’s  goal  to 
promote  broad  dissemination  of  the  AAA  Presidential  Scholar  Lecture. 
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The  accounting  system  serves  many  purposes  in  a  multi-person  world.  A  conflict  among  uses 
is  inevitable,  as  pointed  out  by  Gjesdal  (1981).  Thus,  errors  are  inevitable.  More  to  the  point  is  that 
it  is  an  information  system,  and  information  is  imbedded  in  the  errors.  Hence,  the  question  of  how 
to  extract  information  from  the  endogenous  errors  of  the  accounting  system  must  be  addressed. 
This  brings  the  time-series  properties  of  accounting  numbers  into  focus.  Extracting  permanent  and 
transitory  earnings  components  from  errors  becomes  an  important  exercise,  as  I  discuss  in  Section 
II. 

Costing  and  the  managerial  accounting  system  also  contain  endogenous  errors.  Most  of  these 
errors  originate  from  the  discrepancy  between  the  accounting  model  and  the  underlying  economics 
of  the  firm.  The  old  variable-cost-versus-fixed-cost  controversy  might  be  seen  as  belonging  to  this 
type  of  error.  The  bigger  picture  is  that  the  accounting  model  is  linear,  whereas  the  world  is  hardly 
linear.  Economies  of  scale  and  scope  are  part  of  most  businesses,  and  they  certainly  form  the  basis 
for  most  multi-product  firms.  The  discrepancy  between  the  linear  accounting  model  and  the 
nonlinear  economics  induces  errors  into  the  accounting  system  and  reduces  its  usefulness  for 
decision  purposes.  The  activity-based  costing  (ABC)  accounting  system  offers  a  finely  tuned 
system,  which  mirrors  the  production  technology  of  the  firm,  albeit  using  a  linear  costing  tech¬ 
nique.  The  alternative  is  a  simple  accounting  system  using  fewer  cost  drivers.  Both  are  infected  by 
errors,  although  it  is  not  clear  that  one  should  favor  the  more  complex  system.  I  return  to  this  issue 
in  Section  III. 

Another  set  of  errors  of  the  accounting  system  relates  to  the  fact  that  accounting  is  constructed 
as  if  it  were  the  only  information  source  providing  information  to  decision  makers.  Thus,  much 
effort  is  directed  to  ensuring  that  the  accounting  system  reflects  all  available  information.  For 
example,  consider  the  fair-value  debate.  The  point  here  is  that  the  accounting  system  is  not 
necessarily  the  best  aggregator  of  information.  The  accounting  system  has  a  comparative  advan¬ 
tage  when  it  comes  to  providing  information  about  the  financial  history  of  the  firm.  Market-related 
information  might  also  reach  the  decision  makers  through  other  channels  such  as  the  pricing 
mechanism.  In  fact,  the  market  mechanism  is  a  powerful  information  aggregator,  as  suggested  by 
Roll  (1984).  Hence,  there  is  a  horse  race  between  the  accounting  system  and  the  market  mecha¬ 
nism  in  aggregating  market  information.  When  this  aggregation  is  performed  by  the  accounting 
system,  the  system  will  potentially  be  less  efficient  in  providing  historical  financial  information. 
As  both  types  of  information  are  in  demand,  there  is  a  conflict.  I  return  to  this  issue  in  Section  IV. 


II.  THE  INFORMATION  PERSPECTIVE 

Accounting  provides  information  to  decision  makers.  It  produces  numbers  that  have  proven 
useful  in  pricing  commodities  and  produces  data  that  serve  as  inputs  into  the  managerial  perfor¬ 
mance  evaluation  exercise  (Homgren  et  al.  2009).  Thus,  accounting  provides  information  for  the 
decision  making  and  control  purposes  of  the  firm  in  general  (Sunder  1997). 

Bayes  taught  us  how  our  beliefs  are  updated  once  new  information  arrives.  The  event  of 
interest  is  called  A,  and  the  new  piece  of  information  is  denoted  y.  The  beliefs  of  the  decision 
maker  are  encoded  in  the  probability  distribution,  P.  Bayes’  formula  shows  that  the  updated 
probabilities  reflect  the  relative  likelihood  of  y  when  y  has  occurred: 


P(A|y)  = 


P{A  n  y) 

Piy) 


(1) 


More  insight  might  be  obtained  when  information  is  placed  in  a  normal  distribution  frame¬ 
work.  The  value  of  the  object  in  question  is  x,  which  is  normally  distributed  with  mean  p  and 
variance  a^.  The  new  information  signal  is  y,  which  is  normal  with  mean  x  and  variance  o^. 
Consistent  updating  of  the  expectation  of  x  given  the  information  in  y  results  in  the  following 
expression; 
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X  ~  N{ix,  cP')  -value 


y  ~  N{x,  02)  -information 

£[x|y]  =  /i+-^ — 3(j-/x).  (2) 

(T  +  02 

As  anticipated,  the  updated  expectation  depends  both  on  y  and  on  the  prior  mean  p.  The 
interesting  observation  is  that  the  updated  expectation  of  x  also  depends  on  the  variances.  The 
amount  of  updating  depends  upon  the  relative  variance  of  the  new  information  as  compared  to  the 
prior  variance.  If  the  new  source  of  information  has  very  little  variance — reliable  information — 
then  the  updated  value  is  equal  to  the  signal.  When  the  signal  is  of  lower  quality,  the  updating 
adjusts  accordingly,  as  the  magnitude  of  the  updating  is  a  decreasing  function  of  the  variance  of. 

The  implication  of  this  for  an  accounting  context  can  be  shown  in  a  simple  setting,  adapted 
from  Christensen  and  Demski  (2003a).  Suppose  that  the  firm  has  a  lifetime  of  three  periods  and 
faces  perfect  capital  markets  with  an  interest  rate  of  r.  Initially,  the  owners  invest  -CFq>(), 
generating  uncertain  cash  flows  over  the  following  three  periods  of: 

CFj  =  Cj  -I-  £  -I-  £] 


CF2  =  C2  +  Oe  +  S2 


CFj,  =  C3  -t  yfi  +  £3 .  (3) 

The  expected  value  of  all  the  error  terms  £,  £1,  £2,  and  £3  is  zero  and  the  expected  net  present 
value  of  the  investment  is  also  zero.  All  error  terms  are  independent,  implying  that  the  c/s  are 
transitory  earnings  components,  whereas  s  has  a  permanent  effect,  as  it  affects  all  three  periods.  It 
is  assumed  that  all  cash  is  transferred  to  the  owners  as  immediate  dividends.  Thus,  the  firm  never 
holds  cash. 

The  risk-neutral  value  of  the  firm  is  given  at  time  0  (value  =  -CFo)  and  time  3  (value  =  0). 
The  interesting  valuation  questions  are  found  at  t  =  1  and  2.  Suppose  only  cash  flows  are  observed 
by  the  market  participants  as  time  progresses.  Given  the  efficiency  of  markets,  the  market  valua¬ 
tion  at  time  1  will  be: 

E[PVy\CF^  =  Cl  +  £  +  £1]  =  F[CF2l£  -I-  £i](l  +  r)“'  +  F[CF3l£  +  £i](l  +  =  (c2  -i-  dE[s\s 

+  £i])(l  -t  r)~^  +  (c3  -I-  7£'[£|£  +  ei])(l  +  r)~^-  (4) 

The  error  terms  provide  the  information  for  Bayesian  updating.  Furthermore,  the  autocorre¬ 
lation  of  the  error  terms  is  what  makes  the  valuation  exercise  interesting.  Only  the  permanent 
component  of  the  error  term  enters  the  value  calculation. 

The  accounting  system  will  also  reflect  the  first  period  error  term,  but  in  a  different  way. 
Accounting  is  governed  by  accounting  conventions  and  recognition  rules.  Under  historical  cost 
accounting,  the  accounting  valuation  will  start  from  the  book  value  (=  acquisition  value)  and  then 
use  periodic  depreciation  to  adjust  the  value.  The  depreciation  charge  is  predetermined  as  The 
first  period  accounting  value  is  then  and  the  period  1  accounting  income  is  f 

=  CFi  -  <5i.  The  accounting  income  reflects  the  error  terms.  Since  the  accounting  system  reflects 
the  same  information  as  the  market  value,  it  is  possible  to  calculate  the  market  value  using 
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accounting  data.  Contrary  to  the  accounting  income,  the  accounting  valuation  contains  no  infor¬ 
mation  about  the  error  term.  The  expected  future  residual  income  measures  will  counterbalance 
the  error  made  in  the  accounting  valuation,  as  in  Feltham  and  Ohlson  (1995).  The  calculation  is  as 
follows: 

^[PVils  -I-  Sj]  =  Ai  +E[l2  -  Mi|e  -t  ai](l  -l-  -l-  £’[73  -  rA2\s  +  ei](l  -I-  r)~^.  (5) 

The  general  point  here  is  that  the  accounting  valuation  seldom  coincides  with  the  market 
valuation.  The  recognition  rules  of  accounting  dictate  a  more  conservative  information  processing 
compared  to  the  market  valuation.  When  pricing  the  securities  in  a  risk-free  market,  the  market 
participants  use  their  information  to  estimate  the  mean  value  and  are  not  constrained  by  account¬ 
ing  conventions. 

When  the  market  valuation  and  the  accounting  valuation  are  based  upon  the  same  information 
set,  it  is  possible  to  find  the  market  valuation  from  the  accounting  valuation,  making  it  important 
to  extract  the  information  about  the  permanent  error  term  from  the  reported  income.  This  analysis 
suggests  that  an  important  role  of  the  accounting  system  is  to  provide  information  about  the  error 
terms,  and  yet  how  to  fulfill  that  role  is  not  predominant  in  the  accounting  literature. 

When  the  market  valuation  is  based  on  more  information  than  the  information  contained  in 
the  accounting  reports,  it  becomes  more  difficult  to  analyze  the  role  of  the  accounting  information. 
The  errors  are  important,  as  suggested  by  Bayesian  updating,  but  the  question  remains  as  to 
whether  the  accounting  information  should  be  confined  to  transaction  information  or  whether 
market  information  should  be  part  of  the  accounting  system.  I  will  return  to  this  question  in 
Section  IV,  after  considering  the  effects  of  nonlinearities  in  accounting  in  Section  III  below. 

III.  NONLINEARITIES  IN  ACCOUNTING— COSTING 

Costing  is  an  old  art  of  accountants  and  the  ABC  literature  has  given  rise  to  a  new  interest  (cf. 
Christensen  and  Demski  1995).  One  of  the  important  insights  to  be  gained  from  the  activity -based 
accounting  system  is  that  the  accounting  system  is  improved  when  the  accounting  structure  re¬ 
flects  the  structure  of  production.  In  this  way,  ABC  is  better  able  to  produce  product  costs  that  are 
in  line  with  the  underlying  economic  cost.  The  basic  activity-based  construction  of  product  cost  is 
a  two-stage  linear  cost  allocation.  The  errors  of  ABC  have  been  the  subject  of  recent  research  such 
as  Cardinaels  and  Labro  (2008).  The  decision  problem  regarding  production  plans,  make-or-buy, 
or  acquisitions  might  take  on  many  disguises,  but  according  to  economics,  marginal  cost  or 
incremental  cost  is  the  statistic  that  guides  optimal  decisions  when  confronted  with  the  marginal 
revenue.  Furthermore,  cost  functions  are  often  nonlinear,  as  increasing  or  decreasing  returns  to 
scale  are  commonplace,  such  that  nonlinearity  is  often  cited  as  the  reason  for  a  particular  size  of 
an  operation  or  even  the  existence  of  multi-product  firms. 

Accounting  cost  usually  assumes  a  linear  cost  function.  The  accounting  construction  of  unit 
cost  is  a  prominent  example  of  this.  When  the  cost  function  is  nonlineai',  an  endogenous  error  in 
the  accounting  system  is  the  result. 

Consider  a  facility  that  produces  several  products  in  quantities  described  by  the  vector  q,  as  in 
Christensen  and  Demski  (1997).  Assume  three  indirect  cost  pools  named  after  the  production 
function:/,  g,  and  h.  Cost  pool /uses  the  inputs  x,  and  to  produce  the  products  in  the  quantity 
q.  The  products  use  the  output  from  the  cost  pool  in  a  Leontief  technology  described  by  A,.  This 
means  that  there  is  a  linear  use  of  the  resources  produced  by  /.  In  this  way,  A ,  illustrates  the  cost 
driver  for  pool  one,  as  each  unit  of  the  first  product  uses  the  number  represented  by  the  first 
element  of  A 1  of  the  resource/.  Cost  pool  g  is  similar  to  cost  pool /  Cost  pool  h  is  an  intermediate 
cost  pool.  The  output  of  h  is  also  Leontief,  as  it  produces  the  inputs  X5  and  Vg,  which  are  used  as 
inputs  in /and  g,  respectively.  The  production  function  h  uses  the  inputs  JC3  and  X4.  The  production 
functions  /,  g,  and  h  describe  the  transformation  process  of  inputs  to  outputs  for  each  of  the  cost 
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pools.  In  addition  to  these  cost  pools,  production  gives  rise  to  labor  cost,  L,  and  material  cost,  M. 
These  two  cost  categories  are  linear  by  default. 

The  cost  function  of  the  operation  is  derived  from  the  following  optimization: 


c{q)  =  Min 


4 

Lq  +  Mq  +  ^  PiXi 
i=I 


s.t. 


Aiq  </(xi,X5) 


A2q  ^  gix2,X(,) 


X5 +X6  ^ /z(x3,X4).  (6) 

The  marginal  cost  is  the  relevant  cost  statistic  for  incremental  decisions.  The  Lagrange  mul¬ 
tipliers  on  the  constraints  are  useful  in  characterizing  the  marginal  cost: 

MCfq)  =  Li  +  Ml  +  -I-  A,2(^M2!- 

The  Leontief  coefficients  fully  capture  the  differences  of  the  products’  marginal  costs,  and 
there  is  a  linear  relationship  between  the  marginal  costs  of  the  products.  It  is  noteworthy  that  the 
intermediate  cost  pool  is  not  present  in  this  marginal  cost  calculation,  though  it  is  accounted  for 
indirectly  through  Lagrange  multipliers  of  cost  pools  /and  g.  The  marginal  cost  is  not  necessarily 
linear  in  the  products  q.  That  depends  upon  the  production  functions  /,  g,  and  h,  as  the  Lagrange 
multipliers  are  not  necessarily  constant. 

The  costs  are  accumulated  in  the  three  obvious  cost  pools.  The  costs  are  exactly  as  suggested 
by  the  cost  functions  and  the  production  functions  as  in  Equation  (6).  There  are  no  error  terms  and 
there  are  no  inefficiencies.  Now  apply  cost  calculations  of  the  ABC  type  to  find  the  product  costs. 
First,  the  intermediate  cost  pool  is  allocated  to  the  indirect  cost  pools  /  and  g  using  the  available 
cost  drivers  describing  the  activity  of  the  intermediate  cost  pool.  Second,  the  total  cost  in  the 
indirect  cost  pools  is  allocated  to  the  cost  objects  using  the  Leontief  coefficients  of  the  production 
functions.  These  will  form  the  natural  and  available  cost  drivers.  The  calculations  are  summarized 
in  Equation  (8): 

Ch  =  P3X3+P4X4 


PlXi  +  - Ch 

X5  +X5 


A^q 


xe 

P2^2 + 

X5  +Xq 


^2^ 


Ci  =  Li  +  Mi  +  CfAii  +  C^2i- 

It  is  noteworthy  that  the  structure  of  the  ABC  calculation  is  identical  to  the  structure  of  the 
marginal  cost  calculation.  The  direct  cost  components  are  identical  and  the  Leontief  coefficients 
A^i  and  Aj/  are  also  found  in  the  ABC  calculation.  Thus,  the  differences  among  the  products  with 
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respect  to  cost  structure  are  precisely  reflected  in  the  ABC  system,  due  to  the  Leontief  structure  of 
the  production  function.  Furthermore,  any  discrepancy  between  ABC  cost  and  marginal  cost  must 
be  found  in  the  Lagrange  multipliers,  which  represent  the  marginal  costs  of  the  indirect  cost  pools. 
As  noted,  the  multipliers  indirectly  account  for  the  cost  of  the  intermediate  cost  pool  as  in  the  ABC 
system.  When  the  production  functions  /,  g,  and  h  are  all  linear,  the  ABC  approach  results  in  the 
marginal  cost  for  all  of  the  products  simultaneously. 

When  one  or  more  of  the  production  functions  show  increasing  returns  to  scale,  the  ABC 
system  will  estimate  the  marginal  cost  too  high,  whereas  the  opposite  is  the  case  for  decreasing 
returns  to  scale. 

The  activity-based  accounting  system  was  introduced  as  an  improvement  of  the  traditional 
accounting  system,  particularly  for  firms  that  produce  multiple  products.  In  comparison,  the  tra¬ 
ditional  accounting  system  is  a  single  cost  pool  system  using  direct  labor  as  the  cost  driver.  The 
product  cost  calculation  runs  according  to  the  following  pattern: 


Cz.= 


PiXi+P2J^2  +  P3^3+P4^4 

Lq 


Q  -  m  1  +  Cj)  +  Ml.  (9) 

This  costing  system  is  simpler  than  the  ABC  system,  as  it  has  only  one  independent  parameter 
to  describe  the  differences  of  the  product  cost  of  the  products  in  q.  The  input  of  labor,  L,  is  the 
only  source  of  assigning  overhead  costs  to  the  products. 

However,  as  the  ABC  system  is  also  an  approximation,  it  is  worthwhile  to  run  the  horse  race 
between  these  two  systems,  as  in  Christensen  and  Demski  (1997).  The  actual  marginal  cost  of  the 
products  is  used  to  evaluate  the  performance  of  the  two  accounting  systems.  The  performance  of 
an  accounting  system  is  measured  by  the  mean  squared  error  of  the  unit  cost  reported  by  the 
accounting  system.  Christensen  and  Demski  (1997)  found  that  the  traditional  accounting  system 
based  solely  on  labor  input  for  some  combinations  of  the  production  functions  outperformed  the 
more  refined  ABC  system.  The  picture  is  more  pronounced  when  the  individual  products  are 
evaluated.  For  some  products,  one  accounting  system  performs  best,  whereas  for  other  products, 
the  other  accounting  system  performs  best.  There  is  no  uniform  winner. 

This  finding  is  not  restricted  to  nonlinear  production  functions.  Datar  and  Gupta  (1994)  found 
similar  results  when  they  studied  the  combined  effect  of  an  erroneous  choice  of  cost  driver  and 
different  levels  of  aggregation.  They  found  that  the  two  factors  interact,  such  that  repairing  the 
system  on  one  dimension  does  not  necessarily  lead  to  an  overall  improvement.  They  also  used  the 
Euclidean  distance  measure  to  determine  the  performance  of  the  accounting  systems.  Labro  and 
Vanhoucke  (2007)  took  the  analysis  one  step  further  by  performing  a  large  simulation  analysis. 
The  underlying  “true”  economics  of  the  firm  is  linear  and  with  a  multi-layer  system  like  an  ABC 
system.  Their  setting  included  a  large  number  of  cost  pools  that  are  allocated  to  activity  cost  pools 
and  further  to  cost  objects.  All  the  allocations  are  linear,  using  pre-specified  cost  drivers.  In  their 
simulation,  Labro  and  Vanhoucke  (2007)  varied  the  number  of  cost  pools  both  at  the  resource  cost 
pool  level  and  the  activity  cost  pool  level.  They  also  varied  the  cost  drivers  used  in  the  cost 
allocations.  Results  supported  the  findings  of  Datar  and  Gupta  (1994),  revealing  interaction  effects 
between  errors.  An  incremental  refinement  of  the  accounting  system  does  not  always  lead  to  a 
superior  system. 

The  same  pattern  of  results  arises  when  the  accounting  system  is  used  to  induce  efficient 
behavior.  The  economic  relationships  in  the  division  are  as  described  in  Equation  (6).  It  is  as¬ 
sumed  that  part  of  the  production  from  this  division  is  sold  for  profit  in  the  market,  and  part  of  the 
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production  is  delivered  to  other  divisions  in  return  for  cost  reimbursement.  The  eost  reimburse¬ 
ment  is  based  on  the  realized  eost  and  observed  drivers,  as  measured  by  the  accounting  system. 

The  evaluation  of  the  division  is  based  upon  divisional  profit.  Inefficient  decisions  result  in 
excessive  costs,  but  perhaps  divisional  profit  will  increase.  The  question  is  whether  the  division  is 
able  to  pass  more  than  the  excessive  cost  on  to  the  other  division  through  clever  but  inefficient 
decision  making.  The  control  system  prevents  this  from  happening  to  some  extent.  Christensen 
and  Demski  (2003b)  found  that,  when  the  technology  exhibits  constant  returns  to  scale,  the 
ABC-type  accounting  system  will  always  induce  efficiency.  However,  if  the  indirect  cost  pools 
show  increasing  returns  to  scale  or  if  the  intermediate  cost  pool  has  non-constant  returns  to  scale, 
then  it  is  possible  for  the  division  to  profit  from  using  resources  inefficiently.  The  ineffieiency  is 
passed  on  to  the  other  division  via  the  cost  allocation.  The  scheme  relies  heavily  on  substitution  of 
factors  of  production.  The  scheme  allows  the  division  to  employ  the  more  productive  resources  for 
the  products  sold  for  profit,  whereas  the  less  productive  inputs  are  used  for  internal  transfers. 
There  is  nothing  in  the  system  itself  which  limits  the  amount  of  inefficiency.  Such  limits  must  be 
found  in  the  eontrol  system. 

When  the  possibility  of  such  inefficiencies  exists,  a  simpler  traditional  accounting  system 
might  be  better  at  promoting  efficiency.  Christensen  and  Demski  (2003b)  analyzed  the  horse  race 
between  a  labor-cost-based  allocation  system  and  an  activity-based  accounting  system.  The  pro¬ 
duction  technology  suggests  an  activity-based  system.  The  inefficiencies  of  an  ABC  system  are 
confined  to  the  indirect  cost  categories. 

If  labor  is  used  to  allocate  cost  and  if  it  is  possible  to  use  too  much  labor  for  the  internal 
division,  then  the  internal  division  absorbs  a  larger  share  of  the  indirect  cost.  The  inefficiency  is 
confined  to  the  direct  labor  and  also  in  this  case  the  control  system  limits  the  inefficiency. 

The  net  result  is  that  the  winner  of  the  contest  is  determined  by  the  control  system.  Are  the 
controls  better  at  limiting  the  ineffieiencies  in  the  labor  domain  than  in  the  indirect  cost  domain? 
Accounting  provides  a  mapping  and  illustration  of  the  economics  of  the  firm.  The  accounting 
model  has  many  parameters,  and  selecting  the  model  just  to  mirror  the  structure  of  the  firm  is  too 
simplistic.  The  underlying  economics  play  a  crucial  role.  The  other  dimension  is  the  control 
system,  which  might  result  in  one  seemingly  inferior  accounting  model  winning  the  horse  race 
against  a  more  refined  accounting  model.  Unlike  accounting,  the  world  is  hardly  ever  linear,  and 
the  analysis  of  accounting  has  to  take  that  reality  into  consideration. 

IV.  ALTERNATIVE  INFORMATION  SOURCES 

Traditionally,  accounting  has  been  based  upon  historical  cost  and  revenue  measures.  Only 
historical  information  or  realized  cash  flows  could  penetrate  the  recognition  rules  of  accounting. 
Accruals  also  have  a  historical  orientation,  but  contain  certain  forward-looking  elements.  The 
valuation  of  material  assets  starts  out  with  historical  cost  and  is  then  depreciated  according  to  a 
predetermined  scheme.  The  scheme  is  determined  at  the  time  of  acquisition.  Only  hard  evidence 
will  make  it  possible  to  deviate  from  the  scheme.  Recently,  there  has  been  a  trend  toward  substi¬ 
tuting  the  historical  valuation  of  assets  with  a  fair  market  valuation,  in  which  the  current  market 
value  of  the  asset  is  used  in  the  accounting  valuation  of  the  asset. 

Fair-value  accounting  changes  recognition  rules  to  include  market  information  in  the  accounts 
to  a  larger  extent.  The  implication  is  that  the  aggregation  within  the  accounting  system  reduces  the 
information  content  of  the  accounting  statement  with  respect  to  historical  information.  Conse¬ 
quently,  fair-value  accounting  poses  a  new  horse  race  concerning  introduction  of  alternative  in¬ 
formation  sources  into  the  accounting  system.  One  system  would  be  the  historical  accounting 
system,  which  is  transaction  based,  and  opposed  to  that  is  a  fair-value  accounting  system,  which 
allows  market  information  to  be  aggregated  into  the  accounting  system.  Market  information  is 
available  in  the  market  by  definition  and  market  participants  also  have  access  to  this  type  of 
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information.  Therefore,  the  horse  race  could  also  be  coined  into  a  question  of  who  is  best  at 
performing  the  aggregation  of  market-wide  information  with  historical  cost  information.  The 
contestants  for  this  competition  are  the  market  participants  or  the  accountants.  The  analysis  of  this 
problem  follows  Christensen  and  Frimor  (2007). 

Inherently,  the  value  of  a  firm  is  uncertain.  To  keep  the  presentation  simple,  suppose  that  the 
uncertainty  consists  of  two  components:  one  that  summarizes  the  internal  factors  and  the  other  that 
summarizes  the  external  factors  of  the  firm. 

V=Sa  +  Se-  (10) 

It  is  assumed  that  the  firm  observes  the  first  error  term  and  only  observes  the  second  error 
term  with  error.  Thus,  the  firm  and  its  auditor  observe: 


yA-^A^  yp-  ^AU-  (11) 

The  firm  and  its  accountant  are  confined  to  a  single  aggregated  report  stating  the  expected  value  of 
the  firm  conditional  on  the  information  that  is  allowed  by  the  recognition  rules.  There  are  two 
possibilities  for  the  recognition  rules  that  lead  to  two  reporting  options.  One  is  a  historical  cost 
model  in  which  the  recognition  rule  only  allows  the  internal  factors  of  the  firm  into  the  accounting 
valuation,  Vjj.  The  alternative  set  of  recognition  rules  allows  the  accountant  to  aggregate  the 
observed  external  factors  into  the  reported  value  of  the  firm.  This  set  of  recognition  rules  brings 
market  observations  into  the  accounting  valuation,  and  in  this  sense  it  has  the  flavor  of  fair-value 
accounting,  Vp.  In  both  cases,  the  accountant  is  disclosing  an  unbiased  report  subject  to  his/her 
information  constraint.  The  report  published  by  the  firm  under  the  two  regimes  can  be  calculated 
assuming  that  all  information  variables  are  normally  distributed  with  zero  means: 

yH  =  E{V\yA)=yA 


^F  =  E{V\yA,yE)=yA  +  -T^^J-yE-  (12) 

^E^  ^AU 

Investors  observe  the  accounting  report  prior  to  trading.  Each  investor  also  makes  his/her  own 
private  observation  of  the  external  factor,  and  the  error  term  for  each  investor  is  personal  but 
follows  the  same  distribution.  To  simplify,  it  is  assumed  that  all  the  error  terms  of  the  investors  are 
uncorrelated.  In  addition,  the  investors  observe  the  price,  P,  at  which  the  stock  of  the  firm  is 
traded.  Thus,  the  investors  observe: 

yE,i  =  SE+Si,P,Vi.  (13) 

The  security  is  traded  in  the  open  market,  and  the  price  is  formed  when  the  investors  meet  and 
decide  on  their  individual  trading.  There  is  uncertainty  in  the  market,  as  all  individual  endowments 
are  stochastic  and  assumed  normal  with  variance  oj.  This  structure  leaves  the  model  tractable  and 
prevents  prices  from  being  too  informative.  Supposedly,  the  investors  participate  in  an  anonymous 
trading  mechanism  transforming  the  individual  bids  and  forming  a  price.  The  investors  only 
disclose  their  private  information  through  the  pricing  mechanism,  and  there  is  no  direct  informa¬ 
tion  exchange  among  the  investors.  The  investors  are  rational,  and  when  entering  into  the  pricing 
game,  they  anticipate  the  reaction  of  their  co-traders.  The  market  clearing  price  reflects  this 
conjecture,  such  that  the  resulting  price  is  a  rational  expectations  equilibrium.  The  model  used  for 
this  analysis  stems  from  Hellwig  (1980). 

The  equilibrium  price  for  this  mechanism  reflects  the  market  uncertainty,  o^,  the  private 
information  of  the  investors,  and  the  information  content  in  the  accounting  report. 
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The  important  question  is  how  to  evaluate  the  two  accounting  options.  One  method  is  to 
analyze  how  the  accounting  valuation  is  aligned  with  the  market  valuation.  The  relevant  measure 
for  this  is  the  squared  difference  between  the  accounting  valuation  and  the  market  value.  This 
difference  is  often  used  in  empirical  research,  as  both  measures  are  readily  available  (Schipper  and 
Vincent  2003).  This  measure  would  clearly  favor  the  market-based  accounting  valuation  as  the 
accounting  valuation  includes  the  second  error  term. 

An  evaluation  of  the  accounting  system  is  closely  linked  to  the  ability  of  the  accounting 
system  to  provide  useful  information  for  decision  making.  The  final  outcome  of  the  decisions  is 
what  matters  to  the  investor. 

The  market  value  might  be  a  good  proxy  for  the  final  outcome.  At  the  same  time,  the  market 
value  works  as  an  information  aggregator,  compounding  all  information  available  to  investors, 
including  the  private  information  and  the  accounting  information.  The  aggregator  role  is  not 
highlighted  when  only  the  difference  between  the  accounting  value  and  the  market  value  is 
evaluated.  Using  the  market  value  as  a  measure  provides  only  a  partial  view  of  the  relative 
performance  of  the  two  accounting  systems.  Consequently,  market  value  is  not  necessarily  a  good 
measure  of  the  merits  of  the  accounting  system.  Furthermore,  another  measure  of  the  merits  of  the 
two  accounting  options  would  be  the  squared  difference  between  the  price  of  the  asset  and  the 
final  payout  from  the  investment.  This  is  a  measure  of  the  aggregate  informativeness  of  the 
accounting  system.  It  is  difficult  to  measure  the  final  payout  empirically,  as  the  final  dividend  only 
materializes  with  a  considerable  time  lag  for  most  investments.  In  the  present  setting  it  is  possible 
to  evaluate  this  measure  analytically.  The  measure  reflects  the  ability  of  the  price  to  reflect  the 
information  available  to  the  market  participants.  It  is  of  no  relevance  to  the  measure  how  the 
information  is  transferred  to  the  price.  As  a  result,  it  provides  a  measure  of  who  is  best  at 
aggregating  the  information:  the  accountant  or  the  market. 

It  is  not  easy  to  get  a  closed  form  solution  to  show  exactly  which  accounting  system  is 
optimal,  but  it  is  possible  to  calculate  the  value  of  the  valuation  error  as  a  variance,  Var(.).  The 
valuation  error  is  calculated  for  the  two  accounting  systems.  The  historical  accounting  system  is 
independent  of  the  accountant’s  ability  to  measure  the  market  component.  The  accounting  valua¬ 
tion  of  the  fair- value  accounting  system  depends  on  the  quality  of  the  accountant.  Consequently, 
the  variance  is  calculated  as  a  function  of  the  ability  of  the  accountant  to  estimate  the  “market” 
information  for  the  fair  value  accounting  system — i.e.,  as  a  function  of  the  variance  For 
example,  if  the  auditor  is  very  good  at  estimating  the  market  factor  Sg,  then  the  variance  is 
very  small. 

In  Figure  1,  the  performances  of  the  two  accounting  systems  are  calculated  when  market 
uncertainty  is  large — that  is,  when  the  variance  of  the  initial  wealth  of  the  investors  is  large.  In  that 
case,  it  is  expected  that  the  market  price  will  carry  little  information  from  the  individual  investors 
and,  consequently,  even  less-able  accountants  will  be  able  to  provide  useful  market  information  to 
the  price  mechanism.  The  graph  for  the  fair  value  accounting  system  shows  a  smaller  variance, 
even  for  relatively  large  cP^jj. 

The  calculations  are  repeated  for  smaller  market  uncertainty  in  Figure  2.  In  that  case,  it  is 
expected  that  the  market  price  will  reflect  the  investors’  private  information  to  a  large  extent,  and 
as  a  result  there  is  a  greater  demand  for  accounting  information  to  supplement  the  information  that 
is  already  available  in  the  market.  Therefore,  it  is  expected  that  the  historical  accounting  system 
will  outperform  the  fair-value  accounting  system.  A  comparison  of  the  graphs  in  Figure  2  reveals 
that  this  is  indeed  the  case.  The  two  graphs  intersect  at  a^y  =  0.02,  implying  that  the  accountant 
should  outperform  the  average  investor  50  times  when  it  comes  to  estimating  the  market  influence 
on  the  performance  of  the  firm. 

These  findings  support  the  view  that  the  optimal  accounting  system  depends  on  the  finer 
details.  In  some  cases,  it  is  optimal  to  aggregate  other  information  into  the  accounting  system,  and 
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FIGURE  1 

High  Market  Uncertainty 


in  other  cases,  it  is  optimal  to  let  the  market  do  the  aggregation.  The  other  message  is  that  the 
accounting  system  is  not  the  only  system  available  and  it  is  a  serious  error  to  pretend  that  the 
accounting  system  should  be  the  only  information  system. 

V.  CONCLUDING  REMARKS 

The  common  theme  of  this  discussion  has  been  the  errors  of  accounting  systems.  Errors  are 
important  and  often  neglected  when  the  design  of  accounting  systems  is  evaluated.  Accounting 
carries  information,  as  is  often  recognized,  but  the  full  impact  of  that  statement  is  not  endogenized 
in  the  analysis  (Demski  2004). 

First,  the  information  perspective  calls  for  using  the  accounting  information  to  update  our 
expectation  of  the  future  events  of  the  firm,  cost,  cash  flows,  etc.  The  important  element  of  this  is 
what  we  learn  from  the  accounting  statement  and  what  we  knew  beforehand.  The  quality  of  both 
pieces  of  information  is  important  for  consistent  updating  of  beliefs  in  a  manner  consistent  with 
Bayes’  theorem. 

The  accounting  mapping  is  a  linear  construction,  using  a  linear  aggregation  of  transactions  in 
the  accumulation  accounts,  in  the  allocation  of  cost  and  benefits  to  cost  objects,  and  in  the 
presentation  in  the  balance  sheet.  Yet,  the  world  is  hardly  linear.  Economies  of  scale  and  scope  are 
found  everywhere  and  are  often  the  reason  for  the  existence  and  size  of  firms.  Thus,  the  assumed 
linearity  in  accounting  results  in  structural  errors.  There  are  many  reasons  for  sticking  to  the 
linearity  assumption,  as  estimating  the  “correct”  functional  form  of  some  accounts  is  an  impos¬ 
sible  task.  The  result  is  that  the  errors  persist,  and  the  system  designer  can  counter  this  through  the 
allocation  mechanisms  employed.  Understanding  the  error  structure  is  important. 

Finally,  the  accounting  system  is  not  the  only  information  system  in  the  world.  Other  and 
perhaps  more  timely  information  sources  exist.  The  accounting  system  must  acknowledge  this  fact 
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FIGURE  2 

Low  Market  Uncertainty 


and  act  accordingly.  The  comparative  advantage  of  the  accounting  system  must  be  considered, 
with  some  tasks  better  left  for  other  information  sources.  The  debate  on  fair-value  accounting 
provides  an  example  of  this  reasoning.  The  accounting  system  has  a  non-disputed  advantage  when 
it  comes  to  reporting  of  firm  specific  information.  Aggregating  market  information  into  the  ac¬ 
counting  system  might  or  might  not  be  a  good  idea.  The  basic  question  here  is  who  is  best  at 
performing  the  aggregation  of  the  market  information.  The  market  mechanism  is  a  powerful 
information  aggregator,  and  it  might  be  better  suited  for  aggregating  firm-specific  information  with 
market  information. 
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ABSTRACT:  Previous  research  in  management  accounting  and  economics  has  noted 
the  potential  for  complementarities  between  the  firm’s  performance  measurement  sys¬ 
tem  and  its  other  organizational  design  choices.  We  add  to  this  literature  by  studying 
how  the  informativeness  and  incentive  properties  of  a  performance  metric  can  be  influ¬ 
enced  by  one  particular  organizational  design  choice — the  size  of  the  firm’s  inventory 
buffers.  We  model  a  manufacturing  setting  in  which  an  agent  manages  a  workstation 
that  processes  intermediate  units.  As  intermediate  units  arrive,  they  are  stored  in  an 
inventory  buffer  until  the  agent  can  process  them.  The  buffer  can  hold  a  maximum 
number  of  intermediate  units — its  buffer  size.  The  agent  is  compensated  on  the  basis  of 
his  workstation’s  throughput.  We  characterize  the  conditions  under  which  reducing  the 
inventory  buffer  enhances/degrades  the  informativeness  of  the  performance  metric 
and,  hence,  mitigates/exacerbates  the  agent’s  incentive  problem. 

Keywords:  agency  theory;  inventory  buffers;  incentive  complementarities;  organiza¬ 
tional  design;  siack. 


I.  INTRODUCTION 

Most  of  the  agency  theory  work  in  managerial'accounting  has  studied  the  relation  between 
performance  metric  properties  (e.g.,  informativeness,  precision,  congruity)  and  optimal 
incentive  compensation,  holding  fixed  the  firm’s  other  organizational  design  decisions 
(e.g.,  hierarchical  structure,  job  assignm.ent,  production  technology).  However,  by  holding  fixed 
these  other  organizational  design  choices,  one  cannot  study  whether  and  how  they  affect  the 
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performance  metric  properties.  As  Hemmer  (1998,  321-322)  notes,  “the  value  of  a  performance 
measure  is  determined  not  simply  by  its  congruity  and  precision  but  by  its  influence  on  the  optimal 
organizational  design  ...  Much  of  the  recent  theoretical  accounting  literature  ...  has  largely  ig¬ 
nored  complementarities  between  performance  measures  and  organizational  design.” 

In  this  study,  we  expand  on  Hemmer ’s  (1998)  observation  by  studying  how  the  informative¬ 
ness  and  incentive  properties  of  a  performance  metric  are  influenced  by  one  of  the  firm’s  organi¬ 
zational  design  choices — the  size  of  its  inventory  buffers.  The  introduction  of  just-in-time  (JIT) 
and,  more  generally,  lean  manufacturing  has  led  to  an  increased  emphasis  on  controlling  and 
reducing  inventory  levels.  Managerial  accounting  has  responded  to  this  reduction  in  inventory 
with  new  costing  techniques  such  as  backflush  costing  (Horngren  et  al.  2008).  However,  we  show 
that  the  choice  of  inventory  buffers  has  a  more  subtle  effect  on  the  design  of  the  managerial 
accounting  system,  in  that  it  affects  the  informativeness  of  performance  metrics  produced  by  the 
managerial  accounting  system. 

We  model  a  manufacturing  setting  in  which  an  agent  expends  effort  to  set  up  or  program  a 
workstation  that  processes  intermediate  units.  As  intermediate  units  arrive,  they  are  stored  in  an 
inventory  buffer  until  the  agent  can  process  them.  The  buffer  can  hold  a  maximum  number  of 
intermediate  units,  b — its  buffer  size.  The  buffer  size  represents  the  workstation’s  maximum  order 
backlog.  If  an  intermediate  unit  arrives  while  the  buffer  is  full,  then  the  unit  is  diverted  and  the 
agent  loses  the  opportunity  to  process  it.  If  the  workstation  is  ready  to  process  another  unit  but  the 
buffer  is  empty,  then  it  remains  idle  until  the  next  intermediate  unit  enters  the  buffer.  While  we 
frame  the  analysis  in  terms  of  a  manufacturing  setting,  the  analysis  is  also  applicable  to  service 
operations  such  as  call  centers  that  can  have  a  maximum  number  of  calls  awaiting  service. 

We  assume  that  the  agent  is  compensated  on  the  basis  of  his  realized  throughput;  i.e.,  the 
output  per  unit  of  time  of  his  workstation.*  We  show  that  the  principal’s  choice  of  buffer  size 
indirectly  affects  the  effectiveness  with  which  throughput  reflects  the  agent’s  chosen  effort  and 
consequently  affects  the  optimal  compensation  weight  placed  on  throughput.  The  buffer  size 
affects  both  the  probability  that  the  workstation’s  buffer  will  be  full  when  an  intermediate  unit 
arrives,  causing  it  to  lose  the  opportunity  to  complete  that  unit  (referred  to  as  blocking),  and  the 
probability  that  the  buffer  will  be  empty  when  the  workstation  is  free,  causing  it  to  forgo  further 
production  until  another  intermediate  unit  arrives  (referred  to  as  starving).  Both  probabilities 
affect  the  marginal  productivity  of  the  agent’s  setup  effort  with  respect  to  throughput  (his  perfor¬ 
mance  metric)  and,  hence,  the  extent  to  which  that  effort  is  reflected  in  throughput. 

Next  we  discuss  the  relevant  literature  on  organizational  design  and  incentives.  Section  II 
introduces  the  model  and  Section  III  presents  the  results.  Section  IV  discusses  extensions  and 
concludes. 

A  number  of  articles  in  both  the  accounting  and  economics  literatures  have  considered  the 
effect  of  organizational  design  choice  on  the  design  of  optimal  incentives.  Among  the  issues 
considered  are:  job  design  (Riordan  and  Sappington  1987;  Holmstrom  and  Milgrom  1991;  Bal- 
akrishnan  et  al.  1998);  the  choice  of  production  systems  and  technologies  (Hemmer  1995,  1998; 
Milgrom  and  Roberts  1995);  the  hierarchical  structuring  of  work  (Melumad  et  al.  1995);  routing 
schemes  for  product  rework  (Lu  et  al.  2009);  and  production  bottlenecks  (Datar  and  Rajan  1995; 
Gietzmann  and  Hemmer  2002).^ 


‘  Realized  throughput  is  measured  as  realized  total  output  during  the  shift  that  the  workstation  has  been  operating  divided 
by  the  length  of  the  shift. 

^  Also  somewhat  related  is  Cremer  (1995),  which  examines  the  incentive  effects  on  both  the  buyer  and  supplier  when  the 
buyer  goes  to  a  zero-inventory  system  for  the  supplier’s  product.  However,  the  two  motivations  are  different  in  that 
Cremer  (1995)  is  based  on  a  double  moral  hazard  model. 
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Similar  to  our  model,  Hemmer  (1995,  1998)  and  Gietzmann  and  Hemmer  (2002)  examine 
how  different  workflow  arrangements  between  agents  affect  the  information  available  for  contract¬ 
ing,  and  the  incentives  facing  agents.  Our  work  is  distinct  from  these  models  that  do  not  consider 
buffer  size  as  one  of  the  principal’s  choice  variables.  Nagar  et  al.  (2009)  examine  the  role  of 
inventory  buffers  in  agency  problems,  although  unlike  our  model,  in  their  setting  buffers  are  filled 
by  agents  in  order  to  signal  private  information  and  buffer  size  is  not  a  choice  variable.  Alles  et  al. 
(1995)  do  focus  on  the  effect  that  the  choice  of  buffer  size  can  have  on  the  informativeness  of 
performance  metrics.  The  major  difference  between  our  work  and  theirs  is  that  they  do  not 
formally  model  how  buffers  affect  performance  metrics,  but  instead  assume  a  monotonic  relation. 
In  contrast,  we  formally  model  the  inventory  process  and  derive  a  non-monotonic  relation  between 
buffer  size  and  the  informativeness  of  the  agent’s  performance  metric. 

II.  MODEL  AND  INITIAL  ANALYSIS 

The  model  consists  of  a  single-period  and  a  firm  comprised  of  a  risk-neutral  principal  and 
agent.  Contracting  takes  place  at  the  start  of  the  period  when  the  agent  is  hired  to  set  up  a 
workstation  to  process  incoming  intermediate  units  (e.g.,  test  completed  computers  or  weld  auto¬ 
motive  chassis).  The  intermediate  units  arrive  stochastically  into  the  workstation’s  incoming  in¬ 
ventory  buffer  at  a  commonly  known,  mean  arrival  rate  of  X.  The  buffer  has  capacity  £>  e  N, 
allowing  a  maximum  of  b  —  \  units  to  be  held  in  front  of  the  workstation  while  the  workstation 
processes  one  unit.  If  an  intermediate  unit  arrives  and  the  buffer  is  not  full,  then  the  unit  is  added 
to  the  inventory.  However,  if  the  buffer  is  full,  then  blocking  occurs  and  intermediate  units  cease 
arriving  until  there  is  space  in  the  buffer,  whereupon  the  intermediate  units  begin  arriving  at  the 
same  stochastic  rate  as  before.^ 

Once  parties  agree  to  the  contract,  but  before  the  workstation  begins  processing,  the  agent 
chooses  his  personally  costly  setup  effort  r,  which  affects  the  average  rate  at  which  his  workstation 
can  process  the  intermediate  units.  That  effort  is  unobserved  by  the  principal  and  is  subject  to 
moral  hazard.  The  arriving  intermediate  units  are  sufficiently  heterogeneous  that  the  agent’s  effort 
does  not  perfectly  control  the  rate  at  which  the  workstation  processes  the  incoming  intermediate 
units.  Rather,  the  intermediate  units  are  processed  at  a  stochastic  rate  with  mean  r(l  +  h),  where 
h  e  (—1,°°)  represents  the  productivity-enhancing  resources  the  principal  has  allocated  to  the 
agent.  Examples  of  the  latter  include:  machine  and  tool  upgrades,  improved  working  conditions, 
training,  etc.  As  the  workstation  finishes  processing  an  intermediate  unit,  the  agent  reaches  into  the 
buffer  for  the  next  unit  on  which  to  work.  If  the  buffer  is  empty,  then  starving  occurs  and  the 
workstation  is  idle  until  the  next  intermediate  unit  arrives.  Figure  1  provides  a  graphical  descrip¬ 
tion  of  the  process. 

The  principal  chooses  both  the  agent’s  compensation  scheme  and  the  buffer  size,  b.  We 
restrict  our  attention  to  linear  compensation  schedules  for  the  agent  consisting  of  a  fixed  payment, 
Wq,  plus  an  incentive  wage,  w,  based  on  realized  throughput — the  number  of  units  processed  per 
unit  of  time.^’^ 


^  Alternatively,  one  can  interpret  the  model  as  one  in  which  the  intermediate  units  continue  arriving  while  the  inventory 
is  blocked  but  are  diverted  elsewhere.  What  is  important  is  that  in  both  cases,  blocking  imposes  an  opportunity  cost  on 
the  agent  because  his  workstation  loses  the  chance  to  process  the  intermediate  units  that  might  have  been  added  to  the 
buffer  had  it  not  been  full. 

Given  that  the  basic  properties  of  queuing  systems  are  stated  in  terms  of  rates  (i.e.,  arrivals  per  unit  of  time,  number  of 
units  processed  per  unit  of  time)  we  assume  that  the  principal  compensates  the  agent  on  throughput  per  unit  of  time 
rather  than  total  throughput.  Linear  compensation  schemes  are  frequently  used  in  practice  (Berg  and  Fast  1975;  Hall  et 
al.  2000)  and  are  extensively  used  in  the  analytical  literature  (Feltham  and  Xie  1994;  Alles  et  al.  1995;  Hemmer  1995, 
1998;  Gietzmann  and  Hemmer  2002). 

^  In  Section  IV,  we  discuss  the  possibility  of  contracting  on  inventory  levels. 
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FIGURE  1 

The  Effect  of  Inventory  on  the  Production  Process 
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In  the  top  panel,  the  buffer  is  not  full,  incoming  units  are  added  to  the  buffer,  and  the  workstation  starts  process¬ 
ing  a  unit  from  the  buffer  once  it  completes  the  current  unit.  In  the  middle  panel,  the  buffer  is  full,  incoming 
units  are  blocked  from  entering  the  buffer  (blocking)  until  the  workstation  finishes  processing  its  current  unit 
and  the  agent  withdraws  the  next  intermediate  unit  from  the  buffer.  In  the  bottom-most  panel,  the  buffer  is 
empty,  implying  that  the  workstation  is  idle  (starved)  until  the  next  unit  arrives. 


Last,  we  assume  that  the  marginal  cost  associated  with  each  unit  of  buffer  capacity  is  c  >  0, 
the  agent’s  marginal  cost  of  effort  is  a  >  0,  and  we  normalize  the  marginal  cost  of  the  additional 
resources  provided  to  the  agent  by  the  principal,  h,  at  one  dollar  per  unit  of  resource. 

All  notation  and  assumptions  are  as  follows: 

X.  =  The  mean  arrival  rate  of  the  intermediate  units.  The  arrival  rate  is  Poisson 

distributed,  implying  that  the  time  between  arrivals  is  distributed  according  to  an 
exponential  distribution  with  mean  of 
r  =  The  level  of  effort  the  agent  exerts  at  private  cost,  ar. 
h  =  The  amount  of  productivity-enhancing  resources  allocated  to  the  agent.  The 
principal’s  out-of-pocket  cost  is  h. 

r(l  +  h)  =  The  mean  rate  at  which  the  agent’s  workstation  processes  the  intermediate  units. 

The  actual  processing  time  is  distributed  according  to  an  exponential  distribution. 
b  =  The  maximum  number  of  units  the  station  can  simultaneously  store  (buffer  size). 
The  principal  incurs  the  associated  capacity  cost,  cb,  each  period. 


®  For  an  exponential  distribution,  the  same  parameter  that  represents  the  mean  also  represents  the  variance.  For  simplicity 
we  refer  to  the  choice  of  this  parameter  as  the  choice  of  the  mean  arrival  rate. 
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^  -  The  steady-state  mean  throughput  per  period,  with  K  denoting  the  realized 
throughput. 

w,  Wq  =  The  incentive  payment  made  to  the  agent  per  unit  completed  per  period,  and  the 
fixed  salary  payment,  respectively. 

Note  that  in  our  model  there  are  two  sources  of  noise  that  make  it  difficult  for  the  principal  to 
infer  the  agent’s  effort  from  throughput.  As  is  common  in  agency  problems,  the  relation  between 
the  agent’s  effort  and  the  time  required  for  the  workstation  to  process  an  intermediate  unit  is 
stochastic.  In  addition,  the  rate  at  which  units  become  available  to  be  processed  is  also  stochastic. 
Thus,  even  if  the  relation  between  the  agent’s  effort  and  the  time  required  to  process  an  interme¬ 
diate  unit  were  deterministic,  instantaneous  throughput  would  still  be  stochastic  in  the  agent’s 
effort  because  of  the  stochastic  arrival  rate  of  the  intermediate  units.  Placing  an  inventory  buffer  in 
front  of  the  agent’s  workstation  is  one  way  of  dampening  out  the  effect  of  the  variation  in  arrivals; 
however,  as  we  will  show,  such  dampening  can  also  reduce  the  informativeness  of  throughput  with 
respect  to  the  agent’s  effort. 

Our  assumptions  regarding  arrival  and  processing  rates  follow  the  standard  M/M/l/b  queuing 
model  with  finite  buffers  used  in  the  operations  management  literature.^  Note  that  while  we  are 
modeling  a  single-period  in  that  the  agent  chooses  his  setup  effort  only  once  at  the  start  of  the 
period,  we  analyze  the  stochastic  evolution  of  the  workstation’s  throughput  over  the  entire  period. 
The  transient  behavior  of  throughput  in  queuing  systems  with  finite  buffers  cannot  be  described 

o 

analytically,  although  the  steady-state  behavior  can  be  described  in  closed  form.  As  a  result,  the 
operations  management  literature  typically  uses  simulations  to  analyze  the  transient  behavior  of 
queuing  models  with  finite  buffers  and  closed-form  analysis  to  examine  steady-state  behavior.  In 
contrast,  agency  models  have  emphasized  rewarding  transient  behavior,  but  simplify  the  produc¬ 
tion  process  (e.g.,  using  the  TEN  model)  to  achieve  closed-form  solutions.^  In  order  to  incorporate 
stochastic  intermediate  unit  arrival  rates,  stochastic  processing  rates,  and  finite  inventory  buffers  in 
our  analysis,  we  focus  on  the  problem’s  steady-state.  That  is,  we  assume  that  the  principal  is 
interested  in  maximizing  her  steady-state  expected  profit  and  that  the  agent  selects  his  effort  to 
maximize  his  steady-state  expected  utility.  This  is  a  reasonable  approximation  if,  as  we  assume, 
the  production  process  reaches  steady-state  fairly  quickly.  Our  assumption  that  the  agent’s  effort  is 
expended  in  setting  up  the  workstation  before  production  begins,  rather  than  managing  the  pro¬ 
duction  process  as  it  evolves,  is  also  consistent  with  our  emphasis  on  the  steady-state  behavior  of 
the  workstation.  An  example  of  our  modeled  setting  is  a  robotic  line  that  welds  automotive 
chassis.  The  agent’s  main  task  is  to  program  the  spot-welding  robots.  This  task  involves  identify¬ 
ing  the  optimal  position  of  the  chassis,  identifying  the  optimal  position  of  the  robotic  arms,  and 
making  sure  that  robot  paths  do  not  cross  each  other  or  the  chassis  itself  (so  that  nothing  is 
damaged  in  the  process)  while  ensuring  that  welding  is  done  in  the  least  possible  number  of  steps. 
All  of  this  planning  and  programming  setup  is  done  once  by  the  agent  at  the  beginning  of  the  shift 
and  thereafter,  as  long  as  the  workstation  is  in  control,  the  agent  does  not  need  to  intervene. 
Clearly,  the  actions  of  the  agent  in  programming  the  robots  affect  the  time  it  takes  to  weld  the 
chassis  (on  average),  but  the  agent  does  not  reprogram  the  robot  once  production  has  begun. 


^  The  M/M/l/b  model  is  the  most  widely  accepted  modeling  approach  for  production  systems  with  finite  buffers  (Berkley 
1992;  Groenevelt  1993;  Balsamo  et  al.  2001). 

*  The  process  attains  steady-state  when  the  probability  distributions  over  inventory  levels  and  throughput  are  no  longer 
functions  of  the  starting  conditions,  and  no  longer  vary  over  time.  That  is,  while  the  realized  inventory  level  and 
throughput  at  any  moment  are  still  uncertain,  the  probability  distributions  that  describe  them  do  not  vary  over  time. 

^  Queuing  models  similar  to  ours  have  been  used  to  analyze  the  role  of  cost  allocation  in  resolving  congestion  problems 
(Radhakrishnan  and  Balachandran  1995,  2004;  Balachandran  and  Radhakrishnan  1996). 

Lu  et  al.  (2009)  restrict  their  analyses  to  steady-state  for  the  same  reason. 
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The  workstation’s  realized  output  per  unit  of  time  or  throughput,  K,  is  stochastic,  though  the 
mean  steady-state  throughput,  K,  can  be  represented  as  (Hopp  and  Spearman  2001): 


K{b,r)  =  \{\ -Pb)  =  \ 


1  -(r(l 


(1) 


where  p^,  is  the  steady-state  probability  of  the  buffer  being  full;  i.e.,  the  steady-state  probability  of 
blocking.  The  workstation’s  steady-state  mean  throughput  is  thus  equal  to  the  unblocked  mean 
arrival  rate  of  intermediate  units,  X,  times  the  steady-state  probability  that  the  buffer  can  accept  an 
additional  unit  when  it  arrives,  1  -  pi,.  In  equilibrium,  the  workstation’s  mean  steady-state  through¬ 
put  is  increasing  in:  r,  the  agent’s  effort;  h,  the  level  of  resources  allocated  to  the  agent;  b,  the  size 
of  the  input  buffer;  and  X,  the  mean  arrival  rate  of  the  intermediate  units.  Equation  (1)  illustrates 
that  the  agent’s  effort  affects  throughput  via  its  effect  on  the  probability  of  blocking,  or  equiva¬ 
lently,  the  probability  of  starving.^* 

Note  that  the  mean  arrival  rate  of  incoming  intermediate  units  and  therefore  the  maximum 
mean  throughput  is  X.  However,  if  the  principal  induced  the  agent  to  choose  r  =  \/(l  +  h)  so  that 
the  workstation  was  set  to  process  the  intermediate  units  at  the  same  average  rate  at  which  they 
arrived,  then  the  instantaneous  probability  of  starving  the  workstation  would  be  >  0, 

which  in  turn  would  limit  the  workstation’s  mean  steady-state  throughput  to  A'  =  ^  <  X.  That  is, 
inducing  the  agent  to  set  the  workstation  to  process  the  intermediate  units  at  the  same  average  rate 
at  which  they  arrive  would  significantly  limit  steady-state  throughput  and  induce  a  non-trivial 
probability  of  starving.  Thus,  the  principal  has  a  natural  incentive  to  induce  the  agent  to  set  the 
workstation  to  process  units  at  an  average  rate  that  is  strictly  greater  than  the  average  rate  at  which 
the  intermediate  units  arrive,  as  commonly  assumed  in  the  operations  management  literature 
(Hopp  and  Spearman  2001). 

Assume  that  the  principal  earns  R  dollars  per  unit  of  throughput.  The  principal  chooses  the  set 
of  decision  variables,  {w,WQ,b),  and  a  target  level  of  agent  setup  effort,  r*,  which  maximize  her 
steady-state  objective  function: 


^(^Hw,WQ,b,r*)RK{b,r*)  -  wK{b,r^)  -WQ-cb-h. 

Given  (w,vv'o,Z?),  the  agent  chooses  his  setup  effort,  r,  to  maximize  his  steady-state  objective 
function  max^  wK{b,r)  -t-  wq  -  ar.  Varying  the  size  of  the  buffer,  b,  has  three  effects  on  the  princi¬ 
pal’s  problem:  (1)  it  incurs  a  carrying  cost  of  c  per  unit  of  buffer,  (2)  it  affects  the  principal’s 
expected  revenue  through  its  direct  effect  on  steady-state  throughput,  K{b,r),  and  (3)  it  indirectly 
affects  the  agent’s  effort  incentive  through  its  effect  on  the  expected  steady-state  throughput, 
K{b,r).  Effort  and  buffers  are  substitutes  in  throughput  and,  therefore,  the  principal’s  choice  of  b 
affects  the  wage,  w,  required  to  induce  the  agent  to  work.  Thus,  the  latter  two  effects  of  the 
principal’s  choice  of  b  directly  influence  the  effort  that  the  principal  wants  to  induce  the  agent  to 
take  and  the  cost  of  inducing  that  effort. 

As  noted  earlier,  we  are  interested  in  studying  the  incentive  implications  of  the  principal’s 
choice  of  the  buffer,  b.  To  isolate  that  effect,  we  will  analyze  the  problem  in  which  the  principal 
wants  to  minimize  her  expected  steady-state  cost  of  inducing  the  agent  to  take  a  particular  effort, 
r*.  This  allows  us  to  hold  constant  the  principal’s  desire  to  choose  b  so  as  to  directly  affect 
revenue  and  the  carrying  cost  associated  with  buffers,  and  instead  concentrate  on  her  desire  to 


The  two  probabilities  are  directly  related  in  that  the  probability  of  starving  is  1  -  ;:(]^(1  -p^,  allowing  (1)  to  be 
rewritten  as  K{b,r)  =  r{  \  +h){  \  -pX  Note  that  a  change  in  the  buffer  affects  the  probability  of  starving  in  the  same 
direction  (i.e.,  increase/decrease)  as  its  effect  on  the  probability  of  blocking. 
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choose  b  to  affect  the  cost  of  inducing  effort  from  the  agent.  Thus,  we  take  a  Grossman  and  Hart 
(1983)  approach  to  the  principal’s  expected  profit  maximization  problem.  This  is  without  loss  of 
generality  because  any  incentive  effects  of  inventory  buffer  size  found  in  this  analysis  will  also 
appear  in  the  principal’s  full  profit-maximization  problem.  Consequently,  our  subsequent  analy¬ 
sis  is  analogous  to  the  first  step  in  a  Grossman  and  Hart  (1983)  analysis — minimizing  the  princi¬ 
pal’s  expected  steady-state  cost  of  inducing  effort  r*  from  the  agent. 

The  problem  can  be  written  as; 


^^^{w,Wfy,b,r*)'^K{b,h,r*)  +  WQ  +  h  (Program  1) 

s.t. 

wk-i-WQ-ar^  y  k  (LL) 

r*  G  arg  ma.xfwK{b,r)  -t  Wq  -  ar) .  (IC) 


We  assume  that  the  labor  market  is  such  that  the  principal  must  leave  the  agent  with  a  minimum 
level  of  utility  regardless  of  realized  throughput  (constraint  (LL)).^^  The  incentive  compatibility 
constraint,  (IC),  ensures  that  the  agent  will  select  the  principal’s  desired  level  of  effort  r*.  In  the 
First-Best  case,  in  which  the  agent’s  effort  is  not  subject  to  moral  hazard,  (IC)  can  be  ignored  and, 
given  that  the  minimum  possible  realized  throughput  is  ^  =  0,  the  limited  liability  constraint,  (LL), 
implies  that  wq  =  ar*  and  w  =  0.  Hence,  the  size  of  the  buffer  b  plays  no  role  in  the  agent’s 
compensation  in  the  First-Best  solution.  In  the  Second-Best  setting,  (LL)  will  again  bind,  implying 
that  wq  =  ar*.  However,  incentive  compatibility  now  requires  a  strictly  positive  incentive  wage,  w 
>  0.  Thus,  w  represents  the  agency  cost  per  unit  of  throughput,  and  it  is  the  behavior  of  w  in  Z?  that 
captures  the  incentive  effect  of  b  in  the  Second-Best  setting.  Note  that  while  the  principal  could 
infer  the  agent’s  effort  from  the  mean  steady-state  throughput,  only  the  realized  throughput,  k,  is 
observed,  upon  which  the  principal  cannot  perfectly  infer  the  agent’s  choice  of  effort. 

Lemma  1  characterizes  this  incentive  effect  and  guarantees  the  validity  of  the  First-Order 
Approach  to  the  agent’s  choice  of  effort: 

Lemma  1:  The  agent’s  problem  is  concave  in  r.  Further,  the  optimal  incentive  wage  is: 


w{b,r*)  = 


which  is  non-negative  and  increasing  in  the  principal’s  choice  of  r*. 

The  optimal  incentive  wage  is  given  by  the  agent’s  marginal  cost  of  effort  divided  by  the 
sensitivity  of  mean  steady- state  throughput  to  his  effort.  The  more  sensitive  the  performance 
metric  is  to  the  agent’s  effort  (i.e.,  the  greater  the  agent’s  marginal  productivity),  the  lower  the 
piece-rate  required  to  induce  the  agent  to  implement  the  obedient  effort  and  the  lower  the  agency 
cost  of  doing  so.  The  basic  idea  is  that  the  more  sensitive  throughput  is  to  variations  in  the  agent’s 
setup  effort,  the  more  likely  it  is  that  the  principal  can  detect  any  deviation  from  the  desired  setup 
effort,  r*;  therefore,  the  lower  the  incentive  wage  necessary  to  dissuade  the  agent  from  shirking. 

dK{b,r) 

Henceforth,  we  refer  to  the  sensitivity  of  throughput  to  the  agent’s  effort,  as  the  informa- 


Later  we  will  discuss  more  fully  how  our  results  would  manifest  in  the  principal’s  full  expected  profit-maximization 
problem. 

See  Demougin  and  Garvie  (1991)  for  a  similar  assumption.  Alternatively,  we  could  assume  that,  because  of  the  agent’s 
limited  liability,  the  principal  cannot  use  negative  payments.  Our  results  remain  unchanged  with  this  alternative  formu¬ 
lation.  The  only  effect  is  on  the  optimal  fixed  wage. 
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tiveness  of  throughput  with  respect  to  the  agent’s  effort. As  illustrated  by  (1),  the  agent’s  ability 
to  affect  throughput  is  entirely  driven  by  his  ability  to  affect  the  probability  of  blocking  and 
starving  via  his  choice  of  effort.  Given  that  the  wage  required  to  induce  the  agent  to  take  the 
desired  action  is  inversely  related  to  the  informativeness  of  throughput,  we  turn  our  attention  to 
characterizing  the  relation  between  the  principal’s  choice  of  the  buffer  size  b  and  the  informative¬ 
ness  of  throughput  with  respect  to  the  agent’s  action. 

III.  RESULTS 

In  what  follows,  we  will  sometimes  refer  to  the  mean  rate  at  which  the  workstation  processes 
the  intermediate  units  relative  to  their  mean  arrival  rate,  r(l  -I-  h)l\,  as  p,  the  agent’s  relative  effort 
or  the  production  schedule.  We  parameterize  our  discussion  with  respect  to  the  agent’s  relative 
effort  since  the  amount  of  help  resources,  h,  and  the  mean  arrival  rate  of  the  intermediate  units,  X, 
only  affect  the  principal’s  incentive  problem  as  they  relate  to  the  agent’s  effort  via  p. 

As  noted  earlier,  the  principal  may  choose  to  avoid  excessive  blocking  and  starving  by 
inducing  the  agent  to  set  the  workstation  to  process  intermediate  units  at  a  rate  exceeding  that  at 
which  they  arrive.  Therefore,  consider  first  the  case  in  which  p  >  1.  In  the  agency  model,  the 
principal  compensates  the  agent  as  if  she  were  inferring  the  agent’s  effort  from  the  performance 
metric  (Holmstrom  1979).  But  recall  that  throughput  is  subject  to  two  sources  of  uncertainty  that 
make  inferring  the  agent’s  setup  effort  from  it  difficult:  the  stochastic  arrival  rate  of  the  interme¬ 
diate  units  and  the  workstation’s  stochastic  processing  rate.  If  the  buffer  size,  b,  is  sufficiently 
small,  then,  in  expectation,  the  probability  of  starving  is  large,  resulting  in  relatively  few  units  per 
period  available  for  the  workstation  to  process  into  throughput  (i.e.,  a  small  potential  sample  size). 
Therefore,  when  p  >  1  and  the  buffer  is  small,  throughput  is  relatively  uninformative  about  the 
agent’s  effort.  By  increasing  the  buffer,  b,  the  principal  can  increase  the  expected  number  of  units 
available  to  the  workstation,  thereby  making  throughput  less  sensitive  to  the  stochastic  arrivals  of 
the  intermediate  units  and  more  sensitive  to  the  agent’s  setup  effort.  Thus,  when  b  is  small,  the 
principal  can  improve  the  informativeness  of  throughput,  and  reduce  the  compensation  weight  on 
throughput,  by  increasing  b. 

The  above  intuition  is  based  on  a  small  increase  in  b  starting  at  a  buffer  that,  given  p  >  1  (i.e., 
r{\+h)  >  X),  is  sufficiently  small  to  severely  restrict  throughput  due  to  blocking  and  starving. 
However,  the  effect  of  increasing  the  buffer  on  the  informativeness  of  throughput  differs  once  the 
buffer  is  already  large  enough  so  that  blocking  and  starving  are  both  unlikely.  Increasing  buffer 
size  beyond  some  point  must  result  in  a  decrease  in  the  informativeness  of  throughput  with  respect 
to  the  agent’s  effort  choice.  To  see  this,  note  that  in  the  extreme,  when  the  buffer  is  infinite,  there 
is  zero  probability  of  blocking  and,  given  p  >  1,  the  mean  steady-state  throughput  isK  =  \,  which 
is  independent  of  the  agent’s  effort  in  excess  of  X.  Thus,  when  there  is  no  blocking,  throughput  is 
totally  insensitive  to  and  uninformative  about  any  agent  effort  beyond  the  cut-off  level  X.  This 
implies  that  some  probability  of  blocking  is  necessary  for  the  throughput  to  be  informative  with 
respect  to  the  agent’s  effort.  Therefore,  increasing  b  beyond  some  point  decreases  the  relative 
amount  of  information  about  the  agent’s  setup  effort.  So  when  b  is  relatively  large,  the  principal 
can  increase  the  informativeness  of  throughput  by  decreasing  b. 

The  intuition  above  suggests  that  increasing  the  size  of  the  buffer  may  cause  the  informative¬ 
ness  of  throughput  to  initially  increase  and  later  decrease.  The  following  proposition  confirms  our 
intuition  and  guarantees  that  once  the  informativeness  of  throughput  is  decreasing  in  the  size  of 
the  buffer,  a  larger  buffer  will  never  make  throughput  more  informative.’^ 


Our  notion  of  informativeness  is  closely  related  to  that  in  Laux  (2001)  and  Laffont  and  Martimort  (2002,  156). 
All  proofs  are  in  Appendix  B. 
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Proposition  1:  For  any  fixed  p  >  1,  the  informativeness  of  throughput  with  respect  to  the 
agent’s  choice  of  effort,  r,  is  either  decreasing  in  the  size  of  the  buffer,  b,  or 
initially  increasing  in  b  and  then  decreasing  in  b. 


Proposition  1  supports  the  preceding  intuition  by  establishing  that  when  buffers  are  relatively 
small  (large),  the  informativeness  of  throughput  is  increasing  (decreasing)  in  buffer  size.  For 


example,  consider  the  case  in  which  =  0,  X,  =  1,  and 

(  3K{b,r)  \  . 

V  r=,*/  1 


=  1.1.  The  informativeness  of 


throughput 


dr 


is  0.226151  for  b  =  1,  0.292075  for  b  =  2,  and  0.0286627  for  b  =  50. 


Thus,  when  h  =  0,\  =  1,  and  r*  =  1.1,  informativeness  is  initially  increasing  then  decreasing  in 
the  buffer  size.  The  proposition  also  establishes  the  possibility  that  the  informativeness  of  through¬ 
put  is  monotonically  decreasing  in  buffer  size.  This  case  arises  because  no  matter  how  small  the 
buffer  size  {b>  1),  there  always  exists  a  sufficiently  large  level  of  setup  effort  that  would  cause  the 
probability  of  blocking  and  starving  to  be  very  small,  in  which  case  the  informativeness  of 
throughput  would  always  decrease  in  buffer  size.  For  example,  consider  a  case  where  the  work¬ 
station  is  set  up  to  process  intermediate  units  twice  as  fast  as  they  arrive:  r*  =  2  and  h  =  0,  X  = 
1.  The  informativeness  of  throughput  is  0.111111  for  b  =  I,  0.102041  for  b  =  2,  and  1.088 
X  10“^"^  for  b  =  50;  i.e.,  the  informativeness  of  throughput  is  always  decreasing  in  buffer  size. 

The  intuition  for  Proposition  1  suggests  that,  when  a  relatively  large  buffer  causes  the  prob¬ 
ability  of  blocking  to  be  very  small,  the  principal  would  want  to  decrease  the  buffer,  while  when 
a  relatively  small  buffer  causes  the  probability  of  blocking  to  be  very  large,  the  principal  would 
want  to  increase  the  buffer.  Since  the  probability  of  blocking  is  always  decreasing  in  the  agent’s 
relative  effort,  p,  this  intuition  suggests  that  the  optimal  buffer  size  should  also  be  decreasing  in  p. 
This  logic  is  formalized  in  Proposition  2: 


Proposition  2:  For  p  >  1,  the  buffer  size,  b,  which  maximizes  the  informativeness  of 
throughput  with  respect  to  the  agent’s  choice  of  effort,  is  decreasing  in  the 
agent’s  relative  effort,  p. 


Lemma  1  and  Proposition  1  lead  to  the  empirically  testable  result  that  the  incentive  wage  rate 
(i.e.,  the  marginal  cost  of  effort  divided  by  the  informativeness  of  throughput)  should  be  either 
increasing  or  U-shaped  in  buffer  size.  Combining  Lemma  1  with  Proposition  2  implies  that  one 
would  expect  to  see  smaller  buffers  in  settings  with  relatively  aggressive  production  schedules; 
i.e.,  p  »  1. 

To  this  point,  we  have  only  addressed  the  case  in  which  the  principal  wants  the  agent  to  set 
the  workstation  to  operate,  on  average,  at  a  faster  rate  than  intermediate  goods  arrive  (p  >  1). 
When  p  <  1,  on  average,  the  workstation  processes  the  intermediate  units  at  a  slower  rate  than 
they  arrive.  Unlike  the  case  with  p  >  1,  when  p  <  1,  even  as  the  buffer  size  tends  to  infinity,  the 
marginal  effect  of  the  agent’s  effort  on  the  probability  of  blocking  does  not  go  to  zero,  because  no 
matter  how  large  the  buffer,  the  principal  can  never  eliminate  the  threat  of  blocking.  In  this  setting, 
the  principal  continues  to  increase  the  informativeness  of  throughput  by  increasing  b,  as  this 
lowers  the  probability  of  blocking,  which  in  turn  allows  the  agent  a  greater  opportunity  to  influ¬ 
ence  the  probability  of  blocking  and,  consequently,  throughput.  This  logic  is  summarized  in 
Corollary  1: 

Corollary  1:  When  p  <  1,  the  informativeness  of  throughput,  K,  with  respect  to  the  agent’s 
effort,  r,  is  everywhere  increasing  in  the  buffer  size,  b. 

The  above  corollary  suggests  that  minimizing  buffers,  as  advocated  by  the  JIT  literature 
(Schonberger  1986,  67)  exacerbates  the  incentive  problem  when  p  <  1.  However,  when  p  >  1, 
minimal  buffers  {b  =  \)  may  be  optimal  from  an  incentive  perspective  under  certain  conditions. 
Because  buffer  sizes  are  integer-valued,  when  p  >  1,  it  is  difficult  to  characterize  the  optimal 
buffer  in  closed  form,  though  Proposition  3  establishes  an  interval  in  which  the  optimal  b  must  lie. 
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Proposition  3:  When  p  >  1 : 

a.  the  optimal  b  lies  in  the  interval:  max(^^,2)  -  1  <  Z?  <  max(^,l). 

b.  p  >  2  is  a  sufficient  condition  for  minimal  buffers  (b  =  1)  to  be  optimal. 

The  first  part  of  Proposition  3  indicates  that  the  common  objective  in  the  JIT  literature  of 
minimizing  buffers  would  incur  unnecessary  agency  costs  if  the  agent  is  not  being  asked  to  work 
relatively  hard;  i.e.,  1  <  p  <  2.  More  generally,  our  results  point  to  the  complementarities  that  are 
created  between  the  choice  of  buffer  size  and  the  choice  of  the  agent’s  relative  effort,  p,  as  a  result 
of  the  effect  of  both  on  the  informativeness  of  throughput  as  the  performance  metric.'^  As  we  have 
just  shown,  changes  in  buffer  size  without  associated  changes  in  the  agent’s  desired  relative  effort 
can  lead  to  unintended  agency  costs. 

Our  analysis  of  the  effect  of  buffer  size  on  the  informativeness  of  throughput  is  based  on  the 
role  of  buffers  in  blocking  and  starving  the  production  process,  with  the  other  organizational 
design  variables  (r*,k,/z)  held  fixed.  However,  we  can  also  view  these  design  variables  in  terms  of 
their  potential  effects  on  the  informativeness  of  throughput  via  their  effects  on  the  probabilities  of 
blocking  and  starving.  Situations  can  exist  in  which  it  may  be  more  efficient  to  manipulate  the 
informativeness  of  throughput  by  varying  these  other  design  choices  than  by  varying  the  buffer 
size.  For  example,  increasing  buffer  size  has  space  utilization  implications  that  the  other  variables 
do  not;  i.e.,  one  cannot  increase  buffer  size  if  there  is  no  additional  space  available,  whereas  it  may 
be  possible  to  upgrade  the  workstation  machinery  (increase  h)  or  speed  up/slow  down  the  up¬ 
stream  workstations  (vary  X).  Similarly,  buffer  size  increments  come  in  discrete  steps  while  the 
other  design  variables  may  allow  for  finer  adjustments.  This  motivates  us  to  consider  the  effect  of 
these  other  design  choices  on  the  informativeness  of  throughput  while  holding  the  buffer  size 
fixed,  as  considered  next. 

Proposition  4:  Holding  the  buffer  fixed,  the  informativeness  of  throughput  with  respect  to 
the  agent’s  setup  effort  is: 

a.  increasing  in  the  mean  arrival  rate  of  intermediate  units,  X, 

b.  decreasing  in  the  principal’s  desired  level  of  agent  effort,  r*,  and 

c.  single-peaked  in  the  level  of  help  resources  allocated  to  the  agent,  h. 

Increasing  the  arrival  rate,  X,  decreases  the  likelihood  of  starving,  which  in  turn  reduces  the 
effect  of  the  variation  in  arrivals  on  throughput,  making  throughput  more  informative  about  the 
other  source  of  variation,  the  agent’s  effort,  r.  The  same  logic  holds  for  decreases  in  r.  The  more 
interesting  result  concerns  productivity-enhancing  investments,  h.  Proposition  4  implies  that  even 
ignoring  out-of-pocket  costs,  there  are  two  counter-acting  effects  to  increasing  productivity¬ 
enhancing  investments  for  the  agent.  The  direct  positive  effect  is  that  such  investments  make  the 
agent  more  productive,  thereby  increasing  his  ability  to  influence  throughput  via  blocking  and 
starving.  The  indirect,  negative  effect  is  that  such  investments  may  render  the  agent  so  productive, 
that  holding  effort  fixed,  the  threat  of  blocking  tends  to  zero  as  h  increases,  even  if  the  agent’s 
effort  is  only  negligible.  As  (1)  indicates,  throughput  only  reflects  effort  to  the  extent  that  the 
agent’s  effort  affects  the  probability  of  blocking.  Beyond  some  point,  increases  in  h  make  through¬ 
put  less  sensitive  to  the  agent’s  effort  and  therefore  less  informative  about  the  agent’s  effort, 
making  it  more  costly  to  motivate  him.  In  a  sense,  resources,  h,  play  the  same  role  as  buffer  size 
in  our  earlier  analysis.  Both  h  and  b  can  be  looked  at  as  productivity-enhancing  investments  and 


Thus,  our  results  are  consistent  with  the  complementarities  literature,  for  example,  Milgrom  and  Roberts  (1990,  1992). 
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both  can  be  the  source  of  additional  agency  costs  as  a  result  of  their  effects  on  the  informativeness 
of  throughput  as  a  performance  metric. 

One  can  interpret  our  results  as  highlighting  a  value  to  creating  bottlenecks,  either  by  choos¬ 
ing  a  finite  buffer  size  or  a  finite  amount  of  productivity-enhancing  investments.  Because  we  have 
abstracted  from  both  the  out-of-pocket  costs  associated  with  inventory  buffers,  (c  >  0)  and  the 
effect  of  the  principal’s  choice  of  buffer  size  on  the  expected  throughput  and  revenue,  these  results 
arise  solely  for  incentive  reasons.  Allowing  for  these  two  additional  effects  will  mitigate  the  force 
of  our  results;  however,  because  all  of  our  findings  are  obtained  using  arbitrary  parameter  values, 
the  incentive  issues  identified  would  persist.  Appendix  A  provides  a  numerical  example  where  we 
solve  the  principal’s  more  general,  expected  steady-state,  profit-maximizing  problem.  The  solution 
illustrates  that  the  incentive  effect  of  buffer  size  identified  will  continue  to  hold  in  this  more 
general  setting. 


IV.  CONCLUSION 

The  present  study  analyzes  the  relation  between  the  choice  of  the  firm’s  production  line 
buffers  and  optimal  incentives.  The  operations  management  literature  has  focused  on  the  impor¬ 
tance  of  minimizing  buffers  and  reducing  inventory.  A  major  insight  from  the  present  work  is  that 
we  identify  an  additional  incentive  value  associated  with  the  firm’s  choice  of  buffers.  We  show 
that  under  reasonable  conditions,  the  informativeness  of  throughput  as  a  performance  metric  is 
single-peaked  in  both  buffer  size  and  production-enhancing  resources,  such  that  it  is  optimal  to 
induce  non-zero,  albeit  bounded,  probabilities  of  blocking  and  starving.  We  show  further  that  the 
information-maximizing  buffer  is  decreasing  in  the  effort  that  the  principal  wants  to  induce  from 
the  agent.  Therefore,  we  would  expect  to  observe  smaller  buffer  sizes  in  settings  with  more 
aggressive  production  schedules  (i.e.,  p  :»  1).  Our  findings  thus  emphasize  the  incentive  comple¬ 
mentarity  between  buffer  size  and  production  schedules,  and  provide  a  possible  explanation  for  the 
mixed  empirical  evidence  regarding  the  profitability  of  work-in-process  inventory  reductions  (Na- 
gar  et  al.  2009). 

This  research  spans  two  different  disciplines,  managerial  accounting  and  operations  manage¬ 
ment,  each  with  its  own  nuances  and  paradigms.  It  represents  an  attempt  to  bridge  these  two 
literatures  by  weaving  together  some  of  the  basic  models  of  each.  To  do  so,  we  made  a  number  of 
modeling  assumptions.  We  ignored  some  of  the  factors  that  are  identified  in  the  operations  man¬ 
agement  literature  as  influencing  the  inventory  buffer  decision  in  order  to  focus  on  incentive  issues 
and  the  informational  effect  of  the  buffer  size  decision.  For  example,  some  of  the  claimed  benefits 
of  small  buffers  under  JIT  that  we  have  ignored  include  the  increased  ability  to  identify  production 
problems  early,  shorter  lead-times,  and  faster  responsiveness  to  changes  in  market  conditions. 
However,  we  are  confident  that  the  incentive  consequences  of  inventory  buffers  brought  to  light  by 
our  analysis  will  be  present  notwithstanding  these  other  factors. 

A  limitation  of  our  model  is  our  restricted  contracting  space.  While  we  limited  our  analysis  to 
linear  contracts  for  analytical  reasons,  the  crucial  assumption  underlying  our  results  is  the  mono¬ 
tonicity  of  compensation  in  the  agent’s  marginal  expected  productivity,  which  is  either  assumed  or 
shown  to  be  optimal  in  the  majority  of  agency  models.  Our  analysis  also  assumes  that  the  agent  is 
risk-neutral.  However,  recall  that  the  principal  chooses  h  based  on  the  information  content  of 
realized  throughput  with  respect  to  the  agent’s  effort.  Therefore,  to  the  extent  that  a  risk-averse 
agent  must  be  compensated  for  risk,  intuition  would  suggest  that  our  results  would  be  strengthened 
in  the  presence  of  risk-aversion.  However,  introducing  risk-averse  agents  imposes  several  analyti¬ 
cal  complications,,  and  we  leave  the  resolution  of  such  issues  to  future  research. 
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Another  simplifying  assumption  is  that  the  agent’s  compensation  is  based  on  a  single  perfor¬ 
mance  metric.  Clearly,  other  performance  metrics  are  possible,  including  realized  inventory 
levels.  However,  the  mean  steady-state  inventory  level  does  not  offer  any  information  above  and 
beyond  the  mean  steady-state  throughput,  K,  since  both  measures  are  functions  of  exactly  the  same 
variables.  On  the  other  hand,  if  we  allowed  for  continuous  monitoring,  as  with  radio  frequency 
identification  (RFID),  then  we  could  isolate  the  agent  from  the  stochasticity  of  the  arrival  process 
and  mitigate  the  agent’s  moral  hazard  problem.*®  In  this  case,  our  problem  reduces  to  the  tradi¬ 
tional  moral-hazard  setting  where  the  agent  always  has  work  to  process,  and  the  principal  infers 
the  agent’s  effort  via  stochastic  output.  It  would  therefore  be  interesting  to  empirically  analyze  the 
incentive  effect  of  buffers  both  before  and  after  the  installation  of  an  RFID  monitoring  application 
to  see  the  extent  to  which  they  are  used  as  substitutes  to  buffers  for  filtering  noise  in  available 
performance  measures.  We  leave  to  future  research  such  inquiries,  as  compensation  data  surround¬ 
ing  RFID  installations  are  not  yet  readily  available. 

APPENDIX  A 

Our  results  are  based  on  analyzing  a  model  in  which  the  principal  minimizes  her  cost  of 
inducing  the  agent  to  exert  a  particular  setup  effort.  In  particular,  our  analysis  of  the  optimal  buffer 
size  ignored  both  the  effect  of  buffers  on  the  out-of-pocket  cost  of  maintaining  inventory  and, 
more  importantly,  the  direct  revenue  effect  of  the  choice  of  buffer  size.  In  this  appendix  we  show 
that  the  insights  and  qualitative  results  derived  from  our  analysis  continue  to  hold  when  we  take 
into  consideration  both  of  these  additional  effects.  To  do  so,  we  provide  a  numerical  example  in 
which  the  principal’s  objective  is  to  maximize  her  steady-state  expected  profit.  As  before,  her 
choice  variables  are  buffer  size,  b,  and  the  compensation  contract,  (wo,w),  determining  the  desired 
level  of  agent  effort,  r*,  which  is  subject  to  moral  hazard.  Including  the  explicit  revenue  and 
maintenance  cost  effects  associated  with  the  choice  of  buffer  size,  the  principal’s  problem  is  now: 

-  wK{b,r'^)  -WQ-cb 
s.t. 


wK  -I-  Wq  -  ar*  >  0  V  AT  (LL) 

r*  e  arg  ma.xXwK{b,r)  +  Wq  -  ar)  (IC) 

where  we  again  assume  a  linear  demand  schedule  for  throughput.  We  first  solve  the  above  problem 
to  find  the  compensation-buffer  size-effort  quadruple  {w,WQ,b^^,r*).  We  then  solve  the  same 
problem  where  we  again  induce  the  agent  to  choose  r*,  but  we  assume  that  the  agent’s  effort  is 
contractible  and  therefore  not  subject  to  moral  hazard.  Thus,  in  this  second  problem  we  are  only 
solving  for  the  optimal  compensation  and  buffer  size.  We  label  the  optimal  buffer  size  in  this 
second,  no  moral  hazard  problem,  Comparing  the  two  optimal  buffer  sizes  allows  us  to 

isolate  the  incentive  effects  of  the  principal’s  choice  of  buffer  size  within  this  more  general 
profit-maximizing  setting.  The  example  demonstrates  that  our  findings  continue  to  hold.  In  par¬ 
ticular,  the  incentive  effect  of  buffers  continues  to  exist  and  can  bias  the  principal’s  profit- 
maximizing  buffer  away  from  the  no  moral  hazard  solution. 


We  thank  a  reviewer  for  raising  this  issue. 

RFID  involves  “an  integrated  circuit  with  an  antenna,  known  as  a  ‘tag,’  attached  to  a  ...product.  Product  information  as 
well  as  other  relevant  information  can  be  stored  in  the  tag  ...  Using  wireless  technologies,  readers  can  be  set  up  to  read 
the  information  on  the  tags”  (Lee  and  Ozer  2007,  40). 
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FIGURE  2 

Second-Best  Profit  and  Informativeness 
r  =  2.11 
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Second-Best  Profit  and  Informativeness,  r  =  2.11 
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For ./?  =  3  50,  c  =  1 0,  a  =  1 ,  X.  =  1 ,  the  solution  to  the  profit-maximizing  problem  with  moral  hazard  sets  the  agent’s 
effort  at  r*  =  p  =  2.1 1  and  the  optimal  buffer  size  at  =  3,  though  a  buffer  of  size  =  \  maximizes  the  infor¬ 
mativeness  of  throughput  when  p  =  r*  =  2. 1 1 .  For  the  no  moral  hazard  case  in  which  the  principal  chooses  p  =  r*  = 
2. 1 1 ,  her  profit  (not  drawn)  is  maximized  at  the  buffer  size  b^’^^  =  4. 


The  example  assumes  that  the  principal’s  revenue  per  unit  of  throughput  is  R  dollars.  In  our 
example  we  set  R  =  350,  c  =  10,  a  =  l,/z  =  0  and  X  =  1.  In  the  moral  hazard  problem,  the 
principal  finds  it  optimal  to  induce  the  agent  to  work  relatively  hard  (p  =  r*  =  2.11)  and  to  set  the 
buffer  size  at  =  3. 

Figure  2  plots  both  the  expected  steady-state  profits  for  the  moral  hazard  case  and  the  infor¬ 
mativeness  of  throughput,  while  fixing  the  induced  effort  at  r*  =  2.11  and  varying  the  buffer  size. 
Consistent  with  Proposition  3  and  the  fact  that  p  =  r*  =  2. 1 1  >  2,  informativeness  is  everywhere 
decreasing  in  the  size  of  the  buffer,  implying  that  the  optimal  buffer  from  an  incentive  perspective, 
which  we  label  b^,  is  equal  to  1.  However,  unlike  in  Proposition  3,  the  optimal  buffer  size  is 
f^MH  _  2  jjjg  reason  for  this  difference  is  that  we  are  now  trading  off  the  revenue  and  holding  cost 
effects  of  buffer  size  (the  smaller  the  buffer  size,  the  smaller  the  expected  revenue  and  the  lower 
the  holding  costs)  against  the  agency  cost  (the  smaller  the  buffer  size,  the  more  informative  is 
throughput  and,  hence,  the  smaller  the  agency  cost).  The  logic  of  Proposition  3,  however,  indicates 
that  when  r*  =  2.11,  the  optimal  buffer  size  in  the  moral  hazard  case  should  be  smaller  than  in  the 
no  moral  hazard  case.  In  fact,  that  is  exactly  what  we  find:  the  profit-maximizing  buffer  size  in  the 
no  moral  hazard  case  is  b^^^  =  4,  as  indicated  in  Figure  2. 


APPENDIX  B 


Lemma  0 

The  function  f  e  C\a,b^  has  a  root  of  multiplicity  ^  at  r  if  and  only  if: 
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0  =f{r)  =f{r)  =fir)  =  ...  but  /«(r)  ^  0. 

Proof  of  Lemma  0 

See  Theorem  3.20  in  Vinberg  (2001)  and  Corollary  3.21  in  Artin  (1991,  508).  ■ 

Proof  of  Lemma  1 

The  agent’s  utility  is  given  by  m  =  wK{b,r)  +  Wq  ~  Hence: 

du  dK 

—  =  w —  -  a 
dr  dr 


d^u  d^K 
dr^  dr^ 

Substituting  the  functional  form  of  K  from  (1)  yields: 

=  (1  +  hy'{w){b  +  l)(X/r(l  + 

{jb{r{\  +h)-  \){\{\lr{\  +  h)Y  +  r(l  +  /;))  +  2r(l  +  h)\{{\lr{\  +  h)f  -  1)) 

{\{\lr{\+h)f-r{\+h)f 

and  the  denominator  is  non-positive  if  and  only  if  r(l  +  h)  >  \.  Further, 

(h')(& -I- 1)(\ /  r(l -I- /z))^"^^  >  0.  Hence,  if  we  can  show  that 
(Z?(r(l  + /z)  -  X)(\(X  /  r(  1 -I- /z))^ -1- r(l -I- /z)) -I- 2r(l  + /z)X((X  /  r(  1 -I- /z))^  -  1))  is  non-negative  if 
and  only  if  r(l  +  h)  >  X,  we  will  then  have  established  that  0  <  0. 

To  this  end,  we  can  re-write  the  expression  as: 

(K1  +  h)r\b{r{\  +  +  (r(l  -t  h)y^\-  b\  -  2X)  +  r(l  -t  +  b)- 

labeling  the  bracketed  expression,  N.  According  to  Descartes’  rule,  N  can  have  at  most  three  roots 
in  r.  Using  Lemma  0,  note  that: 


Wl  X  = 

\+h 


m 

dr 


dr^ 

'l+h 


\ 

''\+h 


=  0<b{l+  h)\‘’-'{2  +  3b  +  b^)  = 


dr^ 


\ 

'\+h 


which  shows  that  all  three  roots  are  at  r  =  X/(l  -I-  h).  Finally,  since  the  third  derivative  is  positive, 
the  Extremum  Test  assures  that  N  has  a  saddle  point  at  r  =  X/(l  -I-  /z);‘^  therefore  N  alternates  in 
sign  as  r  crosses  X/(l  -f  h).  However,  since  N  is  negative  at  r(l  +  h)  =  Q,N  must  be  non-negative 
for  r(l  +  h)  >\>  0. 

The  constraint  (IC)  can  be  expressed  as: 


dr 


\  -  {r{\  +  h)l\)-^ 
'^1  -(r(l  +/z)/X)-(^+’^ 


\-ar 


=  0, 


implying  that: 


For  an  explanation  of  the  Extremum  Test, 


see  http://mathworld.wolfram.com/ExtremumTest.html. 
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„  ,  ldK{b,r)Y  a{{r{\+h)l\f^^-\f 

w(b,r)  =  a\ -  = - ; - 

\  dr  )  {\+h){l-{r{\+h)l\)\l-b{\-r{\+h)l\))y 

which  is  both  positive  and  increasing  in  r. 


Proof  of  Proposition  1 

Because  we  have  defined  informativeness  as  a  constant  times  the  inverse  of  the  incentive 
wage,  it  suffices  to  show  that  the  incentive  wage  is  either  U-shaped  in  buffer  size,  or  increasing  in 
b.  In  particular,  we  will  show  that  if  the  wage  is  increasing  in  b,  at  say  b,  then  the  wage  continues 

to  increase  for  all  b>  b.  Then  we  will  show  that  there  exists  a  finite  b  beyond  which  the  wage 
must  be  increasing  in  b. 

In  general,  we  can  write: 


w{b,p) 


-  1)^ 

(l  +  h)ibp^*^-(b+l)p’^-l)’ 


(2) 


where,  henceforth  to  simplify,  we  omit  the  constant 


.  Hence: 


db 


(p  -  1)(P  -  1)  +  (Hp  -  1)(1  +  P  )  -  pip  -  2)  -  l)log(p) 

(bp’’^^  -  {b  +  l)p^  +  if 


(3) 


We  first  claim  that  the  denominator  of  (3)  is  strictly  positive.  To  see  this,  note  that 
bp^'^^  -  {b  +  l)p^  -H  1  has  a  root  at  p  =  1;  its  first  derivative  evaluated  at  p  =  1  is  zero  and  its 
second  derivative  is  strictly  positive  for  all  p  >  1,  implying  that  the  denominator  is  strictly  positive 
for  all  p  >  1.  Thus,  for  p  >  1,  the  sign  of  -^w{b,p)  is  determined  by  the  function  t{b,p),  where: 

tib,p)  =  -  (p  -  l)(p^'"'  -  1)  -1-  {b{p  -  1)(1  -t  p^+^)  -  p(p^  -  2)  -  l)log(p). 


We  will  show  that  if  t{b,p)  is  positive,  it  is  increasing  in  b,  implying  that  once  the  wage  is 
increasing  in  buffer  size  b,  it  will  continue  to  increase  in  b  for  larger  buffers.  To  this  end,  note  that 
the  first  term  in  t{b,p)  is  strictly  negative  for  p  >  1.  We  claim  that  the  second  term  in  t{b,p) 
is  strictly  positive  for  p  >  1.  To  see  this,  note  that  we  can  write  the  coefficient  of  log(p) 
as  -  b  +  {2  + b)p- {b  +  ■\-bp^'^^,  which  at  p  =  1,  is  both  equal  to  0  and  increasing 

in  p.  Moreover,  the  second  derivative  in  p  is  strictly  positive  for  p  >  1,  since 

b{l+b){p'^-\2p  +  b{p-\)-\)  >0. 

Thus,  if  t{b,p)  is  positive,  it  must  be  the  case  that: 

,  (p-i)(p“-i) 


To  show  that  if  t{b,p)  >  0,  t{b,p)  is  increasing  in  b,  we  difference  t{b,p)  in  b: 

t(b  +  hp)  -  t{b,p)  =  {p-  1)(-  (p  -  Dp'^"'  +  {\-{b  +  l)p'+^  +  {b+  l)p2+^)log(p)). 


(5) 


The  coefficient  of  log(p)  in  (5)  is  always  positive,  hence  we  can  use  the  bound  for  log(p)  from  (4) 
to  form  a  lower  bound  for  t{b  -f  l,p)  —  t{b,p): 


t{b+  l,p)-t(^,p)  =  (p-  1)(-  (p-  l)p^+*  +  (l  -  {b  +  +  {b  +  l)p^^^)logip)) 

(p l)(p^+i l) 


>  (p  -  1)  -  (p  -  +{l-(b+  +  {b+  l)p2^^)- 


Mp-i)(i+p"^')-p(p"-2)-u 


-  1  +  (3  +  2b) p^^^  -  (3  +  2b) p^^^  +  p^+^^ 
-l-b  +  p(2  +  b)-{b+  l)pi'"^  +  &p^+^  ■ 
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The  denominator  above  is  identical  to  that  in  (4),  which  we  have  already  shown  to  be  strictly 
positive  for  p  >  1 .  Thus,  we  need  only  sign  the  numerator,  which  has  at  most  three  roots  according 
to  Descartes’  rule.  However,  in  accordance  with  Lemma  0,  the  numerator  has  all  three  roots  at  p 
=  1,  and  since  it  has  a  strictly  positive  third  derivative  in  p,  the  expression  is  strictly  positive  for 
p  >  1,  implying  that  if  at  some  b  =  b  the  wage  is  increasing  in  buffer  size  {t{b,p)  >  0),  then  it  will 
continue  to  be  increasing  in  buffer  size  for  larger  buffers. 

To  prove  that  the  wage  is  eventually  increasing  in  buffer  size,  note  that  for  an  arbitrary,  finite 
p  >  1,  lim  t{b,p)  -  °o,  implying  that  there  exists  a  finite  b  for  which  t{b,p)  >0.  | 

fe— >00 

Proof  of  Proposition  2 

In  order  to  characterize  the  most  informative  buffer  size,  it  suffices  to  characterize  the  lowest 
incentive  compatible  wage,  w.  We  will  show  that  if  the  wage  is  minimized  at  b  when  p  =  p,  then 
the  wage  will  be  increasing  m  b  dd  b  when  p>  p;  thus  by  Proposition  1,  the  wage-minimizing 
buffer  for  p  >  p  is  less  than  or  equal  to  b. 

Suppose  the  wage  is  minimized  at  b  when  p  =  p.  Then,  by  definition: 


w{b,p)  -  w{b  -  \  ,p)  <0  <  w{b  +  l,p)  -  w{b,p). 


To  complete  the  proof,  we  will  prove  that  w{b  +  l,p)  -  w{b,p)  >  0  implies 
w(^  -I-  1  ,p)  -  w{b,p)  >  0  for  p>  p. 

To  this  end,  consider  the  difference  w{b  +  l,p)  —  w{b,p): 


,l+b-  3p{b  +  1)  +  3p^+^  +  p^+^  -  (2  +  b)p^^^^  +  bp'^^^’’ 
ip-  1)P  (1  _(^+  i)p^  +  ^pi+fc)(l  _(2  +  ^)pi+^  +  (Z,+  l)p2+i') 


(6) 


We  first  show  that  the  denominator  of  (6)  is  always  positive.  Using  Descartes’  rule,  the  first  term 
has  at  most  two  roots.  The  term  and  its  first  derivative  in  p  evaluated  at  p  =  1  are  both  equal  to 
0.  Therefore,  Lemma  0  implies  that  its  only  positive  root  is  p  =  1  and  since  its  leading  coefficient 
is  positive,  the  term  is  strictly  positive  for  p  >  1.  The  same  argument  shows  that  the  second  term 
in  the  denominator  is  strictly  positive  for  p  >  1. 

Thus,  the  sign  of  (6)  is  determined  by  the  numerator.  Applying  Descartes’  rule,  the  numerator 
has  at  most  four  positive  roots  in  p.  Because  the  numerator  and  both  its  first  and  second  derivatives 
in  p  are  equal  to  0  when  p  =  1,  Lemma  0  implies  that  p  =  1  is  a  root  with  multiplicity 
three.  Because  the  third  derivative  of  the  numerator  in  p  evaluated  at  p  =  1  is  negative 
i-3b^-9b-6)  but  the  leading  coefficient  is  positive,  the  final  root  in  p,  which  we  label  p{b), 
must  be  greater  than  1 . 

Now,  suppose  w{h  +\,p)  -  w{b,p)  >  0.  Then  it  must  be  the  case  that  p  is  greater  than  or  equal 
to  the  final  root,  pib);  therefore,  if  p  >  p,  then  p  >  pib),  in  which  case  w{b+  l,p)  -  w{b,p)  >  0, 
and  thus  the  wage-minimizing  buffer  for  p  >  p  must  be  no  larger  than  b.  | 

Proof  of  Corollary  1 

The  proof  is  similar  to  that  of  Proposition  2  and  is  omitted.  H 

Proof  of  Proposition  3 

By  definition,  the  information-maximizing  b  will  equivalently  minimize  the  wage,  w.  We 
identify  a  sufficient  upper  bound  on  b,  as  parameterized  by  p,  for  w{b  +  1  ,p)  ~  w{b,p)  to  be 
positive,  and  a  sufficient  lower  bound  on  b  for  w{b  -I-  l,p)  “  w{b,p)  to  be  negative.  Combining  the 
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two  bounds,  we  identify  a  region  for  b  where  w{b  +  l,p)  —  w{b,p)  crosses  zero  in  b,  defining  the 
wage-minimizing  buffer  size. 

We  begin  with  two  additional  Lemmas. 


Lemma  2 

Let  f{x)  =  a^  +  ...  +aix  +  Oq  be  a  real  polynomial  with /(I)  =  0.  Furthermore,  assume  that 
there  exists  a.  k  <  n  —  1  such  that  for  i  >  k,  ai  >  0  and  for  i  <  k,  a,  <  0.  Then  ffx)  >  0  for 
X  >  1  and/(x)  <  0  for  x  <  1. 

Proof  of  Lemma  2 

For  all  X  >  1  we  have  /(x)  >  (a„ -t- -t-  ...-afjx^.  Since 

/(I)  =  0,  the  sum  of  the  terms  in  the  parentheses  are  equal,  and  we  denote  the  sum  (either  in  the 
first  or  the  second  set  of  parentheses)  by  s.  Clearly,  5  >  0  and  ^x^(x-  1)  >  0  for  x  >  1;  hence, 
f[x)  >  0  when  x  >  1.  Similarly,  for  all  0  <  x  <  1,  we  have  /(x)  <  -  sx'^  =  sx^{x  -  1)  <  0.  This 

concludes  the  proof  of  the  lemma.  ■ 


Lemma  3 

Let  b  he  a  natural  number  and  p  >  1,  then; 


2  +  l)p* 

2  i=o 

2  (j+  i)p' 

i=0 


b+l 


1 

4-  - 

b 


for  p  >  1,  and  the  opposite  weak  inequalities  hold  when  p  <  1. 


Proof  of  Lemma  3 

First  consider  the  inequality  on  the  right-hand  side.  Set: 

/pi  ^ 

/(p)  =  ^~p + ~  2  («■ + i)p'  -  2  (« +  i)p‘- 
V  b  /=o 


We  want  to  show  thatffp)  >  0  for  p  >  1  and/(p)  <  0  for  p  <  1.  Thus,  it  is  sufficient  to  show  that 
ffp)  satisfies  the  hypothesis  of  the  previous  lemma.  First,  to  show  that /(I)  =  0,  we  have: 


/(1)  = 


b-l  b 

2  (i  +  1)  ~  2  ji  +  1)  = 

1=0  1=0 


We  must  now  find  a  k  as  in  the  prior  lemma.  If  f{p)  =  anP'^+  ...OiP  +  Oq,  then  n  =  b  -  I 
(although  there  is  a  p*  term  in  the  construction  of f(p),  note  that  it  has  a  coefficient  of  zero)  and  for 
arbitrary  m>  1,  the  coefficient  a„  is  given  by: 


b+\ 

b 


m  +  —{m  -f- 1)  -  (m  -I- 1)  = 
b 


2m  -  (P  -  1) 
b 


Hence,  >  0  if  and  only  if  m  >  ^.  In  particular-,  a  threshold  k  exists  when  n  =  P  -  1  >  ^, 
which  is  valid  for  P  >  2.  If  P  =  1,  then  ffp)  =  ^(p-  1),  which  is  trivially  greater  than  zero  for 
p  >  1  and  less  than  zero  for  p  <  1.  Hence,  Lemma  2  holds  and  we  have  established  Lemma  3  for 
the  right-hand  side  inequality. 

Next,  consider  the  left-hand  side  inequality  in  the  statement  of  the  lemma.  Let: 
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b 


g(p)  =  2  (*+  l)p'  - 
1=0 


b-\ 


2  {i  +  l)p‘- 
1=0 


Note  that: 


b-\ 


§(1)  -  2  (*  +  1)  ~  (  ^  +  1  (*  +  1)  =  0- 


1=0 


i=0 


Moreover,  the  leading  coefficient  of  g{p)  is  positive  for  ^  >  1  and  all  the  remaining  coefficients 
are  always  negative;  hence,  the  conditions  of  Lemma  2  are  always  satisfied  fox  b  >  1.  If  b  =  1, 
then  g{p)  =  2(p  —  1),  which  is  again  positive  for  p  >  1  and  negative  for  p  <  1,  as  was  to  be 
shown.  This  concludes  the  proof  of  the  lemma.  | 

Continuing  with  the  proof  of  Proposition  3,  when  p  >  1,  we  can  rewrite  w{b,p)  without  the 
constant  multiplier  as: 


(p^^^-1)^  p^^  +  2p^^-^+  ...  +  (Z7+l)p^  +  Zpp^-^+  ...  +  2p+l 

{bp^'^^  -  {b  +  l)p^  +  1)  bp^~^ +  (b-l)p^~^  +  ib-2)p^~^  +  ...  +  2p+ I 


To  see  this,  we  begin  by  rewriting  the  numerator: 


(p*+’-l)2=((p^+l  +  p^  +  p*-‘+ . 

..  +  p)-(p*  +  p^ 

=  ((p-l)(p^+p^-'+ . 

=  (p-  l)^(p*  +  p*-'+  . 

..+  1)2 

/  b  \2 

=  p^*  +  2p^^  '  +  ...  +  ib+l)p^  +  bp^  '  +  . . .  +  2p  +  1 . 
The  denominator  can  be  rewritten  as: 


bp’’^^  -(b+\)p^+l=  bip^^^  -  p*)  -  (p^  -  1) 

b 

=  &p*(p-l)-(p-l)2p*~' 

1=1 

=  {p-\)[bp^-^p^-^ 

b 

=  (p-i)X  (p‘-p'’-') 

1=1 

b-l 

=  (P-  1)^2  (i+  l)p' 

1=0 

=  bp^-^  +  {b-  l)p^-2  +  {b-  2)p^-3  +  . . .  +  2p  +  1 . 
In  light  of  this  representation,  we  can  continue  simplifying  to  obtain: 
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p^^  +  2p^^~^+  ■■■  +  (&+ l)p^  +  Z7p^~^+  ■■■  +  2p+l 
^  +  {b-\)p^-'^  +  {b-l)p^-^  +  ...  +  2p+l 

_ p2fc  +  2p^^-^+  ...  +  {b+\)p'’ 

~  bp^-^ +  {b-\)p^-^  +  {b-2)p^-'^  +  ...  +  2p+  1 

_  2b _ 1  +  2p  ^  +  .  ■ .  +  (Z?  +  l)p  ^ _ 

bp^-^  +  ib-l)p^-^  +  (b-2)p^-^+  ...  +  2p+l  ‘ 

Using  (7),  w{b  +  l,p)  >  w{b,p)  is  equivalent  to: 


7f,^9l+2p  ...  +  {b  +  2)p  ^  ^  +2p  ■■■  +  (&+  l)p  ^ 

(Z7+ l)p^+  ...  +  2p+ 1  &p*“^  +  ...  +  2p+ 1 

2 1  +  2p  ^  +  . . .  +  (Z?  +  2)p  ^  ^  1  +  2p  ^  +  . . .  +  (Z?  +  l)p  ^ 

(Z7+ l)p*+  ...  +  2p+ 1  ^  V-’+...  +  2p+l 

J  (b+l)p^+  ...  +  2p+l  \/l  +  2p-^+  ...  +  (b  +  2)p-^-^\ 

\l  +  2p-'+  ...  +  (Z?+l)p-V\  (Z7+l)pH  ...  +  2p+l  / 
/  (Z?  +  l)p^  +  . . .  +  2p  +1  \  /  1  +  2p  ^  +  . . .  +  (Z>  +  l)p 

^  \  1 +  2p~^  +  ...  +  (Z7+ l)p''^/ \  Z?p^“^  +  ...  +  2p+ 1  / 


b+l  b 

20'+i)p~'  .  20’+i)p‘ 

+  2p  '  +  ...  +  (Z?  +  2)p  2 '=0  (Z>+l)p  +  ...  +  2p+l  i=o _ 

l+2p'' +...  +  (/>+ Dp-"  "'’7'  ^  (,p»-'+ ...  +  2p+l  “^7 

2  O’ + Dp-'  2o+Dp' 

1=0  i=0 


(8) 


We  can  use  Lemma  3  with  p  >  1  on  the  right-hand  side  of  (8)  to  conclude  that  it  is  less  than  or 
equal  to  ^p  -l-  Since  we  have  assumed  that  p  >  1,  p"^  <  1,  and  we  can  again  apply  Lemma  3, 
replacing  p  with  p“^  on  the  left-hand  side  of  (8),  tq  conclude  that  it  is  greater  than  or  equal  to 
p2(^p-i  +  ^).  Thus,  a  sufficient  condition  on  b  and  p  for  w(b  +  l,p)  >  w{b,p)  is  given  by: 


1  b  +  2 
P  + 


b+l 


b+l 


b+l  1 

p  > p  +  —  ^ 


1  „  b+2  b+l  1  ^ 

- p2+ - p- - p-->0. 

Z?-l-L  Zj4-1  b  b 


Solving  the  second  quadratic  above  for  equality  gives  two  roots  in  p:  -1  and  Since  the 
leading  coefficient  is  positive,  we  know  that  (8)  holds  whenever  p  >  -^,  or  a  sufficient  condition 
for  the  wage  to  be  increasing  in  b  is  given  by  Z>  >  ^.  Because  our  analysis  applies  to  all  p  >  1, 
(8)  implies  that  for  p  >  2,  the  wage-minimizing  buffer  is  given  by  Z?  =  1,  the  smallest  feasible 
buffer  size.  This  establishes  part  (b). 

Next  we  establish  a  sufficient  upper  bound  for  the  wage  to  be  decreasing  in  b.  When  a  >  0 
and  p  >  1,  p"‘'<l;  thus.  Lemma  3  implies  that  the  left-hand  side  of  (8)  is  less  than 
+  i),  whereas  the  right-hand  side  is  bounded  below  by  ^P+  !•  Hence,  a  sufficient  con¬ 
dition  for  (8)  to  be  violated  (for  the  wage  to  be  decreasing  in  b)  is  given  by: 


The  Accounting  Review 


November  2010 
American  Accounting  Association 


1858 


Baiman,  Netessine,  and  Saouma 


2 

b+l 


p  +  p2 


2 

-p-l<0. 

b 


(9) 


2  /  4 

The  two  roots  to  the  polynomial  on  the  left-hand  side  of  (9)  are  given  by  p  = - ^ - and 

b{b+\)^ 

p=  2  ’  since  the  latter  is  negative  and  the  polynomial  has  a  positive  leading 

coefficient;  a  sufficient  condition  for  the  wage  to  be  decreasing  in  b  is  given  by 

2  ^  /  4  ^  ^  2  I  4  ~  2 

bib+^)^ '\]  b^{b+iy^  b{b+l)'''yj  b'^(b+l)^^‘^  b{b+\)*^  , 

1  <  P  < - 2 - •  ^  lower  bound  for - ^ -  is  given  by  — ^ —  =  1  + 

and  since  b{b  +  1)  <  (Z? -l- 1)^,  a  sufficient  condition  for  w{b,p)  to  be  decreasing  is  thus  p<  1 

+  or  -  l>  b.  The  lower  bound  on  b  is  redundant  once  p  >  ^,  as  Z?  is  bounded  below  by 

1.  ""  ■ 


Proof  of  Proposition  4 

In  order  to  prove  that  the  sensitivity  of  K  to  r  is  increasing  in  X,  decreasing  in  r,  and 
single-peaked  in  h,  we  will  show  that  the  wage  w  is  decreasing  in  X,  increasing  in  r  and  U-shaped 
in  h.  Differentiating  w  with  respect  to  h  yields: 

dw  _  (-1-1-  p^(l  +2b  +  b^)  -  p^^^{Ab  -I-  b^)  -t-  \-\-Ab)-  \+bf-t-  p^^^^b^) 

dh~''  (p2(i+p^(p^_/,_l)))2  •  (10) 

The  denominator  is  unambiguously  non-negative,  whereas  Descartes’  rule  implies  the  numerator 
has  at  most  five  positive  roots.  Because  the  numerator  and  the  first  three  derivatives  with  respect 
to  p  have  a  root  at  p  =  1,  only  one  root  remains  unidentified.  If  Z?  =  1,  then  the  fourth  derivative 
of  the  numerator  in  (10)  is  zero  at  p  =  1,  and  thus  the  fifth  root  is  in  fact  p  =  1.  If  Z?  >  1,  then  the 
fourth  derivative  of  the  numerator  in  (10)  is  strictly  positive  at  p  =  1,  hence,  by  the  Extremum 
Test,  p  =  1  is  a  relative  minimum  and  there  exists  a  sufficiently  small  e  >  0  such  that  the 
numerator  is  positive  at  p  =  1  —  e  when  Z?  >  1.  However,  because  the  numerator  in  (10)  is  strictly 
negative  at  p  =  0,  the  final  root  must  belong  to  the  interval  (0,1). 

Differentiating  w  with  respect  to  r  yields: 


dw  _  (1  +  Z?)p^  '(1  -  P^'^^)(-  b  +  p{b  +  2)  -  p^'*''(Z?  +  2)  +  p^'^^b) 

dr~~^  \{\-p\\+b)  +  bp^^^f  ■ 

The  denominator  of  (11)  is  non-negative.  Descartes’  rule  implies  that  in  addition  to  p  =  1, 
(11)  may  have  up  to  three  additional  positive  roots.  Using  Lemma  0,  the  expression 
-Z?  -t-  p(Z?  -I-  2)  -  p^'^'(Z7  -I-  2)  -I-  p^^'^b  and  its  first  two  derivatives  have  a  root  at  p  =  1,  whereas  the 
third  derivative  is  strictly  positive  at  p  =  1,  implying  that  the  expression  has  a  saddle  point  at  p  = 
1.  Since  the  pre-multiplier,  1  -p*+\  also  alternates  in  sign  at  p  =  1,  the  entire  expression,  is 
non-positive.  Finally,  differentiating  w  with  respect  to  X  yields  ^  =  and  since  ^  is  non- 

positive,  ^  must  be  non-negative.  H 
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ABSTRACT:  In  a  setting  in  which  corporate  headquarters  dictates  total  sales  targets, 
we  study  how  supervisors  allocate  sales  targets  to  individual  stores.  Specifically,  we 
analyze  whether  supervisors  strategically  use  discretion  in  the  target-setting  process  to 
address  compensation  contracting  issues.  We  first  examine  whether  supervisors  use 
discretion  to  manage  compensation  risk.  The  results  are  consistent  with  the  agency- 
theoretic  prediction  that  supervisors  provide  easier  targets  to  stores  facing  higher  levels 
of  store-specific  risk.  Next,  we  examine  whether  discretion  is  used  to  mitigate  fairness 
concerns.  The  results  suggest  that,  consistent  with  behavioral  arguments,  supervisors 
use  discretion  to  deal  with  fairness  issues,  even  if  the  area  of  the  supervisor’s  discretion 
is  not  the  source  of  the  fairness  concerns.  Finally,  we  analyze  whether  supervisors  use 
discretion  in  the  target-setting  process  to  reduce  their  potential  confrontation  costs. 
Consistent  with  research  in  psychology,  we  find  that  supervisors  provide  easier  targets 
to  store  managers  with  relatively  higher  hierarchical  status. 

Keywords:  discretion;  sales  target;  target-setting  process;  compensation  risk;  fair¬ 
ness  concerns;  confrontation  costs. 
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I.  INTRODUCTION 


his  study  examines  supervisor  discretion  in  target  setting.  Specifically,  in  a  context  in 


which  overall  regional  sales  targets  are  dictated  by  corporate  headquarters,  we  analyze  how 


-1-  regional  directors  (supervisors)  use  discretion  to  allocate  regional  sales  targets  to  individual 
stores  in  the  region.  Understanding  how  performance  targets  are  set  is  important  because  such 
targets  play  a  key  role  in  many  aspects  of  management  accounting  and  control.  For  instance,  in  the 
area  of  performance  evaluation,  targets  communicate  supervisors’  expectations  and  directly  influ¬ 
ence  employee  incentives,  as  meeting  or  exceeding  targets  leads  to  increased  short-  and/or  long¬ 
term  compensation.  Despite  the  importance  of  targets,  however,  our  knowledge  of  supervisor 
target-setting  behavior  is  limited. 

A  considerable  amount  of  research  examines  the  related  topic  of  goal  setting.  Much  of  this 
literature  addresses  the  relationship  between  target  achievability  and  employee  effort  or  perfor¬ 
mance  (see,  e.g.,  Locke  and  Latham  1990).  The  results  from  this  goal-setting  literature,  however, 
do  not  provide  clear  insight  into  supervisors’  target-setting  behavior.  For  example.  Merchant  and 
Manzoni  (1989)  find  that  supervisors  set  targets  at  substantially  more  attainable  levels  (i.e.,  at 
levels  with  a  higher  probability  of  achievement)  than  the  goal-setting  theory  predicts.  The  authors’ 
interview  data  indicate  that  the  targets  are  more  attainable  than  predicted  because  eliciting  high 
levels  of  employee  performance  is  not  supervisors’  only  eoncem.  Other  factors,  such  as  increasing 
the  predictability  of  budgets  or  discouraging  earnings  management,  also  influence  supervisors’ 
target-setting  decisions.  To  date,  however,  there  exists  little  empirieal  evidence  on  the  array  of 
factors  influencing  supervisors’  target-setting  decisions  and  methods  (Ittner  and  Larcker  2001; 
Anderson  et  al.  2010).  We  address  this  gap  in  the  literature  by  analyzing  how  directors  strategi¬ 
cally  use  discretion  in  the  target- setting  process  to  deal  with  compensation  contracting  issues. 

Specifieally,  we  begin  by  examining  whether  directors  use  discretion  to  manage  compensation 
risk.  Consistent  with  predictions  based  on  agency  theory,  we  hypothesize  that  directors  provide 
more  attainable  targets  to  stores  that  face  higher  levels  of  store-specific  risk.  Next,  we  study 
whether  discretion  is  used  to  mitigate  fairness  concerns.  Drawing  on  behavioral  theory,  we  predict 
that  directors  will  provide  more  attainable  targets  to  stores  when  other  elements  of  the  compen¬ 
sation  plan  are  taken  to  be  unfair.  Finally,  we  analyze  whether  directors  use  discretion  in  the 
target-setting  process  to  minimize  confrontation  costs.  Consistent  with  research  in  psychology,  we 
hypothesize  that  directors  provide  easier  targets  to  store  managers  with  relatively  higher  hierar¬ 
chical  status. 

We  test  our  predictions  using  archival  data  across  multiple  years  and  organizational  business 
units.  In  particular,  we  employ  a  rich  dataset  from  the  population  of  103  business  units  (referred 
to  as  stores)  managed  by  a  postal  service  provider  in  Korea  for  the  period  2000  to  2003. ‘  Our 
research  setting  offers  important  advantages  in  examining  the  use  of  discretion  in  target  setting. 
First,  our  data  include  initial  store  targets,  which  are  calculated  by  applying  an  established  formula 
to  the  regional  targets  set  by  corporate  headquarters,  and  final  store  targets,  which  are  set  by 
regional  directors.  Any  differences  between  the  initial  and  the  final  targets  allow  us  to  draw 
inferences  on  how  directors  use  discretion  in  the  target-setting  process.  Second,  in  our  setting,  the 
sum  of  the  store  targets  within  each  region  must  equal  the  total  regional  target  set  by  corporate 
headquarters;  that  is,  when  a  director  adjusts  one  store’s  target  downward,  one  or  more  other 
stores’  targets  must  be  adjusted  upward.  As  a  result,  directors  must  be  especially  assiduous  in 


We  use  the  same  research  setting  as  Matsumura  and  Shin  (2006),  but  we  analyze  a  different  dataset  and  a  different  time 
period. 
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using  their  discretion  and,  hence,  the  factors  driving  the  use  of  discretion  should  be  particularly 
informative  in  our  setting.  Moreover,  the  zero-sum  nature  of  the  adjustments  creates  a  situation  in 
which  directors,  after  considering  store  managers’  proposals,  must  choose  which  store  managers  to 
satisfy  and  which  to  displease.  This  allows  us  to  examine  the  social  construct  of  conflict  avoidance 
that  can  arise  in  such  a  situation.  Third,  consistent  with  reported  practice,  store  managers  in  our 
setting  participate  in  the  target- setting  process  by  proposing  targets  and  providing  supporting 
documentation  (see,  e.g.,  Umapathy  1987;  De  With  and  Dijkman  2008;  Shastri  and  Stout  2008). 
Our  setting  therefore  allows  us  to  expand  the  study  of  target  setting  beyond  much  of  the  prior 
literature  that  focuses  on  unilaterally  determined  targets  (e.g.,  Locke  and  Latham  1990). 

We  find  a  positive  and  significant  relation  between  downward  target  adjustments  and  our 
proxies  for  risk  and  fairness.  These  results  support  the  view  that  directors  strategically  use  their 
discretion  in  the  target-setting  process  to  manage  compensation  risk  and  mitigate  fairness  con¬ 
cerns.  We  further  find  a  significantly  positive  relation  between  downward  target  adjustments  and 
store  managers’  hierarchical  status  relative  to  both  their  director  and  other  store  managers.  This 
evidence  is  consistent  with  directors  using  their  discretion  to  avoid  confrontations  with  higher- 
status  store  managers. 

This  study  makes  several  important  contributions  to  the  literature.  First,  ours  is  among  the 
first  studies  to  take  a  broader  perspective  on  the  supervisor’s  role  in  the  target- setting  process. 
Prior  research  focuses  on  the  relationship  between  target  levels  and  employee  effort,  but  does  not 
take  into  account  the  fact  that  the  target-setting  process  can  also  be  strategically  used  to  manage 
other  elements  essential  in  compensation  contracting,  such  as  compensation  risk.  In  a  setting  in 
which  standardized  compensation  contracts  cover  multiple  units  but  there  is  some  discretion  in 
target  levels,  we  find  evidence  suggesting  that  supervisors  manage  compensation  risk  by  using 
their  discretion  in  the  target- setting  process  to  compensate  stores  for  additional  store-specific  risk. 

Second,  this  study  is  the  first  to  provide  field  data  evidence  consistent  with  supervisors 
exercising  discretion  in  the  target-setting  process  to  counter  other  elements  of  the  incentive  con¬ 
tract  that  may  be  considered  unfair.  In  particular,  our  results  are  consistent  with  supervisors  using 
their  discretion  to  improve  fairness  perceptions,  even  if  the  area  in  which  the  supervisor  is  granted 
discretion  is  not  the  source  of  the  fairness  concerns.  Our  results  suggest  that  supervisors  view 
different  elements  of  the  compensation  contract  as  interrelated.  When  supervisors  are  not  granted 
discretion  to  adjust  every  element  of  the  compensation  contract,  discretion  on  one  dimension  is 
used  to  improve  compensation  contracting  on  other  dimensions.  More  importantly,  this  evidence 
on  the  strategic  use  of  discretion  to  harmonize  elements  of  the  compensation  contract  highlights 
the  importance  of  studying  the  compensation  system'  as  a  whole. 

Third,  this  study  contributes  to  the  literature  by  examining  the  participative  target-setting 
process  from  the  supervisor’s  perspective.  In  contrast  to  much  of  the  target-setting  literature  that 
focuses  on  motivating  subordinates’  decisions  or  effort  levels,  we  shed  light  on  how  supervisors’ 
individual  incentives  and  preferences  influence  target-setting  decisions.  Results  suggest  that  su¬ 
pervisors  strategically  make  discretionary  target  adjustments  to  reduce  confrontation  costs.  This 
implies  that  supervisors’  personal  preferences  and  incentives  can  result  in  not  only  biased  perfor¬ 
mance  evaluations  at  the  end  of  the  performance  period,  but  also  distorted  target  levels  at  the 
beginning  of  the  contract  period. 

We  describe  our  research  setting  in  Section  II  and  develop  our  hypotheses  in  Section  III.  In 
Section  IV  we  discuss  our  sample  and  variables.  Section  V  presents  the  results  of  our  main 
analyses  and  Section  VI  presents  the  results  of  additional  analyses  and  robustness  tests.  Section 
VII  summarizes  and  concludes. 

11.  RESEARCH  SETTING 

Our  research  setting,  which  we  refer  to  as  POST,  is  the  state-owned  provider  of  comprehen¬ 
sive  postal  services  in  Korea.  POST  is  a  monopolistic  service  provider  with  respect  to  the  handling 
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and  delivery  of  regular  mail.  However,  it  competes  against  several  large  domestic  and  interna¬ 
tional  couriers,  such  as  Federal  Express  and  DHL,  in  the  expedited  delivery  market.  Although 
expedited  delivery  services  are  subject  to  more  intense  competition,  their  volume  is  more  control¬ 
lable  from  a  store  manager’s  perspective  because  it  can  be  influenced  in  part  by  store  managers’ 
actions.  For  instance,  an  individual  store  manager  significantly  increased  the  store’s  mail  volume 
by  targeting  marketing  efforts  for  express  parcels  to  families  in  rural  areas  that  were  identified  as 
having  children  who  were  a  significant  distance  from  home.  Regular  mail  volume  (mostly  business 
bulk  mail),  on  the  other  hand,  is  largely  determined  by  factors  beyond  store  managers’  control. 
Because  expedited  delivery  services  constitute  most  of  a  store’s  “strategic  mail  services,”  we 
henceforth  refer  to  expedited  delivery  services  simply  as  strategic  mail  services.^ 

POST  consists  of  eight  regional  headquarters,  214  Business  Unit  (BU)  stores,  and  2,579  small 
windows-only  stores.^  At  the  beginning  of  2001,  POST  restructured  the  tasks  and  responsibilities 
of  the  store-level  administrative  staff  to  reduce  the  number  of  redundant  administrative  tasks 
performed.  Some  administrative  tasks  were  delegated  to  other  BU  stores  and  some  were  moved  to 
the  marketing  and  sales  department.  At  the  same  time,  several  BU  stores  merged  in  order  to  realize 
administrative  efficiency  gains.  Although  this  did  not  reduce  the  number  of  store  locations  (the 
number  of  BU  stores  remained  unchanged  at  214),  for  administrative  purposes  POST  now  only 
recognizes  103  separate  stores.  We  refer  to  these  103  higher-level  BU  stores  as  Evaluation  Unit 
(EU)  stores.  Beginning  in  2001,  performance  evaluation  was  performed  at  this  more  aggregated 
level  (see  Eigure  1). 

The  Incentive  Plan 

POST’s  compensation  system  provides  a  fixed  salary  plus  a  bonus  to  all  EU  store  employees, 
including  the  store  manager.  The  fixed  salary  is  based  on  individual  qualifications  and  seniority, 
while  the  bonus  depends  on  store  performance.  Bonuses  range  from  30  percent  to  150  percent  of 
the  employee’s  monthly  salary.  This  means  that  the  maximum  annual  bonus  constitutes  approxi¬ 
mately  12.5  percent  of  the  employee’s  annual  compensation.  Management  believes  that  this  in¬ 
centive  system  should  motivate  workers  because  such  bonuses  are  unusually  high  among  Korean 
state-owned  companies. 

As  of  2001,  the  following  three  performance  dimensions  are  measured  and  rewarded  in 
POST’s  incentive  plan:  profitability,  sales,  and  quality  of  service  (see  Table  1  for  definitions).  The 
incentive  weights  on  the  different  dimensions  are  60  percent,  20  percent,  and  20  percent,  respec¬ 
tively.  The  profitability  dimension  contains  two  equally  weighted  parts:  profitability  compared  to 
the  store’s  own  profitability  in  the  previous  year  and  profitability  compared  to  the  profitability  of 
a  reference  group.  This  relative  performance  evaluation  (RPE)  was  introduced  in  1998  in  an  effort 
to  create  a  more  fair  reward  system  that  takes  into  account  various  exogenous  factors  likely  to 
affect  performance.''  Specifically,  POST  used  cluster  analysis  to  classify  stores  into  reference 
groups  of  stores  with  similar  business  environments.  In  a  continued  effort  to  improve  fairness, 
POST  revised  the  reference  groups  in  1999.  Nevertheless,  in  June  2000,  POST  recognized  that 
some  employees  still  perceived  considerable  unfairness  in  the  reward  system  (Matsumura  and 
Shin  2006). 


^  POST’s  definition  of  strategic  mail  services  includes  domestic  express  mail,  international  express  mail,  domestic  reg¬ 
istered  parcel,  philately  products,  and  congratulatory  and  condolence  card  services.  According  to  an  internal  document 
prepared  in  2003,  POST  estimates  that  their  market  shares  of  parcel  and  express  mail  services  in  the  Korean  market  are 
10.1  percent  and  30.1  percent,  respectively. 

^  We  use  the  term  Business  Unit  (BU)  store  to  refer  to  stores  that  are  equipped  to  perform  the  complete  set  of  tasks 
(ranging  from  mail  reception  to  final  delivery),  as  opposed  to  small  windows-only  stores,  which  are  only  able  to  provide 
a  limited  set  of  services. 

Moreover,  management  also  wanted  to  address  gaming  behavior  that  occurred  in  response  to  a  previous  incentive 
system  that  placed  considerable  weight  on  improvements  from  the  previous  year  (Matsumura  and  Shin  2006). 
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FIGURE  1 

Organizational  Chart  of  POST 


POST  consists  of  214  Business  Unit  (BU)  stores  and  2,579  small  windows-only  stores.  For  administrative  and 
performance  evaluation  purposes,  POST  aggregates  the  BU  stores  and  windows-only  stores  into  103  higher- 
level  BU  stores,  which  we  refer  to  as  Evaluation  Unit  (EU)  stores. 


The  incentive  plan’s  sales  dimension  contains  two  unequally  weighted  parts,  one  for  regular 
mail  services  (with  an  incentive  weight  of  6  percent)  and  one  for  strategic  mail  services  (with  an 
incentive  weight  of  14  percent).^  In  both  parts,  the  ratio  of  actual  sales  to  target  sales  determines 
performance. 

Finally,  quality  of  service  consists  of  three  dimensions;  management  efficiency,  customer 
satisfaction,  and  mail  service  quality.  Customer  satisfaction  and  mail  service  quality  are  measured 


^  One  might  wonder  why  sales  are  evaluated  separately  when  a  significantly  large  weight  is  already  placed  on  profitabil¬ 
ity.  The  reason  behind  the  separate  measure  is  that  the  store’s  revenue  used  in  calculating  the  profitability  measure  is 
“allocated”  revenue,  which  is  calculated  by  comparing  the  store’s  contribution  to  the  entire  value  chain.  The  rationale  is 
that  mail  service  involves  multiple  steps,  ranging  from  receiving  and  processing  mail  to  final  delivery,  and  these  steps 
are  usually  performed  by  different  stores  that  together  form  the  value  chain.  In  contrast,  sales  revenue  simply  refers  to 
the  revenue  earned  on  sales  that  originate  at  the  particular  store. 
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TABLE  1 

Description  of  Incentive  Plan,  2001-2003 

Performance 


Dimension 

Performance  Measure 

Weight 

Profitability 

Store  Profitability{f) 

Store  Profitabilityit-l) 

30% 

Store  Profitabilityit) 

Reference  Group  Profitability{t) 

30% 

Sales  Target 

Actual  Sales  of  Regular  Mail  Service{t) 

Target  Sales  of  Regular  Mail  Service{t) 

6% 

Actual  Sales  of  Strategic  Mail  Service{t) 

Target  Sales  of  Strategic  Mail  Service{t) 

14% 

Quality  of  Service 

Management  Efficiency 

5% 

Customer  Satisfaction 

10% 

Mail  Service  Quality 

5% 

Store  Profitability  (t)  =  store-level  allocated  revenue  divided  by  operating  cost  in  year  t; 

Reference  Group  Profitability  (t)  =  arithmetic  mean  of  profitability  of  stores  that  belong  to  the  specific  reference 

group  in  year  t; 

Sales  of  Regular  Mail  Service  (t)  -  sales  revenue  from  regular  mail  volume  received  by  the  store  at  year  r; 

Sales  of  Strategic  Mail  Service  (t)  =  sales  revenue  from  strategic  mail  volume  received  by  the  store  at  year  t. 


by  an  independent  outside  agency  contracted  by  corporate  headquarters,  and  management  effi¬ 
ciency  is  assessed  by  regional  headquarters.  All  three  dimensions  receive  high  scores  and  the 
scores  traditionally  exhibit  little  variation  across  stores.  POST’s  attention  has  therefore  focused  on 
increasing  sales  while  maintaining  cost  effectiveness  in  order  to  increase  profitability. 

At  the  end  of  the  year,  a  store’s  performance  score  is  calculated  using  the  performance 
dimensions  and  weights  in  the  incentive  plan  (see  Table  1).  The  stores  are  then  ranked  based  on 
their  performance  scores.  The  store’s  rank  determines  the  size  of  the  bonus  for  each  store.  Hence, 
POST’s  incentive  plan  employs  a  two-stage  RPE  system  that  involves  (1)  RPE  measures  relative 
to  a  reference  group  of  stores  and  (2)  a  tournament  in  which  the  size  of  the  bonus  depends  on  the 
store’s  performance  score  rank  among  all  the  stores.  When  the  incentive  plan  was  initially  intro¬ 
duced  in  1998,  only  the  top  50  percent  of  the  stores  received  a  bonus.  Because  a  large  number  of 
managers  considered  this  unfair  (Matsumura  and  Shin  2006),  POST  expanded  the  bonus  eligibility 
to  100  percent  of  the  stores  in  2001.  Although  all  stores  now  receive  a  bonus,  the  size  of  the  bonus 
continues  to  depend  on  the  store’s  rank.^  Consequently,  some  of  the  weaker  stores  still  feel  that  the 
system  puts  them  at  a  disadvantage. 


POST  also  made  other  notable  changes  to  the  incentive  plan  introduced  in  1998  (Matsumura  and  Shin  2006).  The  most 
relevant  change  from  the  perspective  of  this  study  is  that  as  of  2001,  POST  replaced  the  cash  flow  measures  with 
store-level  sales  targets.  POST  also  eliminated  the  productivity  dimension  due  to  the  high  correlation  between  profit¬ 
ability  and  productivity  measures.  Instead,  the  weight  placed  on  the  profitability  growth  measure  was  significantly 
increased,  leading  to  an  equal  weight  of  30  percent  on  both  the  relative  profitability  and  the  profitability  growth 
measures. 
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We  focus  this  study  on  sales  targets  because  these  are  the  only  major  incentive  plan  design 
measures  over  which  regional  directors  have  discretion  (described  further  below).  While  the  20 
percent  weight  placed  on  the  sales  target  measures  might  appear  relatively  low,  we  believe  that 
more  attainable  sales  targets  are  considered  relevant  by  the  stores  because  of  POST’s  two-stage 
RPE  system  that  ultimately  bases  the  size  of  each  store’s  bonus  on  its  performance  rank  compared 
to  other  stores.^ 

The  eight  regional  directors  also  receive  a  fixed  salary  and  a  bonus,  where  the  bonus  is 
determined  by  averaging  the  bonus  rate  (as  a  percent  of  monthly  salary)  of  all  stores  under  their 
direct  supervision.  Regional  directors  are  not  rewarded  for  meeting  the  regional  sales  target.  By 
tying  the  regional  directors’  bonuses  to  their  stores’  bonus  levels,  POST  ensures  that  the  regional 
directors  have  incentives  to  consider  how  sales  targets  affect  store  managers’  and  workers’  moti¬ 
vation. 

The  Target-Setting  Process 

The  company’s  overall  sales  target  plays  an  important  role  within  POST,  as  the  sales  targets 
of  the  eight  regional  headquarters,  and  ultimately  of  the  103  EU  stores,  are  based  upon  it.  Re¬ 
sponsibility  for  determining  the  company’s  overall  sales  target  lies  with  a  corporate  task  force 
team,  which  meets  each  November  to  set  out  the  key  strategic  directions  of  the  firm.  Based  on  the 
overall  macro-economic  prospects,  the  business  environment,  and  the  intensity  of  competition,  the 
team  forecasts  the  coming  year’s  mail  volume,  operating  expenses,  and  sales.  The  proposed  targets 
are  reviewed  by  the  CEO  in  December,  and  after  the  board  of  governors  approves  the  targets,  they 
serve  as  the  formal  financial  targets  for  the  coming  year. 

Using  the  company’s  overall  financial  targets,  the  corporate  task  force  prepares  operating  and 
capital  budgets  for  the  eight  regional  headquarters.  Targets  for  the  regional  headquarters  are  set  in 
a  top-down  manner.  In  early  January,  the  CEO  presides  over  a  target  review  committee  meeting 
held  at  the  corporate  headquarters.  In  this  meeting,  the  regional  headquarters’  budgets,  including 
their  sales  targets,  are  presented  to  the  regional  directors  and  discussed  in  detail. 

The  regional  directors  then  have  responsibility  for  turning  the  total  regional  sales  targets  into 
store-specific  sales  targets.  Although  headquarters  asks  the  regional  directors  to  develop  the  initial 
targets  based  on  a  formula,  the  regional  directors  have  discretion  in  the  setting  the  targets.  The 
only  restriction  headquarters  imposes  is  that  the  sum  of  all  individual  stores’  targets  equals  the 
regional  sales  target.  Because  the  store  managers  have  superior  information  about  local  market 
conditions,  POST  designed  the  target- setting  process  to  be  participative.  Store  managers  can 
potentially  influence  the  sales  targets  they  receive  by  sharing  their  private  information.  Specifi¬ 
cally,  the  steps  in  the  target-setting  process  are  as  follows.  First,  the  regional  director  privately 
determines  the  initial  sales  targets  for  each  store  by  allocating  the  regional  sales  target  to  the  stores 
according  to  the  relative  weight  of  the  stores’  prior  sales.  At  the  same  time,  without  knowledge  of 
the  regional  director’s  initial  sales  estimates,  the  store  managers  are  requested  to  submit  proposed 
sales  targets  for  their  stores,  as  well  as  supporting  documentation  to  the  regional  director.  When 
proposing  sales  targets  and  providing  documentation,  the  store  managers  have  no  knowledge  of 
the  total  regional  sales  targets  or  of  the  store- specific,  formula-based  initial  targets. 

When  the  regional  director  has  received  all  store  managers’  information,  s/he  compares  the 
formula-based  initial  estimates  to  the  targets  proposed  by  the  stores;  based  on  this  information  the 
regional  director  arrives  at  his/her  targets.  In  cases  in  which  there  is  a  large  discrepancy  between 


''  See,  for  example,  Cichello  et  al.  (2009)  for  evidence  on  the  importance  of  small  performance  changes  in  tournament 
structures. 

®  The  regional-level  sales  targets  are  shared  with  the  store  managers  at  the  regional  target  review  meeting  in  late  January. 
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the  director’s  target  and  the  target  the  store  proposes,  the  director  solicits  further  information  from 
the  store  before  setting  the  final  targets.  By  the  end  of  January  (about  two  weeks  after  the 
proposals  are  submitted),  the  regional  director  and  all  his/her  store  managers  meet  at  the  regional 
headquarters,  and  the  regional  director  presents  and  discusses  the  sales  targets. 

III.  HYPOTHESES 

In  the  budgeting  literature,  considerable  attention  is  directed  to  studying  targets.  The  focus  of 
many  of  these  studies  is  on  budgetary  slack,^  in  particular,  on  how  subordinates  are  able  to  obtain 
easier  targets  (i.e.,  create  slack).  The  most  robust  finding  from  this  literature  is  the  relation  be¬ 
tween  private  information  and  slack.  Subordinates  are  found  to  use  their  information  advantage  to 
negotiate  performance  targets  that  are  more  easily  achieved  (Schiff  and  Lewin  1968;  Merchant 
1985;  Blanchard  et  al.  1986;  Lukka  1988).  In  addition,  when  information  asymmetry  allows  slack 
creation,  subordinates’  efforts  to  create  slack  are  related  to  their  marginal  returns  from  effort.  For 
example,  Anderson  et  al.  (2010)  find  that  performance  targets  are  lower  when  subordinates’  career 
horizon  and  economic  dependence  on  the  job  are  greater.  This  relation  suggests  that  subordinates 
expend  greater  effort  to  create  slack  when  the  returns  from  such  effort  are  higher.  Other  factors 
that  are  relevant  for  slack  creation  include  subordinates’  risk  preferences  and  preferences  for 
honesty  (e.g..  Young  1985;  Dunk  and  Nouri  1998;  Evans  et  al.  2001). 

Research  that  examines  the  target-setting  process  from  the  supervisor’s  side  mainly  focuses 
on  how  target  achievability  is  related  to  subordinates’  performance.  The  main  prediction  of  this 
goal-setting  literature  is  that  there  is  a  strong  positive  relation  between  target  difficulty  and  per¬ 
formance  (Locke  and  Latham  1990).  Although  the  goal-setting  literature  consistently  finds  evi¬ 
dence  indicating  that  difficult  goals  motivate  better  performance  than  easier  goals,  (e.g.,  Locke  and 
Latham  1990),  when  Merchant  and  Manzoni  (1989)  re-examine  these  conclusions  in  a  budgeting 
setting,  they  find  that  budget  targets  are  more  attainable  than  the  goal-setting  literature  would 
predict.  Interviews  that  Merchant  and  Manzoni  (1989)  conduct  with  managers  suggest  that  targets 
are  generally  attainable  because  eliciting  high  levels  of  employee  performance  is  not  supervisors’ 
only  concern;  their  target-setting  decisions  are  also  affected  by  factors  such  as  increasing  the 
predictability  of  budgets  and  discouraging  earnings  management.  Nevertheless,  to  date  there  exists 
little  empirical  evidence  on  the  array  of  factors  influencing  supervisors’  target-setting  decisions 
and  methods  (Ittner  and  Larcker  2001;  Anderson  et  al.  2010).  As  a  result,  an  understanding  of 
supervisors’  target- setting  behavior  is  far  from  complete. 

We  contribute  to  the  knowledge  on  target  setting  by  analyzing  supervisors’  target-setting 
decisions.  Our  study  differs  from  Merchant  and  Manzoni  (1989)  in  that  we  do  not  examine  the 
general  attainability  of  budget  levels.  Rather,  in  our  setting,  the  levels  of  the  regional  sales  targets, 
as  well  as  the  form  of  the  incentive  plan,  are  dictated  by  corporate  headquarters,  and  we  examine 
how  supervisors  use  discretion  to  allocate  the  regional  sales  targets  to  specific  stores.  Specifically, 
we  study  how  supervisors  use  discretion  in  the  target- setting  process  to  manage  compensation  risk, 
address  fairness  concerns,  and  minimize  confrontation  costs.  Below  we  develop  our  predictions 
with  respect  to  each  of  these  three  elements. 

Compensation  Risk 

Much  of  principal-agent  theory  focuses  on  situations  with  outcome  uncertainty,  information 
asymmetry  between  the  principal  and  agent,  and  compensation  contracts  based  on  measures  that. 


^  Some  notable  exceptions  are  Murphy  (2000)  and  Indjejikian  and  Nanda  (2002),  who  study  the  performance  standards 
that  companies  use  to  evaluate  performance.  Murphy  (2000)  examines  the  factors  associated  with  the  choice  of  an 
internal  versus  an  external  performance  standard  and  Indjejikian  and  Nanda  (2002)  examine  whether  firms  fully  adjust 
performance  standards  for  past  performance. 
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due  to  uncertainty,  are  imperfectly  informative  about  the  agent’s  effort.  Linking  pay  to  perfor¬ 
mance  induces  the  agent  to  expend  effort,  but  also  transfers  some  risk  from  the  principal  to  the 
agent.  The  principal  must  compensate  the  agent  for  bearing  this  risk.  The  optimal  compensation 
contract,  therefore,  trades  off  the  costs  and  benefits  of  imposing  compensation  risk  on  the  agent 
(Holmstrom  1979;  Lambert  2001). 

At  POST,  the  form  of  stores’  compensation  contracts  is  standardized  and  set  by  corporate 
headquarters.  When  the  regional  directors  determine  each  store’s  initial  sales  estimates  based  on 
the  regional  targets  set  by  headquarters,  the  resulting  contracts  likely  imposes  excessive  risk  on 
specific  stores’  managers  and  employees.  Because  the  regional  directors  cannot  raise  store  man¬ 
agers’  base  salary  to  compensate  for  bearing  more  risk,  we  predict  that  directors  will  increase 
expected  compensation  to  cover  a  risk  premium  by  adjusting  the  sales  targets  downward  for  stores 
that  face  greater  environmental  uncertainty.  This  prediction  is  in  line  with  prior  budgeting  research 
that  finds  a  positive  relation  between  slack  (more  attainable  targets)  and  uncertainty  (Merchant 
1985;  Govindarajan  1986;  Schoute  and  Wiersma  2007).^^ 

In  sum,  consistent  with  agency  theory,  we  hypothesize  that  supervisors  use  their  discretion  in 
the  target-setting  process  to  manage  store- specific  compensation  risk.  More  formally; 

HI:  Downward  discretionary  target  adjustments  are  positively  associated  with  store-specific 
compensation  risk. 

Fairness  Concerns 

Behavioral  research  shows  that  the  perceived  fairness  of  a  compensation  plan  has  a  significant 
impact  on  employee  incentives  (e.g.,  Akerlof  and  Yellen  1988;  Lindquist  1995;  Konovsky  2000; 
Libby  2001).  If  an  employee  perceives  his/her  compensation  outcome  to  be  unfair,  then  s/he  will 
have  a  negative  behavioral  reaction  toward  the  compensation  system  and  as  a  result  will  not  be 
motivated  by  it  (Cohen-Charash  and  Spector  2001;  Colquitt  et  al.  2001).  These  negative  behav¬ 
ioral  reactions  are  especially  strong  when  employees  believe  their  outcomes  would  have  been 
better  if  decision  makers  had  implemented  different  procedures  (Cropanzano  and  Folger  1989). 
Considering  the  significant  influence  on  employee  motivation,  supervisors  have  clear  incentives  to 
take  fairness  concerns  into  account  when  managing  a  compensation  plan. 

Incentive  compensation  payouts  at  POST  are  partly  based  on  a  store’s  performance  relative  to 
its  reference  group  (see  Section  II).  Although  the  reference  groups  were  created  with  the  intention 
of  increasing  fairness  in  performance  evaluation  by  grouping  stores  with  similar  operating  envi¬ 
ronments,  interview  data  reveal  that  some  stores  still  consider  the  system  unfair,  as  they  believe 
that  they  are  competing  against  much  stronger  stores  (Matsumura  and  Shin  2006).  Top  manage¬ 
ment  therefore  remains  concerned  about  dysfunctional  behavior  due  to  perceived  unfairness  and 
encourages  regional  directors  to  take  fairness  concerns  seriously.  Regional  directors  also  have 
direct  incentives  to  mitigate  fairness  concerns  because  decreased  worker  motivation  impacts  their 
own  bonuses.  Consequently,  we  expect  regional  directors  to  address  fairness  concerns.  However, 
because  they  are  unable  to  directly  adjust  the  RPE  element  of  the  bonus  system  (the  source  of  the 
unfairness  complaints),  we  predict  that  they  respond  to  these  fairness  issues  by  using  the  discretion 
they  do  have,  which  relates  to  the  target-setting  process.  In  particular,  we  predict  that,  by  making 
sales  targets  easier  to  achieve  for  the  stores  facing  more  challenging  reference  groups,  supervi- 


Indjejikian  and  Matejka  (2006)  find  no  such  relation. 

"  Recall  that  the  sum  of  the  targets  must  equal  the  regional  target  dictated  by  central  headquarters,  so  reducing  one  store’s 
target  requires  raising  one  or  more  other  targets. 
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sors  can  make  total  incentive  compensation  more  equitable  in  the  sense  of  attainability.  This 
prediction  is  in  line  with  previous  research  that  finds  that  more  attainable  targets  help  foster  a 
sense  of  fairness  (see,  e.g.,  Merchant  and  Manzoni  1989). 

In  sum,  we  hypothesize  that  supervisors  strategically  use  discretion  in  the  target-setting  pro¬ 
cess  to  help  restore  the  perception  of  fairness  among  those  managers  and  store  employees  who  feel 
that  the  composition  of  the  reference  groups  impairs  the  overall  fairness  of  the  incentive  system. 
More  formally: 

H2:  Downward  discretionary  target  adjustments  are  positively  associated  with  fairness  con¬ 
cerns  arising  from  other  elements  of  the  incentive  system. 

Conflict  Avoidance 

Supervisors  are  often  nonresidual  claimants  whose  incentives  are  not  perfectly  aligned  with 
those  of  the  company’s  owners.  Hence,  when  supervisors  tend  to  their  own  incentives  in  using 
their  discretion  during  the  compensation  contracting  process,  their  decisions  will  not  necessarily 
be  firm-value  maximizing  (e.g.,  Bol  2009).  In  a  target- setting  context,  this  suggests  that  some  of 
the  supervisors’  target  adjustments  will  reflect  their  own  utility  preferences,  where  utility  encom¬ 
passes  both  pecuniary  and  nonpecuniary  elements.  One  way  supervisors  can  reduce  personal  costs 
is  by  using  their  discretion  in  the  target-setting  process  to  avoid  painful  conflicts  or  confrontations, 
as  these  conflicts  are  stressful  and  hence  costly  to  supervisors  (Murphy  and  Cleveland  1995). 

In  our  setting,  where  a  downward  adjustment  to  one  initial  target  necessitates  an  upward 
adjustment  in  one  or  more  other  targets,  conflict  is  inevitable.  Nevertheless,  supervisors  can  reduce 
confrontation  costs  by  strategically  determining  which  subordinates  to  disappoint  and  which  to 
(partly)  satisfy.  By  disappointing  those  subordinates  who  are  less  likely  to  confront  their  superior, 
supervisors  avoid  conflict  as  much  as  possible. 

One  factor  influencing  how  much  “push  back”  a  supervisor  will  likely  receive  from  a  subor¬ 
dinate  is  the  subordinate’s  status,  that  is,  the  amount  of  respect  and  prominence  an  individual  holds 
within  a  social  group  (Anderson  et  al.  2001;  Anderson  and  Berdahl  2002).  Because  of  their  social 
standing,  high-status  individuals  are  more  likely  to  challenge  their  supervisor’s  decision.  Conse¬ 
quently,  communicating  unfavorable  information  to  relatively  high-status  individuals  is  expected 
to  be  more  costly  than  communicating  unfavorable  information  to  relatively  low-status  individuals 
who  are  more  likely  to  accept  the  decision  without  a  challenge  (Rahim  1986).  Accordingly,  we 
predict  that  supervisors  are  more  likely  to  adjust  targets  downward  for  those  subordinates  who 
hold  a  relatively  high  status  within  the  organization.  This  prediction  is  consistent  with  empirical 
research  from  psychology  that  suggests  that  managers  resolve  conflicts  differently  depending  on 
the  status  of  the  individuals  involved:  supervisors  tend  to  compromise  with  those  who  have  similar 
status,  whereas  they  tend  to  impose  decisions  on  those  with  lower  status  (e.g.,  Rahim  1986; 
Phillips  and  Cheston  1979;  Yukl  and  Falbe  1990).  This  prediction  is  also  in  line  with  findings  that 
show  that,  consciously  or  not,  individuals  concede  more  to  higher-status  individuals  in  negotia¬ 
tions  (Thye  2000;  Thye  et  al.  2006). 

We  note  that  a  related  investigation  in  the  accounting  domain  is  Anderson  et  al.  (2010),  who 
examine  whether  job  tenure  influences  target  attainability.  The  authors  find  a  negative  association 
between  subordinates’  job  tenure  and  target  levels.'^  Although  job  tenure  could  proxy  for  status, 
Anderson  et  al.  (2010)  argue  that  in  their  setting,  the  negative  association  between  job  tenure  and 
target  levels  is  more  likely  the  result  of  bargaining  power  from  increased  local  knowledge,  not 
status.  In  contrast,  the  source  of  status  that  is  especially  pertinent  in  our  setting  is  job  grade  within 


Indjejikian  and  Matejka  (2006)  incorporate  business-unit  controller  job  tenure  as  a  control  variable  but  find  no  evidence 
that  it  is  related  to  more  attainable  targets. 
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the  organization’s  hierarchy.  This  source  is  not  correlated  with  increased  knowledge  of  the  local 
environment,  as  POST  maintains  a  policy  of  frequent  job  rotation. Hence,  supervisors’  consid¬ 
eration  of  hierarchical  status  arises  from  individual  influence,  not  from  private  information. 

In  sum,  we  predict  that  supervisors  use  their  discretion  in  the  target- setting  process  to  de¬ 
crease  the  sales  targets  of  high-status  store  managers  to  reduce  confrontation  costs.  More  formally; 

H3:  Downward  discretionary  target  adjustments  are  positively  associated  with  higher  relative 
hierarchical  status. 


IV.  RESEARCH  DESIGN 

Sample 

To  study  the  target- setting  process,  we  collected  annual  performance  evaluation  data  from 
2000  (the  first  year  sales  targets  were  part  of  the  incentive  plan)  through  2003  for  POST’s  103  EU 
stores.  We  obtained  financial  and  operational  data  of  individual  stores  from  POST’s  performance 
measurement  system,  and  control  variables  such  as  demographics  and  individual  store  character¬ 
istics  from  one  of  the  company’s  databases  that  tracks  information  on  individual  stores.  To  gain  an 
understanding  of  the  incentive  plan  and  the  company’s  background,  we  smdied  a  variety  of 
company  documents  and  interviewed  corporate  headquarters’  managers,  including  one  of  POST’s 
incentive  plan  designers. 

Dependent  Variable 

To  capture  the  regional  directors’  use  of  discretion  in  the  target- setting  process,  we  measure 
the  extent  to  which  regional  directors  adjust  the  initial  formula-based  sales  estimate.  Recall  from 
Section  II  that  the  regional  directors  allocate  the  regional  sales  targets  set  by  corporate  headquar¬ 
ters  into  store-level  sales  targets.  The  regional  director  first  calculates  the  initial  sales  estimate  by 
allocating  the  regional  sales  target  according  to  the  relative  weight  of  the  stores’  prior- year  sales 
and  then,  after  receiving  information  from  the  individual  stores,  adjusts  this  estimate  upward  or 
downward  to  arrive  at  the  final  sales  target.  We  use  the  size  of  the  adjustment  (ADJUST),  mea¬ 
sured  by  subtracting  the  final  store-level  sales  target  {TAR_SALES)  from  the  regional  director’s 
initial  formulaic  estimate  (TAR_SALES_SUP),  as  our  proxy  for  regional  directors’  use  of  discretion 
at  each  store.  We  designate  ADJUSTJREG  and  ADJUST_STRA  as  the  adjustments  for  regular 
sales  and  strategic  sales,  respectively. 

Explanatory  Variables 

Compensation  Risk  Due  to  Environmental  Uncertainty 

To  examine  whether  regional  directors  use  discretion  in  the  target-setting  process  to  manage 
compensation  risk,  we  identify  stores  that  face  different  levels  of  compensation  risk.  We  do  so  by 
capturing  store- specific  levels  of  environmental  uncertainty.  In  our  setting,  regular  and  strategic 
mail  volumes  received  by  a  store  are  summary  measures  that  capture  external  changes  in  the  local 
business  environment.  These  measures  allow  us  to  use  the  change  in  regular  and  strategic  mail 


We  contacted  the  103  stores  directly  and  requested  store  managers’  tenure  data  (as  of  January  of  2001,  2002,  and  2003, 
when  the  target-setting  process  occurs).  We  received  218  store-year  managers’  tenure  data  (out  of  309  possible  obser¬ 
vations).  The  mean  (median)  store  manager  tenure  was  11  (9)  months  and  the  maximum  store  manager  tenure  was  35 
months.  We  also  obtained  a  list  of  the  22  former  regional  directors  who  served  at  a  specific  regional  headquarters  from 
1970  to  2003  and  found  that  the  mean  tenure  of  a  regional  director  was  19  months.  The  short  tenure  of  store  managers 
and  regional  directors  results  from  the  traditional  rank-in-person  system  of  the  Korean  government,  where  one  is  hired 
not  for  a  specific  position  but  as  an  individual  suited  to  occupy  multiple  positions.  This  system  has  been  criticized  for 
limiting  employees  from  developing  expertise  in  specialized  fields.  However,  this  is  not  the  case  for  POST’s  store 
managers  because  only  the  location  and  generally  not  the  job  function  changes. 
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volume  to  capture  the  volatility  of  the  business  environment  faced  by  each  store.  More  specifi¬ 
cally,  we  use  the  time-series  standard  deviation  of  the  logarithm  of  regular  (strategic)  mail  volume 
received  by  a  particular  store  over  the  most  recent  four  years  to  create  N01SE_MV_REG 
iNOISE_MV_STRA)  (Holthausen  et  al.  1995;  Nagar  2002). 


Fairness  Concerns 

To  analyze  whether  store  managers’  concerns  about  the  fairness  of  the  incentive  plan  influ¬ 
ence  regional  directors’  use  of  discretion  in  the  target-setting  process,  we  create  a  proxy  for  store 
managers’  fairness  perceptions.  As  described  in  Section  II,  one  of  POST’s  main  performance 
measures  is  based  on  a  comparison  to  a  reference  group  of  similar  stores.  Matsumura  and  Shin 
(2006)  document  that  store  managers  who  feel  that  their  reference  group  is  too  challenging  are  not 
motivated  to  exert  effort  because  they  consider  the  incentive  system  unfair.  We  create  a  proxy  that 
captures  such  perceptions  of  unfairness  by  measuring  how  challenging  a  store’s  reference  group  is. 
To  capture  the  difficulty  of  the  reference  group,  we  examine  a  store’s  profitability  as  compared  to 
its  reference  group  in  the  prior  year.  However,  because  weak  performance  relative  to  the  reference 
group  could  be  due  to  reasons  other  than  a  challenging  reference  group  (e.g.,  poor  management  or 
lack  of  effort),  we  also  incorporate  the  store’s  performance  compared  to  the  prior  year’s  fixed  sales 
targets.  We  reason  that  weak  performance  relative  to  the  reference  group  but  strong  performance 
compared  to  the  individual  target  indicates  a  challenging  reference  group. 

Consistent  with  Matsumura  and  Shin  (2006),  we  define  UNEAIR_REG  as  the  difference 
between  a  store’s  yearly  rank  on  its  RPE  profitability  measure  (profitability  divided  by  the  refer¬ 
ence  group  mean)  and  the  store’s  yearly  rank  on  the  regular  sales  target  measure  in  the  prior  year. 
That  is; 


UNFAIR_REG  =  Rank 


Store  Profitability 
Reference  Group  Profitability 


-  Rank 


Actual  Sales  of  Regular  Mailf-i 
Target  Sales  of  Regular  Maili_i 


UNFAIR_STRA  is  defined  similarly  for  strategic  mail,  using  the  actual  and  target  sales  of  strategic 
mail.  Higher  values  of  the  UNFAIR  variables  indicate  higher  unfairness.'"^  That  is,  we  expect  an 
increased  perception  of  unfairness  when  a  store’s  profitability  relative  to  its  peers  is  low  even 
though  it  meets  or  beats  its  sales  targets. 


Conflict  Avoidance 

To  examine  whether  directors  use  discretion  in  target  setting  to  reduce  confrontation  costs  by 
avoiding  conflicts  with  relatively  higher-status  store  managers,  we  use  job  grade  level,  which  is 
one  of  the  most  important  indicators  of  status  at  POST.'^  All  BU-level  store  managers  of  POST  are 
level  4  or  5  employees  and  regional  directors  are  level  1,  2,  or  3  employees  of  the  Korean 
government’s  pay  schedule.  We  use  this  information  to  create  the  indicator  variable  STATUS, 


Thus,  the  possible  rank  differences  range  from  -102  to  102.  Suppose  store  A  is  ranked  93rd  (out  of  103  stores)  on  the 
RPE  profitability  measure  but  ranked  5th  on  the  regular  sales  target  measure  in  2000.  This  indicates  that  while  store  A’s 
actual  sales  far  exceeded  its  regular  mail  sales  target,  its  profitability  relative  to  peer  stores  is  very  low.  The  difference 
in  ranking  for  this  store  is  88,  resulting  in  a  value  of  88  on  our  UNFAIR_REG  variable.  In  contrast,  suppose  store  B  is 
ranked  1st  on  the  RPE  profitability  measure  but  ranked  98th  on  the  regular-  sales  target  measure  in  2002.  Store  B’s 
difference  in  ranking  is  —97,  resulting  in  a  value  of  —97  on  our  UNFAIR_REG  variable. 

Similar  to  the  General  Schedule  of  U.S.  Federal  employees,  the  Korean  government  uses  job  grades  that  conform  to  the 
government’s  pay  schedule.  The  pay  schedule  assigns  a  grade  level  from  10  to  1  (with  1  the  highest)  to  each  job 
according  to  the  minimum  level  of  education  and  experience  that  the  position  requires.  An  employee’s  salary  increases 
as  that  employee’s  level  on  the  scale  increases.  Typically,  an  employee  moves  up  the  grade  scale  as  his/her  seniority 
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which  equals  1  if  the  difference  in  job  grade  level  between  the  regional  director  and  the  store 
manager  is  2  or  less,  and  0  otherwise. Hence,  STATUS  is  1  if  the  status  difference  between  the 
regional  director  and  the  store  manager  is  small. 

Control  Variables 

We  include  a  set  of  proxy  variables  that  control  for  differences  in  target- setting  behavior. 
First,  we  control  for  the  level  of  information  asymmetry  between  regional  directors  and  store 
managers,  as  the  budgeting  literature  has  long  recognized  the  potential  effect  of  information 
asymmetry  in  a  participative  budgeting  process  (e.g.,  Schiff  and  Lewin  1970;  Christensen  1982; 
Baiman  and  Evans  1983;  Indjejikian  and  Matejka  2006).  Drawing  on  the  performance  evaluation 
literature,  we  identify  divisional  complexity  as  an  important  source  of  information  asymmetry 
(Holthausen  et  al.  1995;  Bushman  et  al.  1995;  Keating  1997;  Bushman  et  al.  2004;  Berry  et  al. 
2006).  We  capture  the  complexity  of  the  different  EU  stores  by  including  the  number  of  small 
windows-only  stores  (N_SUB)  and  the  number  of  BU  stores  added  to  an  EU  store  for  performance 
evaluation  purposes  since  2001  {N_MERGE)}^ 

Second,  we  include  the  number  of  store  employees  {N_EMP)  as  a  proxy  for  size  (Indjejikian 
and  Matejka  2006).  Because  the  status  variable  is  likely  correlated  with  store  size,  it  is  important 
to  control  for  size  to  eliminate  potential  confounding  effects. 

Third,  we  control  for  the  competitive  environment  of  the  area  (COMPET)  in  which  the  store 
operates,  as  prior  research  documents  that  the  degree  of  competition  influences  budget  misrepre¬ 
sentation  (Bruggen  and  Luft  2009).  The  variable  COMPET  takes  the  value  of  1  for  urban  areas 
and  0  for  rural  areas.  We  distinguish  between  urban  and  rural  areas  because  major  competitors  of 
POST,  such  as  domestic  and  international  couriers,  mainly  operate  in  profitable  urban  areas  (see 
Banker  and  Mashruwala  [2007]  for  a  similar  distinction  in  a  retail  industry  setting).  This  variable 
also  captures  the  economic  viability  of  the  area  in  which  the  store  operates. 

Finally,  in  addition  to  year  indicator  variables,  we  include  seven  region  indicator  variables 
(REGION)  to  control  for  unspecified  supervisor  and  region-specific  differences  in  the  target¬ 
setting  process. 

The  above  discussion  leads  to  the  following  specifications: 


(1) 


-I-  REGION JNDICATORS  +  YEARJNDICATORS  +  e,, 


(2) 


-I-  REGION  JNDICATORS  +  YEARJNDICATORS  +  s,- ^ 


where  Equation  (1)  pertains  to  regular  mail.  Equation  (2)  pertains  to  strategic  mail,  i  indicates 


within  the  firm  increases,  although  above-average  job  performance  can  accelerate  this  progress.  We  obtained  store 
managers’  salary  data  for  1997  and  found  that  the  average  salary  for  a  level  4  store  manager  is  significantly  higher  than 
that  of  a  level  5  store  manager  (p  =  0.07). 

The  mean  (median)  job  grade  difference  between  a  store  manager  and  a  regional  director  in  our  sample  is  2.18  (2.00). 
Thus,  we  use  the  median  job  grade  difference  (2)  as  our  cut-off  to  create  the  indicator  variable  STATUS. 

Prior  research  also  documents  budget  emphasis  and  the  degree  of  budget  participation  as  factors  that  influence  subor¬ 
dinates’  target-setting  behavior  (e.g..  Merchant  1985;  Dunk  1993;  Dunk  and  Nouri  1998).  However,  because  the  em¬ 
phasis  on  targets  and  the  degree  of  participation  in  the  target-setting  process  are  consistent  over  our  sample,  we  do  not 
separately  control  for  these  factors. 
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stores  (/  =  1,...,103),  and  t  indicates  time  {t  =  2001,  2002,  2003)/^  We  estimate  Equations  (1) 
and  (2)  using  OLS  regression  analysis  with  Huber- White  robust  standard  errors  clustered  by  store. 
These  standard  errors  are  robust  to  both  serial  correlation  and  heteroscedasticity  (Rogers  1993).*^ 

V.  RESULTS 

Descriptive  Statistics  and  Univariate  Correlations 

Table  2  presents  descriptive  statistics  of  the  variables  used  in  the  analyses.  The  average 
store-level  sales  target  for  regular  mail  services  {TAR_SALES_REG)  is  7.81  billion  Korean  Won, 
while  the  average  sales  target  for  strategic  services  {TAR_SALES_STRA)  is  2.27  billion  Korean 
Won.  The  mean  of  ADJUST  is  0  by  design  because  the  target-setting  process  at  the  regional  level 
is  a  zero-sum  game.  The  mean  of  NOISE_MV_STRA  is  higher  than  that  of  NOISE_MV_REG, 
which  indicates  that  strategic  mail  volume  displays  higher  temporal  variability  than  regular  mail 
volume.  The  mean  of  UNFAIR  is  also  0  by  construction  because  this  measure  is  based  on  the  raw 
difference  between  a  store’s  rank  on  the  relative  profitability  measure  and  its  rank  on  the  sales 
target  measure.  The  mean  of  STATUS  is  0.57;  that  is,  57  percent  of  the  store  managers  have  a  job 
grade  that  is  within  2  levels  of  their  regional  supervisor.  On  average,  EU  stores  are  responsible  for 
approximately  25  windows-only  stores  and  two  BU  stores  (the  mean  of  N_MERGE  is  1.07).  While 
not  reported  in  Table  2,  regional  headquarters  are  responsible  for  supervising  22  EU-level  stores 
on  average.  Consistent  with  Merchant  and  Manzoni’s  (1989)  field-based  findings  that  targets  are 
set  to  be  achievable  80  to  90  percent  of  the  time,  POST  stores’  actual  sales  in  regular  (strategic) 
mail  exceed  target  sales  73  percent  (83  percent)  of  the  time.^'  Table  3  provides  details  on  Pearson 
correlations  among  the  variables  used  in  the  analyses. 

Determinants  of  Supervisors’  Target  Adjustments 

Columns  (1)  and  (2)  of  Table  4  present  the  estimation  results  of  Equations  (1)  (regular  mail) 
and  (2)  (strategic  mail),  which  examine  regional  directors’  use  of  discretion  in  the  target-setting 
process.  Consistent  with  HI,  the  coefficient  on  NOISE_MV,  our  proxy  for  environmental  uncer¬ 
tainty,  is  positive  and  significant  at  the  5  percent  level  for  strategic  mail.  This  is  consistent  with 
regional  directors  using  their  discretion  in  the  target-setting  process  to  manage  compensation  risk 
arising  from  environmental  uncertainty.  The  coefficient  on  NOISE_MV  is  not  significant  for  regu¬ 
lar  mail.  A  possible  explanation  for  this  finding  is  that  the  lower  differential  compensation  risk  for 
regular  mail  (the  temporal  variability  of  mail  volume  is  lower  for  regular  mail  than  for  strategic 
mail,  as  seen  in  Table  2)  reduces  the  need  for  risk  adjustments  to  a  level  where  our  tests  are  not 
powerful  enough  to  detect  them  empirically.  The  low  environmental  uncertainty  in  conjunction 
with  the  much  lower  weight  placed  on  the  regular  mail  sales  target  (i.e.,  6  percent  for  regular  mail 


Because  the  BU  store-level  sales  targets  are  determined  at  the  beginning  of  the  year,  all  explanatory  variables  are  lagged 
values. 

Because  the  sum  of  all  store-level  adjustments  that  a  regional  director  makes  in  a  given  year  must  be  zero,  we  check 
whether  our  findings  are  robust  to  potential  error  dependence  among  stores  in  the  same  region  for  a  given  year  by  using 
standard  errors  clustered  by  24  region-year  pairings  (8  regions  for  3  years).  In  addition,  we  run  yearly  regressions  of 
Equations  (1)  and  (2)  using  standard  errors  clustered  by  region.  The  results  are  similar  to  those  reported  in  Table  4. 
Descriptive  statistics  are  based  on  309  store-year  observations  from  2001  to  2003  for  multivariate  analyses. 

We  compare  the  relative  forecast  accuracy  of  the  final  targets  {TAR_SALES)  and  the  director’s  initial  formula-based 
estimate  {TAR _S ALES _SUP)  with  respect  to  predicting  actual  sales.  Consistent  with  the  measurement  of  analyst  forecast 
error  in  financial  archival  research,  we  construct  a  measure  of  forecast  accuracy  by  taking  the  absolute  value  of  the 
difference  between  the  final  sales  target  (initial  sales  estimate)  and  actual  sales,  scaled  by  actual  sales.  We  find  that  the 
regional  director’s  initial  formula-based  estimate  is  a  more  accurate  predictor  (i.e.,  exhibits  smaller  forecast  error)  of 
actual  sales  for  both  regular  and  strategic  mail.  The  paired  t-tests  suggest  that  the  difference  in  forecast  accuracy  is 
statistically  significant  at  the  1  percent  level.  This  finding  lends  support  to  the  conjecture  that  supervisors’  strategic  and 
self-serving  use  of  discretion  in  target  setting  may  result  in  targets  that  are  less  predictive  of  future  performance. 


The  Accounting  Review 
American  Accounting  Association 


November  2010 


Supervisor  Discretion  in  Target  Setting:  An  Empirical  Investigation 


1875 


TABLE  2 


Descriptive  Statistics 


Variable 

Mean 

Std.  Dev. 

Ql 

Median 

Q3 

n 

TAR_SALES_REGt 

7.81 

7.29 

3.84 

5.97 

8.67 

309 

TAR_SALES_STRAt 

2.27 

1.86 

1.29 

1.83 

2.48 

309 

ACT_SALES_REGi 

8.42 

7.38 

4.01 

6.46 

9.63 

309 

ACT _S ALES _STRA, 

2.46 

1.78 

1.49 

2.02 

2.84 

309 

TAR__SALES_REG_SUPt 

7.81 

6.83 

3.72 

6.01 

9.05 

309 

TAR__SALES_STRA_SUPt 

2.27 

1.70 

1.31 

1.83 

2.61 

309 

ADJUST_REGi 

0.00 

1.15 

-0.23 

-0.03 

0.28 

309 

ADJUST_STRA, 

0.00 

0.32 

-0.08 

0.02 

0.12 

309 

NOISE_MV_REGt^i 

0.11 

0.13 

0.04 

0.07 

0.12 

309 

NOISE_MV_STRA,_i 

0.16 

0.11 

0.09 

0.14 

0.20 

309 

UNFAIR_REGt_i 

0.00 

40.79 

-26.00 

-2.00 

29.00 

309 

UNFAIR_STRAt_i 

0.00 

40.84 

-28.00 

1.00 

26.00 

309 

STATUSt_i 

0.57 

0.50 

0.00 

1.00 

1.00 

309 

N_SUB,_^ 

25.11 

13.93 

14.00 

22.00 

35.00 

309 

N_MERGE,_^ 

1.07 

1.01 

0.00 

1.00 

2.00 

309 

N_EMPt_x 

2.50 

0.86 

1.88 

2.43 

3.05 

309 

COMPETt.i 

0.61 

0.49 

0.00 

1.00 

1.00 

309 

Variable  Definitions: 

TAR_SALES_REG 

TAR_SALES_STRA 

ACT_SALESJtEG 

ACT_SALES_STRA 

TAR_SALES_REG_SUP 

TAR_SALES_STRA_SUP 

ADJUST_REG 

ADJUST_STRA 

NOISE_MV_REG 

NOISEJAV_STRA 

UNFAIRJIEG 


UNFAIR_STRA 


STATUS 

N_SUB 

NJAERGE 

N_EMP 

COMPET 


=  store-level  sales  target  for  regular  mail  (in  billion  Korean  Won); 

=  store-level  sales  target  for  strategic  mail  (in  billion  Korean  Won); 

=  store-level  actual  sales  from  regular  mail  (in  billion  Korean  Won); 

=  store-level  actual  sales  from  strategic  mail  (in  billion  Korean  Won); 

=  regional  director’s  initial  sales  estimate  for  regular  mail  (in  billion  Korean  Won); 

=  regional  director’s  initial  sales  estimate  for  strategic  mail  (in  billion  Korean  Won); 

=  TAR^SALES_REG_SUP  -  TAR_SALES_REG; 

=  TAR_SALES_STRA_SUP  -  TAR_SALES_STRA; 

=  time-series  standard  deviation  of  the  logarithm  of  regular  mail  volume  received  by  the 
store  over  the  most  recent  four  years; 

=  time-series  standard  deviation  of  the  logarithm  of  strategic  mail  volume  received  by  the 
store  over  the  most  recent  four  years; 

=  difference  between  the  store’s  rank  on  relative  profitability  and  rank  on  the  sales  target 
measure  for  regular  mail,  with  higher  values  indicating  stores  that  exhibited  a  high 
achievement  of  regular  mail  sales  compared  to  their  target,  but  low  relative  profitability; 
=  difference  between  the  store’s  rank  on  relative  profitability  and  rank  on  the  sales  target 
measure  for  strategic  mail,  with  higher  values  indicating  stores  that  exhibited  a  high 
achievement  of  strategic  mail  sales  compared  to  their  target,  but  low  relative  profitability; 
=  indicator  variable  that  takes  the  value  of  1  if  the  difference  in  job  grade  level  between 
the  regional  director  and  the  store  manager  is  2  or  less,  and  0  otherwise; 

=  number  of  small  windows-only  stores  the  EU  manager  is  responsible  for; 

=  number  of  BU  stores  added  to  an  EU  store  for  performance  evaluation  purposes; 

=  number  of  employees  (in  hundreds);  and 

=  indicator  variable  that  takes  the  value  of  1  for  stores  in  urban  areas  and  0  for  stores  in 
rural  areas. 


versus  14  percent  for  strategic  mail)  make  it  less  likely  that  the  costs  of  making  the  adjustment 
outweigh  the  benefits. 
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TABLE  4 

Determinants  of  Supervisors’  Target  Adjustments 
{t  =  2001-2003) 


ADJUSTi^  =  ao  +  aiNOISE_MVi^_i  +  a2UNFAIRi^_i  +  aj,STATUSi^_2  +  a^N_SVBit_i 
+  asN_MERGEi^_i  +  a^_EMPi^_i  +  ajCOMPETi^_i 

+  REGION JNDICATORS  +  YEARJNDICATORS  + 


(1)  (2) 


ADJUST 


Variable 

Predicted 

Sign 

ADJUST REG 

ADJUST STRA 

Coefficient 

(t-statistic) 

Coefficient 

(t-statistic) 

Constant 

7 

0.944** 

0.378*** 

(2.15) 

(2.95) 

NOISE_MVt_y 

(  +  ) 

0.154 

0.305** 

(0.30) 

(2.02) 

UNFAIR,_^ 

(  +  ) 

0.006*** 

0.001* 

(2.64) 

(1.40) 

STATUSt_^ 

(  +  ) 

0.604*** 

0.107** 

(2.95) 

(1.65) 

N_SUBt_x 

(  +  ) 

0.069*** 

0.022*** 

(3.47) 

(3.30) 

N_MERGE,_^ 

(  +  ) 

0.143 

0.097** 

(1.09) 

(2.92) 

N_EMPt_^ 

7 

-1.024*** 

-0.357*** 

(-4.65) 

(-3.45) 

COMPETt_^ 

7 

0.351** 

0.053 

(2.11) 

(1.09) 

n 

309 

309 

Adjusted 

’  23.5% 

31.6% 

*,  **,  ***  Significant  at  the  0,10,  0.05,  and  0.01  levels,  respectively  (one-tailed  test  if  a  signed  prediction  is  provided, 
two-tailed  otherwise). 

For  parsimony,  coefficients  on  region  and  year  indicators  are  not  reported. 

Reported  t-statistics  are  based  on  the  Rogers  (1993)  standard  errors. 


Variable  Definitions: 

ADJUST 

ADJUST_REG 

ADJUST_STRA 

NOISE_MVJiEG 

NOISE_MV_STRA 


ADJUST_REG  or  ADJUST_STRA; 

TAR_SALES_REG_SUP  -  TAR_SALES_REG-, 

TAR_SALES_STRA_SUP  -  TAR_SALES_STRA\ 

time-series  standard  deviation  of  the  logarithm  of  regular  mail  volume  received  by  the 
store  over  the  most  recent  four  years; 

time-series  standard  deviation  of  the  logarithm  of  strategic  mail  volume  received  by  the 
store  over  the  most  recent  four  years; 


(continued  on  next  page) 
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TABLE  4  (continued) 


UNFA1R_REG  = 


UNFAIR_STRA  = 


STATUS  = 

N_SUB  = 
N_MERGE = 

N_EMP  = 
COM  PET  = 


difference  between  the  store’s  rank  on  relative  profitability  and  rank  on  the  sales  target 
measure  of  regular  mail,  with  higher  values  indicating  stores  that  exhibited  a  high 
achievement  of  regular  mail  sales  compared  to  their  target,  but  low  relative  profitability; 
difference  between  the  store’s  rank  on  relative  profitability  and  rank  on  the  sales  target 
measure  of  strategic  mail,  with  higher  values  indicating  stores  that  exhibited  a  high 
achievement  of  strategic  mail  sales  compared  to  their  target,  but  low  relative  profitability; 
indicator  variable  that  takes  the  value  of  1  if  the  difference  in  job  grade  level  between 
a  regional  director  and  a  store  manager  is  2  or  less,  and  0  otherwise; 
number  of  small  windows-only  stores  the  EU  manager  is  responsible  for; 
number  of  BU  stores  added  to  an  EU  store  for  performance  evaluation  purposes  since 
2001; 

number  of  employees  (in  hundreds);  and 

indicator  variable  that  takes  the  value  of  1  for  stores  in  urban  areas  and  0  for  stores  in 
rural  areas. 


Next,  as  predicted  by  H2,  the  results  in  Table  4  suggest  that  regional  directors  use  their 
discretion  to  address  apparent  unfairness  of  other  elements  of  the  incentive  contract.  The  coeffi¬ 
cients  on  UNFAIR  are  positive  and  significant  at  the  1  and  10  percent  levels  for  regular  and 
strategic  mail,  respectively.  Recall  from  Section  II  that  from  a  store  manager’s  perspective,  regular 
mail  is  less  controllable  than  strategic  mail.  This  difference  is  relevant,  as  competing  against  a 
reference  group  of  stronger  stores  will  likely  be  even  more  frustrating  when  the  manager  has  little 
control  over  the  specific  dimension.  When  addressing  fairness  concerns,  regional  directors  are 
therefore  more  likely  to  adjust  targets  on  the  less  controllable  dimension  than  on  the  more  con¬ 
trollable  dimension.  Hence,  the  stronger  result  for  regular  mail,  which  is  less  controllable,  is 
consistent  with  our  fairness-based  predictions.^^ 

Table  4  also  shows  that,  consistent  with  H3,  the  coefficients  on  STATUS  are  positive  and 
significant  at  the  1  percent  level  for  regular  mail  and  at  the  5  percent  level  for  strategic  mail.  This 
finding  suggests  that  supervisors  assign  more  attainable  targets  to  store  managers  of  more  similar 
social  status  to  reduce  confrontation  costs. 

Finally,  our  results  corroborate  the  findings  of  prior  literature  on  budgetary  slack,  suggesting 
that  complexity  of  operations  creates  information  asymmetry  that  subordinates  use  to  their  advan¬ 
tage  to  obtain  more  attainable  targets.  Specifically,  the  coefficients  on  N_SUB  are  positive  and 
significant  at  the  1  percent  level  for  both  regular  and  strategic  mail  and  the  coefficient  on 
N_MERGE  is  positive  and  significant  at  the  5  percent  level  for  strategic  mail. 


As  discussed  in  Section  II,  a  regional  director’s  bonus  rate  is  the  average  bonus  rate  of  all  stores  in  the  same  region. 
Hence,  assigning  more  (less)  attainable  sales  targets  to  stores  when  poor  (favorable)  RPE  performance  is  anticipated 
may  also  be  consistent  with  a  regional  director’s  economic  incentive.  If  a  regional  director’s  economic  incentive  drives 
our  results,  however,  we  would  expect  stronger  results  for  strategic  mail  than  for  regular  mail  because  strategic  mail 
receives  a  higher  incentive  weight  (14  percent  as  compared  to  6  percent  of  regular  mail)  and  therefore  provides  an  easier 
way  to  create  an  impact.  This  is  not  what  the  results  show  (see  Table  4).  Furthermore,  if  a  regional  director  solely  tries 
to  maximize  his/her  bonus  payout  when  setting  store-level  sales  targets,  s/he  would  only  consider  a  store’s  prior  year 
performance  on  the  RPE  incentive  element  rather  than  comparing  the  store’s  RPE  performance  to  its  sales  target 
performance.  When  we  replace  UNFAIRJREG  and  UNEAIR_STRA  with  a  store’s  rank  on  relative  profitability  in  the 
prior  year  to  better  capture  a  director’s  self-interest,  we  find  that  the  coefficient  on  RPE  rank  is  insignificant  (p  =  0.13 
(0.35)  one-tailed)  for  regular  (strategic)  mail  in  contrast  to  the  results  reported  in  Table  4.  Overall,  we  conclude  that  our 
results  are  more  consistent  with  our  fairness-based  predictions. 
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VI.  ADDITIONAL  ANALYSES  AND  ROBUSTNESS  TESTS 
Related  Dependent  Variables:  Attainability 

In  the  main  analysis,  we  use  the  extent  to  which  the  regional  directors  adjust  their  formulaic 
initial  estimate  upward  or  downward  for  final  target  determination  as  a  proxy  for  supervisors’  use 
of  discretion  in  target  setting.  Because  of  the  focus  in  the  budgeting  literature  on  the  related  issue 
of  target  attainability,  and  because  target  attainability  is  one  of  the  factors  that  regional  directors 
consider  as  they  adjust  initial  targets  upward  or  downward,  we  compare  our  findings  to  results 
using  two  measures  of  target  attainability  as  the  dependent  variable. 

First,  in  the  target-setting  process,  supervisors  likely  make  a  private  estimate  of  future  store 
sales  that,  unlike  the  initial  sales  estimate,  is  based  not  only  on  past  sales,  but  also  on  the  regional 
director’s  knowledge  of  the  store’s  characteristics  and  competitive  environment.  We  try  to  capture 
this  private  estimate  by  using  a  prediction  model  that  includes  past  sales,  store  characteristics,  and 
environmental  characteristics  available  in  POST’s  databases  (see  the  Appendix  for  details).  We  use 
the  store-year-level  predicted  values  of  sales  resulting  from  the  prediction  model  to  proxy  for  the 
regional  directors’  private  estimates  of  future  sales.  We  then  compute  target  attainability  {AT¬ 
TAIN  1)  by  taking  the  difference  between  the  proxy  for  the  directors’  estimates  of  sales  and  the 
final  sales  targets.  A  positive  (negative)  value  means  that  the  target  is  set  lower  (higher)  than  the 
level  predicted  by  the  exogenous  factors  influencing  the  store-level  sales,  which  we  interpret  as 
implying  that  the  sales  target  is  more  (less)  attainable. 

Second,  following  prior  studies  (e.g..  Merchant  and  Manzoni  1989;  Indjejikian  and  Nanda 
2002;  Indjejikian  and  Matejka  2006;  Anderson  et  al.  2010)  we  use  the  difference  between  actual 
sales  and  target  sales  as  an  additional  proxy  for  target  attainability  {ATTAIN!). 

Table  5  presents  the  results  from  estimating  Equations  (1)  and  (2)  when  we  use  our  proxies 
for  target  attainability,  ATTAIN  I  and  ATTAIN!. In  contrast  to  the  results  of  our  main  analysis 
concerning  HI,  the  coefficient  on  NOISEJdV  is  positive  and  significant  at  the  1  percent  level  for 
regular  mail  but  is  not  significant  for  strategic  mail  when  ATTAIN  I  is  used  as  a  dependent  variable. 
When  we  use  ATTAIN!  as  a  dependent  variable,  however,  the  results  are  similar  to  those  reported 
in  Table  4. 

Consistent  with  the  results  of  our  main  analysis  presented  in  Table  4,  we  find  support  for  H2 
and  H3  when  using  the  alternative  dependent  variables.  The  coefficient  on  UNFAIR  is  positive  and 
significant  at  the  5  percent  level  or  less  except  in  column  (4),  where  the  coefficient  is  not  signifi¬ 
cant  (p  =  0.46,  one-tailed).  The  coefficient  on  STATUS  is  also  positive  and  significant  at  the  5 
percent  level  except  in  column  (4),  where  the  coefficient  is  not  significant  (p  =  0.12,  one-tailed). 
We  also  confirm  the  effect  of  information  asymmetry  on  target  attainability.  In  all  four  regressions, 
the  coefficients  on  N_SUB  are  positive  and  significant.  Overall,  we  are  able  to  confirm  the  findings 
from  our  main  analysis  when  we  replace  the  extent  of  regional  directors’  discretionary  target 
adjustments  with  measures  of  target  attainability,  suggesting  that  target  attainability  appears  to 
be  a  key  factor  underlying  regional  directors’  target  adjustments.  This  result  is  also  consistent  with 
findings  in  prior  research  that  supervisors  consider  a  number  of  factors  when  they  determine 
business  units’  targets  (Merchant  and  Manzoni  1989). 


As  expected,  our  proxies  for  target  attainability  are  positively  correlated  with  our  measure  of  supervisors’  target 
adjustments  (ADJUST)  at  the  1  percent  level.  The  correlation  between  ADJUSTJREG  (STRA)  and  ATTAINIJREG 
(STRA)  is  0.30  (0.36)  and  the  correlation  between  ADJUST_REG  (STRA)  and  ATTAINIJREG  (STRA)  is  0.64  (0.81). 
By  comparing  actual  performance  against  target  performance  over  time,  prior  research  documents  that  exceeding  the 
target  in  the  current  year  is  positively  associated  with  the  probability  of  exceeding  the  target  in  the  next  year  (Indjejikian 
and  Nanda  2002;  Indjejikian  and  Matejka  2006).  This  suggests  that  our  measures  of  target  attainability,  and  ATTAIN!  in 
particular,  could  be  serially  correlated.  To  control  for  potential  serial  correlation  we  include  the  lagged  value  of  ATTAIN  1 
and  ATTAIN!  in  the  estimation  models;  this  does  not  change  our  inferences. 
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TABLE  5 

Determinants  of  Target  Attainability 
{t  =  2001-2003) 


ATTAIN =  ao  +  a^NOISE_MVi^_i  +  ajUNFAIRi^.^  +  a^TATUS^.i  +  a^N_SUBi^_i 
+  a^N  _MERGEij_i  +  afJS_EMP^_i  +  ajCOMPETy_i 
+  REGION JNDICATORS  +  YEARJNDICATORS  +  £y 


0) _ (2) _ (3) _ W 

Alternative  Dependent  Measure 


ATTAINl  ATTAIN! 


rreaiciea 

Sign 

REG 

STRA 

REG 

STRA 

Constant 

? 

0.178 

0.738** 

0.884** 

0.235** 

(0.49) 

(2.16) 

(2.20) 

(2.36) 

N01SE_MV,_i 

(+) 

1.646*** 

0.531 

0.429 

0.352** 

(2.48) 

(1.26) 

(0.62) 

(2.17) 

UNFAlRt^i 

(+) 

0.012*** 

0.005*** 

0.006** 

0.000 

(4.99) 

(3.38) 

(1.97) 

(0.10) 

STATU 

(+) 

0.708** 

0.399** 

0.292** 

0.064 

(2.13) 

(2.14) 

(1.99) 

(1.18) 

N_SUB,^i 

(+) 

0.067*** 

0.038*** 

0.034** 

0.017*** 

(3.07) 

(2.38) 

(2.09) 

(3.41) 

N  MERGE, 

(+) 

0.013 

0.051 

-0.171 

0.060** 

(0.09) 

(0.61) 

(-1.43) 

(2.12) 

N_EMPt-x 

? 

-0.655** 

-0.701*** 

-0.129 

-0.210*** 

(-2.13) 

(-2.60) 

(-1.05) 

(-2.85) 

COM  PET, 

? 

1.161*** 

0.480*** 

0.450*** 

0.068* 

(6.28) 

(3.86) 

(2.97) 

(1.51) 

n 

309 

309 

309 

309 

Adjusted 

42.3% 

23.4% 

32.3% 

24.3% 

*,  **,  ***  Significant  at  the  0.1,  0.05,  and  0.01  levels,  respectively  (one-tailed  test  if  a  signed  prediction  is  provided, 
two-tailed  otherwise). 

For  parsimony,  coefficients  on  region  and  year  indicators  are  not  reported. 

Reported  t-statistics  are  based  on  the  Rogers  (1993)  standard  errors. 

Variable  Definitions: 

ATTAIN  =  ATTAIN _REG  or  ATTAIN _STRA; 

ATTAIN  1_REG  =  store  and  year-specific  predicted  value  obtained  by  estimating  Equation  (3)  in  the 
Appendix  minus  TAR_SALES_REG\ 

ATTAINl _STRA  =  store  and  year-specific  predicted  value  obtained  by  estimating  Equation  (4)  in  the 
Appendix  minus  TAR_SALES_STRA; 

ATTAIN2_REG  =  ACT_SALES_REG  -  TAR_SALES_REG\ 

ATTAIN2_STRA  =  ACT _S ALES _STRA  -  TAR_SALES_STRA- 
NOISE_MV_REG  =  time-series  standard  deviation  of  the  logarithm  of  regular  mail  volume  received  by  the 
store  over  the  most  recent  four  years; 


(continued  on  next  page) 
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TABLE  5  (continued) 


NOISE_MV _STRA  —  time-series  standard  deviation  of  the  logarithm  of  strategic  mail  volume  received  by  the 
store  over  the  most  recent  four  years; 

UNFAIR_REG  =  difference  between  the  store’s  rank  on  relative  profitability  and  rank  on  the  sales  target 
measure  for  regular  mail,  with  higher  values  indicating  stores  that  exhibited  a  high 
achievement  of  regular  mail  sales  compared  to  their  target,  but  low  relative  profitability; 

UNEAIR_STRA  =  difference  between  a  store’s  rank  on  relative  profitability  and  rank  on  the  sales  target 
measure  for  strategic  mail,  with  higher  values  indicating  stores  that  exhibited  a  high 
achievement  of  strategic  mail  sales  compared  to  their  target,  but  low  relative  profitability; 

STATUS  =  indicator  variable  that  takes  the  value  of  1  if  the  difference  in  job  grade  level  between 
the  regional  director  and  the  store  manager  is  2  or  less,  and  0  otherwise; 

N_SUB  =  number  of  small  windows-only  stores  the  EU  manager  is  responsible  for; 

N_MERGE  =  number  of  BU  stores  added  to  an  EU  store  for  performance  evaluation  purposes  since 
2001; 

N_EMP  =  number  of  employees  (in  hundreds);  and 
COMPET  =  indicator  variable  that  takes  the  value  of  1  for  stores  in  urban  areas  and  0  for  stores  in 
rural  areas. 


Robustness  Tests 

Alternative  Measures  of  Environmental  Uncertainty 

To  test  the  robustness  of  our  findings  related  to  compensation  risk  due  to  environmental 
uncertainty,  we  capture  environmental  uncertainty  in  alternative  ways.  Specifically,  in  line  with 
Nagar  (2002),  we  use  the  average  percentage  growth  rate  in  regular  and  strategic  mail  volume 
received  by  a  store  over  the  past  three  years  and  the  prior  year’s  percentage  growth  rate.  All 
inferences  remain  unchanged  when  using  these  alternative  measures. 

Alternative  Measures  of  Unfairness 

We  explore  the  robustness  of  our  findings  related  to  fairness  concerns  by  re-estimating  Equa¬ 
tions  (1)  and  (2)  using  alternative  measures  of  unfairness.  First,  we  use  an  indicator  variable  that 
takes  the  value  of  1  if  the  rank  difference  between  the  RPE  profitability  measure  and  the  sales 
target  measure  is  in  the  top  10  percent  (50  percent),  and  0  otherwise.  Second,  we  take  the 
difference  between  the  store’s  yearly  rank  on  its  RPE  profitability  measure  and  the  store’s  yearly 
rank  on  its  profitability  growth  measure  (instead  of  the  sales  target  measure)  in  year  t-\.  Finally, 
we  use  the  raw  value  of  the  RPE  profitability  measure  in  year  t-\.  All  inferences  remain  un¬ 
changed  under  the  first  and  second  alternative  measures.  When  we  use  the  third  alternative  mea¬ 
sure,  the  coefficient  is  significantly  negative  at  the  5  percent  level  for  regular  mail,  but  is  insig¬ 
nificant  for  strategic  mail  (p  =  0.21,  one-tailed). 

Alternative  Measures  of  Status 

We  also  examine  the  robustness  of  our  findings  related  to  conflict  avoidance  using  an  alter¬ 
native  measure  of  status.  Specifically,  we  use  a  measure  that  captures  a  store  manager’s  status 
relative  to  other  store  managers  in  the  same  region,  rather  than  relative  to  the  regional  director. 
This  measure  is  constructed  by  creating  an  indicator  variable  that  equals  1  for  store  managers 
whose  job  grade  level  corresponds  to  grade  4,  and  0  for  store  managers  whose  Job  grade  level 
corresponds  to  grade  5.  All  inferences  remain  unchanged  when  we  re-estimate  Equations  (1)  and 
(2)  using  this  alternative  measure  of  status. 

Potential  Scale  Effects 

To  ensure  that  the  results  are  robust  to  potential  scale  effects,  we  deflate  the  dependent 
variables  ADJUST,  ATTAIN  1,  and  ATTAIN2  by  target  sales  and  re-estimate  all  regression  equa- 
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tions.  The  re-estimated  results  (not  tabulated)  are  similar  to  those  reported  in  Tables  4  and  5. 
Additional  Controls 

We  also  perform  additional  tests  to  control  for  two  alternative  explanations  for  our  results. 
First,  more  experienced  regional  directors  may  have  greater  ability  to  unravel  the  slack-building 
behavior  of  store  managers  with  private  information.  To  test  for  this  possibility,  we  create  the 
indicator  variable  EXPER,  which  takes  the  value  of  1  for  directors  who  are  internally  promoted 
and  0  for  directors  hired  from  outside  of  POST,  and  interact  this  variable  with  N_SUB,  one  of  our 
information  asymmetry  variables.  The  results  (not  tabulated)  show  that  the  interaction  term  be¬ 
tween  EXPER  and  N_SUB  is  insignificant  for  regular  mail  (p  =  0.39,  one-tailed)  and  only  mar¬ 
ginally  significantly  negative  for  strategic  mail  (p  =  0.08,  one-tailed).  All  other  inferences  remain 
unchanged. 

Second,  we  examine  whether  unknown  individual  store  manager  traits  that  are  correlated  with 
store  characteristics  drive  our  results.  We  analyze  this  by  calculating  the  average  store  observation 
for  all  variables  and  using  these  newly  created  variables  to  estimate  Equations  (1)  and  (2).  Because 
the  mean  store  manager  tenure  is  less  than  one  year,  averaging  yearly  observations  for  a  given 
store  eliminates  any  potential  effect  of  store  manager  traits  on  target  adjustments.  The  results  are 
robust  to  averaging  yearly  observations  for  a  given  store. 

VII.  CONCLUDING  REMARKS 

Using  2001  to  2003  annual  performance  evaluation  data  on  103  postal  stores  in  Korea,  we 
analyze  how  regional  directors  use  their  discretion  to  allocate  regional  sales  targets  set  by  corpo¬ 
rate  headquarters  to  individual  stores  in  the  region.  Our  findings  are  consistent  with  predictions 
based  on  agency  and  behavioral  theories  that  regional  directors  strategically  use  discretion  in  the 
target-setting  process  to  manage  compensation  risk  and  address  fairness  concerns.  The  results  are 
also  consistent  with  our  prediction  based  on  research  in  psychology  that  regional  directors  provide 
easier  targets  to  store  managers  with  relatively  higher  hierarchical  status  to  reduce  confrontation 
costs. 

Our  study  has  several  important  implications.  First,  our  results  suggest  that  supervisor  discre¬ 
tion  is  used  strategically  to  improve  incentive  contracting.  In  our  setting,  some  stores’  initial  sales 
targets  based  on  the  regional  targets  set  by  corporate  headquarters  may  impose  excessive  risk 
because  of  higher  store-specific  levels  of  environmental  uncertainty.  We  find  evidence  that  super¬ 
visors  manage  this  risk  by  using  their  discretion  in  the  target- setting  process  to  assign  more 
attainable  targets  to  stores  facing  higher  levels  of  store-specific  risk.  These  adjustments  effectively 
provide  a  risk  premium  for  bearing  more  risk. 

Our  study  also  suggests  that  supervisors  strategically  use  discretion  in  the  target-setting  pro¬ 
cess  to  deal  with  fairness  concerns.  Importantly,  this  study  is  the  first  to  provide  field  data  evidence 
consistent  with  supervisors  exercising  discretion  in  the  target-setting  process  to  counter  other 
elements  of  the  incentive  contract  that  may  be  considered  unfair.  This  strategic  use  of  discretion  to 
harmonize  the  elements  of  the  compensation  contract  underscores  the  importance  of  studying  the 
compensation  system  as  a  whole.  While  the  majority  of  studies  that  examine  compensation  con¬ 
tracts  focus  on  one  or  two  elements  in  isolation,  our  results  suggest  that  supervisors  view  different 
elements  of  the  compensation  contract  as  interrelated.  Thus,  when  a  manager  has  no  influence  over 
a  certain  dimension  of  the  compensation  contract  but  has  discretion  over  others,  spillover  effects 
are  likely  to  occur  (Bol  and  Smith  2009). 


See  footnote  13. 
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Our  findings  also  emphasize  the  importance  of  considering  supervisors’  personal  preferences 
and  incentives  when  analyzing  the  target-setting  process.  We  provide  evidence  consistent  with 
supervisors  using  their  discretion  in  target  setting  not  only  to  improve  compensation  contracting, 
but  also  to  decrease  the  personal  costs  involved  in  managing  the  performance  evaluation  process. 
Our  results  suggest  that  supervisors  reduce  confrontation  costs  by  strategically  determining  who  to 
disappoint  with  relatively  harder- to-attain  targets  and  who  to  satisfy  with  more  attainable  targets. 
In  particular,  supervisors  appear  to  disappoint  those  subordinates  who  are  less  likely  to  confront 
them  and  as  a  result  avoid  painful  conflicts  as  much  as  possible.  Viewed  from  a  decision-theory 
perspective,  the  supervisors  seem  to  try  to  maximize  their  expected  utility  over  both  pecuniary  and 
nonpecuniary  elements. 

More  generally,  our  study  illustrates  that  discretion  is  relevant  at  more  than  just  the  perfor¬ 
mance  evaluation  stage.  Most  studies  that  examine  the  role  of  supervisor  bias  focus  on  supervisor 
discretion  in  the  evaluation  phase  of  the  compensation  process.  In  contrast,  our  study  shows  that 
biasing  behavior  may  also  take  place  when  agreeing  on  the  specifics  of  the  incentive  system. 
Future  research  could  examine  what  drives  principals  to  provide  supervisors  with  discretion  in 
target  setting  in  certain  situations,  while  allowing  discretion  in  evaluation  in  other  settings.  Simi- 
larly,  in  a  setting  where  supervisors  are  allowed  multiple  forms  of  discretion,  future  research  could 
analyze  when  supervisors  prefer  to  use  their  discretion  in  the  target-setting  process  compared  to 
their  discretion  in  ex  post  evaluation. 

This  study  is  subject  to  several  limitations,  the  most  important  of  which  is  the  fact  that  the 
observations  come  from  a  single  firm,  limiting  the  extent  to  which  we  can  generalize  the  results. 
Another  important  caveat  is  the  lack  of  access  to  the  store  managers’  sales  target  proposals, 
because  these  proposals  likely  impact  regional  directors’  use  of  discretion  in  setting  targets.  While 
we  attempt  to  capture  the  effect  of  store  managers’  proposals  by  controlling  for  information 
asymmetry,  as  store  managers’  informational  advantage  is  expected  to  be  reflected  in  their  target 
proposals,  we  are  unable  to  capture  this  dimension  directly  because  of  data  limitations.  Future 
research  could  collect  more  complete  information  on  the  participative  element  of  the  target- setting 
process  and  investigate  how  store  managers’  proposals  affect  supervisors’  use  of  discretion  in  the 
target-setting  process. 

APPENDIX 

To  develop  a  reliable  prediction  model  for  future  sales,  we  use  a  comprehensive  set  of 
independent  variables  that  are  likely  to  explain  much  of  the  variation  in  store-level  sales.  Our  main 
explanatory  variable  is  prior-year  sales  because  it  is  likely  the  key  input  in  estimating  this  year’s 
sales.  In  addition,  we  include  the  variables  that  POST  used  in  their  cluster  analysis  to  create  the 
RPE  reference  groups  (Matsumura  and  Shin  2006).  POST  believes  that  these  variables  are 
predictive  of  store  profitability;  because  profitability  and  sales  are  highly  correlated,  it  is  reason¬ 
able  to  assume  that  these  variables  are  also  predictive  of  sales.  We  use  all  explanatory  variables 
measured  in  year  t-\  under  the  assumption  that  the  regional  directors  will  make  their  own  fore¬ 
casts  based  on  the  information  available  at  the  beginning  of  year  t.  Because  there  are  two  sales 
targets,  regular  and  strategic,  we  estimate  the  following  cross-sectional  regressions  using  the 
dependent  variables  ACT_SALES_REG  and  ACT _S ALES _STRA\ 

ACT_SALES_REGij  =aQ+  aiACT_SALES_REGit-i  +  a2SERVICE_AREAit_x  +  a^POPij_^ 

-t  o:^N_BANKi  j_^  -t-  oc^METROi  f_\  -l-  oi(^NEW_MGRi  +  ctjDISTi  (_i 
+  a^SKILLi,_^  +  YEARJNDICATORS  +  (3) 


Among  the  variables  POST  used  in  their  cluster  analysis,  we  do  not  use  the  number  of  employees  {N_EMP)  in  our 
prediction  model  because  that  variable  is  used  as  a  predictor  when  estimating  columns  (1)  and  (2)  of  Table  5. 
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ACT_SALES_STRAii=  ao  +  a^ACT_SALES_STRAi,_^  +  ajSERVlCE^AREAit^^  +  a^POPi^t-i 

+  a^N_BANKi^,_^  +  a^METRO^^i  +  a^EW_MGRij_i  +  ajDISTi,_^ 

+  a^SKlLLi^,_^  +  YEARJNDICATORS  +  £,■ ,  (4) 


where; 


ACT_SALES_REG  =  store-level  actual  sales  from  regular  mail  (in  billion  Korean  Won); 
ACT _S ALES _STRA  =  store-level  actual  sales  from  strategic  mail  (in  billion  Korean  Won); 
SERVICE_AREA  =  area  that  the  store  is  responsible  for  (in  hundreds  of  square  kilometers); 
POP  =  natural  log  of  the  population  in  the  service  area; 

N_BANK  =  number  of  financial  institutions  in  the  service  area  (in  hundreds); 
METRO  =  1,  2,  3,  and  4  for  rural  area,  small-sized  city,  middle-sized  city,  and 
metropolitan  area,  respectively; 

NEW_MGR  =  indicator  variable  that  is  equal  to  1  if  the  new  manager  is  appointed 
in  the  given  year,  and  0  otherwise; 

DIST  =  average  daily  delivery  distance  per  mail  carrier  (in  hundred  kilometers); 
SKILL  =  natural  log  of  average  labor  cost  per  employee. 


We  use  the  store-year-level  predicted  values  of  sales  to  proxy  for  regional  directors’  sales  esti¬ 
mates.  The  adjusted  R^s  from  estimating  Equations  (3)  and  (4)  are  94  percent  and  70  percent, 
respectively,  suggesting  that  a  substantial  part  of  the  variation  in  sales  can  be  explained  by  the  set 
of  store-specific  information  available  to  the  regional  directors  at  the  beginning  of  year  t. 
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ABSTRACT:  We  investigate  the  determinants  of  hedge  fund  internal  controls  and  their 
association  with  the  fees  that  funds  charge  investors.  Hedge  funds  are  subject  to  mini¬ 
mal  regulation.  Hence,  hedge  fund  managers  voluntarily  implement  internal  controls, 
and  managers  and  investors  freely  contract  on  fees.  We  find  that  internal  controls  are 
stronger  in  funds  with  higher  potential  agency  costs.  Further,  internal  controls  are  stron¬ 
ger  in  funds  domiciled  in  jurisdictions  that  provide  investors  with  limited  legal  redress  for 
fraud  and  financial  misstatements.  Short  selling  funds,  however,  are  more  likely  to 
protect  information  about  their  investment  positions  by  implementing  weaker  internal 
controls.  With  respect  to  fees,  we  find  that  the  percentage  of  positive  profits  that  the 
manager  receives  increases  in  the  strength  of  the  fund’s  internal  controls.  Finally,  re¬ 
moving  the  manager  from  setting  and  reporting  the  fund’s  official  net  asset  value,  along 
with  reputational  incentives  and  monitoring  by  leverage  providers,  are  all  associated 
with  lower  likelihoods  of  future  regulatory  investigations  of  fraud  and/or  financial  mis¬ 
statement. 
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1.  INTRODUCTION 

Hedge  funds  encompass  a  diverse  range  of  privately  managed  investment  vehicles  that  are 
exempt  from  a  broad  range  of  federal  acts  regulating  investment  vehicles.  They  are 
typically  not  exchange  traded  and  not  registered  with  the  Securities  and  Exchange  Com¬ 
mission  (SEC).  Hence,  hedge  funds  are  in  general  exempt  from  securities  regulations  that  dictate 
internal  controls  that  managers  must  implement  and  maintain,  fees  that  managers  can  charge 
investors,  and  disclosures  that  fund  managers  must  make  to  investors.  Therefore,  hedge  funds  are 
opaque  investment  vehicles  that  expose  investors  to  the  risk  of  substantial  losses  arising  from 
fraud  and/or  financial  misstatements.  Hedge  funds  thereby  provide  a  setting  to  examine  the  extent 
to  which  fund  managers  voluntarily  implement  internal  controls  to  limit  agency  costs  arising  from 
fraud  and/or  financial  misstatements,  and  the  extent  to  which  investors  value  such  internal  controls 
when  determining  the  fees  that  they  are  willing  to  pay  fund  managers. 

In  this  study,  we  examine  the  determinants  of  hedge  fund  internal  controls  and  the  association 
between  internal  controls  and  the  fees  that  fund  managers  charge  investors.  We  use  a  broad 
definition  of  internal  controls  that  encompasses  mechanisms  designed  to  decrease  the  likelihood  of 
fraud,  and  to  increase  the  accuracy  of  asset  valuations  and  performance  disclosures  made  to 
investors.  Some  examples  include  the  independent  pricing  of  investment  positions,  signature  pro¬ 
tocols  for  transferring  funds  from  bank  and  prime  brokerage  accounts,  and  the  use  of  reputable 
service  providers,  such  as  auditors  and  administrators.  The  risks  that  hedge  fund  internal  controls 
monitor  and  reduce  could  be  more  important  than  financial  risks  in  determining  hedge  fund 
performance  (Lo  2001,  30-31;  Kundro  and  Feffer  2003). 

To  investigate  the  use  of  internal  controls  and  their  association  with  fees,  we  use  a  proprietary 
database  of  due  diligence  reports  prepared  by  The  Hedge  Fund  Due  Diligence  Group  at  Analytical 
Research  (HedgeFundDueDiligence.com).  These  reports  contain  an  extensive  array  of  details  re¬ 
garding  fund  characteristics  and  internal  controls,  investment  contract  terms  and  provisions,  in¬ 
vestment  style  and  portfolio  characteristics,  and  fund  and  manager  backgrounds.  Investors  com¬ 
missioned  these  due  diligence  reports  to  evaluate  whether  to  invest  in  the  funds.  Therefore,  our 
sample  represents  a  set  of  hedge  funds  that  were  actively  seeking  investors  and  for  which  investors 
initiated  due  diligence  to  evaluate  the  backgrounds,  contract  terms,  and  internal  controls  of  the 
funds. 

We  find  substantial  variation  in  internal  controls.  Funds  domiciled  in  the  Caribbean  are  more 
likely  to  incorporate  stricter  signature  authority  for  the  transfer  of  funds,  implement  pricing 
mechanisms  that  involve  external  oversight,  and  employ  more  reputable  outside  service  providers. 
These  findings  are  consistent  with  investors  demanding  greater  internal  controls  for  funds  domi¬ 
ciled  in  jurisdictions  that  provide  investors  with  limited  legal  redress  for  fraud  and  financial 
misstatements.  Furthermore,  and  also  consistent  with  managers  implementing  internal  controls  to 
reduce  agency  conflicts,  we  find  that  younger  funds  are  more  likely  to  use  more  reputable  external 
administrators  and  levered  funds  are  more  likely  to  implement  more  stringent  external  oversight  of 
pricing  and  use  more  reputable  auditors.  Short  selling  funds,  however,  are  more  likely  to  protect 
proprietary  information  by  implementing  weaker  internal  controls  that  limit  external  oversight  of 
investment  position  pricing. 

Next,  we  argue  that  when  considering  a  hedge  fund  investment,  investors  estimate  potential 
agency  costs  arising  from  fraud  and  financial  misstatements,  and  their  expectations  of  these 
agency  costs  decrease  in  the  quality  of  a  fund’s  internal  controls.  Consequently,  we  posit  that 
managers  of  funds  with  more  stringent  internal  controls  can  charge  higher  fees,  all  else  equal. 
Consistent  with  this  argument,  we  observe  that  internal  controls  that  reduce  managers’  opportuni¬ 
ties  to  manipulate  reported  performance  and/or  commit  fraud  are  positively  associated  with  the 
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percentage  of  investment  profits  received  by  the  manager.  Therefore,  investors  appear  to  mitigate 
moral  hazard  costs  arising  from  fraud  and/or  manipulated  reported  returns  when  managers  have 
greater  discretion. 

We  supplement  the  above  analysis  of  fees  by  examining  the  association  between  restatements 
and  fees.  Relevant  to  our  setting,  restatements  are  predetermined  at  the  point  of  contracting.  We 
find  that  managers  of  funds  that  have  restated  prior  performance  receive  a  significantly  lower 
percentage  of  assets  under  management  for  managing  the  funds,  providing  further  support  for  our 
prediction  that  investors  protect  against  the  risk  of  future  misstatements  by  paying  lower  fees. 

Finally,  we  investigate  whether  internal  controls  are  associated  with  future  regulatory  inves¬ 
tigations  of  fraud  and/or  financial  misstatements  on  the  part  of  the  fund  and  its  managers.  Such 
investigations  can  be  considered  an  extreme  form  of  poor  performance  because  they  are  typically 
associated  with  fund  liquidations  and  deeply  discounted  investor  redemptions.  We  find  that  ex¬ 
cluding  the  manager  from  setting  and  reporting  the  fund’s  official  net  asset  value  (NAV)  to 
investors  reduces  the  probability  of  such  investigations  by  over  one-third.  Moreover,  we  find  that 
proxies  for  reputational  incentives  and  proxies  for  explicit  monitoring  and  screening  by  leverage 
providers  are  associated  with  lower  likelihoods  of  future  regulatory  investigations  of  fraud  and/or 
financial  misstatements. 

We  contribute  to  the  internal  control  and  hedge  fund  literatures.  First,  this  study  differs  from 
previous  research  on  internal  controls  by  examining  the  supply  and  demand  of  actual  internal 
controls  determined  within  a  market  as  opposed  to  the  disclosure  of  internal  control  weaknesses.^ 
Further,  research  is  generally  based  on  public  firms  subject  to  securities  regulations  that  limit  the 
risks  that  investors  face  from  weak  internal  controls.  In  contrast  to  public  companies,  internal 
control  failures  can  be  more  important  than  financial  risks  in  determining  hedge  fund  performance 
(Lo  2001,  30-31).  Moreover,  research  that  investigates  internal  control  choice  in  non-regulated 
settings  focuses  primarily  on  auditor  choice  (e.g.,  Chow  1982;  Watts  and  Zimmerman  1983; 
Blackwell  et  al.  1998;  Fortin  and  Pittman  2007).  We  also  contribute  by  investigating  a  broad  range 
of  internal  controls  in  a  voluntary  setting  with  the  potential  for  substantial  agency  costs.  Our 
empirical  results  provide  several  insights.  We  find  in  a  voluntary  setting  that  fund  reputation 
substitutes  for  explicit  internal  controls  and  that  legal  domicile  influences  the  choice  of  internal 
controls  employed.  We  also  find  that  weaker  internal  controls  are  associated  with  a  lower  percent¬ 
age  of  investment  profits  received  by  the  manager.  Further,  manager  involvement  in  setting  and 
reporting  the  fund’s  official  NAV  increases  the  likelihood  of  future  regulatory  investigation  for 
fraud  and  financial  misstatement.  But,  we  find  no  support  for  higher  quality  auditors  reducing  the 
likelihood  of  future  regulatory  investigations. 

Second,  we  extend  the  literature  on  hedge  funds  by  examining  their  internal  operations. 
Research  on  hedge  funds  generally  concentrates  on  investment  returns  (see  Lo  [2008]  for  a 
review).  We  extend  this  research  by  directly  observing  the  internal  controls  employed  by  funds  to 
reduce  agency  costs,  and  how  these  controls  are  associated  with  fees.  Moreover,  we  demonstrate 
that  weak  internal  controls  are  associated  with  future  regulatory  investigations  of  fraud  and  finan¬ 
cial  misstatements  and  that  reputational  incentives  and  leverage  providers  reduce  the  likelihood 
that  managers  carry  out  actions  that  lead  to  such  regulatory  investigations. 

Finally,  we  contribute  to  the  recent  debate  over  the  regulation  of  hedge  funds.  In  general,  the 
SEC  regards  internal  controls  as  a  critical  element  of  investor  protection,  and  it  recently  increased 
its  regulatory  focus  on  hedge  funds,  proposing  regulations  that  include  mandatory  disclosures  and 
other  internal  controls  (Smith  2006a,  2006b;  Oesterle  2006).  Hedge  fund  advocacy  groups  re- 


'  Some  examples  are  Ashbaugh-Skaife  et  al.  (2007),  Ashbaugh-Skaife  et  al.  (2009),  Doyle  et  al.  (2007),  Ge  and  McVay 
(2005),  Hammersley  et  al.  (2008),  Kim  et  al.  (2008),  and  Ogneva  et  al.  (2007). 
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sponded  to  these  proposals  by  suggesting  that  funds  follow  “best  practice”  industry  standards  that 
give  consideration  to  the  particular  characteristics  and  circumstances  of  each  fund  (Managed 
Funds  Association  [MFA]  2005).  Consistent  with  the  advocacy  groups’  proposals,  we  find  that 
funds  systematically  vary  their  internal  controls  to  address  potential  agency  costs.  Furthermore,  we 
show  that  internal  controls  are  positively  associated  with  investor  fees,  suggesting  that  investors 
evaluate  and  price  the  risks  arising  from  internal  control  failures. 

IL  HEDGE  FUND  SETTING 

Hedge  funds  are  managed  investment  vehicles.  They  are  often  privately  held,  generally  com¬ 
prised  of  wealthy  individuals  and  institutional  investors,  and  typically  organized  in  the  U.S.  as  a 
limited  partnership  or  offshore  as  a  corporation  (Fung  and  Hsieh  1999).^  There  has  been  substan¬ 
tial  growth  in  the  hedge  fund  industry,  both  in  the  number  of  funds  and  in  assets  under  manage¬ 
ment  (Brown  et  al.  1999;  Fung  and  Hsieh  1999;  Lo  2008).  As  of  the  first  quarter  of  2008,  hedge 
funds  held  more  than  $2.8  trillion  in  assets  under  management  (HedgeFund.net  [HFN]  2008). 
Although  hedge  funds  have  grown  tremendously  and  are  under  intense  scrutiny  regarding  their 
operations  and  potential  contribution  to  systemic  risk,  they  are  opaque,  and  therefore  little  is 
known  about  how  they  operate  (Lo  2001;  Chan  et  al.  2006). 

Unlike  other  investment  vehicles,  hedge  funds  are  structured  to  be  exempt  from  the  public 
offering  requirements  of  the  Securities  Act  of  1933,  the  periodic  reporting  obligations  of  the 
Securities  Exchange  Act  of  1934,  and  the  registration  requirements  of  the  Investment  Company 
Act  of  1940  (Oesterle  2006).^  For  example.  Section  5  of  the  Securities  Act  of  1933  requires 
registration  of  the  sale  of  securities  unless  the  issue  qualifies  for  a  “private  placement”  exemption.'^ 
Rule  506  of  Regulation  D  provides  exemption  for  hedge  funds  if  they  restrict  their  offerings  to 
only  accredited  investors,  namely  investment  companies,  or  individuals  with  more  than  $1  million 
or  income  exceeding  $200,000  in  the  two  most  recent  years  or  joint  income  with  a  spouse  of  over 
$300,000,  and  up  to  35  other  U.S.  purchasers.^  Additionally,  to  qualify  for  exemption,  all  non- 
accredited  investors  must  be  determined  by  the  fund  to  be  sophisticated,  with  knowledge  and 
experience  to  evaluate  the  prospective  investment.  Therefore,  a  more  descriptive  definition  of  a 
hedge  fund  is  an  investment  fund  exempt  from  a  list  of  specific  federal  acts  regulating  investment 
vehicles  (Oesterle  2006).®  To  ensure  exemption  from  SEC  regulation,  hedge  funds  cannot  under¬ 
take  any  form  of  general  solicitation  or  advertising  for  their  services  or  sale  of  securities  to  the 
general  public,  and,  therefore,  must  solicit  investments  through  private  placements  only  to  those 
who  are  sophisticated  enough  to  evaluate  the  investment  and  have  sufficient  wealth  to  bear  the  risk 
of  the  investment  (American  Bar  Association  [ABA]  2005,  214-215;  Lhabitant  2008).^ 

This  minimal  regulatory  environment  provides  hedge  funds  substantially  more  flexibility  than 
regulated  investment  vehicles,  such  as  mutual  funds.  For  example,  hedge  funds  have  greater 
discretion  regarding  valuation  and  reporting  of  their  investments  (SEC  2003;  McVea  2008).  Un¬ 
like  other  investment  vehicles  that  are  registered  under  the  Investment  Company  Act  of  1940, 


^  Although  the  early  hedge  funds  generally  held  offsetting  long  and  short  positions  (leading  to  the  title  “hedge  funds”), 
hedge  funds  are  not  limited  to  strict  long-short  positions. 

^  For  a  more  detailed  discussion  of  the  regulation  of  hedge  funds  see  Fung  and  Hsieh  (1999,  314-317),  ABA  (2005),  and 
Lhabitant  (2008,  37-84). 

Although  issuers  of  private  placements  are  required  to  file  a  Form  D  with  the  SEC  within  15  days  after  the  first  sale,  this 
form  contains  very  little  detailed  information  about  the  issuance  (Lhabitant  2008,  41). 

^  Regulation  S  provides  detailed  exemptions  related  to  issuances  to  non-U. S.  investors. 

^  Providing  a  definition  of  a  hedge  fund  based  on  operating  characteristics  is  problematic  given  its  wide  variation  in 
investment  asset  class,  sector,  duration,  region,  and  organizational  structure  (ABA  2005,  1;  Lhabitant  2008,  1). 

’’  Violation  of  this  solicitation  rule  could  evoke  registration  of  the  sale  of  securities  under  the  Securities  Act  of  1933, 
registration  as  a  mutual  fund  under  the  Investment  Company  Act  of  1940,  and  registration  as  an  investment  advisor 
under  the  Investment  Advisers  Act  of  1940. 
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hedge  funds  have  greater  discretion  to  use  leverage  to  finance  their  investment  positions  and  can 
undertake  substantial  short  selling  (Lhabitant  2008,  46).®  In  addition,  hedge  funds  can  charge  fees 
based  on  performance,  whereas  other  investment  vehicles,  such  as  mutual  funds,  are  restricted  to 
fees  based  solely  on  assets  under  management  (Fung  and  Hsieh  1999;  ABA  2005,  331-334). 

When  evaluating  an  investment  in  a  hedge  fund,  investment  advisors  or  accredited  investors 
solicit  information  about  the  fund,  with  the  investment  terms  provided  in  an  offering  circular  or 
“private  placement  memorandum”  (PPM).  The  PPM,  and  the  subsequent  executable  limited  part¬ 
nership  agreements  and  subscription  agreements,  lay  out  the  fund’s  operations  and  the  investor’s 
contractual  rights:  the  fund’s  investment  strategy,  the  fees  agreed  to  be  paid,  the  terms  under  which 
the  investor  can  invest  and  withdraw  funds,  the  investor’s  ability  to  monitor  the  fund,  the  manner 
and  frequency  in  which  the  fund  will  estimate  and  report  performance,  and  the  investor’s  remedy 
rights  in  the  case  of  a  dispute  (ABA  2005,  96-98).  Because  hedge  funds  are  substantively  exempt 
from  securities  regulations,  the  contractual  terms  laid  out  in  the  PPM  and  subsequent  executable 
agreements  represent  the  primary  mechanisms  in  place  to  protect  the  investor’s  investment. 

III.  PREDICTIONS 

Hedge  funds  are  opaque  due  to  the  limited  mandatory  disclosure  requirements  and  the  pro¬ 
prietary  nature  of  their  operations.  Consequently,  there  is  the  potential  for  substantial  agency  costs 
and  information  asymmetries  between  investors  and  fund  managers.  Moreover,  hedge  funds  often 
engage  in  complex  trading  strategies  that  involve  large  numbers  of  securities  and/or  trades, 
thereby  increasing  the  potential  for  fraud  on  the  part  of  employees.  Therefore,  this  setting  presents 
strong  incentives  to  implement  internal  controls  that  increase  the  monitoring  of  managers,  reduce 
the  likelihood  of  fraud,  and  increase  the  accuracy  of  asset  valuations  and  performance  disclosures 
made  to  investors.®  Consistent  with  the  importance  of  hedge  fund  internal  controls,  Kundro  and 
Feffer  (2003)  find  that  the  breakdown  of  internal  systems  was  a  major  determinant  of  failure  for 
more  than  50  percent  of  the  100  failed  funds  they  investigated. 

The  fund  manager  is  the  residual  claimant  on  the  fund.  By  implementing  internal  controls  to 
reduce  the  agency  costs  between  investors  and  the  fund,  managers  can  increase  the  value  of  their 
residual  claim  if  the  benefits  of  implementation,  such  as  obtaining  greater  fees,  are  greater  than  the 
costs  of  implementing  and  maintaining  internal  controls.  Whereas  there  are  likely  to  be  benefits  to 
having  particular  internal  controls  in  place,  not  all  hedge  fund  managers  will  invest  or  commit  to 
using  internal  controls,  as  they  are  costly  to  use  and,  in  many  cases,  have  a  large  fixed  component 
(SEC  2003).  For  example,  external  oversights  can  inhibit  the  manager’s  ability  to  quickly  respond 
to  trading  opportunities.  Further,  there  are  explicit  costs  to  retain  external  service  providers  for 
operational  oversight  and  support,  and  the  use  of  external  service  providers  requires  the  manager 
to  disclose  to  the  service  providers  proprietary  information  about  investment  strategies  and  posi¬ 
tions. 

We  predict  that  ex  ante  agency  costs  are  higher  for  younger  funds  that  lack  valuable  reputa¬ 
tions,  offshore  funds  that  are  subject  to  a  weaker  legal  system,  and  unlevered  funds  that  are  not 
subject  to  creditor  monitoring.  Managers  of  established,  higher  quality  funds  have  valuable 
reputations  and  potentially  superior  investing  technologies,  implying  that  they  hold  more  valuable 


*  For  example,  open-end  investment  companies  are  restricted  to  bank-sourced  leverage  only,  with  this  amount  capped  at 
300  percent  of  the  asset  coverage  (Section  18(f)(1)  Investment  Company  Act  of  1940). 

®  Note  that  even  in  the  absence  of  agency  problems,  internal  controls  can  increase  the  precision  of  asset  values  disclosed 
to  investors  and,  therefore,  reduce  uncertainty  about  the  cash  flows  that  investors  expect  to  receive. 

We  present  our  predictions  and  empirics  using  the  fund  as  the  unit  of  analysis,  because  any  one  manager  can  manage 
both  levered  and  unlevered  funds,  and  both  offshore  and  onshore  funds.  Further,  prior  literature  exclusively  employs 
fund-level  variables  in  empirical  analyses.  While  both  fund  and  manager  characteristics  could  potentially  affect  internal 
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residual  claims  from  operating  their  funds  than  do  managers  of  lower  quality  funds.  Therefore, 
managers  of  higher  quality  funds  have  lower  incentives  to  commit  fraud  and  greater  incentives  to 
provide  investors  with  more  accurate  performance  reports  (Klein  and  Leffler  1981),  reducing  the 
benefit  of  explicit  internal  controls.  Fund  quality  can  be  signaled  by  past  investment  performance 
and  capital  flows  (Fung  and  Hsieh  1997).  For  new  funds,  there  is  greater  uncertainty  about  the 
quality  of  the  manager  and  the  likelihood  that  the  manager  will  commit  fraud  or  misstate  perfor¬ 
mance.  This  uncertainty  should  increase  the  benefit  to  new  funds  from  enacting  more  stringent 
internal  controls  and  using  reputable  outside  service  providers.  For  example,  Mansi  et  al.  (2004) 
find  that  auditor  quality  and  tenure  are  negatively  related  with  the  cost  of  debt  financing,  especially 

for  firms  with  riskier  grades  of  debt.  Consequently,  ceteris  paribus,  we  predict  a  negative  asso- 

11  12 

ciation  between  internal  controls  and  fund  age.  ’ 

Although  onshore  and  offshore  funds  are  generally  exempt  from  U.S.  securities  regulations, 
investors  in  onshore  funds  can  use  the  U.S.  legal  system  to  redress  fraud  and  financial 
misstatements.  Although  many  offshore  funds  are  domiciled  in  countries  that  regulate  invest¬ 
ment  vehicles  and  provide  investors  with  legal  redress,  U.S.  legal  contract  enforcement  mecha¬ 
nisms  are  generally  considered  superior.  Most  offshore  hedge  funds  are  domiciled  in  Caribbean 
islands  that  have  historically  been  associated  with  secret  bank  accounts  and  money  laundering 
(Suss  et  al.  2002).  Fund  managers  who  operate  in  a  regulatory  environment  that  allows  secret  bank 
accounts  and  enables  money  laundering  can  find  that  the  lax  banking  environment  makes  it  easier 
to  commit  fraud,  and,  conditional  on  committing  fraud,  easier  to  hide  and  protect  such  fraudulent 
gains  from  defrauded  investors  (for  a  discussion,  see  Blum  et  al.  1998). 

There  could  be  systematic  variation  in  the  types  of  investors  that  invest  in  onshore  versus 
offshore  funds. In  particular,  when  U.S.  tax-exempt  entities,  such  as  pension  funds  and  charitable 
trusts,  invest  in  offshore  funds,  they  can  avoid  recognizing  unrelated  business  taxable  income 
arising  from  investments  in  leveraged  portfolios  (ABA  2005,  360;  Lhabitant  2008,  88).  Neverthe¬ 
less,  it  is  not  clear  why,  holding  other  investor  characteristics  constant,  U.S.  tax-exempt  investors 
would  have  monitoring  incentives  that  differ  significantly  from  those  of  other  investors,  which 
suggests  that  this  potential  difference  has  limited  implications  for  hedge  fund  internal  controls. 
Therefore,  ceteris  paribus,  we  predict  that  funds  domiciled  offshore  will  have  better  internal 
controls  because  investors  have  reduced  legal  recourse  to  redress  fraud  and  financial  misstate¬ 
ments. 


control  choice,  inclusion  of  both  manager  and  fund  characteristics  results  in  multicollinearity  arising  from  the  fact  that 
many  managers  run  only  one  fund  in  our  sample.  In  our  “Robustness”  section  we  examine  the  choice  of  using  either  the 
fund  or  manager  as  the  unit  of  analysis,  and  including  manager  characteristics  in  our  tests. 

"  Smaller  funds  can  be  considered  to  be  of  lower  quality  because  they  have  had  less  capital  inflows.  Therefore,  they  are 
likely  to  have  greater  benefits  from  implementing  internal  controls  than  larger  funds.  However,  larger  funds  have  greater 
incentives  to  implement  internal  controls  because  the  magnitude  of  potential  agency  costs  increases  in  fund  size.  For 
example,  research  has  consistently  shown  a  positive  association  between  better  internal  controls  and  organization  size 
(Chow  1982;  Doyle  et  al.  2007).  Therefore,  ex  ante,  it  is  unclear  what  the  association  between  fund  size  and  better 
internal  controls  will  be. 

Note  that  this  prediction  is  based  on  the  assumption  either  that  internal  control  technology  has  improved  over  time  and 
older  funds  find  it  too  costly  to  implement  newer  technologies,  or  that  over  time  funds  receive  decreasing  benefits  from 
internal  controls  and  therefore  unravel  internal  controls. 

If  the  manager  is  onshore  but  the  fund  is  offshore,  investors  can  sue  the  fund  manager  through  the  U.S.  legal  system. 
Nevertheless,  any  judgments  against  the  fund  would  be  executed  through  the  legal  system  of  the  fund’s  offshore 
domicile. 

Although  the  Taxpayer  Relief  of  1997  eliminated  the  historical  primary  tax  rationale  for  offshore  hedge  fund  investing, 
namely  the  avoidance  of  U.S.  taxation  of  non-U. S.  investors  on  their  capital  gains,  non-U. S.  investors  could  still  prefer 
offshore  funds  to  limit  their  exposure  and  retain  their  anonymity  from  U.S.  regulatory  authorities,  such  as  the  IRS  (ABA 
2005,  98-99  and  360;  Lhabitant  2008,  88).  Like  previous  hedge  fund  research,  we  are  unable  to  identify  specific 
investors  in  our  sample  funds.  The  extent  to  which  we  cannot  control  for  investor  is  a  limitation  of  the  research. 
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Some  funds  use  leverage  to  increase  their  invested  assets.  Funds  generally  obtain  leverage 
from  prime  brokers  who  provide  managers  with  trade  execution  and  custodial  services  (Lhabitant 
2008).  Leverage  creates  another  stakeholder  in  the  fund’s  performance  and  operations.  Therefore, 
leverage  providers  are  likely  to  demand  that  funds  implement  strong  internal  controls  as  precon¬ 
dition  for  providing  credit,  leading  to  a  positive  relation  between  leverage  and  internal  controls. 
However,  to  the  extent  that  the  prime  broker  has  an  incentive  and  the  ability  to  monitor  the  fund 
manager  because  it  typically  executes  the  fund’s  trades,  leverage  could  dampen  this  hypothesized 
association,  given  potential  substitutability  between  formal  internal  control  mechanisms  and  moni¬ 
toring  on  the  part  of  debt  holders. 

Finally,  investment  style  can  affect  the  costs  associated  with  a  fund’s  internal  control  choices. 
It  is  generally  accepted  that  a  hedge  fund  investment  is  essentially  a  bet  on  the  fund  manager’s 
proprietary  investment  strategies  (Edwards  and  Caglayan  2001;  Lo  2008).  Hence,  a  fund  can  incur 
substantial  costs  if  outsiders  acquire  proprietary  information  about  the  fund’s  investment 
positions. These  costs  are  particularly  relevant  for  funds  that  employ  a  short  bias  investment 
style.  Outsiders  who  learn  a  fund’s  short  positions  can  profit  because  the  fund  eventually  has  to 
unwind  its  short  positions,  and,  therefore,  informed  outsiders  can  trade  against  the  fund  when  fund 
has  to  cover  its  short  positions  (Dechow  et  al.  2001).  If  short  bias  funds  take  actions  to  protect 
proprietary  information  about  investment  positions,  then  they  should  limit  the  use  of  objective 
external  verification  of  fund  operations,  such  as  the  valuation  of  fund  assets.'^  Consequently, 
ceteris  paribus,  we  predict  a  negative  association  between  short  selling  and  internal  controls  that 
involve  outside  service  providers. 

In  summary,  we  predict  that  younger  funds  and  offshore  funds  are  more  likely  to  adopt 
stronger  internal  controls  and  reporting  mechanisms,  as  these  funds  are  riskier  along  the  dimen¬ 
sions  of  fraud  and  financial  misstatements  and,  therefore,  internal  controls  should  allow  managers 
of  these  funds  to  charge  higher  fees.  In  addition,  we  predict  that  leverage  affects  managers’ 
internal  control  decisions  and  that  short  bias  funds  are  less  likely  to  use  external  parties  to  value 
investment  positions. 


IV.  SAMPLE 

To  investigate  the  determinants  of  internal  controls  and  their  association  with  fees,  we  use  a 
database  of  hedge  fund  due  diligence  reports  prepared  by  HFDD.  The  reports  were  commissioned 
by  investors  evaluating  whether  to  invest  in  the  funds. Therefore,  the  sample  represents  a  set 
of  hedge  funds  in  which  investors  were  actively  seeking  to  invest  capital.  HFDD  specializes 
exclusively  in  hedge  fund  due  diligence,  and  it  obtained  the  information  included  in  these  reports 


Note  that  hedge  funds  are  required  under  Section  13(d)  and  Section  13(g)  of  the  Securities  Act  of  1933  to  report  long 
positions  relevant  to  corporate  control  and  transfer,  namely  those  with  more  than  5  percent  of  a  class  of  equity  security 
registered  under  Section  12  of  the  Securities  Exchange  Act  1934.  Also,  hedge  funds  with  discretion  over  $100  million 
in  assets  are  required  to  disclose  their  long  positions  on  a  quarterly  basis.  However,  these  requirements  are  of  limited 
proprietary  risk  given  the  disclosures  are  aggregated,  delayed,  and  restricted  to  long  positions  only  (Lhabitant  2008,  44). 
Although  service  providers  can  and  do  implement  mechanisms  to  protect  the  release  and  use  of  clients’  proprietary 
information,  such  systems  are  not  perfect.  For  example,  Geczy  and  Yan  (2006)  find  that  such  mechanisms  do  not 
completely  prevent  the  transmission  of  proprietary  information  between  brokers  and  market  makers,  and  Ivashina  and 
Sun  (2007)  find  that  institutional  investors  trade  on  proprietary  information  that  they  receive  from  participating  in  loan 
syndicates. 

All  HFDD  investigations  were  initiated  and  paid  for  by  potential  investors.  No  investigations  were  commissioned  and/or 
paid  for  by  the  investigated  hedge  funds. 

HFDD  is  not  the  only  organization  that  carries  out  hedge  fund  due  diligence.  Due  diligence  is  performed  by  other 
third-party  providers,  and  in-house  by  many  fund-of-funds.  Like  other  due  diligence  providers,  HFDD  clients  are 
primarily  major  institutional  investors  and  high  net  worth  individuals  who  use  their  services  on  a  repeated  basis.  Instead 
of  undertaking  due  diligence,  investors  can  invest  in  funds  of  hedge  funds  that  specialize  in  evaluating  and  monitoring 
fund  managers  (Brown  et  al.  2008). 
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from  several  sources  including  on-site  visits  and  interviews  with  key  staff,  discussions  with  ser¬ 
vice  providers  (auditors  and  administrators),  reviews  of  offering  memorandums,  examinations  of 
public  filings  and  registrations,  media  scans,  background  checks  of  fund  managers  and  staff, 
reviews  of  financial  statements,  and  reviews  of  internal  control  protocols.  Consequently,  this 
database  overcomes  potential  concerns  related  to  commercial  hedge  fund  databases  that  are  based 
on  self-reported  fund  performance  and  manager  characteristics.^^  The  HFDD  reports  provide 
extensive  detail  regarding  fund  characteristics,  internal  controls,  contract  terms,  and  manager 
backgrounds. 

Table  1  provides  the  descriptive  statistics  for  the  funds  and  managers.  The  sample  consists  of 
427  funds  run  by  358  unique  managers  investigated  from  2003  to  2007.  Panel  A  shows  the  sample 
distribution  by  domicile.  Sample  funds  are  domiciled  primarily  in  the  Cayman  Islands  (25 1  funds). 
United  States  (70  funds),  British  Virgin  Islands  (50  funds),  and  Bermuda  (34  funds).  Panel  B 
provides  a  summary  of  fund  characteristics.  The  mean  (median)  fund  has  $305  million  ($107 
million)  in  assets  under  management  and  is,  on  average,  less  than  three  years  old  (1,020  days)  at 
the  time  of  due  diligence.  Of  our  sample  funds’  managers,  36  percent  are  domiciled  offshore; 
however,  in  contrast  to  manager  location,  most  funds  (84  percent)  are  located  offshore.  Examining 
the  overlap  between  offshore  managers  and  offshore  funds,  more  than  half  (56.8  percent)  of  the 
sampled  offshore  funds  are  managed  by  onshore  managers;  however,  no  onshore  funds  are  man¬ 
aged  by  offshore  managers.  For  multivariate  analyses,  we  create  two  indicator  variables  to  reflect 
the  joint  distribution  of  offshore  funds  and  managers,  and  capture  potential  variation  from  the 
separation  of  fund  and  manager  domicile.  The  first  variable  is  coded  as  1  if  both  the  manager  and 
the  fund  are  offshore,  and  0  otherwise.  The  second  variable  is  coded  as  1  if  the  fund  is  offshore 
and  the  manager  is  onshore,  and  0  otherwise. 

Table  1  also  provides  descriptive  detail  on  the  funds’  investment  styles,  asset  classes,  and 
portfolio  characteristics.  We  observe  that  54  percent  of  the  funds  use  leverage  to  finance  their 
investments.  The  due  diligence  firm  provided  the  fund  investment  styles,  which  are  based  quan¬ 
titative  and  qualitative  analyses  of  funds’  portfolio  holdings  and  returns.  Our  sample  spans  a  broad 
range  of  invested  assets  including  equity,  fixed  income,  and  foreign  exchange.  With  respect  to 
investment  style,  20  percent  of  the  funds  were  identified  as  having  a  short  selling  bias,  whereas  36 
percent  were  identified  as  having  a  long  bias.  In  addition,  there  is  substantial  variation  in  the 
number  of  investment  positions  typically  held  in  the  funds’  portfolios;  41  percent  hold  less  than  40 
positions,  and  3  percent  hold  1,000  or  more  positions.  There  is  also  variation  in  the  typical  holding 
period  for  the  investments  made  by  the  sample  funds,  with  13  percent  holding  investment  posi¬ 
tions  for  only  days  and  32  percent  holding  positions  for  longer  than  a  year. 

Hedge  fund  managers  generally  receive  two  types  of  fees.  First,  they  receive  an  annual 
management  fee  calculated  as  a  percentage  of  assets  under  management.  For  the  sample,  the  mean 
(median)  management  fee  is  1.52  percent  (1.50  percent)  of  assets  under  management.  Second, 
managers  receive  a  performance  fee  calculated  as  a  percentage  of  investment  profits.  For  the 
sample,  the  mean  (median)  performance  fee  is  19.34  percent  (20.00  percent).^'  Finally,  10  percent 
of  our  sample  funds  have  restated  asset  values  and  performance  disclosures  that  were  made  to 
investors.  Hedge  fund  restatements  are  potentially  more  serious  than  restatements  of  public  firms. 


One  concern  with  the  database  is  that  it  consists  of  funds  willing  to  be  subject  to  due  diligence.  According  to  HFDD, 
such  refusals  are  rare  and  clients  typically  do  not  invest  in  funds  that  refuse. 

The  remaining  funds  can  engage  in  some  short  or  long  selling;  however,  based  on  quantitative  analyses  of  trading 
activities,  the  relative  magnitude  of  active  short/long  selling  is  substantially  greater  for  these  funds. 

A  large  proportion  of  management  fees  are  set  at  1.00  percent,  1.50  percent,  and  2.00  percent,  whereas  the  majority  of 
performance  fees  are  clustered  at  20.0  percent.  To  address  the  kinks  in  the  distributions  of  management  fees  and 
performance  fees,  in  robustness  tests,  we  replace  fee  measures  with  ordinal  measures  coded  as  0  for  below  the  mode  of 
the  distribution,  1  equal  to  the  mode,  and  2  for  above  the  mode.  The  results  are  invariant  to  this  alternative  specification. 
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because  hedge  fund  restatements  involve  NAV s  that  determine  fees  and  redemptions.  Overall,  the 
characteristics  of  the  sample  are  consistent  with  previous  empirical  evidence  on  hedge  funds 
(Aragon  2007). 


V.  RESULTS 

Descriptive  Statistics  of  Internal  Controls  in  Hedge  Funds 

We  report  a  comprehensive  set  of  internal  controls  used  in  hedge  funds.  The  first  group 
consists  of  the  signatures  required  to  make  transfers  out  of  the  fund’s  bank  and  prime  brokerage 
accounts.  The  use  of  more  than  one  signature  authority  and  the  use  of  external  parties  to  authorize 
fund  movement  and  withdrawals  provide  greater  control  of  the  cash  held  by  the  fund,  thereby 
decreasing  the  likelihood  of  fraud.  The  second  group  relates  to  the  mechanisms  used  to  estimate 
and  disclose  the  fund’s  performance  to  investors.  This  group  can  be  further  broken  down  into  three 
subgroups:  who  prices  the  portfolio;  what  is  the  source  of  prices  used  to  value  individual  asset 
performance;  and  who  calculates  the  portfolio’s  NAV  that  is  reported  to  investors.  The  use  of 
external  parties  and  objective  sources  to  value  invested  assets  and  the  use  of  an  independent  party 
without  manager  involvement  to  report  and  verify  the  NAV  to  investors  provide  the  most  objective 
measurement  of  fund  position  and  performance,  thereby  reducing  the  likelihood  of  fraud  and 
increasing  the  precision  of  asset  valuations  disclosed  to  investors. 

The  third  group  of  internal  controls  consists  of  the  quality  and  reputation  of  the  fund’s  service 
providers,  specifically  its  auditor  and  administrator.  Similar  to  engagements  with  operating  firms, 
auditors  generally  undertake  annual  audits  of  hedge  funds  to  ensure  that  the  financial  statements 
furnished  to  fund  investors  comply  with  relevant  accounting  standards.  However,  although  many 
auditors  review  the  consistency  of  how  funds  value  investment  positions,  they  do  not  typically 
review  the  validity  of  valuations  (Lhabitant  2008).  Nevertheless,  auditors  with  good  reputations 
have  incentives  to  decrease  the  likelihood  of  fraud  or  financial  misstatement  on  the  part  of  their 
clients. The  role  of  fund  administrators  varies  substantially  across  engagements;  however,  they 
generally  provide  “back-office”  support,  such  as  performing  day-to-day  administrative  operations, 
accounting  and  valuation  services,  and  serving  as  the  interface  with  investors.  In  many  cases  the 
administrator  also  calculates  the  NAV  using  data  provided  by  either  the  fund  custodian  or  prime 
broker  (Lhabitant  2008).  The  practitioner  literature  strongly  advocates  against  allowing  fund  man¬ 
agers  to  perform  final  valuations  and/or  to  communicate  valuations  to  an  administrator,  except  in 
highly  exceptional  circumstances,  and  even  then,  such  occasions  are  recommended  to  be  made 
with  auditor  approval  and  fully  disclosed  to  investors  ^Lhabitant  2008,  100).  Similar  to  auditors, 
we  observe  that  several  administrators  serve  more  than  one  of  our  sample  hedge  funds.  Therefore, 
we  assume  that  administrators  with  valuable  reputations  work  to  decrease  the  likelihood  of  fraud 
or  financial  misstatement. 

Table  2,  Panel  A  reports  the  descriptive  statistics  for  the  internal  controls  implemented  by  the 
funds  in  the  sample.  We  find  that  the  type  and  number  of  signatures  required  to  transfer  funds  out 
of  the  bank  or  prime  broker  account  vary  substantially  across  our  sample.  We  observe  27.7  percent 
of  the  funds  require  multiple  signatures  from  both  internal  and  external  parties,  and  24.2  percent 
require  only  an  external  signature.  Some  funds  require  only  internal  signatures,  with  24.6  percent 
requiring  at  least  two  internal  signatures  and  23.5  percent  requiring  only  one  internal  signature  to 
transfer  money  out  of  the  fund’s  bank  and  prime  brokerage  accounts. 


Note  that  in  contrast  with  much  of  the  auditing  literature,  we  assume  that  the  reputation  as  opposed  to  the  size  of  the 
auditor  and/or  administrator  determines  the  quality  of  services  provided  (see  Weber  et  al.  2008  for  a  discussion). 
Although  hedge  fund  service  providers  can  be  sued,  the  reputational  loss  due  to  fraud  or  misstatement  is  probably  larger 
than  any  damages  that  can  be  recovered  especially  for  funds  and  service  providers  located  offshore,  especially  those 
located  in  tax  havens. 
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There  is  also  substantial  variation  as  to  who  prices  the  funds’  portfolios.  We  observe  that  9.2 
percent  of  funds  use  at  least  one  internal  service  and  one  external  service  to  price  investment 
positions,  whereas  62.9  percent  use  only  external  services  to  price  the  portfolio.  We  also  find  that 
11.7  percent  of  funds  use  collaborative  pricing,  whereby  an  external  pricing  service  collaborates 
with  the  fund  manager  to  determine  the  value  of  the  portfolio.  Finally,  16.2  percent  of  the  funds  in 
the  sample  price  the  portfolio  internally. 

The  HFDD  reports  list  all  price  sources  that  the  funds  use  to  value  their  invested  assets.  Given 
that  some  funds  use  multiple  pricing  sources,  we  report  all  sources  and  the  least  objective  source 
used  by  each  fund.  The  most  objective  source,  exchange  quotes,  is  used  (solely  used)  by  69.9 
percent  (34.8  percent)  of  the  funds.  Over-The-Counter  (OTC)  quotes  are  used  (the  fund’s  least 
objective  source)  for  15.8  percent  (5.9  percent)  of  the  funds,  whereas  dealer  quotes  are  used  (the 
fund’s  least  objective  source)  for  40.2  percent  (33.2  percent).  Regarding  proprietary  based  valua¬ 
tion  sources,  we  find  14.8  percent  (12.7  percent)  of  our  sample  uses  (as  their  least  objective 
source)  model-based  prices.  Finally,  13.4  percent  of  our  sample  generates  price  internally.  Internal 
pricing  is  likely  to  be  advantageous  if  the  asset  if  difficult  to  value,  allowing  managers  to  use  their 
superior  judgment  and  information  to  provide  the  best  estimate  of  the  investment  value.  However, 
it  provides  managers  with  greater  opportunities  to  manipulate  reported  performance.  We  use  the 
fund’s  least  objective  price  source  for  our  empirical  analyses. 

The  entity  that  sets  and  reports  to  investors  the  NAV  of  the  hedge  fund’s  investment  portfolio 
differs  from  that  which  prices  the  individual  investment  positions.  Pricing  is  done  on  an  asset-by- 
asset  basis  and  occurs  on  a  routine,  frequent  basis,  whereas  NAV  represents  the  net  asset  value  of 
the  entire  portfolio  and  is,  therefore,  an  aggregation  of  the  individual  prices  determined  by  the 
fund’s  pricing  systems.  NAV  is  typically  estimated  and  reported  to  investors  at  month  end.  The 
majority  of  funds  (85.9  percent)  have  no  manager  involvement  in  the  determination  of  the  NAV, 
which  represents  the  most  unbiased  approach  to  its  calculation.  However,  the  remaining  14.1 
percent  have  some  manager  involvement,  with  most  giving  the  manager  sole  determination  of  the 
setting  and  reporting  of  NAV. 

We  now  describe  the  use  of  service  providers  by  our  funds.  All  funds  in  the  sample  are 
audited,  but  there  is  heterogeneity  in  auditor  quality  and  reputation.  Whereas  a  Big  4  auditor  is 
typically  used  in  accounting  research  to  represent  auditor  quality  or  reputation  (Hogan  1997; 
Mansi  et  al.  2004;  Fortin  and  Pittman  2007),  we  use  hedge  fund  industry-specific  rankings  of  audit 
firms  obtained  from  Institutional  Investor’s  Alpha  Survey,  because  hedge  fund  auditing  requires 
specialized  skills  that  differ  from  those  required  to  audit  publicly  traded  firms.^^  The  Alpha  survey 
is  based  on  voting  by  industry  participants,  and  is  similar  to  Institutional  Investor’s  ranking  of 
equity  analysts  used  by  researchers  (Shekel  1992;  Hong  and  Kubik  2003).  Of  the  funds  in  the 
sample,  77.0  percent  are  audited  by  an  auditor  that  is  ranked  in  the  Alpha  survey.  We  also  use  the 
Alpha  survey  to  determine  the  quality  of  the  fund  administrator  and  find  that  31.6  percent  of 
sample  use  a  ranked  administrator.^"^ 

Table  2,  Panel  A  also  reports  internal  control  use  for  all  domiciles  with  more  than  ten  obser¬ 
vations.  We  observe  variation  across  all  the  internal  controls  investigated  within  domicile  group, 
suggesting  that  funds  have  discretion  on  all  these  internal  control  choices.  Further,  examining  the 
regulation  of  our  sample  domiciles,  we  find  no  mandatory  requirements  that  would  require  internal 
control  adoption  investigated  in  this  study  (ABA  2005;  PricewaterhouseCoopers  2006).  For  ex¬ 
ample,  the  Cayman  Islands  have  a  requirement  that  hedge  fund  service  providers  are  from  an 


The  correspondence  between  Big  4  and  the  hedge  fund  industry-specific  measure  is  85  percent. 

^  We  also  considered  a  prime  broker  as  a  potential  service  provider.  We  found,  however,  that  almost  all  the  funds  in  our 
sample  utilized  a  prime  broker  that  was  ranked  by  the  Alpha  survey. 
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accredited  registered  list,  but  this  list  includes  many  service  providers  not  ranked  by  Alpha  Inves¬ 
tors.  Consistent  with  this  fact,  we  observe  non-ranked  auditors  and  administrators  used  by  Cayman 
Islands  domiciled  funds.  Further,  we  observe  a  higher  likelihood  of  engaging  a  ranked  auditor  and 
administrator  across  non-Cayman  offshore  funds  than  in  U.S.  domiciled  funds,  suggesting  that  any 
offshore  findings  are  not  solely  attributable  to  being  domiciled  in  the  Cayman  Islands. 

Within  each  group,  we  numerically  rank  each  internal  control  by  quality.  These  ranks  are 
provided  in  Table  2,  Panel  A.  Auditor  and  administrator  ranks  are  based  on  the  Alpha  Magazine 
rankings.  For  the  remaining  controls,  we  use  HFDD’s  rank  ordering,  which  is  based  on  its  deter¬ 
mination  of  each  control’s  objectivity.  HFDD  includes  these  rank  orderings  in  the  due  diligence 
reports  it  prepares  for  its  clients.  Furthermore,  to  measure  the  overall  association  between  more 
effective  internal  controls  and  their  determinants,  for  each  fund  we  sum  the  ranks  of  the  individual 
internal  controls  to  form  an  overall  index  score.  An  advantage  of  this  approach  is  that  it  provides 
a  measure  of  overall  internal  control  use  while  allowing  for  substitutes  and  complements.  Table  2, 
Panel  B  presents  descriptive  statistics  for  the  summary  measures  for  each  group  and  our  overall 
internal  control  index. 

Table  3  reports  the  correlations  between  each  of  the  internal  controls  investigated,  the  overall 
internal  control  measure,  the  independent  variables,  and  hedge  fund  fees.  The  six  internal  controls 
we  examine  are  all  significantly  positively  correlated  (at  p  <  0.10,  two-tailed),  aside  from  the 
signature-administrator  pairing.  We  also  find  that  better  internal  controls  are  significantly  posi¬ 
tively  correlated  with  funds  and  managers  that  are  offshore,  levered  funds,  and  management  and 
performance  fees,  and  negatively  correlated  with  fund  age  and  short  bias  in  investment  strategy. 

Determinants  of  Hedge  Fund  Internal  Controls 

Table  4  presents  regressions  modeling  the  determinants  of  internal  controls  used  by  the  funds 
in  the  sample.  To  represent  fund  characteristics,  we  use  the  following  independent  variables:  the 
natural  logarithm  of  the  fund’s  assets  under  management,  the  natural  logarithm  of  the  fund’s  age 
in  days,  and  indicators  for  whether  the  fund  and  fund  manager  are  offshore  and  for  whether  the 
fund  is  offshore  but  the  manager  is  onshore.  To  proxy  for  portfolio  characteristics  and  investment 
style,  we  include  indicator  variables  for  whether  the  fund  uses  leverage,  whether  the  portfolio  has 
a  long  bias,  and  whether  the  portfolio  has  a  short  bias.  To  control  for  the  asset  class  and  liquidity 
of  the  portfolio’s  investments,  we  include  indicator  variables  for  typical  holding  periods  of  port¬ 
folio  investments,  indicator  variables  for  typical  number  of  positions  held  by  the  fund,  and  year 
fixed  effects.^^  Note  that  variables  related  to  investment  style  and  liquidity  also  control  for  varia¬ 
tions  in  fund  complexity. 

Although  previous  research  that  investigates  the  liquidity  of  hedge  fund  investment  portfolios 
employ  a  measure  based  on  the  serial  correlation  of  self-reported  monthly  fund  returns  (Getman¬ 
sky  et  al.  2004),  we  use  the  series  of  indicators  representing  typical  number  and  length  of  positions 
to  avoid  a  mechanical  association  between  the  serial  correlation  measure  and  internal  controls. 
Namely,  if  internal  controls  affect  the  serial  correlation  of  reported  performance  by  influencing  the 


In  unreported  results,  we  attempted  to  reduce  the  dimensionality  of  the  internal  control  variables  through  principal 
component  factor  analysis.  Although  the  six  internal  controls  were  positively  correlated  with  each  other,  as  shown  in 
Table  3,  the  largest  factor  obtained  explained  only  33.9  percent  of  the  overall  variance  in  our  internal  controls. 

“  In  unreported  analyses  we  investigate  whether  the  internal  control  index  increases  through  our  sample  period.  We  find 
that  the  mean  level  of  the  index  monotonically  increases  each  year  from  2002  through  2007,  implying  a  secular 
improvement  in  hedge  fund  internal  controls. 
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TABLE  3 

Correlation  Matrix^ 


1 

2 

3 

4 

5 

6 

7 

8 

1.  Signatures  Index 

2.  Who  Prices  Index 

0.37 

3.  Source  of  Prices  Index 

0.09 

0.24 

4.  NAV  No  Manager 

0.19 

0.54 

0.14 

5.  Ranked  Auditor 

0.22 

0.13 

0.10 

0.13 

6.  Ranked  Administrator 

0.07 

0.15 

0.09 

0.18 

0.16 

7.  Internal  control 

0.64 

0.72 

0.68 

0.50 

0.35 

0.34 

8.  Ln(Fund  Age) 

-0.02 

-0.04 

-0.13 

-0.07 

0.05 

-0.04 

-0.12 

9.  Off.  Mgr.  and  Fund 

0.17 

0.19 

0.08 

0.21 

0.15 

0.21 

0.25 

0.01 

10.  Fund  offshore 

0.03 

-0.01 

-0.04 

0.05 

0.07 

-0.03 

-0.01 

-0.07 

11.  Leverage 

0.12 

0.13 

0.12 

0.03 

0.13 

0.16 

0.17 

0.00 

12.  Short  bias 

0.02 

-0.05 

-0.06 

-0.07 

-0.03 

-0.11 

-0.09 

0.00 

13.  Long  bias 

0.02 

0.01 

-0.11 

0.06 

-0.02 

0.16 

-0.02 

0.05 

14.  Ln(Fund  AUM) 

0.12 

0.04 

-0.17 

0.02 

0.17 

0.16 

0.02 

0.49 

15.  Management  fee 

0.05 

0.15 

0.12 

0.13 

0.09 

0.04 

0.16 

-0.05 

16.  Performance  fee 

0.07 

0.06 

0.36 

0.04 

0.08 

0.07 

0.23 

0.02 

17.  Restatement 

-0.02 

-0.02 

0.05 

0.00 

0.04 

-0.06 

-0.01 

0.04 

9 

10 

11 

12 

13 

14 

15 

16 

10.  Fund  offshore 

-0.71 

11.  Leverage 

0.08 

-0.02 

12.  Short  bias 

-0.04 

0.03 

0.01 

13.  Long  bias 

0.05 

0.00 

0.18 

-0.22 

14.  Ln(Fund  AUM) 

0.09 

0.04 

0.11 

-0.03 

0.17 

15.  Management  fee 

0.17 

-0.04 

0.09 

-0.04 

0.06 

-0.02 

16.  Performance  fee 

0.01 

0.10 

0.13 

0.01 

-0.04 

0.05 

0.09 

17.  Restatement 

-0.01 

0.06 

-0.04 

-0.01 

-0.06 

0.01 

-0.15 

0.00 

“  Variables  are  defined  in  Tables  1  and  2.  Correlations  in  bold  are 

statistically  significant  at  the  p= 

=0.10  level. 

ability  of  the  manager  to  smooth  reported  performance,  then  the  serial  correlation  measure  of 
liquidity  is  likely  to  be  correlated  with  the  quality  of  the  fund’s  internal  controls.^^ 

Table  4,  Panel  A  presents  probit  regressions  for  the  signatures  required  to  transfer  funds  from 
the  bank  accounts  and  prime  broker  accounts.^^  We  find  that  single  internal  signature  authority  is 
significantly  less  likely  to  be  used  by  offshore  funds,  regardless  of  manager  domicile,  consistent 
with  offshore  funds  providing  higher  quality  internal  controls.  Larger  funds  are  also  more  likely  to 


27 

The  strength  of  internal  controls  can  have  an  analogous  effect  on  observed  return  volatility  and  observed  tracking  error. 
These  effects  imply  that  observed  return  volatility  and  tracking  error  is  likely  to  be  an  outcome,  in  addition  to  a 
determinant,  of  the  fund’s  internal  controls.  Given  the  potential  reverse  causality  between  these  variables  and  internal 
controls,  we  do  not  include  observed  return  volatility  or  observed  tracking  error  as  determinants  of  internal  controls. 
Unlike  an  ordered  probit,  the  probit  regressions  are  not  constrained  to  ordering  assumptions  related  to  internal  control 
quality.  An  alternative  approach  is  to  present  a  multinomial  probit  for  each  pair-wise  combination  of  internal  controls. 
One  issue  with  the  multinomial  specification  is  that  the  reported  findings  (coefficients)  are  presented  as  a  difference 
between  each  alternative  internal  control,  resulting  in  many  reported  results.  For  example,  for  pricing  sources  (Table  4, 
Panel  C),  there  would  be  ten  potential  differences  (columns). 
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use  single  signature  authority.  As  shown  from  the  External  Only  and  Dual/Triple  Entity  models, 
offshore  funds  and  offshore  managers  are  also  more  likely  to  require  external  signatures.  Trans¬ 
action  costs  appear  to  affect  signature  authority,  with  funds  that  hold  more  than  1,000  positions 
less  likely  to  require  external  signatures.  If  these  funds  move  positions  regularly  and  are  relatively 
liquid,  then  the  requirement  to  have  an  external  party  authorize  the  transfer  of  funds  on  each 
occasion  could  be  overly  burdensome  and  costly.  These  funds  are,  however,  more  likely  to  require 
at  least  two  internal  signatures,  because  large  numbers  of  positions  increase  the  likelihood  of 
fraud. 

The  right-most  column  of  Table  4,  Panel  A  presents  an  ordered  probit  regression  in  which  the 
dependent  variable  takes  on  the  index  values  (0,  1,  2,  3),  representing  Single  Internal,  Double 
Internal,  External  Only,  and  Dual/Triple  Entity  signature  authority,  respectively,  enabling  us  to 
determine  the  overall  association  between  more  effective  signature  internal  controls  and  the  inde¬ 
pendent  variables.  Under  this  specification  we  observe  that  larger  funds  and  funds  that  are  domi¬ 
ciled  offshore  are  significantly  associated  with  increased  control  over  signatures  required  to  trans¬ 
fer  funds  out  of  bank  and  prime  broker  accounts. 

Table  4,  Panel  B  presents  a  series  of  probit  regressions,  modeling  who  prices  the  hedge  fund’s 
investment  positions.  We  find  that  offshore  funds  are  significantly  less  likely  to  internally  price 
their  portfolios  and  significantly  more  likely  to  use  external  pricing  services.  Levered  funds  are 
also  significantly  less  likely  to  have  the  manager  price  investment  positions  and  more  likely  to  use 
external  pricing  services.  Furthermore,  funds  with  a  short  bias  in  their  investment  positions  are 
more  likely  to  use  internal  pricing  and  less  likely  to  use  external  pricing.  This  finding  is  likely 
driven  by  the  proprietary  nature  of  short  selling  and  the  potential  for  outsiders  to  profit  from 
knowing  that  the  fund  has  to  unwind  its  short  positions.  The  right-most  column  of  the  panel 
presents  an  ordered  probit  regression  of  overall  quality  with  respect  to  who  prices  individual 
investment  positions.  The  coefficients  in  this  specification  are  consistent  with  the  inferences  from 
the  individual  pricing  regressions,  with  offshore  funds  and  levered  funds  more  likely  to  rely  on 
external  rather  than  internal  pricing,  and  those  funds  with  a  short  bias  in  their  investment  strategy 
more  likely  to  rely  on  internal  pricing  than  external  pricing. 

Table  4,  Panel  C  presents  probit  regressions  modeling  the  source  of  prices  used  to  value  the 
funds’  portfolios.  Consistent  with  the  results  for  who  prices  the  portfolio,  offshore  funds  are  less 
likely  to  use  internal  and  model-based  sources  of  prices  and  more  likely  to  use  more  dealer  and 
exchange-based  sources  to  value  their  portfolios.  Funds  that  use  leverage  are  less  likely  to  use 
internal  sources  and  more  likely  to  use  dealer  sourced  prices,  consistent  with  hedge  funds  with 
greater  potential  agency  costs  using  relatively  more  objective  pricing  sources.  Further,  we  observe 
that  funds  with  some  bias  in  their  investment  style,  long  or  short,  are  less  likely  to  use  external 
sources,  such  as  dealer  and  exchange-based  prices,  and  are  more  likely  to  use  internal  price 
sources.  The  right-most  column  of  Panel  C  provides  the  ordered  probit  regression  of  overall 
pricing  source  quality.  We  again  observe  that  funds  with  bias  in  their  investment  style  making  less 
use  of  objective  price  sources,  and  some  evidence  that  small  funds  use  less  objective  sources.  We 
also  observe  that  funds  with  managers  domiciled  offshore  make  greater  use  of  more  objective 
pricing  source  measures. 

The  left-most  colunrn  of  Table  4,  Panel  D  presents  probit  regressions  modeling  whether  the 
NAV  is  determined  without  manager  involvement.  We  observe  that  managers  of  offshore  funds, 
regardless  manager  location,  are  more  likely  to  determine  and  report  NAV  without  manager 
involvement.  These  findings  are  consistent  with  the  pricing  findings  observed  in  Panels  B  and  C. 
The  remaining  columns  from  Panel  D  model  whether  the  fund  uses  an  auditor  or  an  administrator 
ranked  by  Institutional  Investor’s  Alpha  Survey.  Larger  funds  and  offshore  funds  are  more  likely 
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to  use  a  higher  quality  auditor  and  administrator.  Additionally,  funds  that  use  leverage  are  more 
likely  to  use  a  higher  quality  auditor,  and  younger  funds  are  more  likely  to  use  a  higher  quality 
administrator. 

Table  4,  Panel  E  provides  the  ordinary  least-squares  regression  modeling  overall  internal 
control  quality  as  a  function  of  fund  characteristics.  Overall,  the  evidence  is  consistent  with 
offshore  funds  and  managers  adopting  stronger  internal  control  mechanisms  to  decrease  the  like¬ 
lihood  of  fraud  and  financial  misstatements,  incorporating  stricter  signature  authority  for  the 
transfer  of  funds,  using  independent  pricing  mechanisms,  and  employing  more  reputable  outside 
service  providers.  These  findings  suggest  that  investors  demand  greater  internal  controls  for  funds 
that  are  not  subject  to  the  U.S.  legal  infrastructure.  There  is  also  evidence  that  funds  that  use 
leverage,  thereby  increasing  fund  risk,  also  employ  stronger  internal  control  mechanisms.  Overall, 
internal  control  quality  is  negatively  associated  with  fund  age,  consistent  with  younger  funds 
having  greater  incentives  to  implement  stronger  internal  controls  to  reduce  greater  concerns  with 
adverse  selection  and  moral  hazard.  Finally,  short  bias  funds  have  lower  quality  internal  controls, 
driven  in  part  by  these  funds  reducing  proprietary  costs  that  can  arise  from  external  parties 
learning  the  funds’  investment  positions. 

Determinants  of  Hedge  Fund  Fees 

Managers  have  incentives  to  deflate,  inflate,  and  smooth  reported  performance  and  assets 
under  management.  First,  when  the  investor  cashes  out,  the  value  of  the  investor’s  asset  is  revealed 
by  the  hedge  fund.  At  this  point,  the  manager  has  an  incentive  to  deflate  the  revealed  value  of  the 
investor’s  claim,  thereby  increasing  the  value  of  assets  retained  by  the  hedge  fund.  Managers  also 
have  incentives  to  inflate  reported  performance  because  they  collect  management  fees  based  on 
reported  assets  under  management  and  reported  performance  is  a  determinant  of  capital  flows 
(Fung  et  al.  2008).  Finally,  they  have  incentives  to  smooth  reported  performance  to  reduce  the 
perceived  volatility  of  the  fund  and  therefore  increase  its  perceived  risk  adjusted  performance 
(Getmansky  et  al.  2004). 

We  posit  that  more  extensive  internal  controls  improve  the  accuracy  and  precision  of  asset 
valuations  and  returns  disclosures  made  to  investors,  and  investors  are  willing  to  pay  higher 
management  fees  to  funds  with  more  extensive  internal  controls  because  of  the  decreased  likeli¬ 
hood  and  magnitude  of  losses  arising  from  managers’  manipulation  of  reported  performance. 
Therefore,  ceteris  paribus,  we  predict  that  internal  controls  that  provide  greater  opportunity  to 
manipulate  reported  performance  are  negatively  associated  with  the  management  fees  paid  to  fund 
managers. 

Aside  from  any  variations  in  management  fees,  the  use  of  internal  controls  can  also  affect 
performance  fees.  The  incentives  to  manipulate  the  reported  performance  and  valuations  increase 
with  the  sensitivity  of  the  manager’s  compensation  relative  to  reported  values  and  performance. 
An  obvious  mechanism  to  vary  this  sensitivity  is  through  the  proportion  of  fees  rewarded  from 
investment  profits,  namely  the  performance  fee.  Therefore,  ceteris  paribus,  we  predict  that  internal 
controls  that  provide  greater  opportunity  to  manipulate  reported  performance  will  be  negatively 
associated  with  the  performance  fees  paid  to  fund  managers. 

Table  5  presents  regressions  of  the  determinants  of  hedge  fund  fees.  For  these  regressions,  we 
use  the  same  independent  variables  as  in  the  “Determinants  of  Hedge  Fund  Internal  Controls” 
section.^^  Panel  A  presents  ordinary  least-squares  regressions  that  model  the  determinants  of  the 


The  use  of  the  same  independent  variables  in  the  determinants  of  internal  control  use  and  the  hedge  fund  fee  models  is 
equivalent  to  using  a  two-stage  approach,  in  which  the  first  stage  estimates  internal  control  use,  and  the  second  stage 
regresses  fees  on  the  residuals  from  the  first  stage. 
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management  fee  paid  to  the  hedge  fund  manager,  with  the  left-most  model  presenting  the  results 
with  controls  only,  and  the  second  model  with  controls  and  our  measure  of  internal  control  quality. 
For  ease  of  interpretation,  we  multiply  the  dependent  variable,  the  management  fee,  by  100. 
Examining  our  overall  measure  of  internal  controls  on  management  fees  reveals  that  there  is  no 
significant  effect.  Taken  by  itself,  this  evidence  suggests  that  our  investigated  internal  controls  are 
not  priced  through  the  management  fees. 

Table  5,  Panel  B  presents  ordinary  least-squares  regressions  that  model  the  determinants  of 
the  performance  fee  paid  to  the  manager.  As  with  the  management  fee,  we  multiply  the  perfor¬ 
mance  fee  by  100.  With  respect  to  the  control  variables,  performance  fees  are  positively  associated 
with  funds  that  have  higher  turnover  in  their  investment  portfolios.  Examining  the  internal  control 
coefficients,  we  find  that  better  internal  controls  are  significantly  positively  associated  with  per¬ 
formance  fees  {/3  =  0.261;  p  <  0.05).  In  terms  of  economic  significance,  the  standard  deviation  of 
the  index  is  2.78,  which  approximately  represents  going  from  internal  pricing  to  dual/triple  entity 
pricing.  In  terms  of  fees,  a  one  standard  deviation  increase  in  the  index  is  associated  with  a  0.726 
percentage  point  increase  in  performance  fee. 

The  significant  coefficient  on  the  internal  control  index  is  consistent  with  investors  protecting 
against  potential  misstatement  as  a  result  of  inadequate  internal  controls  through  lower  perfor¬ 
mance  fees.  It  is  also  consistent  with  investors  and  hedge  funds  reducing  the  costs  from  moral 
hazard  and  the  motivation  for  managers  to  manipulate  reported  performance.  This  finding  sug¬ 
gests  that,  as  the  opportunity  for  manipulation  of  fund  performance  increases,  the  reliance  on  fund 
performance  as  an  incentive  mechanism  decreases. 

Endogeneity 

A  potential  concern  is  that  our  internal  control  measures  are  endogenous.  Although  we  include 
the  variables  that  predict  internal  control  quality  in  the  model  determining  fees,  we  recognize  that 
endogeneity  could  affect  our  inferences  because  of  simultaneity  and/or  correlated  omitted  vari¬ 
ables.  Below  we  discuss  these  endogeneity  concerns  and  how  we  address  them. 

First,  our  motivation  for  adopting  our  empirical  design  is  driven  by  the  ordering  of  the  various 
parameters  investigated  in  this  study.  Given  that  fund  managers  have  specialized  expertise  in 
particular  areas  of  investment  (Brown  and  Goetzmann  2003),  it  is  unlikely  that  they  change  fund 
characteristics  over  time,  justifying  the  assumption  that  these  characteristics  are  predetermined.^' 
Internal  controls  are  then  likely  chosen  as  a  function  of  these  fund  characteristics.  The  manage¬ 
ment  and  performance  fees  are  detailed  in  the  PPM  to  investors.  Investors  first  observe  these 
predetermined  characteristics  of  the  fund  and  the  internal  controls  used  by  the  fund,  and  then 
decide  whether  to  invest  in  the  fund  based  on  the  terms  and  fees  offered.  Furthermore,  given  that 
the  observed  internal  controls  and  fees  are  determined  based  on  those  offered  to  all  fund  investors, 

32 

the  likelihood  that  internal  controls  and  fees  vary  by  individual  investor  characteristics  is  low. 


We  also  perform  a  bootstrap  analysis  using  an  equal  sample  of  funds  with  20  percent  performance  fees  randomly 
selected  without  replacement  with  the  non-20  percent  performance  fee  funds  (n  =  67)  to  examine  the  extent  to  which 
our  findings  and  power  could  be  influenced  by  the  high  proportion  of  funds  with  20  percent  performance  fees.  This 
selection  process  was  repeated  for  1 ,000  iterations  to  estimate  the  parameter  and  level  of  significance.  We  again  observe 
a  significantly  positive  association  between  internal  controls  and  performance  fees,  suggesting  our  results  are  invariant 
to  this  alternative  sampling  procedure. 

Moreover,  investors  have  the  ability  and  the  incentives  to  monitor  the  fund’s  internal  controls  on  an  ongoing  basis, 
thereby  limiting  a  manager’s  ability  to  weaken  the  fund’s  internal  controls. 

We  acknowledge  that  some  investors  have  greater  bargaining  power  and  are,  therefore,  able  to  negotiate  special  fees  and 
terms.  But,  such  fees  and  terms  are  laid  out  in  “side  letters”  that  are  separate  from  the  PPM.  Further,  some  investors 
could  negotiate  a  “favored  nation”  clause,  which  would  result  in  any  favorable  terms  contracted  with  other  investors  also 
granted  to  them,  suggesting  that  the  presence  of  such  a  clause  would  increase  the  overall  cost  to  the  fund  when  offering 
more  favorable  terms  to  some  investors. 
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Second,  to  investigate  the  likelihood  that  unobserved  fund  characteristics  determine  both 
internal  controls  and  fees,  we  adopt  the  approach  developed  by  Rosenbaum  (2002)  to  estimate  a 
bound  on  the  extent  to  which  omitted  variables  would  have  to  be  correlated  with  our  measures  to 
drive  the  results.  We  use  Krauth’s  (2007)  calculation  to  estimate  how  large  the  correlation  between 
unobserved  heterogeneity  and  the  internal  control  index  would  have  to  be  compared  to  the  corre¬ 
lation  between  the  internal  control  index  and  the  other  independent  variables.  For  the  95  percent 
confidence  interval  on  the  internal  control  index  coefficient  to  include  zero,  the  correlation  be¬ 
tween  unobserved  heterogeneity  and  the  index  would  have  to  be  greater  than  56  percent  of  the 
correlation  between  the  index  and  the  other  independent  variables.  Although  such  a  correlation  is 
possible,  we  believe  it  to  be  unlikely  given  the  range  of  fund  characteristics  included  in  our 
independent  variables  and  the  inclusion  of  relevant  variables  used  in  published  hedge  fund  re¬ 
search.  Nevertheless,  the  extent  to  which  such  omitted  variables  exist  is  a  limitation  of  our  study. 

Third,  an  alternative  approach  to  address  potential  endogeneity  when  examining  the  determi¬ 
nants  of  internal  controls  and  fees  is  to  hold  the  investment  characteristics  of  the  fund  constant, 
thereby  ruling  out  that  the  internal  controls  are  determined  by,  or  are  capturing,  the  underlying 
investment  characteristics  of  the  fund.  To  implement  this  approach,  we  perform  our  analyses  only 
for  those  funds  that  typically  held  1  to  39  positions,  the  investment  positions  category  with  the 
greatest  number  of  sample  funds.  For  these  172  funds,  we  observe  the  determinants  of  internal 
controls  and  the  association  between  internal  controls  and  performance  fees  are  consistent  with  the 
presented  results  for  the  full  sample  (/3  =  0.502;  se  =  0.147;  p  <  0.01).  Additionally,  we  perform 
our  analyses  only  for  those  hedge  funds  that  typically  held  their  positions  for  one  year  and  greater, 
the  holding  period  category  with  the  greatest  number  of  sample  funds.  Again,  we  observe  the 
determinants  of  internal  control  use  and  the  association  between  internal  controls  and  fees  to  be 
consistent  with  the  overall  sample  (/3  =  0.622;  se  =  0.231;  p  <  0.01). 

Finally,  we  consider  an  alternative  measure  based  on  outcomes,  namely  restatements.  Restate¬ 
ments  are  not  chosen  by  hedge  funds  at  the  time  of  due  diligence,  making  them  predetermined. 
Examining  the  right-most  column  of  Table  5,  Panel  A,  we  observe  a  statistically  significant 
negative  relationship  between  management  fees  and  whether  the  fund  has  restated  the  performance 
reported  to  investors  (yS  =  —0.227;  p  <  0.05).  This  relationship  between  restatements  and  man¬ 
agement  fees  is  also  economically  significant.  For  the  mean  fund,  a  restatement  is  associated  with 
$692,000  less  in  management  fees  per  annum,  which  is  equivalent  to  a  15  percent  ($692,000/ 
$4,608,633)  reduction.  The  observation  of  restated  performance  by  the  hedge  fund  increases  the 
likelihood  that  future  fund  performance  be  restated,  thereby  reducing  confidence  in  the  accuracy  of 
present  and  future  reported  fund  performance.  Therefore,  this  evidence  demonstrates  a  significant 
association  between  restatements,  a  predetermined  and  observable  event  that  occurred  prior  to 
contracting,  and  management  fees  paid  by  investors,  providing  further  support  for  the  conclusion 
that  investors  protect  against  the  risk  of  future  misstatements  by  paying  lower  fees.  However,  in 
contrast  to  the  management  fee  findings,  we  find  little  evidence  of  a  relationship  between  restate¬ 
ments  and  performance  fees. 

VI.  ANALYSIS  OF  FRAUD  INVESTIGATIONS 

To  investigate  a  performance  consequence  from  the  use  of  internal  controls,  we  examine 
whether  internal  controls  are  associated  with  future  regulatory  investigations  of  fraud  and/or 
financial  misstatements  on  the  part  of  the  fund  and  its  managers.  We  use  investigations  instead 


We  do  not  investigate  the  association  between  internal  controls  and  risk-adjusted  returns  because  their  predicted  asso¬ 
ciation  is  ambiguous.  While  stronger  internal  controls  could  lead  to  better  performance  for  hedge  fund  investors,  prior 
research  finds  evidence  consistent  with  hedge  fund  managers  intentionally  smoothing  self-reported  returns  to  increase 
observed  risk-adjusted  performance  (for  a  discussion,  see  Getmansky  et  al.  2004).  Internal  controls  can  reduce  oppor- 
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of  the  resolution  of  legal  proceedings  because  of  the  long  duration  between  the  commencement  of 
an  investigation  and  the  conclusion  of  legal  proceedings.  Such  investigations  can  be  considered  an 
extreme  form  of  poor  performance  because  they  are  typically  associated  with  fund  liquidations 
and  deeply  discounted  investor  redemptions. 

For  the  funds  in  the  sample  we  search  both  Factiva  and  the  SEC’s  website  for  investigations 
of  fraud  and/or  financial  misstatements.  For  Factiva  we  use  a  keyword  search  that  captures  fraud 
and/or  financial  misstatement  events  and  carry  out  the  search  over  the  period  starting  the  month 
after  completion  of  the  due  diligence  report  and  ending  in  March  2009.^"^  For  the  SEC  we  search 
the  SEC’s  website  for  the  name  of  every  fund  in  our  sample.  We  then  read  every  Factiva  article 
and  SEC  document  to  determine  whether  the  fund  is  under  investigation  for  fraud  and/or  financial 
misstatements.  We  identify  12  funds  in  our  sample  as  investigated  or  under  investigation  of  fraud 
and/or  financial  misstatements.  These  events  are  not  trivial.  All  12  funds  were  investigated  for 
fraud  and/or  intentional  financial  misstatement,  and  all  were  liquidated  with  investors  incurring 
substantial  losses. 

Table  6  presents  probit  regressions  where  the  dependent  variable  is  coded  as  1  if  the  fund  is 
subject  to  investigations,  and  0  otherwise.  The  independent  variables  are  the  same  variables  we 
use  to  examine  the  determinants  of  the  internal  controls,  thereby  allowing  us  to  investigate 
whether  deviations  from  expected  internal  controls  are  associated  with  future  allegations  and/or 
investigations. 

The  first  column  presents  the  baseline  probit  regression  model,  which  is  statistically  signifi¬ 
cant  at  p  <  0.001.  Levered  funds  are  significantly  less  likely  to  be  subject  to  future  investigations 
(marginal  effect  =  —1.00  percentage  points),  suggesting  that  leverage  providers  either  carry  out 
explicit  monitoring  or  implement  extensive  screening  prior  to  providing  credit.  Funds  that  hold 
their  investment  positions  for  quarters  (marginal  effect  =  —0.43  percentage  points)  are  signifi¬ 
cantly  less  likely  to  have  been  investigated  or  under  investigation  for  fraud  and/or  financial 
misstatements.  Consistent  with  greater  opportunities  to  commit  fraud  and/or  financial  misstate¬ 
ments,  we  find  that  funds  holding  thousands  of  investment  positions  are  significantly  more  likely 
to  be  subject  to  future  regulatory  investigations  (marginal  effect  =  5.94  percentage  points).  Fur¬ 
ther,  the  coefficient  on  fund  age  is  negative  and  significant.  Recall  from  Table  4,  Panel  E  that  we 
find  that  the  internal  control  index  decreases  in  fund  age,  suggesting  that  older  funds  have  valuable 
reputations  that  substitute  for  explicit  internal  controls  by  providing  incentives  against  carrying  out 
actions  that  would  lead  to  investigations  of  fraud  and/or  financial  misstatements. 

In  the  second  column  of  Table  6  we  introduce  the  internal  control  index.  Consistent  with 
higher  levels  of  internal  controls  being  associated  with  a  lower  probability  of  investigation,  the 
coefficient  on  the  index  is  negative,  though  it  is  not  statistically  significant.  Next  we  examine  the 
effect  of  individual  internal  controls  on  future  regulatory  investigations  of  fraud  and/or  financial 
misstatement.  When  we  introduce  the  indicator  variable  for  whether  the  manager  is  not  involved 
in  setting  and  reporting  the  fund’s  official  NAV  to  investors,  its  coefficient  is  negative  and  sig¬ 
nificant  (yS  =  — 1.076;  p  <  0.05),  with  a  marginal  effect  of  1.14  percentage  points.  With  respect  to 
economic  significance,  the  unconditional  probability  of  a  sample  fund  being  investigated  is  3 
percent,  implying  that  not  having  the  fund  manager  involved  in  setting  and  reporting  the  NAV 
reduces  the  likelihood  of  being  investigated  for  fraud  by  over  one-third. 


tunistic  manipulation,  suggesting  that  funds  with  weaker  internal  controls  are  likely  to  have  higher  (artificial)  observed 
risk-adjusted  performance.  Given  these  conflicting  predictions,  we  therefore  examine  fraud  and  misstatements  for  which 
there  is  an  unambiguous  prediction. 

We  used  the  following  keywords:  wind  w/5  down;  fraud*;  restate*;  bankrupt*;  death;  liquidat*;  blow*  w/5  up;  failure; 
fine*;  defraud*;  guilty;  conspir*;  scandal;  deceive*;  plea;  missing;  insolven*;  collapse*;  probe*;  prosecut*;  fled;  and 
indict*. 
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TABLE  6 


Determinants  of  Fraud  Investigations 


Independent  Variables 

Base 

With  Index 

With  NAV  No  Manager 

Coefficient 

Std. 

Error 

Coefficient 

Std. 

Error 

Coefficient 

Std. 

Error 

Internal  Control  Index 

-0.040 

0.049 

NAV  No  Manager 

-1.076** 

0.403 

Ln(Fund  Age) 

-0.252** 

0.113 

-0.267** 

0.111 

-0.309*** 

0.118 

Off.  Mgr.  and  Fund 

0.060 

0.422 

0.214 

0.491 

0.540 

0.543 

Fund  Offshore  Only 

•  0.298 

0.426 

0.471 

0.499 

0.766 

0.535 

Leverage 

-1.011*** 

0.338 

-1.024*** 

0.361 

-1.157*** 

0.399 

Short  Bias 

-0.357 

0.461 

-0.386 

0.495 

-0.314 

0.463 

Long  Bias 

0.515 

0.330 

0.548 

0.356 

0.535 

0.355 

Ln(Fund  AUM) 

-0.051 

0.070 

-0.049 

0.070 

-0.060 

0.075 

Years 

-1.009* 

0.601 

-1.037* 

0.580 

-1.471** 

0.591 

Quarters 

-0.856** 

0.352 

-0.913** 

0.376 

-1.001** 

0.399 

Weeks 

-0.001 

0.441 

-0.001 

0.444 

-0.291 

0.454 

Days 

-0.167 

0.427 

-0.148 

0.413 

-0.172 

0.437 

1000+  Positions 

1.376** 

0.599 

1.228** 

0.584 

1.517** 

0.628 

200—999  Positions 

0.396 

0.501 

0.377 

0.506 

0.527 

0.490 

100—199  Positions 

-0.001 

0.526 

-0.076 

0.518 

0.104 

0.547 

1—39  Positions 

-0.589 

0.383 

-0.696* 

0.397 

-0.738* 

0.407 

Intercept 

-2.875** 

1.111 

-2.677** 

1.189 

-1.617 

1.209 

Year  Fixed  Effects 

Yes 

Yes 

Yes 

Pseudo 

0.309 

0.316 

0.353 

Log-Likelihood 

-32.322*^ 

-31.952*^ 

-30.244* 

** 

n 

401 

397 

400 

*,  **,  ***  Denotes  significance  at  p  =  0.10,  p  =  0.05,  and  p  =  0.01,  respectively,  (based  on  a  two-tailed  test). 

Table  reports  probit  estimation.  Variables  are  defined  Tables  1  and  2.  Omitted  indicator  variables  are  Months  and  40-99 
Positions.  Standard  errors  for  all  regressions  are  clustered  at  the  manager  level.  Dependent  variable  is  an  indicator  for 
whether  there  allegations  and/or  investigations  of  fraud  and/or  financial  misstatements  for  the  fund. 


Overall,  we  observe  some  evidence  consistent  with  the  choice  of  internal  controls  being 
associated  with  a  critical  performance  consequence,  namely  fraud  investigations  against  the  hedge 
fund.  Moreover,  we  find  that  reputational  incentives  as  proxied  by  fund  age  and  explicit  external 
monitoring  and/or  screening  by  leverage  providers  are  associated  with  lower  likelihoods  of  future 
regulatory  investigations  of  fraud  and/or  financial  misstatements. 

VII.  ROBUSTNESS 

Unit  of  Analysis 

We  perform  the  analyses  of  the  determinants  of  internal  controls  and  investor  fees  at  the  fund 
level  and  cluster  the  standard  errors  by  manager  to  address  any  manager-related  cross-correlations. 
Alternatively,  we  could  perform  the  analyses  at  the  manager  level.  To  investigate  the  robustness  of 
our  results,  we  repeat  all  analyses  using  three  alternative  selection  criteria  with  only  one  observa¬ 
tion  per  manager.  The  criteria  are:  (1)  the  first  investigated  fund  by  manager,  (2)  the  last  investi¬ 
gated  fund  by  manager,  and  (3)  an  average  of  the  dependent  and  independent  variables  by  man- 
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ager.  For  all  three  approaches,  the  results  (untabulated)  for  both  the  determinants  of  internal 
controls  and  their  relationship  to  fees  are  consistent  with  the  presented  findings. 

The  reported  analyses  employ  fund  characteristics  associated  with  assets  under  management 
and  fund  age,  which  have  been  commonly  used  by  researchers  to  capture  cross-sectional  variation 
in  fund  characteristics  (Aragon  2007;  Brown  et  al.  2008).  The  extent  to  which  the  use  of  fund- 
level  characteristics,  rather  than  manager-level  characteristics,  influences  our  findings  is  an  em¬ 
pirical  question.  We  consider  two  manager-based  measures,  namely  the  manager’s  assets  under 
management  and  the  natural  logarithm  of  the  number  of  funds  that  the  manager  manages.  The 
Pearson  correlations  between  the  fund’s  assets  under  management  and  the  natural  logarithm  of  the 
fund’s  age  in  days  with  the  manager’s  assets  under  management  are  0.61  (p  <  0.01)  and  0.25  (p 

<  0.01),  respectively.  The  Pearson  correlations  between  the  fund’s  assets  under  management  and 
the  natural  logarithm  of  the  fund’s  age  in  days  with  the  natural  logarithm  of  the  number  of  funds 
that  the  manager  manages  are  0.16  (p  <  0.01)  and  0.03  (p  <  0.56),  respectively.  Using  the 
manager  characteristics  in  the  specifications  presented  in  Table  5,  we  again  observe  a  significantly 
positive  association  between  overall  internal  control  quality  and  performance  fees  (yS  =  0.236;  p 

<  0.05)  and  no  significant  association  between  internal  control  quality  and  management  fees  {p  = 
0.009;  se  =  0.011). 

Internal  Control  Index 

To  examine  the  extent  that  the  choice  and  weighting  of  index  items  significantly  affects  our 
inferences,  we  examine  an  alternative  index  to  capture  overall  hedge  fund  internal  control  quality. 
In  particular,  we  adopt  an  index  based  solely  on  the  extent  that  the  internal  controls  did  or  did  not 
include  manager  involvement.  For  signature  authority  to  transfer  funds,  who  prices  the  portfolio, 
and  the  pricing  source  of  the  portfolio,  we  code  0  for  each  component  if  the  manager  was 
involved,  and  1  if  the  manager  was  not  involved.  For  this  alternative  index,  we  score  each  fund 
between  0  and  3  (mean  =  2.010;  standard  deviation  =  0.896).  Consistent  with  the  reported  hedge 
fund  quality  index  results,  we  again  observe  no  significant  association  between  internal  controls 
and  management  fees  (yS  =  0.012;  se  =  0.036),  and  a  significantly  positive  association  between 
internal  controls  and  performance  fees  (y8  =  0.586;  se  =  0.287;  p  <  0.05).  In  terms  of  economic 
significance,  a  one  standard  deviation  increase  in  the  alternative  index  is  associated  with  a  0.526 
percentage  point  increase  in  the  performance  fee.  The  magnitude  of  this  effect  is  similar  to  the 
magnitude  for  the  reported  internal  control  index  (0.526  versus  0.726). 

Self-Selection  in  Reporting  Performance 

We  compare  our  sample  hedge  funds  that  do  (n  =  232)  and  do  not  (n  =  195)  self-report 
performance  to  the  Lipper  TASS  database,  which  is  the  primary  database  used  in  academic  re¬ 
search  on  hedge  funds  (see  Lo  [2008]  for  a  discussion  of  the  database).  Similar  to  our  sample, 
Lipper  TASS  and  other  hedge  fund  databases  do  not  comprise  the  complete  universe  of  hedge 
funds,  as  these  databases  consist  of  information  self-reported  from  hedge  funds  that  choose  to  be 
included  in  the  databases.  However,  these  datasets  are  an  order  of  magnitude  greater  in  number  of 
reported  funds  than  our  database,  and  the  extent  of  self-selection  is  helpful  for  evaluating  the 
implications  of  our  findings  to  the  broader  hedge  fund  population. 

We  perform  univariate  tests  for  differences  across  fund  age,  fund  and  manager  location,  use  of 
leverage,  investment  style,  management  and  performance  fees,  and  internal  controls.  Across  all 
these  variables,  the  only  significant  difference  is  that  older  sample  funds  have  a  significantly 
higher  likelihood  of  reporting  to  Lipper  TASS  (p  <  0.01),  with  the  mean  number  of  days  being 
1,252  (742)  for  sample  funds  reporting  (not  reporting)  to  Lipper  TASS.  An  explanation  for  the 
difference  across  fund  age  is  the  potential  for  self-selection  based  on  performance  for  funds  that 
choose  to  report  to  Lipper  TASS.  Specifically,  if  older  surviving  funds  generally  have  better 
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performance,  then  they  have  greater  incentives  to  self-report  to  database  vendors  for  promotion 
and  marketing  purposes.  It  should  be  noted  that  this  significant  difference  is  above  any  other 
potential  self-selection  bias  arising  from  HFDD  being  appointed  to  perform  due  diligence  on  a 
fund.  In  addition,  we  note  that  investors  were  actively  considering  an  investment  in  the  funds 
included  in  our  sample,  implying  that  the  observed  trade-offs  reflect  market  conditions  at  the  point 
of  due  diligence.  Given  these  results,  we  expect  our  findings  to  be  representative  across  a  broad  set 
of  funds. 

Further  Controlling  for  Investment  Style 

Although  we  control  for  asset  class  and  investment  style  by  including  typical  holding  period, 
number  of  positions,  and  the  use  of  long  or  short  bias,  we  examine  the  extent  that  further  controls 
for  asset  class  and  investment  style  influences  our  observed  findings.  For  our  232  sample  obser¬ 
vations  that  report  to  Lipper  TASS,  we  include  indicator  variables  for  each  fund’s  investment  style 
as  classified  by  Lipper  TASS.  These  investment  styles  include:  convertible  arbitrage,  dedicated 
short  seller,  emerging  markets,  equity  market  neutral,  event-driven,  fixed  income  arbitrage,  global 
macro,  and  long/short  equity.  In  unreported  results,  all  findings  for  the  determinants  of  internal 
controls  and  the  determinants  of  investor  fees  are  robust  to  the  inclusion  of  the  Lipper  TASS 
investment  style  indicator  variables  and  consistent  with  our  empirical  predictions,  with  the  excep¬ 
tion  of  our  investment  position  variables.  Replicating  the  results  from  Table  4,  Panel  E  with 
investment  style  variables,  we  observe  that  both  1000-1-  positions  {p  —  —0.511,  originally 
—2.259)  and  1-39  positions  {p  =  0.243,  originally  —0.547)  are  no  longer  significantly  negatively 
associated  with  our  overall  internal  control  index.  One  interpretation  of  these  findings  is  that  our 
investment  position  variables  are,  to  some  extent,  capturing  the  underlying  investment  style  of  the 
funds,  as  based  on  Lipper  TASS  classifications.  Finally,  we  note  that  the  coefficient  on  1000-1- 
positions  (/3  =  —0.985,  originally  —0.606)  remains  significant  and  negatively  associated  with 
greater  signature  authority  with  the  inclusion  of  investment  style  variables  (Table  4,  Panel  A). 

Offshore  Results 

In  the  reported  analyses,  we  categorize  the  domicile  of  the  hedge  fund  into  on-  and  offshore. 
We  examine  the  robustness  of  our  findings  by  replacing  the  fund  domicile  variables  in  the  reported 
analyses  with  indicator  variables  for  each  country  with  ten  or  more  funds.  Our  results  are  not 
altered  by  the  inclusion  of  country  indicator  variables. 

VIIL  CONCLUSION 

We  investigate  the  determinants  and  internal  controls  of  hedge  funds  and  their  association 
with  the  fees  that  funds  charge  investors  using  proprietary  due  diligence  reports  commissioned  by 
sophisticated  investors.  The  multifaceted  nature  of  this  data  provides  substantial  detail  regarding 
fund  characteristics,  operations,  and  internal  controls.  These  reports  are  based  on  interviews, 
analyses  of  contract  terms  and  financial  position,  and  third-party  sources,  in  contrast  to  other 
hedge  fund  data  sets  that  are  solely  based  on  self-reported,  unverified  information. 

Consistent  with  our  predictions,  we  find  that  managers  of  funds  domiciled  offshore  adopt 
stronger  mechanisms  to  decrease  the  likelihood  of  fraud  and  financial  misstatements,  incorporate 
stricter  signature  authority  to  transfer  funds,  use  external  pricing  services,  and  use  more  reputable 
outside  service  providers.  We  also  observe  that  levered  funds  and  younger  funds  are  more  likely  to 
have  stronger  internal  controls.  Further,  we  find  that  funds  that  have  a  short  bias  investment 
strategy  are  less  likely  to  employ  independent  pricing  sources.  This  finding  likely  reflects  greater 
proprietary  costs  from  others  trading  against  the  short  positions  of  these  funds. 

With  respect  to  outcomes  from  internal  control  weaknesses,  we  find  that  funds  that  have 
restated  performance  charge  lower  management  fees.  Further,  we  find  a  positive  association  be- 
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tween  the  quality  of  internal  controls  and  the  performance  fees  rewarded  to  managers,  which  is 
consistent  with  investors  protecting  against  potential  financial  misstatements  by  placing  less  em¬ 
phasis  on  the  reported  performance  when  internal  controls  are  less  likely  to  detect  or  prevent 
managers  from  manipulating  reported  performance.  Finally,  we  find  that  funds  in  which  the  man¬ 
ager  is  removed  from  setting  and  reporting  the  official  NAV  to  investors  are  significantly  less 
likely  to  be  subject  to  future  regulatory  investigations  for  fraud  and/or  financial  misstatements. 
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ABSTRACT:  This  study  examines  discretionary  bonus  payments  by  firms  to  senior- 
level  executives.  Interpreting  discretionary  bonuses  as  the  result  of  implicit  incentive 
contracts,  I  analyze  an  analytical  model  that  includes  a  contractible  and  a  non- 
contractible  performance  measure.  The  model  yields  the  primary  hypothesis  that  dis¬ 
cretionary  bonuses  occur  when  the  outcome  of  the  contractible  measure  is  either  low  or 
high,  but  not  when  the  contractible  outcome  falls  in  the  medium  range.  Based  on  a 
sample  collected  from  public  sources,  I  find  empirical  support  for  the  notion  that  discre¬ 
tionary  bonuses  are  paid  based  on  non-contractible  performance  measures  that  are 
related  to  future  financial  performance.  Moreover,  discretionary  bonus  payments  occur 
significantly  more  often  when  the  contractible  performance  measure  falls  in  the  tails  of 
the  distribution.  In  contrast,  I  do  not  find  support  for  the  predictions  that  discretionary 
bonus  payments  are  related  to  the  manipulability  of  the  contractible  performance  mea¬ 
sures  or  that  discretionary  bonus  payments  are  related  to  the  power  of  the  executives  in 
the  companies. 

Keywords:  discretionary  bonus;  incentive  compensation;  implicit  contract;  non- 
contractible  performance  measure;  managerial  power. 

Data  Availability:  All  data  used  in  this  study  are  publicly  available. 

L  INTRODUCTION 

This  study  examines  discretionary  bonus  payments.  Discretionary  bonus  payments  are  bonus 
payments  that  are  not  determined  by  a  strict  formulaic  approach.  For  example.  Sun  Micro¬ 
systems’  chief  executive  officer  Scott  McNealy  -did  not  receive  a  bonus  according  to  his 
bonus  formula  for  fiscal  year  2005;  however,  he  was  paid  a  $1.1  million  discretionary  bonus.  This 
study  addresses  the  question  of  why  companies  pay  discretionary  bonus  payments  to  their  execu¬ 
tives. 

First,  I  investigate  when  discretionary  payments  may  occur  in  an  optimal  incentive  contract 
using  an  analytical  model.  I  hypothesize  that  discretionary  bonuses  reflect  the  board’s  assessment 
of  non-contractible  performance  measures.  Therefore,  my  model  includes  not  only  a  contractible 
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performance  measure,  but  also  a  non-contractible  performance  measure/  The  model  is  based  in  a 
conventional  moral  hazard  setting  but  differs  from  traditional  agency  models,  which  only  include 
contractible  performance  measures.  The  model  developed  for  this  study  extends  Raj  an  and  Reich- 
elstein  (2009),  yielding  the  prediction  that  discretionary  bonus  payments  occur  when  the  outcome 
of  the  contractible  performance  measure  is  either  low  or  high,  but  not  when  the  outcome  of  the 
contractible  performance  measure  falls  in  the  medium  range. 

Baker  et  al.  (1994)  also  develop  a  model  that  includes  a  contractible  and  a  non-contractible 
performance  measure.  In  contrast  to  the  one-period  model  developed  in  this  study,  the  model  in 
Baker  et  al.  (1994)  is  based  in  an  infinite  horizon  setting.  Their  model  yields  the  prediction  that 
discretionary  bonuses  are  more  likely  to  be  used  when  the  contractible  measure  is  more  suscep¬ 
tible  to  manipulation. 

The  predictions  from  the  analytical  models  are  contrasted  in  the  empirical  analysis  with  a 
prediction  based  on  managerial  power  theory.  Managerial  power  theory  predicts  a  positive  rela¬ 
tionship  between  discretionary  bonus  payments  and  the  power  of  the  executives  in  the  company. 
Thus,  managerial  power  theory  suggests  that  discretionary  bonus  payments  are  more  likely  to 
occur  in  companies  with  weak  corporate  governance  structures.  According  to  managerial  power 
theory,  executives  who  have  considerable  power  over  the  board  of  directors  and  the  compensation 
committee  receive  compensation  above  the  level  explained  by  standard  economic  factors  alone 
(e.g.,  Lambert  et  al.  1993;  Boyd  1994;  Core  et  al.  1999;  Bebchuk  et  al.  2008).  Discretionary  bonus 
payments  may  be  one  way  in  which  powerful  executives  achieve  such  excess  pay  (Bebchuk  et  al. 
2002). 

I  test  the  hypotheses  using  a  sample  collected  from  public  sources.  The  data  are  gathered  from 
companies’  Forms  8-K  and  proxy  statements  starting  in  August  2004  when  the  updated  SEC 
regulation  regarding  the  disclosure  of  discretionary  bonus  payments  in  Forms  8-K  became  effec¬ 
tive.  This  regulation  requires  companies  to  disclose  the  amounts  of  discretionary  bonuses  paid  to 
the  top  five  executives.  Thus,  the  sample  includes  explicit  information  on  the  amount  of  discre¬ 
tionary  bonus  payments  over  the  period  from  August  2004  to  September  2006,  and  consists  of  234 
firm-years. 

The  results  of  the  analyses  provide  support  for  the  hypothesis  that  discretionary  bonuses 
reflect  implicit  incentive  contracts  based  on  non-contractible  performance  measures.  Specifically, 
discretionary  bonuses  are  incrementally  informative  about  future  financial  performance  after  con¬ 
trolling  for  current  period  financial  performance,  indicating  that  discretionary  bonuses  are  paid 
based  on  forward-looking  performance  measures  that  are  only  observable  to  the  contracting  parties 
and  that  are  likely  to  be  non-contractible.  The  results  also  support  the  prediction  derived  from  the 
model  that  companies  are  more  likely  to  pay  discretionary  bonuses  when  the  contractible  outcome 
falls  in  the  tails  of  the  distribution.  Conversely,  the  results  do  not  support  the  hypotheses  that  there 
is  a  positive  relationship  between  the  occurrence  of  discretionary  bonuses  and  the  manipulability 
of  the  contractible  measures  or  the  power  of  the  executives  in  the  companies.  Overall,  the  results 
support  the  notion  that  companies  use  discretionary  bonus  payments  in  a  way  that  is  consistent 
with  optimal  incentive  contracting. 

The  paper  contributes  to  the  literature  on  discretion  in  bonus  plans  in  several  ways.  First,  I 
provide  evidence  that  companies  are  more  likely  to  pay  discretionary  bonuses  when  the  contract- 


*  Examples  of  contractible  performance  measures  that  are  frequently  used  are  accounting-based  measures  such  as  earn¬ 
ings,  revenue,  or  cash  flow.  Examples  of  non-contractible  performance  measures  include  subjective  assessments  of  the 
agent’s  performance  by  the  principal:  whether  the  agent  negotiated  new  deals  with  major  customers  or  suppliers  that  are 
likely  to  pay  off  in  the  future;  whether  the  agent  implemented  important  strategic  initiatives;  whether  the  agent  worked 
well  as  a  team  member;  whether  the  agent  was  a  good  leader;  assessment  of  the  agent’s  initiative,  or  whether  the  agent 
trained  subordinates  well. 
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ible  performance  is  either  very  low  or  very  high.  The  model  I  develop  shows  that  this  finding  is 
consistent  with  optimal  incentive  contracting.  Second,  this  study  is  one  of  the  first  to  link  discre¬ 
tionary  bonus  payments  to  corporate  governance.  Although  shareholder  activists  and  the  press 
have  portrayed  discretionary  bonus  payments  as  a  form  of  rent  extraction,  researchers  have  not 
examined  the  relationship  between  discretionary  bonus  payments  and  executive  power.  Finally,  the 
sample  used  in  this  study  improves  upon  existing  empirical  work  by  using  an  explicit  measure  of 
discretion  for  companies  in  a  wide  array  of  industries. 

The  next  section  develops  the  analytical  model  and  the  hypotheses.  Section  III  describes  the 
sample  and  measures  used  in  the  empirical  analysis.  Section  IV  discusses  the  research  design  and 
the  empirical  results.  Section  V  provides  a  summary  and  conclusion. 

II.  ANALYTICAL  MODEL  AND  HYPOTHESES  DEVELOPMENT 
Economic  Theory 

Analytical  research  has  modeled  discretionary  bonus  payments  as  the  result  of  implicit  incen¬ 
tive  contracts  based  on  non-contractible  performance  measures  (e.g.,  Baiman  and  Rajan  1995).^ 
The  model  below  follows  that  approach.  In  order  to  link  the  empirical  analysis  to  the  theoretical 
work,  it  is  desirable  to  provide  support  for  the  modeling  approach  of  capturing  discretionary  bonus 
payments  as  the  result  of  non-contractible  performance  measures.  Since  implicit  incentive  con¬ 
tracts  used  in  firms  are  unobservable,  I  cannot  directly  test  whether  discretionary  bonus  payments 
are  indeed  paid  based  on  non-contractible  performance  measures. 

However,  Hayes  and  Schaefer  (2000)  have  introduced  an  approach  to  indirectly  test  whether 
the  performance  metrics  underlying  the  bonus  schemes  are  likely  to  be  non-contractible  in  nature. 
In  particular,  Hayes  and  Schaefer  (2000)  regress  future  financial  performance  on  current  cash 
compensation  and  on  current  financial  performance.  They  find  that  current  compensation  is  incre¬ 
mentally  informative  about  future  financial  performance  after  controlling  for  current  financial 
performance.  Their  findings  are  consistent  with  the  notion  that  companies  employ  performance 
metrics  that  are  only  observable  to  the  contracting  parties  and  that  are  correlated  with  future 
financial  performance. 

Hayes  and  Schaefer  (2000)  estimate  their  regression  using  the  sum  of  salary  and  bonus  as  the 
measure  of  current  compensation.  My  data  set  allows  a  separation  of  the  bonus  payment  into  its 
discretionary  and  formula-based  components.  Since  the  formula-based  portion  of  the  bonus  pay¬ 
ment  is  expected  to  be  based  on  performance  metrics  such  as  accounting  earnings,  I  expect  that  the 
formula-based  bonus  component  is  not  incrementally,  informative  about  future  financial  perfor¬ 
mance.  In  contrast,  I  hypothesize  that  the  discretionary  bonus  components  are  based  on  non- 
contractible  performance  measures,  which  are  likely  to  be  correlated  with  future  performance  and 
unobservable  to  outside  parties.  In  other  words,  I  hypothesize  that  the  discretionary  bonus  com¬ 
ponents  are  incrementally  informative  about  future  financial  performance.  These  arguments  lead  to 
my  first  hypothesis.  (All  hypotheses  are  stated  in  the  alternative  form). 

HI:  The  discretionary  components  of  bonus  payments  are  incrementally  informative  about 
future  financial  performance,  whereas  the  formula-based  components  are  not. 

In  order  to  derive  predictions  regarding  the  occurrence  of  discretionary  bonus  payments  under 
optimal  incentive  contracting,  I  interpret  discretionary  bonuses  as  the  result  of  implicit  incentive 
contracts  based  on  non-contractible  performance  measures.  By  definition,  non-contractible  perfor- 


^  Prior  empirical  studies  that  have  analyzed  discretion  in  bonus  plans  include  Murphy  and  Oyer  (2003)  and  Gibbs  et  al. 
(2004). 

^  Implicit  contracts  are  agreements  that  are  not  legally  enforceable.  They  are  sustained  through  economic  mechanisms  and 
constitute  equilibria  in  a  game-theoretic  sense  (Bull  1987). 
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mance  measures  are  unverifiable,  such  that  it  is  impossible  to  write  explicit  incentive  schemes 
based  on  such  metrics.  Thus,  in  order  to  credibly  incorporate  non-contractible  information,  the 
principal  has  to  resort  to  an  economic  enforcement  mechanism. 

One  such  mechanism  is  for  the  principal  to  commit  to  paying  out  a  fixed  bonus  pool  that  is 
independent  of  the  outcome  of  the  non-contractible  performance  measure  (MacLeod  2003;  Raj  an 
and  Reichelstein  2006;  Rajan  and  Reichelstein  2009).  The  implicit  agreement  between  the  con¬ 
tracting  parties  specifies  how  the  bonus  pool  will  be  divided  between  the  agent  and  a  third  party, 
depending  on  the  realization  of  the  non-contractible  signal."^  Since  the  principal  is  ex  post  indif¬ 
ferent  as  to  the  recipient  of  the  bonus  pool,  the  presumption  is  that  she  will  follow  the  implicit 
contract  that  was  agreed  upon  ex  ante. 

Rajan  and  Reichelstein  (2009;  hereafter  RR)  also  analyze  a  setting  where,  in  addition  to  the 
non-contractible  performance  measure,  the  principal  also  has  access  to  a  contractible  signal.  This 
setting  is  particularly  relevant  for  my  empirical  setting  since,  aside  from  the  discretionary  pay¬ 
ments,  the  companies  that  comprise  my  sample  also  have  formal  bonus  plans  in  place  that  are 
largely  based  on  financial,  and  therefore  contractible,  performance  measures. 

The  following  analysis  builds  on  the  model  analyzed  in  RR.  In  particular,  the  principal  seeks 
to  implement  a  given  action,  a^,  which  is  more  costly  for  the  agent  than  a  less  productive  action, 
aK  The  principal  can  use  both  a  contractible  performance  measure,  x,  and  a  non-contractible 
performance  measure,  y,  in  designing  the  contract.  Both  performance  measures  are  informative 
about  the  agent’s  action  and  the  outcomes  of  both  measures  are  binary,  i.e.,  x  e  {x' ,  x^}  and  y 
e  {y' ,  y^}.  Moreover,  the  monotone  likelihood  ratio  property  (MLRP)  is  assumed  to  hold  for  both 
performance  measures.^ 

The  analysis  in  RR  is  restricted  to  the  case  for  which  the  two  performance  measures  are 
independent  (conditional  on  a  given  action).  In  contrast,  I  consider  the  more  general  setting  where 
the  two  metrics  may  be  conditionally  dependent.  For  example,  in  many  scenarios  it  seems  rea¬ 
sonable  to  expect  the  contractible  and  non-contractible  measures  to  be  positively  correlated.  In 
particular,  a  positive  macroeconomic  shock  may  result  in  a  strong  accounting  performance  and 
may  also  lead  to  a  favorable  subjective  assessment  of  the  agent’s  performance  since  factors  such 
as  strategic  initiatives  and  new  customers  are  likely  to  be  influenced  by  the  overall  economic 
environment  as  well. 

Thus,  following  RR,  the  contractible  measure  reflects  the  agent’s  action  in  the  following  way: 

p\  =  Prob[x  =  x^|a  =  =  Prob[x  =  x^|a  =  a^] . 

In  contrast  to  the  setup  in  RR,  the  probability  of  observing  a  high  outcome  on  the  non-contractible 
measure,  given  that  the  agent  took  action  a\  may  depend  on  the  realization  of  the  contractible 
measure  x.  In  particular: 

=  Prob[y  =  y^\a^  a‘,x  =  x^] , 
where  i  ,j  g  {I  ,h}. 

The  remaining  structural  elements  follow  the  setup  in  RR.  In  particular,  the  bonus  pool 
corresponding  to  the  contractible  outcome  j  is  denoted  by  m^.  The  payment  that  the  agent  receives 
for  contractible  outcome  j  and  non-contractible  outcome  k  is  denoted  by  s^^,  which  satisfies  the 
inequality  wJ  >  maxjy^ ,  s^^}.  Any  remaining  difference  between  the  bonus  pool  and  the  payment 


''  One  interpretation  of  the  third  party  is  a  charity.  More  generally,  the  third  party  can  be  thought  of  as  any  agent  (inside 
or  outside  the  organization)  that  is  not  involved  in  the  contracting  relationship. 

^  (fl^) 

In  the  context  of  my  model,  the  density  pj{a)  satisfies  the  MLRP  if  is  monotone  decreasing  in  j.  The  MLRP  implies 
that  higher  outcomes  of  the  performance  measure  are  indicative  of  higher  effort  on  the  part  of  the  agent. 
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to  the  agent,  (u^  -  5^^),  is  diverted  to  the  third  party.  Moreover,  the  risk- averse  agent  is  assumed  to 
have  additively  separable  preferences  over  wealth,  U{-)  and  cost  of  effort,  e(-)  and  the  principal’s 
optimization  problem  can  be  formulated  as  follows: 

Program  PI: 


Min[w^  ■  pI  +  ■  {I  -pI)] 


subject  to: 

(i)  E[Uisj'^)\a^]-e{a^)  >  U, 

(ii)  E[U{sj’^)  I  a^]  -  e{a^)  >  E[U{s^'^)  \  a']  -  e(aO, 

(iii)  for  all  j  ,k:  >  0. 

The  non-contractible  performance  measure  is  said  to  be  used  if  the  payment  to  the  agent  depends 
on  the  outcome  of  y.  Specifically,  the  non-contractible  measure  is  said  to  be  used  for  contractible 
outcome  jc  =  if  the  solution  to  PI  is  such  that  Min{s^^  ,  <  vW;  the  non-contractible  measure  is 
said  to  not  be  used  for  contractible  outcome  x  =  x^  if  the  solution  to  PI  is  such  that 

Min{sJ^  ,  =  vW. 

RR  identify  necessary  and  sufficient  conditions  for  the  non-contractible  measure  to  optimally 
be  used  for  the  low-contractible  outcome  (RR,  Proposition  1  and  Observation  1).  Since  the  setting 
in  RR  is  a  special  case  of  the  setting  analyzed  here  (i.e.,  it  follows  directly  that 

there  exist  circumstances  where  the  optimal  contract  for  the  present  setup  is  such  that  the  non- 
contractible  measure  is  used  for  the  low-contractible  outcome. 

In  contrast  to  the  findings  in  RR,  the  following  result  shows  that,  if  the  assumption  of 
conditional  independence  is  relaxed,  it  is  optimal  for  the  principal  to  employ  the  non-contractible 

indicator  for  the  high-contractible  outcome  in  certain  circumstances.  Let  =  — f  and 

^  ^  ^  Hlh 

=  where  ,Q^  ^  (1  ,  °°)-  and  reflect  the  informativeness  of  the  non-contractible 
performance  measure  for  the  low-  and  high-contractible  outcome,  respectively.  I  also  use  the 
notation: 


p*  — 


where: 


1  -r 


J-  e{a^)  ■  Ph- e{a^)  ■  p[\ 

^  /  -  e{a^)  ■  {I  - p[)  -  e{d)  ■  {\  - pt)\ 

i  (p^pi)  ' 

and  V{-)  represents  the  derivative  of  the  inverse  of  the  agent’s  utility  function,  [/“K  •  )■  The  result 
identifies  sufficient  conditions  for  the  non-contractible  performance  measure  to  optimally  be  used 
for  the  high  outcome  of  x.  All  proofs  are  in  Appendix  A. 

Proposition  1:  The  non-contractible  performance  measure,  y,  is  used  when  the  outcome  of 
the  contractible  performance  measure,  x,  is  high  as  long  as  the  following 
conditions  are  satisfied: 
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Qfi>  p*  . 


>Q^. 


pA^-p[) 

-Ph-i^ -Ph)- 


(1) 


Following  RR,  the  intuition  for  this  result  is  that  using  the  non-contractible  performance 
measure  in  the  incentive  scheme  entails  an  additional  agency  cost  for  the  principal  since  she  has 
to  divert  money  to  a  third  party  with  positive  probability.  Thus,  the  principal  will  only  use  the 
non-contractible  metric  if  doing  so  is  sufficiently  gainful  and  she  will  use  it  in  a  way  that  is  most 
beneficial.  When  the  conditions  in  (1)  are  satisfied,  the  non-contractible  signal  for  the  high- 
contractible  outcome  is  sufficiently  informative  such  that  the  principal  finds  it  beneficial  to  use  the 
non-contractible  metric  for  the  high-contractible  outcome. 

Having  established  that  the  non-contractible  performance  measure  may  optimally  be  used  for 
either  the  low  or  the  high  outcome  of  the  contractible  measure  in  the  binary  outcome  case,  to 
deterrmne  whether  the  non-contractible  signal  would  ever  be  used  for  intermediate  values  of  the 
contractible  measure,  I  next  allow  three  possible  contractible  performance  measure  outcomes.^ 
The  setup  of  the  model  is  the  same  as  before,  except  now  jc  g  {x' ,  x"* ,  x^}.^  The  distributions  of  x 
are  now  given  by: 


p'j  =  Prob[x  =  x-'ja  =  a'] 

where  i  e  {/,/?}  and;  e  {l,m,h}.  Analogous  to  the  case  where  the  contractible  performance  measure 
has  two  outcomes,  the  probability  that  the  outcome  of  the  non-contractible  performance  measure 
is  high  is  denoted  by  where,  now,  i  e  {l,h]  and  j  e  {l,m,h}.  The  principal’s  optimization 
problem  now  becomes: 

Program  P2: 


subject  to: 

(i)  E[U{sj’^)\a>^]-eia^)>U, 

(ii)  £[f/(y^)  I  a^]  -  e{a>^)  >  £[f/(^A)  |  «/]  _ 

(iii)  for  all  j,  k:  >  0  (six  constraints). 

Proposition  2  characterizes  sufficient  conditions  for  the  non-contractible  metric  not  to  be  used 
when  the  contractible  measure  takes  on  the  intermediate  value.  To  represent  the  informativeness  of 
the  non-contractible  performance  measure  for  the  intermediate  outcome  of  the  contractible  mea- 

sure,  lete"=7%. 

Proposition  2:  When  the  contractible  performance  measure,  x,  has  three  possible  outcomes, 
the  non-contractible  performance  measure,  y,  is  not  used  for  the  intermediate 
outcome  of  the  contractible  performance  measure  if  >  Q^. 

As  an  alternative  to  the  condition  in  Proposition  2,  consider  the  following  case.  It  seems 
plausible  that  the  probability  that  the  principal  infers  a  high  subjective  performance  may  be 
increasing  in  the  outcome  of  the  contractible  measure  in  the  sense  that  and 

-  (iLh  -  ^\h-  These  conditions  imply  that  x  and  y  are  positively  correlated.  As  discussed  earlier,  a 


®  I  restrict  attention  to  the  case  where  the  contractible  performance  measure  has  three  outcomes  since  the  ultimate  goal  is 
to  derive  predictions  for  the  empirical  analysis.  The  case  where  the  contractible  performance  measure  has  n  outcomes 
adds  little  insight  for  the  empirical  predictions  and  would  complicate  the  analysis. 

In  particular,  the  agent’s  action  and  the  outcome  of  the  subjective  performance  measure  are  still  binary  and  the  MLRP 
is  again  assumed  for  both  performance  measures. 
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positive  macroeconomic  shock  could  yield  both  a  strong  accounting  performance  and  a  favorable 
subjective  assessment  of  the  agent’s  performance. 

A  second  interpretation  of  the  conditions  and  q\^  >  q^^^  ^  q\i^  is  that  the 

subjective  performance  evaluation  technology  is  better  at  reflecting  the  agent’s  action  when  the 
contractible  outcome  more  accurately  reflects  the  action  taken  by  the  agent.  Stated  differently, 
q^^^  >  indicates  that  the  degree  to  which  the  subjective  performance  evaluation  technol¬ 

ogy  (correctly)  yields  a  favorable  assessment  of  the  agent’s  performance  is  increasing  in  the 
contractible  outcome.  Analogously,  q^^^  >  q^^^  >  q\^  indicates  that  the  degree  to  which  the  subjec¬ 
tive  performance  evaluation  technology  (correctly)  yields  an  unfavorable  assessment  of  the  agent’s 
performance  is  higher  when  the  contractible  outcome  is  lower. 

The  following  analysis  shows  that  if  we  make  the  additional  assumption  that  the  improvement 
in  the  subjective  performance  evaluation  technology  described  above  is  non-decreasing,  i.e., 

and  then  we  also  obtain  the  result  Aat  the 

non-contractible  measure  is  not  used  for  intermediate  values  of  the  contractible  metric.  In  par¬ 
ticular,  Appendix  B  summarizes  the  results  of  Monte  Carlo  simulations  that  have  been  conducted 
in  order  to  characterize  the  optimal  incentive  contract  in  this  setting.  In  running  the  simulations,  I 
follow  the  prior  literature  (e.g.,  Baiman  et  al.  1995;  Baiman  and  Rajan  1995;  Rajan  and  Reichel- 
stein  2006,  2009)  in  choosing  the  form  of  the  agent’s  utility  function  and  the  parameter  spaces  for 
e(aO,  e(a^),  and  U.  Specifically,  I  assume  that  the  agent  has  a  power  utility  function  of  the  form 
U{s)  =  The  simulation  results  indicate  that,  for  the  chosen  parameter  spaces,  the  non- 

contractible  signal  is  not  used  when  the  contractible  signal  takes  on  the  intermediate  value.®  The 
simulations  also  indicate  that  the  situations  where  the  non-contractible  measure  is  used  for  the 
low-contractible  outcome  are  more  frequent  than  the  situations  where  the  non-contractible  signal 
is  used  for  the  high-contractible  outcome. 

The  result  in  Proposition  2  and  the  simulation  analysis  forms  the  basis  for  the  next  prediction: 


H2:  Discretionary  bonus  payments  are  more  likely  to  occur  when  the  outcome  of  the  con¬ 
tractible  performance  measure  is  either  low  or  high  than  when  the  outcome  of  the 
contractible  performance  measure  falls  in  the  medium  range. 

The  model  described  above  focuses  on  the  principal’s  committing  to  a  bonus  pool  as  the 
mechanism  that  enforces  the  implicit  contract.  Furthermore,  prior  literature  has  proposed  alterna¬ 
tive  mechanisms  that  can  be  employed  to  credibly  incorporate  subjective  information  into  incen¬ 
tive  contracts.  In  particular.  Baker  et  al.  (1994)  analyze  the  interplay  of  contractible  and  non- 
contractible  performance  measures  in  a  setting  with  repeated  interaction  between  the  principal  and 
the  agent,  where  the  implicit  contract  is  enforced  by  the  principal’s  concern  for  her  reputation.  In 
their  model.  Baker  et  al.  (1994)  focus  on  trigger  strategies  as  the  equilibrium  concept,  which 
stipulates  that  the  principal  can  never  again  employ  the  non-contractible  metric  once  she  has 
reneged  on  the  implicit  contract.  Thus,  in  deciding  whether  to  honor  the  implicit  contract,  the 
principal  compares  the  firm’s  future  profits  when  both  measures  can  be  used  for  contracting  to  the 


9 

10 


The  conditions  qlh  — — ‘ihh  — ‘Imh  — ‘}\h’  9mi)  —  ?»i)>  (‘7m  —  9;/,)  (which  are 

statements  for  a  given  action  t^en  by  the  agent)  together  imply  that  the  informativeness  of  the  non-contractible  measure 
(which  is  defined  across  the  two  possible  actions)  is  not  highest  for  the  intermediate-contractible  outcome.  Stated 
differently,  the  conditions  imply  that  either  and/or  Q^>Q^.  Intuitively,  the  subjective  performance  evalua¬ 

tion  technology  is  so  accurate  at  identifying  the  low  action  for  the  low-contractible  outcome  and  at  identifying  the  high 
action  for  the  high-contractible  outcome  that  the  informativeness  of  the  subjective  signal  will  be  highest  in  one  or  both 


of  the  tails. 

In  future  research,  it  would  be  desirable  to  explore  whether  this  result  can  also  be  shown  analytically. 

Other  papers  that  analyze  implicit  contracts  in  infinite  horizon  settings  include  Bull  (1987),  Pearce  and  Stacchetti 
(1998),  Levin  (2003),  and  Budde  (2007). 
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profits  that  can  be  generated  when  only  the  contractible  measure  remains  credible.  The  authors 
show  that,  when  the  contractible  performance  measure  is  less  “distorted,”  this  difference  in  future 
profits  is  smaller,  and  the  non-contractible  measure  becomes  less  attractive  for  contracting  pur¬ 
poses.  Therefore,  the  optimal  incentive  contract  is  such  that  the  weight  placed  on  the  non- 
contractible  measure  is  increasing  in  the  “distortion”  of  the  contractible  measure.** 

Baker  et  al.  (1994)  capture  the  quality  of  the  contractible  performance  measure  by  the  vari¬ 
ance  of  the  marginal  product,  which  maps  the  agent’s  action  into  the  performance  measure  out¬ 
come.  In  the  model,  the  agent  receives  private  information  regarding  the  marginal  product  before 
choosing  his  action.  The  authors  interpret  a  measure  with  a  higher- variance  marginal  product  as  a 
performance  measure  that  is  more  “distorted”  and  that  is  associated  with  a  higher  level  of  dys¬ 
functional  behavior.  Along  those  lines,  prior  empirical  work  has  interpreted  the  variance  of  the 
marginal  product  in  the  Baker  et  al.  (1994)  model  as  a  performance  measure’s  susceptibility  to 
manipulation  (Gibbs  et  al.  2004),  and  the  potential  for  earnings  management  (Bushman  et  al. 
1996).  For  the  purpose  of  this  study,  I  follow  these  existing  empirical  studies  and,  given  that  the 
vast  majority  of  companies  use  a  measure  of  accounting  profits  in  their  bonus  plans  (e.g.,  Murphy 
2000),  interpret  the  distortion  of  the  contractible  metric  as  the  manipulability  of  the  company’s 
earnings.  This  leads  to  the  following  hypothesis: 

H3:  Discretionary  bonus  payments  are  more  likely  to  occur  when  the  company’s  earnings  are 
more  susceptible  to  manipulation. 

Managerial  Power  Theory 

The  preceding  hypotheses  are  based  on  the  notion  that  discretionary  bonus  payments  are  an 
outcome  of  optimal  incentive  contracts.  Alternatively,  discretionary  bonus  payments  may  consti¬ 
tute  managerial  rent  extraction  stemming  from  governance  limitations  in  the  companies.  In  other 
words,  discretionary  bonus  payments  could  conceivably  occur  in  settings  where  management  has 
considerable  power  over  the  board  of  directors  and  the  compensation  committee. 

Prior  empirical  studies  have  documented  that  measures  of  CEO  power  are  associated  with 
several  compensation  patterns  that  can  be  interpreted  as  rent  extraction.  For  example.  Core  et  al. 
(1999)  document  that  CEO  power  is  positively  related  to  excessive  pay  levels  (i.e.,  compensation 
above  the  level  explained  by  standard  economic  factors).  Moreover,  Bebchuk  et  al.  (2008)  show 
that  powerful  CEOs  are  more  likely  to  receive  a  “lucky”  option  grant  at  the  lowest  price  of  the 
month  and  that  they  are  more  likely  to  be  rewarded  for  luck  due  to  industry-wide  shocks. 

Discretionary  bonus  payments  may  be  one  way  in  which  powerful  executives  extract  rents 
from  their  companies.  Bebchuk  et  al.  (2002)  argue  that  an  important  factor  in  executives’  ability  to 
extract  rents  is  the  amount  of  “outrage”  that  a  proposed  pay  package  would  create.  If  the  outrage 
over  a  pay  package  is  sufficiently  high  and  widespread,  then  it  imposes  significant  costs,  which 
limit  the  amount  of  rents  that  the  executives  can  extract.  Bebchuk  et  al.  (2002)  argue  further  that 
the  outrage  costs  depend  on  the  extent  to  which  the  rent  extraction  can  be  easily  and  distinctly 
identified.  The  authors  argue  that,  under  the  managerial  power  approach,  executives  prefer  com¬ 
pensation  structures  that  enable  the  extraction  of  rents  to  be  camouflaged  as  optimal  contracting. 

As  suggested  in  the  preceding  section,  under  certain  conditions,  discretionary  bonus  payments 
will  be  part  of  the  optimal  incentive  contract.  Moreover,  the  incentive  plans  that  shareholders 


Baker  et  al.  (1994)  obtain  the  result  that  the  weight  placed  on  the  non-contractible  performance  measure  decreases  with 
the  distortion  of  the  contractible  measure  in  the  case  that  the  firm  has  to  shut  down  after  the  firm  reneges  on  the  implicit 
contract.  However,  this  case  is  unlikely  to  reflect  my  empirical  setting  since  I  would  expect  that  companies  can  still 
generate  positive  returns  when  restricted  to  using  contractible  measures  such  as  accounting  earnings  and  the  company’s 
stock  price  in  incentive  contracting. 


The  Accounting  Review 
American  Accounting  Association 


November  2010 


Discretion  in  Bonus  Plans 


1929 


approve  typically  allow  for  the  payment  of  discretionary  bonuses.  Stated  differently,  companies 
can  usually  pay  discretionary  bonuses  without  an  additional  approval  of  shareholders.  Therefore, 
it  is  difficult  for  outsiders  to  determine  whether  a  discretionary  bonus  payment  is  paid  in  accor¬ 
dance  with  the  optimal  incentive  contract  or  whether  the  discretionary  payment  constitutes  rent 
extraction.  Following  the  arguments  provided  by  Bebchuk  et  al.  (2002),  discretionary  bonus  pay¬ 
ments  may  be  an  effective  way  for  powerful  executives  to  extract  rents.  Hypothesis  4  follows  from 
this  argument: 

H4:  Discretionary  bonus  payments  are  more  likely  to  occur  in  companies  where  management 
has  considerable  power. 

III.  SAMPLES  AND  MEASURES 

I  define  “discretionary  bonuses”  as  payments  that  would  not  have  been  paid  according  to  the 
explicit  contract  entered  into  before  the  agent’s  actions  are  taken.  This  definition  includes  bonuses 
that  are  paid  according  to  implicit  incentive  contracts  based  on  non-contractible  performance 
measures.  The  definition  excludes  bonuses  that  are  paid  according  to  explicit  incentive  contracts 
based  on  contractible  performance  measures.  In  addition,  the  definition  excludes  payments  such  as 
signing  bonuses,  severance  payments,  and  guaranteed  minimum  payments  that  are  based  on  ex¬ 
plicit  contractual  agreements. 

Sample 

The  sample  used  in  this  study  is  collected  from  public  sources  and  consists  of  two  sub¬ 
samples:  (1)  a  sample  of  companies  that  paid  discretionary  bonuses  (“discretionary  sample”),  and 
(2)  a  control  sample  that  did  not  make  these  types  of  payments  to  executives  (“non-discretionary 
sample”). 

The  discretionary  sample  is  collected  by  taking  advantage  of  the  updated  SEC  regulation 
regarding  disclosure  requirements  in  Forms  8-K  that  became  effective  in  August  2004.  In  particu¬ 
lar,  companies  are  required  to  disclose  payments  of  material  cash  bonuses  if  the  bonuses  are  not 
paid  in  accordance  with  the  bonus  formula.  Since  companies  may  disclose  the  required  infor¬ 
mation  in  a  subsequent  proxy  statement  if  they  fail  to  file  a  Form  8-K,  the  discretionary  sample 
consists  of  companies  that  disclose  discretionary  bonus  payments  in  either  a  Form  8-K  or  a  proxy 
statement  between  August  23,  2004  and  September  20,  2006.^^  The  sample  is  constructed  based  on 
a  keyword  search  of  Forms  8-K  and  proxy  statements  contained  in  the  database  “Lexis/Nexis 
Research”  using  several  different  search  strings.'"^  To  be  included  in  the  discretionary  sample,  the 
company  must  disclose  a  dollar  amount  for  the  discretionary  bonus  payment  in  the  Form  8-K  or 
proxy  statement.  The  final  discretionary  sample  consists  of  100  firm-years  from  the  search  of 
Forms  8-K  and  134  firm-years  from  the  search  of  proxy  statements  for  a  total  of  234  firm-years 
and  211  firms. 

The  non-discretionary  sample  consists  of  companies  that  do  not  disclose  information  about 
discretionary  bonus  payments  in  either  a  Form  8-K  or  a  proxy  statement  over  the  period  between 
August  23,  2004  and  September  20,  2006.  Since  some  of  the  tests  require  information  on  the  target 
bonus,  the  non-discretionary  sample  is  constructed  by  applying  several  search  strings  related  to 


More  precisely,  according  to  a  statement  by  the  Division  of  Corporation  Finance  of  the  SEC  dated  November  23,  2004: 
“Payment  of  a  cash  bonus  must  be  disclosed  under  Item  1.01  of  Form  8K  if  the  registrant  exercised  discretion  to  pay  the 
bonus  even  though  the  specified  performance  criteria  were  not  satisfied.” 

More  precisely,  if  companies  fail  to  timely  file  a  Form  8-K,  a  safe  harbor  rule  applies  as  long  as  the  information  is 
disclosed  in  a  Form  10-Q  or  a  Form  10-K  relating  to  the  period  in  which  the  event  occurred. 

I  searched  the  Forms  8-K  and  proxy  statements  using  keywords  such  as  “discretionary  bonus,  special  bonus, 
“additional  bonus,”  “discretionary  award,”  “discretionary  cash  award,”  etc. 
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target  bonuses  to  the  proxy  statements  contained  in  “Lexis/Nexis  Research”  that  were  not  identi¬ 
fied  by  Lexis/Nexis  when  constructing  the  discretionary  sample.  To  be  included  in  the  non¬ 
discretionary  sample,  the  company  must  disclose  the  level  of  either  the  target  bonus  or  the  maxi¬ 
mum  bonus.  Moreover,  in  order  to  minimize  the  influence  of  signing  bonuses,  documents  where 
the  managers  had  a  different  position  in  the  year  before  are  also  excluded  from  the  sample.  For  a 
subset  of  50  companies  in  the  non-discretionary  sample,  I  extensively  searched  the  proxy  state¬ 
ment  and  the  Forms  8-K  for  information  regarding  discretionary  bonuses.  Using  several  broad 
keyword  searches,  I  could  not  find  any  information  regarding  the  payment  of  discretionary  bo¬ 
nuses.  The  final  non-discretionary  sample  consists  of  349  firm-years  and  279  firms.  The  final  total 
sample  consists  of  583  firm-years  and  490  firms.  Table  1  provides  further  details  on  the  sample 
collection  process. 

Measures 

Bonus  Payment  Components 

Total  bonus  payment  is  measured  by  the  sum  of  the  formula-based  bonus  and  the  discretion¬ 
ary  bonus,  scaled  by  salary  (TOTALBONUS),  and  its  formula-based  and  discretionary  components 
are  denoted  by  (FORMULABONUS)  and  (DISC BONUS).  Tests  that  concern  the  occurrence  of 
discretionary  bonus  payments  are  estimated  using  an  indicator  variable  that  is  set  equal  to  1  if  a 
discretionary  bonus  was  paid,  and  0  otherwise  (DISCIND). 

Financial  Performance 

The  first  hypothesis  predicts  that  discretionary  bonus  payments  are  predictive  of  future  finan¬ 
cial  performance  after  controlling  for  current  financial  performance.  Similar  to  the  measures  used 
in  Hayes  and  Schaefer  (2000),  I  use  the  firm’s  return  on  assets  in  year  t+l  (ROA,+i)  as  the  measure 
of  future  financial  performance.  To  measure  the  firm’s  current  performance,  I  employ  the  current 
year’s  return  on  assets  {ROA^,  stock  return  {RET),  and  log  of  sales  (log  SALES). 

Outcome  of  Contractible  Performance  Measures 

Hypothesis  2  requires  a  measure  of  the  outcome(s)  of  the  contractible  performance  mea- 
sure(s).  I  measure  this  construct  in  two  ways:  (1)  using  the  bonus  payment  that  was  made  accord¬ 
ing  to  the  bonus  formula  as  a  percentage  of  the  target  bonus  (FBONUS),  and  (2)  using  the  change 
in  eamings-per-share  (AEPS). 

Using  the  formula-based  bonus  payment  seems  natural  since  there  is  a  one-to-one  mapping 
between  the  outcome(s)  of  the  performance  measure(s)  used  in  the  bonus  formula  and  the  bonus 
payment  based  on  the  outcome(s).  A  limitation  of  the  measure  FBONUS  is  that  its  value  is 
influenced  by  how  the  target  bonus  is  set.  Prior  empirical  studies  have  provided  evidence  suggest¬ 
ing  that  the  performance  target  corresponding  to  the  target  bonus  may  not  necessarily  reflect  an 
unbiased  performance  expectation.  In  particular,  using  various  settings  and  samples,  studies  have 
documented  that  (1)  actual  performance  exceeds  targeted  performance  in  the  majority  of  cases 
(Merchant  and  Manzoni  1989;  Leone  and  Rock  2002;  Indjejikian  and  Nanda  2002),  (2)  perfor¬ 
mance  target  revisions  follow  certain  patterns  (Leone  and  Rock  2002;  Indjejikian  and  Nanda  2002; 
Indjejikian  and  Matejka  2006),  and  (3)  performance  targets  may  be  based  on  peer  groups,  timeless 
standards,  or  the  company’s  cost  of  capital  (Murphy  2000).  Because  of  this  limitation,  I  follow 


I  used  keywords  such  as  “target  bonus”  or  “maximum  bonus.” 
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TABLE  1 
Sample  Collection 

I.  Companies  that  Paid  Discretionary  Bonuses  (“discretionary  sample”) 

1.  a.  Forms  8-K 

Documents  identified  by  Lexis/Nexis  1164 

No  discretionary  bonus  was  paid  854 

Employment  agreement,  separation  agreement,  promotion  agreement  83 

Not  clear  whether  bonus  is  discretionary  88 

Proxy  statement  not  available  32 

Not  a  named  officer  _ 7_ 

Final  Firm- Years  100 

I.  b.  Proxy  Statements 

Documents  identified  by  Lexis/Nexis  854 

No  discretionary  bonus  was  paid  472 

Employment  agreement,  separation  agreement,  promotion  agreement  56 

Not  clear  whether  bonus  is  discretionary  137 

Already  in  8-K  Sample  55 

Final  Firm- Years  134 

I.  c.  Final  Discretionary  Sample 

Final  Firm- Years  234 

Final  Firms  211 

II.  Companies  That  Did  Not  Pay  Any  Discretionary  Bonuses  (“non-discretionary  sample”) 

Documents  identified  by  Lexis/Nexis  952 

Amount  of  target/maximum  bonus  not  available  470 

Manager  had  a  different  position  in  the  prior  year  133 

Final  Firm- Years  349 

Final  Firms  279 

III.  Total  Sample 

III.  a.  Firm- Years 

Companies  that  paid  discretionary  bonuses  234 

Companies  that  did  not  pay  any  discretionary  bonuses  ^  349 

Final  Firm- Years  383 

III.  b.  Firms 

Companies  that  paid  discretionary  bonuses  211 

Companies  that  did  not  pay  any  discretionary  bonuses  279 

Final  Firms 


prior  empirical  studies  (Sloan  1993;  Core  et  al.  2003)  and  also  conduct  the  analyses  using  the 
change  in  EPS. 

Manipulability  of  Earnings 

Hypothesis  3  predicts  that  discretionary  bonuses  are  more  likely  to  occur  when  a  company’s 
earnings  are  more  susceptible  to  manipulation.  I  measure  this  construct  using  a  variable  that 
captures  the  volatility  of  the  company’s  accruals.  More  precisely,  I  calculate  the  standard  deviation 
of  the  scaled  total  accruals  over  the  prior  five  years  {SDTA).  This  measure  captures  the  spirit  of  the 
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earlier  discretionary  accruals  models  (Healy  1985;  DeAngelo  1986).*^  Moreover,  accrual  volatility 
is  correlated  with  the  measure  of  accrual  quality  that  is  proposed  by  Dechow  and  Dichev  (2002, 
48).*^  Companies  with  higher  levels  of  accrual  volatility  are  expected  to  have  earnings  that  are 
more  susceptible  to  manipulation. 

Executive  Power 

Hypothesis  4  requires  a  measure  of  executive  power.  I  employ  the  measure  “CEO  pay  slice” 
{CPS),  which  is  defined  as  the  percentage  of  the  total  compensation  paid  to  the  top  five  executives 
that  goes  to  the  CEO.  This  measure,  introduced  by  Bebchuk  et  al.  (2008),  has  received  increasing 
attention  from  the  corporate  governance  literature.  Bebchuk  et  al.  (2008)  show  that  this  measure  is 
related  to  the  odds  of  awarding  “lucky”  option  grants  at  the  lowest  price  of  the  month  and  to  the 
tendency  to  reward  the  CEO  for  luck  due  to  positive  industry-wide  shocks.  CPS  is  expected  to 
reflect  the  power  and  influence  of  the  CEO  in  the  organization,  meaning  that  CEOs  with  a  higher 
CPS  are  expected  to  have  a  greater  influence  on  the  compensation-setting  process.  With  respect  to 
my  empirical  setting  with  a  relatively  small  sample,  this  measure  also  has  the  advantage  that  it  can 
easily  be  computed  from  a  company’s  compensation  table  and  does  not  require  additional  data 
sources. All  measures  are  defined  and  summarized  in  Table  2. 

Descriptive  Statistics 

Table  3  provides  descriptive  statistics  for  the  variables  as  they  are  used  in  the  analyses.  Panel 
A  provides  descriptive  statistics  for  the  discretionary  subsample  and  Panel  B  provides  descriptive 
statistics  for  the  non-discretionary  subsample. 

The  executives  in  the  companies  that  made  discretionary  payments  experience  increases  in 
their  relative  bonus  payments  from  the  prior  year,  as  reflected  in  the  positive  mean  and  median  of 
(ATOTALBONUS).  However,  a  closer  look  reveals  that  the  median  increase  in  the  formula-based 
bonus  component  (AFORMULABONUS)  is  zero.  Thus,  the  executives  receive  higher  total  bonus 
payments,  on  average,  as  compared  to  the  prior  year  because  of  the  discretionary  bonus  compo¬ 
nent. 

The  descriptive  statistics  for  the  measures  reflecting  financial  performance  indicate  that,  in  the 
year  for  which  the  discretionary  bonuses  were  paid,  the  companies,  on  average,  experience  growth 
in  sales  and  ROA  but  a  decline  in  market  returns.  Moreover,  the  companies  experience  further 
growth  in  ROA  in  the  year  following  the  discretionary  payments. 

The  descriptive  statistics  for  FBONUS,  which  captures  the  ratio  of  (formula  bonus/target 
bonus),  indicate  that  the  executives  in  the  companies  that  comprise  the  discretionary  sample,  on 
average,  do  not  achieve  their  targeted  performance.  Conversely,  the  executives  in  the  companies 
that  do  not  pay  discretionary  bonuses  on  average  achieve  a  performance  around  the  target  level. 
The  mean  (median)  values  for  FBONUS  are  0.84  (0.83),  and  1.06  (1.00)  for  the  discretionary  and 
non-discretionary  sample,  respectively.  In  contrast  to  the  descriptive  statistics  for  FBONUS,  the 
descriptive  statistics  for  the  change  in  EPS  suggest  that  the  performance  is  comparable  across  the 
two  samples. 


Estimating  more  sophisticated  discretionary  accrual  models  such  as  the  Jones  (1991)  model  would  result  in  a  high 
proportion  of  missing  data  points  in  my  relatively  small  sample. 

Dechow  and  Dichev  (2002)  focus  on  working  capital  accruals.  This  study  focuses  on  total  accruals  due  to  missing  data 
problems  in  calculating  working  capital  accruals  for  my  relatively  small  sample.  As  discussed  by  Dechow  and  Dichev 
(2002),  the  notion  of  accrual  quality  differs  from  the  concept  of  discretionary  accruals.  However,  the  “distortion”  in 
contractible  performance  measures  used  in  the  model  in  Baker  et  al.  (1994)  could  also  be  interpreted  such  that  higher 
“distortion”  is  related  to  lower  accrual  or  earnings  quality. 

The  results  are  robust  to  using  alternative  proxies  for  executive  power  that  are  based  on  things  like  the  percentage  of 
outside  directors  that  were  appointed  by  existing  inside  directors,  the  number  of  directors  on  the  board,  whether  the 
company  has  a  staggered  board,  etc.  Using  these  alternative  proxies  would  result  in  a  smaller  sample. 
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TABLE  2 
Measures 

Measures  of  Bonus  Payment  Components 

TOTALBONUS  ((Formula-based  bonus  +  discretionary  bonus)/salary) 

FORMULABONUS  (Formula-based  bonus/salary) 

DISCBONUS  (Discretionary  bonus/salary) 

DISCIND  Indicator  variable  that  is  equal  to  1  if  a  discretionary  bonus  was  paid; 

0  otherwise 

Measures  of  Financial  Performance 

ROA  (Net  income/average  total  assets) 

RET  Market  return 

SALES  Sales 

Measures  of  Contractible  Performance 

FBONUS  (Formula  bonus/target  bonus); 

for  companies  that  only  provide  information  on  the  maximum  bonus: 
(Formula  bonus/0.5  *  maximum  bonus) 

ALPS  {{EPS,  -  £P5,_i)  /  Share  Pricef_i) 

Measure  of  Manipulability  of  Earnings 

SDTA  Log  of  the  standard  deviation  of  (total  accruals/average  total  assets)  over  the 

prior  five  years;  where 

Total  accruals  =  (Income  before  extraordinary  items  —  operating  cash  flows) 

Measure  of  Executive  Power 

CPS  (Total  CEO  compensation)/(Aggregate  total  compensation  for  the  five 

highest-paid  executives),  where: 

Total  compensation  =  (Salary  +  bonus  +  other  annual  pay  +  Black-Scholes 
value  of  options  granted  +  value  of  restricted  stock  grants  +  value  of 
long-term  incentive  payout  +  all  other  compensation) 


Contrary  to  the  prediction  in  H3,  the  discretionary  subsample  does  not  exhibit  greater  accrual 
volatility  than  the  non-discretionary  subsample  (the  mean  and  median  values  for  SDTA  are  -3.18 
and  -3.19  for  the  discretionary  subsample  and  -3.23  and  -3.43  for  the  non-discretionary  sub¬ 
sample).  Moreover,  the  values  for  the  variable  capturing  the  CEO’s  pay  slice  do  not  suggest 
significant  differences  with  regard  to  executive  power  in  the  two  samples. 

IV.  RESEARCH  DESIGN  AND  EMPIRICAL  RESULTS 
Future  Financial  Performance 

Hypothesis  1  predicts  that  the  discretionary  bonus  components  are  incrementally  informative 
about  future  financial  performance  but  that  the  formula-based  components  are  not  predictive  of 
future  success  after  controlling  for  the  current  period’s  performance. 

I  follow  the  basic  approach  suggested  by  Hayes  and  Schaefer  (2000)  of  regressing  future 
financial  performance  on  current  compensation  and  current  financial  performance  in  a  first- 
difference  form.  In  particular,  I  estimate  the  following  model: 


The  inferences  are  consistent  when  future  stock  return  is  considered  as  the  dependent  variable. 
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Measures  of  Contractible  Performance 

FBONUS  349  1.06  0.70  0.65  1.00  1.41 

AEPS  302  0.01  0.01  -0.01  0.01  0.03 
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I^ROAj+x  =  Pq  +  /Si(ACompensation  variable,)  +  yS2A/?0A,  +  +  p^^Log{Sales^  +  e 

(i) 

In  terms  of  the  independent  variables,  I  focus  on  the  financial  performance  in  year  t  in  order  to 
economize  on  the  number  of  parameters  that  have  to  be  estimated. 

Table  4  presents  the  results  for  the  regression  analyses.^°  I  first  estimate  the  regression  using 
the  change  in  total  bonus  payment  as  the  measure  of  current  compensation  (model  I).  The  coef- 


TABLE  4 

Future  Financial  Performance 

Dependent  Variable^ 


Independent  Variables® 

I 

II 

Intercept 

-0.01 

0.02 

(-0.57) 

(1.08) 

ATOTALBONUSi 

0.01 

(1.03) 

AFORMULABONUS, 

0.00 

(0.29) 

LogiDlSCBONUS,) 

0.01** 

(2.49) 

AROA, 

-0.24* 

-0.25* 

(-1.68) 

(-1.92) 

ARET, 

0.01 

0.01 

(0.77) 

(1.05) 

A  Log{SALES^ 

-0.07 

-0.08 

(-1.25) 

(-1.56) 

Number  of  observations 

98 

98 

Adj.  r2 

16.76% 

21.21% 

*,  **,  ***  Significant  at  10  percent,  5  percent,  and  1  percent,  respectively,  all  based  on  two-tailed  tests. 

Reported  are  the  coefficients  from  the  models  using  heteroscedastic-consistent  standard  errors;  t-statistics  are  in 

parentheses. 

“  ATOTALBONUS  is  the  change  in  the  salary-scaled  total  bonus  payment.  AFORMULABONUS  is  the  change  in  the 
salary-scaled  formula-based  portion  of  the  bonus  payment.  Log{DISCBONUS,)  is  the  log  of  the  salary-scaled  discre¬ 
tionary  component  of  the  bonus  payment.  AROA  is  the  change  in  return  on  assets,  ARET  is  the  change  in  market  returns, 
and  AlogiSALES)  is  the  change  in  the  log  of  sales.  In  order  to  reduce  the  effect  of  extreme  observations,  I  drop 
observations  that  fall  in  the  1st  and  99th  percentile  of  the  distribution  of  the  dependent  variable.  In  addition  to  the 
variables  listed,  I  include  industry  fixed  effects  based  on  one-digit  SIC  codes. 

The  variables  are  defined  in  Table  2. 


Some  companies  disclose  in  their  From  8-K  or  Proxy  Statement  that  the  discretionary  bonus  was  paid  in  relation  with 
the  achievement  of  a  “special  occasion”  such  as  an  IPO,  a  sale  of  company  assets,  a  merger  or  an  acquisition,  a  change 
in  position,  or  the  refinancing  of  debt.  Table  4  is  estimated  using  the  sample  that  excludes  such  “special  occasions.”  The 
results  are  qualitatively  unchanged  if  the  models  in  Table  4  are  estimated  on  the  full  sample. 
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ficient  on  (ATOTALBONUS)  is  positive  but  not  statistically  significant.^’  As  expected,  the  coeffi¬ 
cient  on  the  change  in  ROA  is  significantly  negative. 

In  the  second  step,  I  decompose  the  measure  (ATOTALBONUS)  into  the  two  components 
(AFORMULABONUS)  and  (Log(DlSCBONUS))-?'^  (ATOTALBONUS)  is  defined  as 
[(formula  bonus^  -i-  discretionary  bonus^)  /  salary J  -  [formula  bonuS;_i  /  salary ,_i],  which  can  be 
separated  into  [formula  bonus^/ salary f]  -  [formula  bonus, _i  /  salary f_i]  (= AFORMULABONUS) 
and  [(discretionary  bonus,)  /  salary,]  (=DISCBONUS).  When  the  regression  is  estimated  by  in¬ 
cluding  the  two  components  of  the  change  in  total  bonus  in  the  regression  separately,  the  coeffi¬ 
cient  on  the  discretionary  portion  is  positive  and  significant,  whereas  the  coefficient  on  the 
formula-based  portion  is  insignificant.^^  These  findings  suggest  that  discretionary  bonuses  are 
incrementally  predictive  of  future  financial  performance  but  that  formula-based  bonuses  are  not.  In 
a  broader  sense,  the  results  in  Table  4  provide  support  for  the  notion  that  discretionary  bonuses  are 
based  on  non-contractible  performance  indicators. 

In  comparing  Table  4  to  the  results  in  Hayes  and  Schaefer  (2000),  I  view  the  analysis  in  this 
study  as  identifying  one  of  the  components  of  total  cash  compensation  that  drive  the  results  in 
Hayes  and  Schaefer  (2000). 

Determinants  of  Discretionary  Bonus  Payments 

Hypotheses  2  through  4  concern  the  occurrence  of  discretionary  bonus  payments.  To  inves¬ 
tigate  the  predictions,  I  run  a  logit  model  using  the  indicator  variable  DISCIND  as  the  dependent 
variable.  Hypothesis  2  predicts  that  discretionary  bonuses  are  more  likely  to  occur  when  the 
outcome  of  the  contractible  measure  falls  in  the  tails  of  the  distribution  than  when  the  contractible 
outcome  is  in  the  medium  range.  I  use  indicator  variables  in  order  to  capture  whether  a  company’s 
contractible  performance  has  an  extreme  value.  In  particular,  I  first  employ  an  indicator  variable, 
TAILS,  which  is  set  equal  to  1  if  the  contractible  outcome  falls  in  the  tails  of  the  distribution. 
Specifically,  when  the  contractible  performance  is  measured  by  FBONUS  (= (formula  bonus/target 
bonus)),  TAILS  is  set  equal  to  1  if  FBONUS  is  0  or  if  FBONUS  falls  in  the  top  15  percent  of  the 
distribution.^'’  When  the  contractible  performance  is  measured  by  AEPS  (=((EPSt 
-EPSt^i)  /  Share  Price,_i)),  TAILS  is  set  equal  to  1  if  AEPS  falls  in  the  bottom  or  top  15  percent 
of  the  distribution.^^  In  the  second  step,  I  estimate  the  logit  model  by  splitting  the  indicator 
variable  TAILS  into  its  two  components,  LOW  and  HIGH,  which  capture  whether  the  contractible 
outcome  falls  into  the  bottom  or  top  end,  respectively. 

Hypotheses  3  and  4  are  tested  by  including  the  variables  capturing  the  manipulability  of 
earnings  (SDTA),  and  executive  power  (CPS)  in  the  logit  analyses.  Specifically,  I  estimate  the 
following  models: 

DISCIND  =  I3q  +  PJAILS  +  I32SDTA  +  P^CPS  -l-  e;  (ii) 


Hayes  and  Schaefer  (2000)  run  a  related  regression  and  obtain  a  significantly  positive  coefficient  for  the  change  in  cash 
compensation.  This  discrepancy  may  be  attributable  to  the  much  larger  sample  that  is  used  in  Hayes  and  Schaefer 
(2000). 

Since  the  distribution  of  DISCBONUS  is  highly  skewed,  I  employ  a  log-transformation.  The  results  are  qualitatively 
unchanged  if  DISCBONUS  is  used  without  the  log-transformation.  Moreover,  DISCBONUS  is  not  differenced  since 
there  are  no  firms  in  the  sample  that  disclosed  payment  of  discretionary  bonuses  in  two  adjacent  years. 

Untabulated  results  indicate  that  when  the  sample  used  in  Table  4  is  split  into  three  groups  based  on  the  company’s 
contractible  performance  as  measured  by  (AEPS),  the  discretionary  bonus  payment  is  significantly  more  predictive  of 
future  financial  performance  when  the  company’s  contractible  performance  falls  in  the  bottom  third  and  when  the 
company’s  contractible  performance  falls  in  the  top  third. 

I  have  chosen  15  percent  since  the  number  of  observations  for  which  FBONUS  is  zero  comprise  15  percent  of  the 
sample.  In  other  words,  the  number  of  observations  in  the  two  tails  is  chosen  to  be  equal. 

15  percent  is  chosen  in  order  to  facilitate  comparison  to  the  results  using  FBONUS. 
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DISCIND  =  ySo  +  /3ilLOW  +  P^hHIGH  +  p^SDTA  +  p^CPS+e.  (iii) 

Panel  A  of  Table  5  shows  the  results  of  the  logit  regression  analysis  when  the  outcome  of  the 
contractible  performance  metric  is  measured  by  FBONUS  (= formula  bonus/target  bonus),  and 
Panel  B  provides  the  findings  when  the  contractible  metric  is  measured  by  A  EPS  {={EPSt 
-EPS,_i)  /  Share  Price, Models  la  and  Ib  are  estimated  using  the  TAILS  variable  and  models 
Ila  and  Ilb  are  estimated  using  the  two  components  of  TAILS,  LOW,  and  HIGH. 

For  the  respective  specifications  la  and  Ib,  the  coefficients  on  TAILS  are  positive  and  statis¬ 
tically  significant,  indicating  that  companies  are  more  likely  to  pay  discretionary  bonuses  when  the 
outcome  of  the  contractible  performance  measure  falls  in  the  tails  of  the  distribution.  The  odds 
ratios  indicate  that  the  probability  of  paying  a  discretionary  bonus  rises  by  approximately  4  (2) 
when  the  company’s  contractible  performance  falls  in  the  tails  of  the  distribution  as  measured  by 
FBONUS  {AEPS).  Evaluated  at  mean  values  for  SDTA  and  CPS  (in  model  Ib),  the  probability  of 
paying  a  discretionary  bonus  increases  from  approximately  18  percent  to  50  percent  (25  percent  to 
40  percent)  when  the  company’s  contractible  performance  falls  in  the  tails  of  the  distribution  as 
measured  by  FBONUS  {AEPS).  These  findings  provide  some  initial  support  for  H2. 

The  results  for  models  Ila  and  lib  indicate  that  companies  are  more  likely  to  pay  discretionary 
bonuses  when  their  contractible  performance  is  in  either  the  bottom  or  the  top  tail  of  the  distri¬ 
bution.  Specifically,  the  coefficients  on  LOW  and  HIGH  are  positive  and  statistically  significant  in 
all  specifications.  The  results  in  Panel  A  that  are  obtained  when  using  EBONUS  as  the  measure  of 
contractible  performance  indicate  that  the  odds  ratio  of  paying  a  discretionary  bonus  rises  by  6-7 
when  the  company’s  contractible  performance  is  in  the  lower  tail.  When  the  contractible  perfor¬ 
mance  is  in  the  upper  tail,  the  odds  ratio  of  paying  a  discretionary  bonus  increases  by  a  factor  of 
approximately  2.  Evaluated  at  mean  values  for  SDTA  and  CPS  (in  model  lib),  the  probability  of 
paying  a  discretionary  bonus  increases  from  approximately  18  percent  to  60  percent  (35  percent) 
when  the  company’s  contractible  performance  falls  in  the  bottom  (top)  tail  of  the  distribution.  As 
can  be  observed,  companies  are  more  likely  to  pay  discretionary  bonuses  when  their  contractible 
performance  is  low  than  when  their  contractible  performance  is  high.  The  model  analyzed  above 
does  not  predict  that  discretionary  bonuses  are  equally  likely  to  occur  for  the  low  and  for  the 
high-contractible  outcome.  Specifically,  the  results  of  the  simulation  analysis  indicate  that,  for  the 
parameter  ranges  considered,  it  is  more  likely  to  observe  discretionary  bonus  payments  for  the 
low-contractible  outcome  than  for  the  high-contractible  outcome.  Thus,  the  discrepancy  in  the 
findings  with  respect  to  LOW  and  HIGH  when  EBONUS  is  used  to  capture  the  outcome  of  the 
contractible  performance  measure  is  consistent  with  the  model  that  is  analyzed  above. 

The  results  in  Panel  B  that  are  obtained  when  using  AEPS  differ  from  the  results  in  Panel  A 
in  that  they  indicate  that  there  are  no  significant  differences  between  the  bottom  and  top  tail. 
Specifically,  the  odds  ratio  of  paying  a  discretionary  bonus  rises  by  approximately  2  when  the 
company’s  contractible  performance  is  in  either  the  lower  or  upper  tail.  Evaluated  at  mean  values 
for  SDTA  and  CPS  (in  model  lib),  the  probability  of  paying  a  discretionary  bonus  increases  from 
approximately  25  percent  to  40  percent  when  the  company’s  contractible  performance  is  either 
very  low  or  very  high. 

Overall,  I  interpret  the  findings  in  Table  5  as  providing  support  for  the  premise  that  discre¬ 
tionary  bonuses  are  more  likely  to  occur  when  the  outcome  of  the  contractible  performance 
measure  falls  in  the  tails  of  the  distribution. 


Analogous  to  Table  4,  Table  5  is  estimated  using  the  sample  that  excludes  “special  occasions.”  The  results  are  qualita¬ 
tively  unchanged  if  the  models  in  Table  5  are  estimated  on  the  full  sample. 
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TABLE  5 
Logit  Analysis® 

Panel  A:  Contractible  Performance  is  Measured  by  F BONUS  =  (Formula  bonus/target  bonus) 
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Panel  B;  Contractible  Performance  is  Measured  by  AEPS  =  ((EPS,  -  EPSt-i)  I  Share  Price, _i) 
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the  top  or  bottom  15  percent  of  the  distribution.  LOW  is  set  equal  to  1  if  A£P5  falls  in  the  bottom  15  percent  of  the  distribution.  HIGH  is  set  equal  to  1  if  SEPS  falls  in  the  top 
15  percent  of  the  distribution.  SDTA  is  the  log  of  the  standard  deviation  of  the  total  accruals  over  the  prior  five  years.  CPS  is  the  proportion  of  the  total  compensation  of  the  top 
five  executives  that  is  captured  by  the  CEO.  In  addition  to  the  variables  listed,  I  include  industry  fixed  effects  based  on  one-digit  SIC  codes. 
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The  coefficients  on  the  variable  capturing  accrual  volatility  (SDTA)  are  insignificant  across  all 
model  specifications.  Thus,  the  empirical  setting  used  in  the  study  does  not  provide  support  for  the 
prediction  based  on  the  implicit  contracting  model  in  Baker  et  al.  (1994)  that  discretionary  bo¬ 
nuses  are  used  more  frequently  when  the  contractible  performance  measures  are  more  susceptible 
to  manipulation. 

Similarly,  the  coefficients  on  the  variable  capturing  executive  power  (CPS)  are  insignificant 
across  all  model  specifications,  failing  to  support  the  hypothesis  that  discretionary  bonuses  occur 
more  frequently  in  settings  where  management  has  considerable  power.  It  is  possible  that  the 
variable  that  is  employed  here  is  too  imprecise  a  measure  to  detect  a  significant  relationship. 

Overall,  the  results  from  the  logit  analysis  provide  support  for  the  hypothesis  that  discretion¬ 
ary  bonuses  are  used  more  frequently  when  the  outcome  of  the  contractible  performance  measure 
falls  in  the  tails  of  the  distribution  than  when  it  falls  in  the  medium  range.  The  hypotheses  that  the 
occurrence  of  discretionary  bonuses  is  positively  related  to  the  manipulability  of  the  company’s 
earnings  or  to  the  power  of  the  executives  are  not  supported.^^ 


V.  CONCLUSION 

This  study  examines  discretionary  bonus  payments.  Building  on  the  hypothesis  that  discre¬ 
tionary  bonuses  are  paid  based  on  non-contractible  performance  measures,  I  develop  an  analytical 
model  that  forms  the  basis  for  the  primary  hypothesis.  The  model,  which  builds  on  Rajan  and 
Reichelstein  (2009),  yields  the  prediction  that  discretionary  bonus  payments  occur  when  the  out¬ 
come  of  the  contractible  measure  is  either  low  or  high,  but  not  when  the  outcome  of  the  contract¬ 
ible  measure  falls  in  the  medium  range. 

I  also  test  whether  discretionary  bonus  payments  are  related  to  the  manipulability  of  the 
contractible  measures — a  prediction  that  follows  from  the  model  in  Baker  et  al.  (1994).  I  contrast 
the  predictions  from  agency  theory  against  a  prediction  based  on  managerial  power  theory. 

The  empirical  results  provide  evidence  that  discretionary  bonus  payments  are  related  to  non- 
contractible  measures  that  are  related  to  future  financial  performance.  Moreover,  the  analyses  yield 
support  for  the  prediction  from  the  model  that  discretionary  bonus  payments  occur  significantly 
more  often  when  the  outcome  of  the  contractible  measure  falls  in  the  tails  of  the  distribution.  The 
findings  do  not  provide  support  for  the  prediction  that  there  is  a  positive  relationship  between 
discretionary  bonus  payments  and  the  manipulability  of  the  contractible  performance  measures. 
The  findings  in  this  study  also  do  not  provide  support  for  the  hypothesis  that  discretionary  bonuses 
are  positively  related  to  executive  power. 

This  study  is  subject  to  some  limitations.  The  sample  collected  from  public  sources  may  be 
subject  to  selection  bias.  Although  companies  are  required  to  disclose  discretionary  bonus  pay¬ 
ments  in  a  Form  8-K  or  a  proxy  statement,  the  regulation  was  still  fairly  new  during  the  sample 
period,  such  that  it  is  possible  that  some  companies  did  not  disclose  the  information.  Second,  the 
sample  size  is  relatively  small.  It  is  conceivable  that  the  lack  of  findings  with  respect  to  the 
manipulability  of  earnings  and  executive  power  is,  at  least  partially,  attributable  to  the  low  statis¬ 
tical  power  generated  by  the  small  sample.  Moreover,  it  is  possible  that  there  are  determinants  of 
discretionary  bonus  payments  that  are  not  considered  in  this  study.  Such  omitted  variables  would 
raise  concerns  with  respect  to  the  empirical  findings  of  the  study. 


The  literature  has  documented  that  bonus  contracts  of  loss-making  firms  are  structured  differently  than  bonus  contracts 
of  profitable  firms  (Matejka  et  al.  2009).  The  results  are  robust  to  excluding  loss-making  firms  from  the  sample. 
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APPENDIX  A 

Proof  of  Proposition  (1) 

I  proceed  by  characterizing  the  solution  to  program  PI.  Let  X  and  p  denote  the  Lagrange- 
multipliers  for  constraints  (i)  and  (ii),  respectively.  Denote  by  the  Lagrange-multiplier  for  the 
constraint  vW  -  >  0  (there  are  four  such  inequalities  embedded  in  constraint  (iii)).  The  first- 


order  conditions  for  the  optimal  solution  with  regard  to  and  are: 

-;7^  +  /3''^  +  /3^'  =  0;  (2) 

-(l-p^)  +  yS'''  +  ;8"  =  0.  (3) 

The  first-order  conditions  for  the  optimal  solution  with  regard  to  and  are: 
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■  pi  ■  (1  - «L)  ■  U'(s'")  +  p  ■  [pi  ■  (1  -  ?i»)  -pi  ■  (1  - ql)]  ■  (/'(/')  -0^  =  0-.  (5) 
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It  is  clear  that  =  max{5^* ,  s^‘}  and  w’  =  max{^^^  ,  s“}.  To  show  that  suppose  that 

the  opposite  were  true.  Then,  5^^  <  =>  0^^  =  0  (by  complementary  slackness),  which  in 

turn  would  imply  (from  (2)).  But  from  (8),  0^  =  0  ^  <  0,  which  is  a  contradiction. 

We  must  therefore  have  To  show  that  >  s’-^,  suppose  that  the  opposite  were  true. 

Then,  =  ^  0^  =  Q  (by  complementary  slackness),  which  in  turn  would  imply  0^  =  {\ 

-  p^)  (from  (3)).  But  from  (9),  0^  =  0^  0^  <  0,  which  is  a  contradiction.  We  must  therefore  have 
>  ^//  Using  these  results,  the  principal’s  problem  PI  can  be  re-cast  as  follows: 

Min  [w'^-p^  +  w'- (1 -p^)] 

w^,w^,S,e 


subject  to: 

[f/(wd  ■  qih  +  (t/(wd  -  e)  •  (1  -  qlh)^  •  Ph  +  Ww^)  ■  q^ih  +  iU{w‘)  -  S)  ■  (I  -  q'l^)]  ■  {I  - p^) 

-e{0)-U>O,  (10) 
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[U{w^)  ■  +  (t/(w^)  -  8)  •  (1  -  ql)]  ■  pI  +  [t/(wO  •  ^  •  (1  -  •  (1 
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-?L)]-(1-pD-^(«^)}-0,  (11) 


(5>0, 


(12) 


8  >  0, 


(13) 


where  (10)  is  the  agent’s  (IR)  constraint  and  (11)  is  the  agent’s  (IC)  constraint.  Since  both  (10)  and 
(11)  are  satisfied  as  equalities,  it  must  be  the  case  that: 
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Thus,  the  problem  in  program  PI  can  be  reduced  to  the  following  program: 

Program  P3: 

Min[Z(^,8)]  =  Min[pl  ■  U-\u^)  +  {I  -  pi)  ■  U-\u)] 

S,E  S,e 

subject  to: 

(Al)  S>0, 

(A2)  8  >  0, 

where  f/“^(  • )  is  the  inverse  of  the  agent’s  utility  function,  and  and  are 
given  by  the  expressions  in  (14)  and  (15). 

To  show  that  the  problem  is  well  defined,  we  can  show  that  Z  is  convex  in  (<5,8): 
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(Ph-Ph) 

iPH-pi) 

L  (Ph-Ph) 

Ph-i'^-Ph)-(^-^\h)-Ph-i^-Ph)-(^-^'ih) 

(pl-pi) 


d^Z{S,e)  ^Z{S,e) 


de^ 


^Z{8,e) 

ddde 


_  _  (-  1  +p;;)  •  (p1  •  (-  1  +  qj)  +p\-{{-\+  ql)  •  (-  1  +  q‘i,)  +  p[ 

ipl-pif 

(-  qiH  +  +  q\H  -  A  •  gD)  +  (- 1  +  p')  •  pI  •  (- 1  +  gik)  •  («0 

The  first-order  conditions  for  the  optimal  solution  with  regard  to  S  and  s  are: 
dZ{  S,  s) 


>0. 


dS 

dZ{S,s) 

de 

where: 

dZiS,s) 


+  Pi-O, 


+  A  -  0, 


=  plViu^)-i-\) 


(16) 


(17) 


dS 


+  il-p1).V{u‘) 


dZ{S,e) 


de 


=  pI-V{u^)-{-1) 


+  {l-p^,)-V{u) 


(1  ~Ph^  •  (1  ~Ph)  • 

ipl-pi) 

Ph  •  (^  ~Ph)  •  (^  ~  ^\h)~Ph  •  (^  ~Ph)  •  (^  ~  ^?/i) 

ip1-pi) 

pi  •  (1  -p^)  •  (1  -gL)-p^a-pl)•(l-gL) 


(18) 


(p^pi) 


Ph'  Ph'  i^hh  ~  ^hh) 


(19) 


(pi- pi) 

V{-)  represents  the  derivative  of  U~^{  •)  and  /3i  ,^2  Ihe  Lagrange-multipliers  for  constraint 
(Al)  and  (A2),  respectively. 

(?Z(  S,e)  dZ(  S,e) 

/8j  >  0  and  ^2  —  ^  implies  S:  0  and  ^  —  >  0  (by  (16)  and  (17)),  which  in  turn  implies 
the  following  (by  (18)  and  (19)): 


Ph'(^~pi)'(^ih  ^\h) 


vW)  ^ _ 

V{u^)  pI  ■  (1  -p‘f)  •  (1  -  q\h)  “P1  •  (1  ~Ph) '  (1  ~  9?ft) 


(20) 
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v(n')  -/)■(! -g!.)-p».(i-p').(i-^;) 

v(m‘)  (i-p‘)-p'-(9;-?i) 

It  suffices  to  show  that  under  the  conditions  in  (1),  a*  >  0. 

(21)  and  (1)  yield  the  following  relationship: 

vfaO ^pi-(i-p»)-(i-g',)-p;.(i-p').(i-^;,)  vii) 

which  implies: 

y{u^)  ^  v{D 

V{u^)  V{h)  ’ 
where: 


Vil)  =  v\  u  + 


and 


v(m')  y(/) 

To  show  that  >  0,  suppose  that  =  0.  Then  violating  (22).  Therefore,  s*  0. 

dZ{S,e) 

It  follows  that  y82  =  0  and  therefore,  |^o  =  0: 


dZ{d,s) 

de 


d=o 


Ph  ■  (1  ~Ph)  '  (1  “  ?L)  ~Ph'  ~Ph)  ■  (1  “  ‘ihh) 


ipl-p[) 


Ph'  Ph'  i^hh  ~  9'li 


ipi-p[) 


=  0 


or,  equivalently: 


yjL')  _  pI  •  (1  -  Ph)  '  ~  ^hh)  ~Ph'  ~Ph)  '  ~  ^hh) 

vm~  (^-pt)'pi'iqhh-qL) 

where: 


V{L’)  =  V\  U  + 


eW)  ■  pj- e(a'')  ■  p[  P>  ■  pj  ■  ( A  - gja) 

(pI-p'h)  (pI-A 


,  and 


V{H')  =  V\  U  + 


e{a^)  ■  {I  - pj)  -  e{a‘)  ■  (I  - pI) 
(Ph-Ph) 


pi•(l-p^)•(l-gL)-p^(l-pi)•a-g.\) 

(p^pD 


In  order  to  show  that  there  exists  a  e*  >  0,  it  suffices  to  show  that  (i)  R{s)  =  is  increasing  in 
e  and  that  (ii)  there  exists  a  e  >  0  such  that: 
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^  Ph  •  •  (1  ~  <lhh)  ~Ph  •  ~p1)  '  ~  ^hh) 

(  ^  “  Ph)  ■  Ph  ■  i^hh  “  ^L) 


Part  (i) 


V{L') 


R(e)  =  is  increasing  in  e: 


dR{s) 

ds 


y'{L') 


Ph'  Ph'  (^hh  ~  ‘Ihh) 

(pl-pi)  ^ 


vm 


f\2 


vm 

V{L')  V'{H')  ( (lih)  -P^  (1  -p1)  •  (1  -  qlh) 

_ _v _ ipi-p[) _ ; 

V{H'f 


>0. 


(We  know  that  (pj,  •  ( 1  -  •  ( 1  -  q\^)  -  •  ( 1  -  pj,)  •  ( 1  -  ql^))  is  positive  by  (1)). 


Part  (ii) 

Note  that: 


Ph  '  '~Ph)  '  (1  ~  ^hh)  ~Ph'  ~Ph)  '  ~  ^hh)  ^  j 

( 1  ~  Ph)  '  Ph'  (‘lhh~  ^hf) 

.  V(L') 

Moreover,  there  exists  a  e  >  0  such  that  V  =H'  and  therefore  =  1: 

V{ti  ) 

. _ e{a^)  -  e{a) _ 

Ph'  Ph'  i^^hh  ~  ^hh)  '  (^  Ph)  '  ~  ^hh)  ~  Ph  '  ~  Ph)  '  “  ^hh) 

(L' ,  H'  are  >0  when  e  =  e). 

dZ{d,e*) 

Therefore,  there  exists  a  (unique)  >  0  such  that  — — |^o  =  0-  H 

Proof  of  Proposition  (2) 

It  suffices  to  show  that  the  solution  to  program  P2  is  such  that  w"'  =  s"'^  =  s"'K  Let  X  and  p 
denote  the  Lagrange-multipliers  for  constraints  (i)  and  (ii),  respectively.  Denote  by  /S'*  the 
Lagrange-multiplier  for  the  constraint  w'  -  y*  >  0  (there  are  six  such  inequalities  embedded  in 
constraint  (iii)).  The  first-order  conditions  for  the  optimal  solution  with  regard  to  w*,  w'",  and 
are; 


-pl  +  l3^^  +  fi^‘  =  0,  (23) 

-p* +/8'”*  +  /S^'  =  0,  (24) 

-(l-p{-pi)  +  /3“  +  y8"  =  0.  (25) 

Similarly,  the  first-order  conditions  for  the  optimal  solution  with  regard  to  5*^,  5'”*,  s'"\  and 
are: 

\-plql-  U'{s^^)  +  p,.  [pi  ql-p[.  ql]  •  U'^  -  /S"'  =  0,  (26) 

^  •  (1  -  qlh)  '  U'{s^‘)  +  p.[pl{l-  ql)-p[  •  (1  -  ql)]  •  U'm  -/3^‘  =  0,  (27) 
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^  +  II -{pi-qih-pL-  q'mh\  ■  U'is"^^)  =  (28) 

^  (1  -  Qih)  ■  +  /^  ■  (1  -  qtn) -pL-{^-  qLh)^  •  =  0,  (29) 

^■i'^-pl-pi)-  qi  •  U'{s^^)  +  ^  •  [(1  -pi- pi)  .qi-{\-p[-pi).  qi]  .  U'{s^^)  -  0^  =  0, 

(30) 


^■(^-pi-pt)-(i-qiH)-U'{s‘)  +  p-[il-pi-pi).{l-qi)-il-pi-pi).(l 

I  \-\  TTl(Jl\ 


lit 

-q\h)^■u'{s^^)-0^=Q. 

By  (28)  and  (29),  and  MLRP: 

nmh 


pi-qik-u'is-^) 


X  +  ytx  • 

1  1  ' 
Pm  '  qmh 

^  X  +  ytX  • 

V  Pm- qmh} 

1  - 


p‘m-i^-qih) 

pl0l-qiH)\ 


pml 


pi0^-qih)-U'{s-r 

By  (29)  and  (31),  MLRP,  and  assuming  that  qi  >  qi^  and  qi^  >  q^: 


jjnl 


\ + p,  • 


^_p‘m-(^-qi 


Pm’i^-qmh)- 


X  +  /X  ■ 


1  - 


Pr  (1  ~  q\h) 

Pr(l-4)J 


pii 


P? -(1-4)  •[/'(/)• 


(31) 


(32) 


(33) 


It  is  clear  that  w"^  =  maxl^"*^ ,  s'"^}.  To  show  that  5'”*  >  s"^’-,  suppose  that  the  opposite  were 
true.  Then,  5'”^  <  s'"'-  =  w"^  ^  -  0  (by  complementary  slackness),  which  in  turn  would  imply 

=  pi  (from  (24)).  But  from  (32),  (3^^  =  0^  <  0,  which  is  a  contradiction. 

To  show  that  w'”  =  s’^'^  =  5'"^,  suppose  not,  so  that  w'"  =  5"*^  >  s’^K  This  implies  =  0  (by 
complementary  slackness).  But  from  (33),  =  0  <  0,  which  is  impossible.  | 


APPENDIX  B 

SUMMARY  OF  THE  MONTE  CARLO  SIMULATIONS 
General  Model  Specifications 

In  all  models,  the  probabilities  can  be  randomly  chosen  from  (0,  1),  subject  to 
the  MLRP  constraints,  the  constraints  that  qi^  >  qi^^  >  qi,  qi^  >  qi^  >  q^, 

(qih  ”  qih)  —  (qih  ~  qit)^  (^'L  “  qit)  —  (qih  ~  qi)- 

The  cost  of  taking  the  low  action  is  set  to  zero  in  all  models. 

I  assume  a  power  utility  function.  The  parameter  y  varies  across  the  models. 

[/(5)  = 

1  -  y 

Model  Parameters 


y  e(a^)  U 

0.5  e[l,10]  g[5,35] 

(continued  on  next  page) 


Results 


Number  of 
Datapoints 
Generated 


Number  of 
Cases  Where 
y  is  Used 
for 


Number  of 
Cases  Where 
y  is  Used 
for  x'" 


Number  of 
Cases  Where 
y  is  Used 
for  X* 


716  272  0  71 


p'pq)h 

e{a‘) 
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Model  Parameters  Results 


r 

e(a*) 

U 

Number  of 
Datapoints 
Generated 

Number  of 
Cases  Where 
y  is  Used 
for  x‘ 

Number  of 
Cases  Where 
y  is  Used 
for  x"‘ 

Number  of 
Cases  Where 
y  is  Used 
for  jr* 

0.5 

e[l,5] 

e[5,25] 

615 

194 

0 

50 

e[0. 1,0.9] 

e[l,10] 

e[5,35] 

206 

73 

0 

13 
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ABSTRACT:  We  examine  how  management  quarterly  guidance  strategy  is  affected  by 
various  outcomes  from  previously  issued  guidance.  We  find  that  managers  are  less 
likely  to  provide  quarterly  earnings  guidance  for  a  given  year  when  past  management 
forecasts  have  been  overly  optimistic,  when  past  forecasts  were  unsuccessful  at  influ¬ 
encing  analysts’  expectations,  when  past  forecasts  failed  to  reduce  information  asym¬ 
metry,  and  when  past  forecasts  resulted  in  earnings  disappointments.  For  firms  that 
continue  to  give  guidance,  adverse  prior  outcomes  also  affect  the  precision  of  future 
guidance  and  the  number  of  quarters  within  a  year  for  which  they  give  guidance. 

Keywords:  earnings  guidance;  management  forecasts;  corporate  disclosure. 

Data  Availability:  Data  used  in  this  study  are  available  from  public  sources. 

1.  INTRODUCTION 

This  study  examines  how  management  quarterly  guidance  strategy  is  affected  by  outcomes 
from  previously  issued  guidance.  Forecasting  earnings  is  fundamentally  a  multi-period 
process  in  which  outcomes  from  one  period  influence  managerial,  investor,  and  analyst 
decisions  about  future  forecasts.  However,  a  recent  review  by  Hirst  et  al.  (2008)  emphasizes  that 
most  prior  research  ignores  the  extent  to  which  forecast  outcomes  from  one  period  influence 
forecast  antecedents  and  characteristics  in  subsequent  periods.  Our  study  begins  to  fill  this  void  by 
documenting  the  extent  to  which  a  wide  range  of  adverse  outcomes  affects  future  guidance  be- 
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havior.  This  is  an  important  contribution  because,  as  Hirst  et  al.  (2008)  argue,  exploring  the 
multi-period  nature  of  forecasting  is  necessary  to  extend  our  understanding  of  forecasting  beyond 
immediate  one-time  decisions  and  reactions. 

We  adopt  a  framework  in  which  managers  seek  to  maximize  shareholder  value  by  aligning 
market  expectations  with  managers’  private  information.^  Our  framework  divides  progress  toward 
attaining  this  goal  into  four  outcomes.  First,  managers  want  to  provide  attainable  guidance  that  can 
be  met  or  beaten  in  order  to  minimize  legal  liability  (Kasznik  1999)  and  develop  a  reputation  for 
credible  reporting  (Graham  et  al.  2005).  Second,  guidance  is  only  effective  when  market  partici¬ 
pants  follow  managers’  guidance.  Third,  guidance  is  intended  to  reduce  information  asymmetry 
(Verrecchia  2001).  Fourth,  guidance  is  used  by  managers  to  avoid  disappointing  market  partici¬ 
pants  at  earnings  announcements  (Richardson  et  al.  2004;  Cotter  et  al.  2006). 

We  argue  that  failing  to  achieve  desired  outcomes  affects  guidance  strategy  by  influencing 
managers’  expectations  about  the  benefits  and  costs  of  future  guidance.  Benefits  to  providing 
earnings  guidance  include  reducing  litigation  risk  (e.g.,  Skinner  1994,  1997),  building  a  reputation 
for  transparent  reporting  (e.g.,  Graham  et  al.  2005;  Hutton  and  Stocken  2009),  reducing  informa¬ 
tion  asymmetry  (Diamond  and  Verrecchia  1991),  and  guiding  analysts’  forecasts  toward  beatable 
earnings  targets  (e.g..  Cotter  et  al.  2006),  while  costs  to  guidance  include  legal  liability  for 
optimistic  forecasting  (Kasznik  1999).  Firms  can  respond  to  changes  in  the  expected  costs  and 
benefits  in  a  number  of  ways.  For  example,  a  firm  that  experienced  adverse  outcomes  could 
conclude  that  forecasting  is  no  longer  in  their  interest,  choosing  not  to  forecast  again.  Other  firms 
could  choose  to  continue  providing  guidance,  but  reduce  intra-year  forecast  frequency  or  change 
the  form  of  guidance  (e.g.,  providing  a  range  of  possible  values,  rather  than  a  point  estimate), 
improving  ex  post  “accuracy”  by  reducing  ex  ante  guidance  precision.  Alternatively,  firms  could 
choose  not  to  change  their  forecasting  strategy  based  on  a  short  period  of  adverse  outcomes.  Given 
the  costs  and  benefits  attributed  to  guidance,  it  remains  an  open  empirical  question  as  to  how 
adverse  past  outcomes  influence  managers’  forecasting  behavior. 

We  empirically  investigate  how  adverse  outcomes  are  associated  with  three  dimensions  of 
future  quarterly  forecasting  strategy:  whether  to  guide  at  all  next  period,  the  number  of  fiscal 
quarters  for  which  firms  give  guidance,  and  the  specificity  of  guidance  provided.  We  find  that 
falling  short  of  one’s  own  guidance,  failing  to  align  market  expectations  with  management  guid¬ 
ance,  failing  to  reduce  information  asymmetry,  or  disappointing  market  participants  at  earnings 
announcements  increases  the  probability  of  omitting  guidance  in  the  next  year.  For  firms  that 
continue  to  give  guidance,  adverse  prior  outcomes  are  also  associated  with  the  precision  of  future 
guidance  and  the  number  of  quarters  within  a  year  for  which  they  give  guidance. 

We  find  that  falling  short  of  one’s  own  forecast  has  a  particularly  significant  effect  on  future 
guidance  strategy.  For  example,  the  unconditional  probability  for  our  sample  that  a  firm  guiding 
this  year  will  stop  guiding  for  next  year  is  30  percent.  For  firms  consistently  falling  short  of  their 
own  guidance  this  year,  the  probability  of  omitting  quarterly  forecasts  next  year  increases  by  8 
percent  relative  to  firms  that  consistently  meet  or  beat  their  own  guidance.  Overall,  we  document 
that  the  economic  effect  of  previous  forecasting  outcomes  on  forecasting  strategy  is  comparable  to 
many  factors  previously  identified  to  affect  forecasting  decisions.  By  focusing  on  the  multi-period 
nature  of  forecasting,  we  identify  significant  determinants  of  management  forecast  strategies  that 


’  Evaluating  the  impact  of  past  outcomes  on  forecasting  behavior  is  complicated  by  the  numerous  possible  objectives 
motivating  the  provision  of  guidance.  The  expectation-adjustment  hypothesis  presumes  that  managers  have  incentives  to 
align  market  expectations  with  their  private  information.  However,  conflicts  between  owners  and  managers  could  create 
incentives  to  withhold  information  or  strategically  issue  biased  forecasts  (Nagar  et  al.  2003;  Rogers  and  Stocken  2005). 
We  know  of  no  overarching  theory  that  encompasses  all  of  the  various  motivations  behind  the  provision  of  guidance. 


The  Accounting  Review 
American  Accounting  Association 


November  2010 


Once  Bitten,  Twice  Shy 


1953 


have  been  largely  ignored  in  prior  literature.  Our  results  suggest  that  previous  forecasting  out¬ 
comes  are  important  controls  that  should  be  included  in  future  forecasting  studies. 

The  next  section  develops  predictions  on  how  quarterly  guidance  strategy  is  affected  by 
outcomes  from  previously  issued  guidance.  Sample  selection  criteria  and  descriptive  statistics  are 
discussed  in  Section  III.  Research  design  and  main  results  are  presented  in  Section  IV.  Robustness 
checks  and  additional  evidence  are  presented  in  Section  V.  Section  VI  concludes. 

II.  DEVELOPMENT  OF  EMPIRICAL  PREDICTIONS 

Prior  research  suggests  a  number  of  benefits  to  providing  management  earnings  guidance, 
including  reducing  information  asymmetry  and  cost  of  capital  (e.g.,  Verrecchia  2001),  reducing 
litigation  risk  (e.g.,  Skinner  1994,  1997),  building  a  reputation  for  transparent  reporting  (e.g., 
Graham  et  al.  2005;  Hutton  and  Stocken  2009),  guiding  analysts’  forecasts  toward  attainable 
earnings  targets  (e.g..  Cotter  et  al.  2006),  and  signaling  managerial  quality  (Trueman  1986).  Such 
benefits  are  weighed  against  costs  to  disclosure,  such  as  exposing  firms  to  new  sources  of  legal 
liability  (Kasznik  1999),  proprietary  costs  if  guidance  is  useful  to  competitors  (e.g.,  Verrecchia 
1983),  and  distracting  management  by  consuming  time  that  could  otherwise  be  devoted  to  running 
the  business  (e.g..  Progressive  Insurance  in  Hutton  and  Weber  [2001]).  Prior  research  (e.g..  King 
et  al.  1990;  Core  2001;  Fields  et  al.  2001)  suggests  that  firms  rationally  weigh  expected  costs  and 
benefits  when  making  decisions  about  future  guidance  strategy.  In  this  section,  we  develop  hy¬ 
potheses  on  how  outcomes  from  previous  guidance  influence  managers’  expectations  about  the 
costs  and  benefits  of  future  guidance. 

One  frequently  cited  unifying  framework  for  managerial  objectives  in  providing  guidance  is 
the  expectation-adjustment  hypothesis  (e.g.,  Ajinkya  and  Gift  1984;  King  et  al.  1990),  which  states 
that  management  issues  forecasts  in  order  to  align  market  expectations  with  their  private  informa¬ 
tion  about  expected  future  earnings.  Under  this  view,  managers  issue  disclosures  because  of  the 
negative  consequences  associated  with  gaps  between  expected  earnings  and  announced  earnings, 
including  high  investor  transaction  costs,  large  stock  price  fluctuations,  and  damage  to  relations 
with  analysts.  In  examining  the  relation  between  prior  outcomes  and  future  forecasting  strategy, 
we  adopt  the  expectation-adjustment  framework,  modified  to  include  recent  research  on  asymme¬ 
tries  in  forecasting  incentives.  We  assume  that  a  primary  goal  of  guidance  strategy  is  to  maximize 
shareholder  value  by  aligning  market  expectations  with  managers’  private  information.  Figure  1 
summarizes  our  framework,  dividing  the  effectiveness  of  management  guidance  in  reaching  this 
goal  into  four  specific  outcomes.  First,  managers  have  incentives  to  provide  attainable  guidance 
that  can  be  met  or  beaten.  Second,  market  participants  must  follow  managements’  guidance; 
otherwise,  guidance  is  unlikely  to  be  helpful  in  meeting  managerial  objectives.  Our  third  and 
fourth  outcomes  relate  to  two  specific  objectives  that  accurate  forecasts  and  aligning  expectations 
with  guidance  can  help  managers  to  achieve:  reducing  information  asymmetry  and  avoiding  dis¬ 
appointments  around  earnings  announcements.  Each  of  these  is  discussed  below. 

Attainable  Forecasts 

Issuing  attainable  guidance  is  important  to  managers  seeking  to  align  market  expectations 
with  their  private  information.  Managers  have  incentives  to  forecast  accurately  and  would  rather 
err  on  the  side  of  being  slightly  pessimistic  about  future  prospects.  The  benefits  to  forecasting 
accurately  include  signaling  managerial  quality  (Trueman  1986),  developing  managerial  reputa¬ 
tion  (Hutton  and  Stocken  2009),  and  increasing  the  credibility  of  future  guidance  (Williams  1996). 
Overly  optimistic  management  guidance  is  particularly  costly  for  three  reasons.  First,  managers 
can  be  sued  (under  SEC  Rule  lOb-5)  by  shareholders  who  allege  being  misled  into  buying  stock 
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FIGURE  1 

Framework  for  Analyzing  Forecast  Outcomes 
(outcome  proxies  in  italics) 


2 

at  inflated  prices  by  the  release  of  overly  optimistic  information.  Because  most  lOb-5  cases  are 
brought  after  disclosures  that  lead  to  stock  price  declines  (Francis  et  al.  1994;  Skinner  1994), 
Kasznik  (1999)  suggests  that  asymmetries  in  legal  liability  rules  make  it  more  costly  to  fall  short 
of  one’s  own  forecast  than  to  exceed  one’s  own  forecast.  Second,  overly  optimistic  guidance  has 
negative  stock  price  consequences.  For  example,  Soffer  et  al.  (2000)  examine  the  combined 
market  reaction  to  preannouncement  warnings  and  subsequent  earnings  announcements  and  find 
that  firms  that  made  insufficient  warnings  experience  more  negative  combined  market  reactions 
than  firms  that  preempt  all  bad  news  through  preannouncement  guidance.  Third,  recent  research 
(e.g.,  Matsumoto  2002;  Richardson  et  al.  2004;  Cotter  et  al.  2006)  has  emphasized  the  role  of 
earnings  guidance  in  moving  market  expectations  toward  targets  that  can  be  met  or  beaten,  rather 
than  toward  targets  that  are  exactly  met.  This  encourages  managers  to  issue  slightly  pessimistic 
guidance. 

In  sum,  overly  optimistic  guidance  is  costly  to  firms,  and  does  not  generate  benefits  attributed 
to  guidance  such  as  enhancing  managerial  reputation.  We  therefore  predict  that  failing  to  meet  or 
beat  one’s  own  forecasts  will  reduce  the  provision  of  guidance  in  the  future.  This  could  take  the 


^  While  the  Private  Securities  Litigation  Reform  Act  of  1995  introduced  safe  harbor  provisions  designed  to  protect  the 
disclosure  of  forward-looking  information,  safe  hai'bor  is  contingent  on  the  inclusion  of  appropriate  cautionary  state¬ 
ments  and  requires  that  the  issuer  has  no  actual  knowledge  of  falsity.  Firms  can  still  be  successfully  sued  if  the  plaintiff 
can  establish  that  a  manager  had  knowledge  that  a  forecast  was  false  or  misleading. 
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form  of  omitting  all  forecasts  for  next  year  or  forecasting  less  frequently  within  that  year.^  In 
addition,  firms  that  continue  to  forecast  can  adjust  the  specificity  of  forecasts.  By  issuing  less 
specific  forecasts,  managers  with  a  history  of  failing  to  meet  or  beat  their  own  forecasts  can 
decrease  the  probability  that  they  will  continue  to  fall  short  of  their  own  guidance.  These  predic¬ 
tions  are  tested  in  HI: 

HI:  The  extent  to  which  firms  meet  or  beat  management  earnings  guidance  is  negatively 
related  to  future  guidance  omissions,  positively  related  to  the  intra-year  frequency  of 
future  guidance,  and  positively  related  to  the  precision  of  future  guidance. 

While  we  expect  that  costs  of  inaccurate  disclosures  arise  primarily  from  earnings  that  fall  short  of 
management  guidance,  rather  than  earnings  that  exceed  management  guidance,  we  include  abso¬ 
lute  past  forecast  errors  to  control  for  management  forecast  accuracy  in  all  tests  of  HI. 

Aligning  Expectations  with  Management’s  Prediction 

The  second  guidance  outcome  that  we  consider  is  the  extent  to  which  post-guidance  market 
expectations  are  aligned  with  management’s  predictions.  This  is  a  key  element  in  the  expectations 
adjustment  hypothesis  (e.g.,  Ajinkya  and  Gift  1984;  King  et  al.  1990)  and  management’s  incen¬ 
tives  to  align  expectations  with  their  private  information  has  been  the  focus  of  a  number  of  prior 
studies  (e.g.,  Kasznik  and  Lev  1995;  Cotter  et  al.  2006).  Managers  are  unlikely  to  obtain  benefits 
from  guiding  when  market  participants  do  not  follow  the  guidance  provided.  However,  these 
managers  would  still  bear  the  costs  associated  with  providing  guidance.  Kothari  (2001)  notes  that 
analysts’  forecasts  are  frequently  used  to  proxy  for  market  expectations.  Therefore,  we  argue  that 
cases  in  which  analysts  do  not  closely  follow  prior  guidance  will  discourage  management  from 
future  forecasting.  This  is  the  basis  for  H2: 

H2:  The  extent  to  which  post-guidance  analyst  forecasts  deviate  from  management  earnings 
guidance  is  positively  related  to  future  guidance  omissions  and  negatively  related  to  the 
intra-year  frequency  of  future  guidance. 

The  relation  between  future  specificity  and  the  extent  to  which  post-guidance  analyst  forecasts 
deviate  from  management  guidance  is  less  clear.  When  a  wide  gap  remains  between  analyst 
expectations  and  guidance,  managers  could  be  unwilling  to  continue  incurring  the  costs  of  issuing 
precise  forecasts  that  do  not  generate  the  desired  analyst  reactions.  Alternatively,  such  firms  could 
be  more  likely  to  issue  specific  guidance  if  that  increases  the  effectiveness  in  convincing  analysts 
to  follow  guidance.  We  examine  the  relation  between  future  specificity  and  the  extent  to  which 
post-guidance  analyst  forecasts  deviate  from  management  earnings  guidance  but  do  not  predict  the 
sign  of  the  effect. 

Reducing  Information  Asymmetry 

Theoretical  work  on  the  effects  of  voluntary  disclosure  frequently  focuses  on  the  role  of 
information  asymmetry.  Public  disclosures  reduce  informational  advantages  from  privately  acquir¬ 
ing  information  (e.g..  Diamond  and  Verrecchia  1991).  The  reduction  in  the  value  and  quantity  of 
private  information  acquisition  reduces  the  risk  of  trading  against  someone  who  has  superior 
information.  As  a  result,  theory  predicts  that  bid-ask  spreads  become  narrower,  while  trading 
volume  and  liquidity  increase  (Verrecchia  [2001]  provides  a  literature  review).  Easley  and  O’Hara 
(2004)  argue  that  uninformed  investors  avoid  trading  stocks  with  high  information  asymmetry. 


^  Managers  could  also  respond  to  the  costs  of  failing  to  meet  or  beat  guidance  by  giving  very  pessimistic  guidance.  We 
argue  that  firms  are  unlikely  to  do  so  for  two  reasons.  First,  pessimistic  forecasting  could  depress  stock  prices,  leading 
to  an  increased  risk  of  takeover  or  management  turnover.  Second,  inaccurate  forecasting  would  reduce  the  credibility  of 
future  disclosures. 
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which  reduces  demand  for  these  stocks  and  increases  the  cost  of  capital  for  these  firms.  Easley  and 
O’Hara  (2004)  suggest  that  firms  can  influence  firm  value  and  cost  of  capital  by  altering  the 
precision  and  quantity  of  information  available  to  investors. 

Prior  research  suggests  that  one  benefit  to  management  guidance  is  the  reduction  of  informa¬ 
tion  asymmetry.  For  example,  guidance  has  been  shown  to  improve  the  accuracy  of  analyst 
forecasts  (Waymire  1986),  reduce  the  dispersion  in  analyst  forecast  (Baginski  et  al.  1993;  Clement 
et  al.  2003),  and  reduce  bid-ask  spreads  (Coller  and  Yohn  1997).  However,  earnings  guidance  can 
increase  the  dispersion  in  beliefs  when  investors  adopt  different  interpretations  of  news  or  when 
forecasts  increase  uncertainty  about  payoffs  (e.g.,  Harris  and  Raviv  1993;  Kandel  and  Pearson 
1995). 

The  dispersion  in  analyst  forecasts  is  a  frequently  used  proxy  for  information  asymmetry 
(e.g.,  Lang  and  Lundholm  1996;  Barron  and  Stuerke  1998;  Irani  and  Karamanou  2003).  High 
dispersion  denotes  a  low  consensus  among  analysts  about  a  firm’s  future  earnings,  a  sign  of  high 
information  asymmetry.  Gebhardt  et  al.  (2001)  and  Han  and  Manry  (2000)  find  that  firms  exhib¬ 
iting  high  forecast  dispersion  also  exhibit  high  cost  of  capital.  We  use  the  dispersion  in  analyst 
forecasts  as  a  proxy  for  information  asymmetry  because  cost  of  capital  is  difficult  to  directly 
observe  and  incorporating  bid-ask  spread  data  substantially  reduces  our  sample  size. 

Given  that  reducing  information  asymmetry  is  one  main  benefit  obtained  from  issuing  guid¬ 
ance,  we  argue  that  managers  will  be  less  willing  to  incur  the  costs  of  guidance  when  guidance  has 
been  unsuccessful  at  reducing  information  asymmetry.  This  is  the  basis  for  H3: 

H3:  The  increase  in  the  dispersion  of  analyst  forecasts  following  the  provision  of  manage¬ 
ment  earnings  guidance  is  positively  related  to  future  guidance  omissions  and  negatively 
related  to  the  intra-year  frequency  of  future  guidance. 

The  relation  between  future  specificity  and  changes  in  analyst  forecast  dispersion  is  less  clear. 
An  increase  in  dispersion  following  the  provision  of  guidance  could  discourage  managers  from 
issuing  precise  forecasts  in  the  future,  in  that  some  firms  do  not  want  to  bear  the  cost  of  precise 
disclosures  if  these  are  ineffective  in  reducing  information  asymmetry.  Alternatively,  such  firms 
could  be  more  likely  to  issue  specific  guidance  if  that  increases  the  effectiveness  in  reducing 
analyst  forecast  dispersion.  We  examine  the  relation  between  changes  in  analyst  forecast  disper¬ 
sion  and  future  guidance  specificity  but  do  not  predict  the  sign  of  the  effect. 

Avoiding  Earnings  Disappointments 

The  final  guidance  outcome  that  we  consider  is  the  extent  to  which  managers  avoid  disap¬ 
pointing  market  participants  at  earnings  announcements  following  the  provision  of  guidance.  Prior 
research  suggests  that  analyst  expectations  are  an  important  threshold  to  managers  (e.g..  Brown 
and  Caylor  2005)  and  that  earnings  disappointments  are  typically  accompanied  by  large  stock 
price  declines  (Skinner  and  Sloan  2002).  Using  earnings  guidance  to  preempt  bad  news  can  both 
reduce  the  probability  of  class-action  lawsuits  that  follow  earnings  disappointments  (Field  et  al. 
2005)  and  reduce  the  magnitude  of  settlement  amounts  (Skinner  1997).  Recent  research  suggests 
that  an  important  role  of  earnings  guidance  is  to  move  analyst  expectations  toward  targets  that  can 
be  met  or  beaten  in  order  to  avoid  disappointing  the  market  at  earnings  announcements  (e.g., 
Matsumoto  2002;  Richardson  et  al.  2004;  Cotter  et  al.  2006). 

Prior  research  also  suggests  that  guiding  and  subsequently  disappointing  the  market  is  more 
costly  than  simply  disappointing  the  market.  For  example,  Graham  et  al.  (2005)  report  that  inter¬ 
viewed  CFOs  consider  issuing  earnings  guidance  yet  still  falling  short  of  guided  analysts’  fore¬ 
casts  to  be  a  particularly  bad  outcome,  as  this  is  interpreted  as  evidence  that  the  firm  is  poorly 
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managed  and  cannot  accurately  predict  its  own  future.  This  is  consistent  with  Lennox  and  Park 
(2006),  who  find  that  the  market  penalty  for  missing  consensus  analyst  forecasts  is  greater  for 
firms  that  issued  guidance  than  for  firms  that  did  not  issue  guidance. 

Given  that  avoiding  disappointments  at  earnings  announcements  is  a  key  benefit  to  providing 
earnings  guidance,  we  predict  firms  that  issued  earnings  guidance  in  the  recent  past  but  yet  still 
disappointed  the  market  at  earnings  announcements  will  reduce  the  provision  of  guidance  in  the 
future.  Again,  this  could  take  the  form  of  omitting  all  forecasts  for  a  given  year  or  forecasting  less 
frequently  within  that  year.  In  testing  this  prediction,  we  measure  earnings  disappointments  using 
both  analyst  forecast  errors  (H4a)  and  price  reactions  around  earnings  announcements  (H4b): 

H4a:  The  extent  to  which  firms  meet  or  beat  guided  analyst  expectations  is  negatively  related 
to  future  guidance  omissions  and  positively  related  to  the  intra-year  frequency  of  future 
guidance. 

H4b:  The  market  reaction  around  recent  past  quarterly  earnings  announcements  that  fol¬ 
lowed  the  provision  of  guidance  is  negatively  related  to  future  guidance  omissions  and 
positively  related  to  the  intra-year  frequency  of  future  guidance. 

The  relation  between  future  specificity  and  the  ability  to  avoid  past  earnings  disappointments 
following  the  provision  of  guidance  is  less  clear.  Earnings  disappointments  following  the  provi¬ 
sion  of  guidance  could  discourage  managers  from  issuing  precise  forecasts  in  the  future,  in  that 
some  firms  could  be  unwilling  to  bear  the  cost  of  precise  disclosures  if  they  ultimately  result  in 
earnings  disappointments.  Alternatively,  such  firms  could  be  more  likely  to  issue  specific  guidance 
if  that  increases  the  effectiveness  in  guiding  expectations  toward  numbers  that  could  be  met  or 
beaten.  We  examine  the  relation  between  earnings  disappointments  and  guidance  specificity  but  do 
not  predict  the  sign  of  the  effect. 

III.  SAMPLE  SELECTION  CRITERIA 

We  obtain  earnings  guidance,  actual  earnings,  and  analyst  forecast  information  from  the  First 
Call  Company  Issued  Guidelines  (CIG)  file  as  of  March  2006.  The  advantage  of  using  the  CIG  file 
is  that  it  provides  a  sample  size  much  larger  than  would  be  practical  by  hand-collecting  data.  One 
disadvantage  is  that  there  may  be  a  bias  in  First  Call’s  collection  or  reporting  of  guidance.^  An 
additional  disadvantage  is  that  the  CIG  file  emphasizes  recent  guidance,  with  very  few  observa¬ 
tions  before  1994.  We  obtain  abnormal  return  data  from  CRSP  and  obtain  SIC  code,  market  value, 
and  other  accounting  information  from  Compustat. 

Table  1  summarizes  the  effects  of  imposing  sample  selection  criteria.  The  March  2006  CIG 
file  includes  43,885  quarterly  management  earnings  forecasts.  We  restrict  our  attention  to  forecasts 
of  earnings  per  share  (EPS)  denoted  in  U.S.  dollars.  Duplicate  forecasts  and  observations  in  which 
management  earnings  guidance  and  subsequently  realized  earnings  are  not  reported  on  a  compa¬ 
rable  basis  due  to  a  merger  or  accounting  change  are  also  deleted.  Anilowski  et  al.  (2007)  suggest 
that  sample  sizes  in  the  CIG  file  reflect  progressively  wider  coverage  of  firms  between  1994  and 
1998,  while  they  find  that  post-1998  sample  sizes  are  more  consistent.  As  a  result,  we  delete 
forecasts  for  periods  before  1998.^  Our  research  design  requires  a  full  year  of  data,  so  we  exclude 


This  prediction  is  consistent  with  Houston  et  al.  (2010),  who  find  that  poor  performance  (proxied  by  past  experience  in 
meeting  or  beating  analyst  expectations)  helps  explain  why  firms  regularly  forecasting  in  2003  did  not  make  a  forecast 
in  2004.  Our  study  differs  from  Houston  et  al.  (2010)  in  that  we  examine  the  effects  of  a  wide-range  of  outcomes  related 
to  previous  guidance  and  focus  on  forecasting  decisions  made  over  a  longer,  more  general  sample  period. 

^  A  conversation  with  a  representative  from  First  Call  confirmed  that  they  do  not  intentionally  follow  only  a  subset  of 
firms.  However,  Miller  (2002)  finds  that  the  CIG  file  emphasizes  quantitative  forecasts,  reporting  a  lower  percentage  of 
qualitative  forecasts  than  is  typically  obtained  through  hand  collection  of  disclosures. 

®  Results  using  samples  from  1994  to  2005  are  qualitatively  similar  and  are  not  reported. 
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TABLE  1 
Sample  Selection 


Initial  sample  from  First  Call  CIG  database  (as  of  March  2006) 

Currency  is  not  in  USD,  duplicate  observations,  guidance  unrelated  (1,926) 

to  EPS,  not  comparable  due  to  mergers  or  accounting  changes,  or 
without  CUSIP 

Forecasts  issued  before  1998  or  after  2005  (4,758) 

Forecasts  made  before  or  after  the  fiscal  quarter  (13,233) 

Missing  data  to  determine  accuracy  (2,892) 

Multiple  guidance  per  quarter  (keeping  the  most  recent  forecast)  (2,888) 

Firm  not  covered  by  First  Call  in  the  next  year  (3,030) 


Forecasts  used  in  Table  2 


Number  of  firm-year  corresponding  to  these  quarterly  forecast 

Firm-years  missing  data  for  variables  in  model  (1)  (3,487) 

Firm-years  for  Table  4 

Firm-years  without  guidance  in  the  next  year  (1,098) 

Firm-years  for  Table  5 

Firm-years  missing  for  variables  in  model  (3)  (43) 

Firm-years  for  Table  6 


43,885 


15,158 

7,685 

4,198 

3,100 

3,057 


forecasts  related  to  periods  after  2005.  We  also  delete  forecasts  that  are  issued  before  the  start  of 
the  fiscal  period  or  after  the  end  of  the  fiscal  period  to  which  they  relate.^  In  the  case  of  most  range 
forecasts  and  some  open-ended  and  qualitative  forecasts,  First  Call  reports  a  conversion  to  a  point 
estimate,  which  is  then  used  to  assess  ex  post  accuracy.^  We  exclude  observations  missing  this  data 
or  missing  the  actual  earnings  number  needed  to  calculate  forecast  accuracy.  For  firms  that  issue 
multiple  forecasts  for  a  given  quarter,  we  keep  only  the  last  forecast  made.  To  ensure  that  guidance 
omissions  are  not  due  to  First  Call  dropping  coverage  of  a  firm,  we  require  that  each  firm  is  still 
covered  by  First  Call  (in  the  company  issued  guidelines,  summary,  detailed  analysts’  forecasts,  or 
actual  earnings  files)  in  the  following  year.  These  criteria  yield  an  initial  sample  of  15,158  quar¬ 
terly  forecasts. 

Table  2  presents  information  on  forecast  horizon  and  forecast  form  for  this  initial  sample  of 
15,158  quarterly  forecasts.  We  code  management  earnings  guidance  as  point  estimates  (e.g.,  EPS 


^  Guidance  given  before  the  start  of  the  fiscal  period  is  made  with  significantly  more  uncertainty,  as  it  relates  to  a  longer 
horizon  and  economic  activity  for  the  prior  quarter  has  not  yet  been  completed.  Guidance  given  after  the  fiscal  period 
end  is  made  with  significantly  less  uncertainty,  as  economic  activity  for  the  period  has  already  been  completed.  We 
expect  that  our  proxies  for  adverse  outcomes  would  have  different  effects  on  management  behavior  when  they  relate  to 
very  long  (very  short)  horizon  forecasts  that  were  ex  ante  expected  to  be  less  (more)  accurate. 

*  For  range  forecasts,  First  Call’s  conversion  is  typically  the  mid-point  of  the  range.  For  maximum  forecasts.  First  Call’s 
conversion  is  typically  one  cent  less  than  the  maximum  given.  For  minimum  forecast.  First  Call’s  conversion  is  typically 
equal  to  or  one  cent  more  than  the  minimum  given.  The  robustness  of  our  results  to  alternate  measures  of  accuracy  is 
discussed  in  Section  V. 
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TABLE  2 

Forecast  Form  and  Horizon 


Frequency 

Percent 

Row  Pet 

60-90  Days 

Before  Fiscal 

30-60  Days 

Before  Fiscal 

30  Days 

Before  Fiscal 

Col  Pet 

Period  End 

Period  End 

Period  End 

Total 

Point 

1,185 

775 

1,202 

3,162 

7.82% 

5.11% 

7.93% 

20.86% 

37.48% 

24.51% 

38.01% 

18.25% 

22.76% 

22.86% 

Range 

4,976 

2,409 

3,461 

10,846 

32.83% 

15.89% 

22.83% 

71.55% 

45.88% 

22.21% 

31.91% 

76.62% 

70.75% 

65.81% 

Open-ended 

324 

211 

558 

1,093 

2.14% 

1.39% 

3.68% 

7.21% 

29.64% 

19.30% 

51.05% 

4.99% 

6.20% 

10.61% 

Qualitative 

9 

10 

38 

57 

0.06% 

0.07% 

0.25% 

0.38% 

15.79% 

17.54% 

66.67% 

0.14% 

0.29% 

0.72% 

Total 

6,494 

3,405 

5,259 

15,158 

42.84% 

22.46% 

34.69% 

100% 

The  sample  consists  of  15,158  quarterly  management  forecasts  from  1998  to  2004  in  which  First  Call  covered  the  firm  in 
the  subsequent  year. 


is  forecasted  to  be  $1.25),  range  estimates  (e.g.,  EPS  is  forecasted  to  be  between  $1.20  and  $1.30), 
open-ended  estimates  (including  maximum  estimates  such  as  “EPS  will  be  no  more  than  $1.30” 
and  minimum  estimates  such  as  “EPS  will  be  at  least  $1.20”),  or  qualitative  guidance.  For  this 
sample,  range  estimates  are  the  most  common  form  of  guidance  (71.55  percent),  followed  by  point 
estimates  (20.86  percent),  open-ended  estimates  (7.^1  percent),  and  qualitative  guidance  (0.38 
percent).  To  provide  initial  evidence  on  the  effect  of  failing  to  meet  or  beat  management  guidance, 
we  compare  the  frequency  of  optimism  in  forecasting  for  a  given  year  conditioned  on  whether  a 
firm  continues  to  forecast.  A  forecast  is  coded  as  “optimistic”  {ex  post  actual  earnings  less  than 
management’s  forecast)  or  “neutral  or  pessimistic”  {ex  post  actual  earnings  greater  than  or  equal  to 
management’s  forecast). 

Figure  2  compares  the  sample  firms  that  issue  a  quarterly  forecast  for  the  following  year  to 
those  that  do  not.  The  proportion  of  optimistic  forecasts  is  lower  in  every  year  of  our  sample 
period  for  firms  forecasting  for  the  following  year  than  for  firms  that  did  not  forecast  for  the 
following  year.  For  example,  for  firms  making  a  quarterly  forecast  for  both  2001  and  2002,  the 
proportion  of  optimistic  forecasts  for  2001  is  24.4  percent.  In  contrast,  for  firms  making  a  quar¬ 
terly  forecast  for  2001  but  not  for  2002,  the  proportion  of  optimistic  forecasts  for  2001  is  35.0 
percent.  Chi-squared  tests  (not  reported)  indicate  that  differences  in  the  proportion  of  optimistic 
forecasts  between  these  two  groups  are  significant  at  the  0.05  level  for  each  year  from  1998  to 
2004.  For  all  sample  forecasts,  only  19.7  percent  of  the  forecasts  made  by  firms  also  forecasting 
for  the  next  year  were  optimistic,  while  34.6  percent  of  the  forecasts  made  by  other  firms  were 
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FIGURE  2 

Percentage  of  Optimistic  Forecasts  for  Firms  Issuing  Forecasts  for  the  Next  Year  (A)  and 
Firms  Not  Issuing  Forecasts  for  the  Next  Year  (□)  from  1998  to  2004 


optimistic.  These  results  are  consistent  with  HI,  which  predicts  that  firms  issuing  optimistic 
forecasts  are  more  likely  to  stop  issuing  forecasts  in  the  future  than  firms  issuing  neutral  or 
pessimistic  forecasts. 

We  test  HI  though  H4  on  an  annual  basis,  comparing  outcomes  in  year  t-\  to  forecasting 
decisions  for  year  t.  It  is  unclear  whether  firms  make  forecasting  decisions  primarily  on  an 
individual  quarter-by-quarter  basis,  or  whether  they  adopt  longer-term  guidance  policies.  Our 
hypotheses  should  apply  equally  to  alternate  windows  and  specification  checks  presented  in  Sec¬ 
tion  V  demonstrate  that  results  are  qualitatively  similar  for  both  quarterly  guidance  behavior  and 
guidance  aggregated  over  two-year  windows.  Aggregating  quarterly  guidance  into  year-by-year 
decisions  yields  7,685  firm-years.  Our  7,685  firm-years  include  multiple  observations  for  some 
firms,  so  we  follow  Petersen  (2009)  and  report  all  regressions  results  using  robust  z-statistics  that 
control  for  firm  clustering  effects.  All  reported  p-values  are  two-tailed. 

IV.  RESEARCH  DESIGN  AND  MAIN  RESULTS 
Determinants  of  Guidance  Omissions 

In  this  section  we  empirically  examine  the  relation  between  prior  guidance  outcomes  and 
future  guidance  strategy.  We  restrict  our  attention  to  firms  that  have  guided  in  the  prior  year, 
because  the  focus  of  our  research  question  (the  decision  to  continue  forecasting  based  on  outcomes 
from  prior  year’s  guidance)  is  relevant  only  for  these  firms.  Given  our  population  of  interest  is  the 
subset  of  firms  that  previously  guided,  we  do  not  use  econometric  techniques  to  control  for  the 
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earlier  decision  regarding  whether  to  guide  in  the  prior  year  (e.g.,  the  Heckman  two-step 
procedure).^  To  test  the  relation  between  adverse  outcomes  and  future  guidance  omissions,  we 
estimate  the  following  logistic  regression: 


(1) 


where: 


=  aQ  +  -l-  /32ABSERROR_MGi,_i  +  l3^DIFF_AFMGi,_i 

+  fS^CHGDISPi,^,  +  /35Mo5_AF,,_i  P^CAR_FAi,_,  +  p,STDRFTi,_,  +  I3^D1SP,,_, 

+  P^RFTi,_,  +  P,^OSSu-i  +  /3n/iVCFFA5F,,_i  +  PnNUMANi,_,  + 

+  l3x^SIZFif_i  +  -I-  Pi^BFTAij_i . 

The  dependent  variable  OMITu  is  an  indicator  variable  equal  to  1  if  firm  i  does  not  issue 
guidance  for  any  quarter  in  year  t,  and  0  otherwise. In  examining  the  relation  between  past 
outcomes  and  future  forecasting,  we  aggregate  outcomes  across  quarters  of  the  prior  fiscal  year 
into  one  annual  measure.  To  test  HI,  we  construct  an  indicator  variable  MoB_MGit_\  q  which  takes 
a  value  of  1  if  subsequently  realized  earnings  for  quarter  q  in  year  t-l  is  greater  than  or  equal  to 
the  earnings  guidance  for  that  quarter  (i.e.,  if  realized  earnings  meet  or  beat  management’s  fore¬ 
cast),  and  0  otherwise.  Hypothesis  1  is  tested  using  MoB_MGii_i,  which  is  an  average  of 
MoB_MGii_i  q  over  each  quarter  for  which  management  issued  earnings  guidance  in  the  previous 
year.  To  capture  the  effect  of  absolute  forecast  accuracy,  rather  than  a  propensity  to  meet  or  beat 
management  forecasts,  we  calculate  ABSFRROR_MGii_i  q  as  the  absolute  value  of  the  difference 
between  management  guidance  and  subsequently  realized  earnings  for  firm  i  in  quarter  q  of  year 
t-l,  scaled  by  stock  price  at  the  beginning  of  the  quarter.  Our  empirical  tests  use 
ABSFRROR_MGit^i,  an  average  of  ABSFRROR_MGii_i  q  over  each  quarter  for  which  manage¬ 
ment  issued  guidance  in  the  previous  year.'^ 

Hypothesis  2  predicts  that  firms  whose  guidance  was  not  closely  followed  by  analysts  are  less 
likely  to  continue  forecasting  than  firms  whose  guidance  was  closely  followed  by  analysts.  To 
measure  the  extent  to  which  analysts  fail  to  follow  management  guidance,  we  define 
DIFF_AFMGit_i  q  as  the  absolute  value  of  the  difference  between  management  guidance  for 
quarter  q  in  year  t-l  and  the  analyst  consensus  forecast  for  that  quarter  as  of  ten  days  after  the 
release  of  the  management  guidance,  scaled  by  share  price  of  firm  i  at  the  end  of  quarter  q-l. 


®  The  use  of  single-stage  regressions  on  the  subsample  of  interest  is  consistent  with  Wooldridge’s  (2001,  551)  discussion 
of  self-selection  techniques.  To  test  the  sensitivity  of  our  results  to  this  approach,  we  re-estimate  all  analyses  in  this 
section  using  a  Heckman  two-step  procedure.  In  re-estimating  model  (1),  we  included  an  initial  stage  regarding  whether 
a  firm  issued  any  guidance  in  the  prior  year.  In  re-estimating  models  (2)  and  (3),  we  included  an  initial  stage  that 
examines  whether  a  firm  provides  guidance  for  both  the  current  year  and  the  prior  year.  All  inferences  are  unaffected  by 
using  this  alternate  estimation  method. 

In  cases  where  a  firm  makes  a  forecast  for  both  quarter  4  of  year  t-\  and  quarter  1  of  year  f,  we  exclude  the  quarter  1 
forecast  from  our  proxy  for  forecasting  behavior  in  year  t  if  the  quarter  1  forecast  was  made  prior  to  the  earnings 
announcement  related  to  the  prior  quarter’s  forecast.  This  insures  that  all  forecasts  included  in  our  proxy  for  guidance 
omissions  were  made  after  the  outcomes  from  the  prior  quarters  are  known.  This  affects  only  67  forecasts  in  our  sample 
and  reported  results  are  qualitatively  unaffected  by  this  exclusion. 

We  also  measure  management  forecast  accuracy  using  the  difference  between  absolute  management  forecast  error  and 
absolute  forecast  error  based  on  analyst  forecasts  prior  to  management  forecasts.  Results  using  this  relative  measure  of 
management  forecast  accuracy  are  qualitatively  similar  to  those  based  on  ABSERROR_MG  and  are  not  reported 
separately. 
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Hypothesis  2  is  tested  using  DIFF_AFMGii_i,  an  average  of  DIFF_AFMGi,_i  „  over  each  quarter 
for  which  management  issued  guidance  in  the  previous  year. 

Hypothesis  3  predicts  that  firms  with  increases  in  analyst  dispersion  following  the  release  of 
past  guidance  are  less  likely  to  guide  in  the  next  year.  To  measure  changes  in  dispersion,  we  define 
CHGDISPii_i  q  as  the  standard  deviation  of  analysts’  forecasts  as  of  ten  days  after  the  management 
guidance  for  quarter  q  in  year  t-\  minus  the  standard  deviation  of  analysts’  forecasts  for  that 
quarter  as  of  one  day  before  the  release  of  the  management  guidance,  scaled  by  share  price  of  firm 
i  at  the  end  of  quarter  q—\.  Hypothesis  3  is  tested  using  CHGDISPjj^i,  an  average  of 
CHGDISPit_i  q  over  each  quarter  for  which  management  issued  guidance  in  the  previous  year. 

Hypothesis  4a  predicts  that  firms  that  issued  guidance  and  subsequently  met  or  beat  analysts’ 
expectations  are  less  likely  to  omit  future  guidance  than  firms  that  guided  earnings  and  fell  short 
of  expectations.  This  prediction  is  tested  using  an  indicator  variable  MoB_AFif_x  q  which  takes  a 
value  of  1  if  subsequently  realized  earnings  for  quarter  q  in  year  t- 1  is  greater  than  or  equal  to  the 
analyst  consensus  forecast,  and  0  otherwise.  Analyst  consensus  forecasts  are  measured  as  of  12 
days  before  the  end  of  fiscal  quarter.  Hypothesis  4a  is  tested  using  an  average  of 

MoB_AFn_i  q  over  each  quarter  for  which  management  issued  guidance  in  the  previous  year. 

Hypothesis  4b  predicts  that  firms  that  have  experienced  positive  market  reactions  to  earnings 
announcements  that  followed  the  issuance  of  earnings  guidance  are  less  likely  to  stop  forecasting 
in  the  later  period.  To  test  this  hypothesis,  we  measure  unexpected  returns  over  short  windows 
around  quarterly  earnings  announcements.  We  calculate  abnormal  returns  CAi?_£'A„_i  ^  from  12 
days  before  the  end  of  the  fiscal  quarter  q  in  year  t-\  to  one  day  after  the  earnings  announcement. 
Abnormal  returns  are  calculated  relative  to  CRSP  value-weighted  returns.  Tests  of  H4b  are  based 
on  CAR_EAii_i,  an  average  of  CAR_EAit_i  q  over  each  quarter  for  which  management  issued 
earnings  guidance  in  the  previous  year. 

Model  (1)  includes  two  controls  for  prior  period  uncertainty:  STDRETii_i  (calculated  as  the 
standard  deviation  of  raw  daily  stock  returns  for  firm  i  over  year  t)  and  (calculated  as  the 

average  of  analyst  forecast  dispersion  at  the  start  of  the  quarter  for  all  quarters  of  firm  i  in  year 
t-1).  Miller  (2002)  finds  that  firms  with  sustained  earnings  increases  contemporaneously  increase 
the  level  of  voluntary  disclosure,  while  Chen  et  al.  (2007)  find  that  poorly  performing  firms  are 
more  likely  to  renounce  the  provision  of  earnings  guidance.  To  control  for  prior  year  performance, 
we  include  three  variables  taken  from  Chen  et  al.  (2007).  is  calculated  as  abnormal  returns 

(relative  to  CRSP  value-weighted  returns)  for  firm  i  cumulated  over  year  r-1.  LOSS„_i  is  calcu¬ 
lated  as  the  percentage  of  quarters  for  firm  i  with  losses  in  year  r-1.  INCREASE^,^^  is  calculated 
as  the  percentage  of  quarters  for  firm  i  with  earnings  increases  relative  to  a  seasonal  random  walk 
model  for  year  t-\.  LOSS  and  INCREASE  are  based  on  Compustat  quarterly  income  before 
extraordinary  items.  Based  on  the  results  of  Chen  et  al.  (2007),  LOSS;,_i  is  expected  to  be  posi¬ 
tively  related  to  the  decision  to  stop  forecasting,  while  and  INCREASEi,_i  are  expected  to 

be  negatively  related. 


As  a  specification  check  we  re-estimate  model  (1)  replacing  DIFF_AFMG  with  RESPONSE,  calculated  as  the  change  in 
analysts’  forecasts  around  management  guidance,  scaled  by  the  initial  difference  between  management  guidance  and 
pre-guidance  analysts’  forecasts.  To  minimize  the  impact  of  small  denominators,  we  exclude  forecasts  where  the 
absolute  value  of  the  initial  difference  between  management  guidance  and  pre-guidance  analysts’  forecasts  is  less  than 
or  equal  to  $0.03.  When  replacing  DIEF_AFMBi,_i  with  RESPONSEi,_^  in  model  (1),  the  coefficient  on  RESPONSE  is 
significantly  negative  in  the  restricted  model  (/8= -0.3350,  p  =  0.02)  and  marginally  significant  in  the  full  model  {/3 
=  -0.2314,  p  =  0.10).  Results  for  RESPONSE  are  insignificant  when  observations  with  all  denominators  (other  than 
zero)  are  included. 

We  measure  earnings  surprises  as  of  12  days  before  the  fiscal  quarter  end  to  be  consistent  with  Skinner  and  Sloan 
(2002),  who  find  that  much  of  the  market  reaction  to  earnings  news  occurs  prior  to  a  three-day  window  centered  on 
earnings  announcements.  Our  return  window  for  price  reactions  to  earnings  announcements  also  corresponds  to  Skinner 
and  Sloan  (2002). 
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Model  (1)  also  includes  a  number  of  additional  control  variables  that  prior  research  suggests 
are  associated  with  forecasting  decisions.  To  control  for  litigation  environment,  we  follow  Francis 
et  al.  (1994)  by  using  a  qualitative  variable  LlTij_i  to  indicate  membership  for  firm  i  in  year 
t-l  in  biotechnology  (SIC  codes  2833-2836,  SIC  codes  8731-8734),  computer  (SIC  codes  3570- 
3577,  SIC  codes  7370-7374),  electronics  (SIC  codes  3600-3674),  and  retail  (SIC  codes  5200- 
5961)  industries.  We  define  NUMANn-i  as  the  average  number  of  analysts  following  firm  i  as  of 
the  beginning  of  the  quarter  for  all  quarters  in  year  t-l.  A  number  of  prior  papers  (e.g.,  Lang  and 
Lundholm  1993)  document  a  positive  association  between  analyst  following  and  disclosure  qual¬ 
ity.  SIZEi,_i  is  measured  as  the  natural  logarithm  of  total  assets  for  firm  i  at  the  end  of  year  t-l. 
Based  on  prior  research  (e.g.,  Kasznik  and  Lev  1995),  firm  size  is  expected  to  be  positively  related 
to  disclosure  frequency.  BMu_i  is  the  ratio  of  book  value  to  market  value  of  equity  for  firm  i  at  the 
end  year  t-l.  Market  to  book  (the  inverse  of  our  measure)  is  used  by  Bamber  and  Cheon  (1998) 
to  proxy  for  proprietary  costs.  BETAi^^i  is  calculated  as  the  coefficient  from  estimating  Sharpe’s 
(1964)  market  model  over  year  t-l.  Ajinkya  et  al.  (2005)  report  a  positive  relation  between 
systematic  risk  and  disclosure  frequency.  The  data  requirements  on  control  variables  yield  a 
sample  size  of  4,198  firm-year  observations,  of  which  3,100  issue  at  least  one  forecast  in  the 
subsequent  year  and  1,098  do  not. 

Table  3  reports  Spearman  and  Pearson  correlation  tables  for  OMITit  and  our  treatment  vari¬ 
ables.  These  correlations  are  consistent  with  our  HI  through  H4b.  In  particular,  future  guidance 
omissions  (OM/r,j)  are  negatively  correlated  with  meeting  or  beating  one’s  forecast 
{MoB_MGii_i),  positively  correlated  with  the  extent  to  which  post-guidance  analyst  forecasts 
differ  from  management  guidance  {DlEF_AFMGif_i),  positively  correlated  with  increases  in  ana¬ 
lyst  forecast  dispersion  following  guidance  (CHGDISPit^i),  and  negatively  correlated  with  our 
proxies  for  avoiding  earnings  disappointments  {MoB_AFij_i  and  CAR_EAn_{).  Each  of  these  re¬ 
lations  is  statistically  significant  using  both  Pearson  and  Spearman  correlations  (one  exception  is 
the  Spearman  correlation  between  OMITj,  and  CFIGDISPit_i).  Table  3  also  suggests  that  our 
treatment  variables  are  significantly  correlated  with  each  other.  Therefore,  regression  results  are 
presented  for  both  a  full  model  and  by  including  the  variables  used  to  test  HI  through  H4b 
individually.  We  also  report  specification  checks  using  principal  component  analysis. 

Table  4  reports  results  for  model  (1).  Marginal  effects  for  each  independent  variable  in  the  full 
model  are  calculated  as  the  change  in  the  probability  of  omitting  guidance  for  a  given  year  that 
results  from  moving  from  the  first  to  third  quartile  of  each  independent  variable’s  distribution. 
Hypothesis  1  predicts  a  negative  relation  between  the  decision  to  stop  forecasting  and  the  ability 
to  meet  or  beat  management’s  past  earnings  guidance.  Consistent  with  HI,  we  find  a  significant 
negative  relation  between  the  probability  of  omitting  forecasts  and  MoB_MG  in  both  the  reduced 
model  (/3i  =  -0.4860,  p  <  0.01)  and  the  full  model  (ySj  = -0.4252,  p  <  0.01).  In  contrast,  AB- 
SERROR_MG,  which  proxies  for  the  absolute  forecast  error  of  past  management  guidance  is 
insignificant  in  both  the  reduced  {^2  =  9.2583,  p  =  0.11)  and  full  model  (/32  =  0.6403,  p  >  0.50). 
Results  therefore  suggest  that  future  forecasting  decisions  are  affected  by  the  ability  to  meet  or 
beat  management  guidance,  rather  than  the  unsigned  accuracy  of  previous  guidance. 

Hypothesis  2  predicts  a  positive  relation  between  forecast  omissions  and  the  extent  to  which 
post-guidance  analyst  forecasts  deviate  from  prior  guidance.  Consistent  with  H2,  we  find  a  sig¬ 
nificant  positive  relation  between  the  probability  of  omitting  forecasts  and  DIFF_AFMG  in  both 
the  reduced  (ygj  =  43.8820,  p  <  0.01)  and  full  model  (/Jj  =  33.1874,  p  =  0.01). 

Hypothesis  3  predicts  that  the  probability  of  omitting  forecasts  is  increasing  in  the  change  in 
dispersion  of  analyst  forecasts  following  the  release  of  previous  earnings  guidance.  Consistent 
with  this  predication,  we  find  a  positive  relation  between  the  probability  of  omitting  forecasts  and 
CHGDISP  that  is  significant  in  the  reduced  model  (;S4  =  54.4515,  p  =  0.01)  and  marginally 
significant  in  the  full  model  (^84  =  42.2345,  p  =  0.06). 
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Hypothesis  4a  predicts  a  negative  relation  between  the  decision  to  stop  forecasting  and  past 
ability  to  meet  or  beat  guided  analyst  earnings  forecasts.  Consistent  with  H4a,  we  find  a  significant 
negative  relation  between  the  probability  of  omitting  forecasts  and  MoB_AF  in  the  reduced  model 
(ySs  =  -0.2782,  p  =  0.02).  However,  the  relation  between  the  probability  of  omitting  forecasts  and 
MoB_AF  is  insignificant  (^5  =  0.0202,  p  >  0.50)  in  the  full  model.  H4b  predicts  a  negative 
relation  between  the  probability  of  omitting  forecasts  and  prior  price  responses  to  earnings  news. 
As  predicted,  we  find  a  negative  and  statistically  significant  relation  between  the  probability  of 
omitting  forecasts  and  CAR_EA  in  both  the  reduced  and  full  models  = -0.9103,  p  <  0.01; 
/36  = -0.8559,  p  <  0.01,  respectively).^'^ 

Among  our  control  variables,  STDRET,  RET,  LOSS,  NUMAN,  LIT,  SIZE,  and  BETA  are 
statistically  significant  at  the  0.05  level.  Large  firms  and  firms  that  experienced  a  high  standard 
deviation  of  returns  or  a  high  proportion  of  quarterly  losses  over  the  past  year  are  more  likely  to 
omit  forecasts.  Firms  that  experienced  positive  returns  over  the  past  year,  firms  with  high  analyst 
following,  firms  operating  in  industries  particularly  at  risk  for  security  litigation,  and  firms  with 
high  systematic  risk  are  less  likely  to  omit  forecasts. 

Overall,  results  from  model  (1)  are  consistent  with  our  hypotheses  that  omissions  in  the 
provision  of  guidance  are  influenced  by  the  outcomes  from  previously  issued  guidance.  Among  the 
treatment  variables,  only  MoB_AE  is  not  consistently  related  to  future  guidance  omissions.  Mar¬ 
ginal  effects  for  our  treatment  variables  suggest  that  variation  in  MoB_MG  has  the  largest  overall 
effect.  In  the  full  model,  moving  from  the  first  to  the  third  quartile  of  MoB_MG  reduces  the 
probability  of  omitting  forecasts  by  approximately  4  percent.  The  marginal  effect  when  moving 
from  MoBjMG  of  0  to  MoB_MG  of  1  (not  reported  in  the  tables)  is  8  percent. These  effects 
appear  economically  significant,  particularly  when  compared  to  30  percent,  which  is  the  baseline 
probability  that  a  firm  forecasting  in  year  t-\  omits  forecasts  in  year  t. 

Determinants  of  Intra-Year  Forecasting  Frequency 

The  previous  section  examines  the  extent  to  which  omissions  in  the  provision  of  earnings 
guidance  are  influenced  by  the  outcomes  from  previously  issued  earnings  guidance.  We  next 
examine  the  extent  to  which  firms  issuing  forecasts  for  a  given  year  condition  forecast  frequency 
on  outcomes  from  previously  issued  guidance.  Restricting  our  sample  to  observations  in  which  a 


As  Table  3  shows,  many  of  the  treatment  variables  are  correlated,  which  raises  concerns  about  the  relevance  of 
individual  outcomes.  We  examine  the  robustness  of  results  for  model  (1)  by  performing  principal  component  analysis. 
By  examining  the  scree  plot  and  eigenvalues,  we  identify  two  factors.  The  first  factor  is  primarily  correlated  with 
MoB_MG,  MoB_AF,  and  CAR_EA,  capturing  meeting  or  beating  key  earnings  benchmarks.  The  second  factor  is 
primarily  correlated  with  ABSERROR_MG  and  DIFF_AEMG,  capturing  absolute  management  forecast  errors  and  a  lack 
of  analyst  response  to  guidance.  We  re-estimate  model  (1),  replacing  MoB_MG,  ABSERROR_MG,  MoB_AF,  CAR_EA, 
and  DIFF_AFMG  with  these  two  factors.  We  find  a  significantly  negative  coefficient  on  the  factor  related  to  falling  short 
of  earnings  benchmarks  (yS)  =  -0.2165,  p  <  0.01)  and  a  significantly  positive  coefficient  on  the  factor  related  to  absolute 
management  forecast  errors  and  the  extent  to  which  analysts  deviate  management  guidance  {^2  =  0.1715,  p  <  0.01).  The 
coefficient  on  CHGDISP  remains  significantly  positive  (/Sj  =  43.4858,  p  =  0.03).  Overall,  results  utilizing  principal 
component  analysis  are  consistent  with  the  results  of  model  (1). 

Prior  research  (e.g.,  Kasznik  and  Lev  1995)  argues  that  managers  give  guidance  to  minimize  the  costs  of  large  gaps 
between  expected  and  announced  earnings.  As  a  specification  check,  we  re-estimate  model  (1)  using  two  proxies  for 
such  a  gap.  We  calculate  EXPGAPj,^  as  the  absolute  value  of  the  difference  for  firm  i  in  year  t  between  realized  quarterly 
EPS  for  quarter  q  and  the  consensus  analyst  forecasts  for  quarter  q  as  of  the  beginning  of  the  quarter,  scaled  by 
beginning  of  quarter  stock  price.  Our  empirical  tests  use  an  average  of  EXPGAP^^  calculated  over  each  quarter  for  year 
t.  We  calculate  EXPDUMi,  as  an  indicator  variable  equal  to  1  for  firm  i  in  year  t  if  the  consensus  analyst  forecast  error 
as  of  the  beginning  of  any  quarter  in  year  t  is  greater  than  1  percent  of  firm  value.  Results  including  either  proxy  are 
qualitatively  similar  to  those  of  Table  4. 

We  re-estimate  model  (1)  after  excluding  observations  where  data  from  Execucomp  indicates  that  CEO  turnover  occurs 
between  year  t- 1  and  year  t.  These  results  are  qualitatively  similar  to  results  in  Table  4. 
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firm  makes  at  least  one  quarterly  forecast  for  two  successive  years  reduces  our  sample  size  to 
3,100  observations.*^  We  define  frequency  as  the  number  of  quarters  within  that  year  for  which  the 
firm  makes  a  forecast  and  estimate  the  following  ordered  probit  model; 

F;  *  =  ao  +  (3iMoB_MGij_i  +  l32ABSERROR_MGit_i  +  ^2DIFF_AFMGii_i  +  /B^CHGDISPit-i 

+  l3^MoB_AF,,_^  +  l5(,CAR_FAi,_^  +  I32STDRFT,,_^  +  ySgD/^P,,-!  + 

+  PioLOSSi,^,  +  l3nINCRFASF,,_,  +  ft.^NUMAN^.i  +  +  /3uSIZFi,_, 

+  /3ijBMfi_i  +  /3i(,BFTAif_i  +  s^,  (2) 

under  the  assumption  that  for  an  underlying  construct  k: 

NUMQTRSi,=  1  if  F,  *  <  k^, 

yvf/Merp5,,=2ifki<F,,*<k2, 

NUMQTRSu  =3  if  k2<  F,/  <  k-^. 


NUMQTRSit  =  4  if  kg  <  F,  * , 

where  NUMQTRSn  takes  a  value  of  1  to  4  if  firm  i  forecasts  for  one  to  four  quarters  in  year  t, 
respectively.  All  other  variables  are  as  defined  in  model  (1). 

Table  5  presents  results  from  estimating  the  ordered  probit  model  (2).  As  with  model  (1),  we 
report  results  for  both  the  full  model  and  restricted  models  that  separately  include  our  treatment 
variables.  Consistent  with  HI,  we  find  a  positive  association  between  NUMQTRS  and  MoB_MG 
that  is  significant  in  the  restricted  model  (ySi  =  0.2464,  p  <  0.01)  but  insignificant  in  the  full  model 
(ySj  =  0.1006,  p  =  0.23).  ABSFRROR_MG,  which  proxies  for  the  absolute  forecast  error  of  past 
management  guidance,  is  insignificant  in  both  the  reduced  model  {^2  =  -4.0562,  p  =  0.32)  and  the 
full  model  (/32  =  3.9973,  p  =  0.41).  Consistent  with  H2,  we  find  a  significantly  negative  relation 
between  NUMQTRS  and  DIFF_AFMG  in  both  the  reduced  (ySg  =  -29.0805,  p  <  0.01)  and  full 
model  (ySg  = -28.4273,  p  <  0.01).  Consistent  with  H3,  we  find  a  significantly  negative  relation 
between  NUMQTRS  and  CHGDISP  in  both  the  reduced  (yS4  = -47.3454,  p  <  0.01)  and  the  full 
model  (yS4  = -38.8935,  p  <  0.01).  Consistent  with  H4a,  we  find  a  significantly  positive  relation 
between  forecast  frequency  and  MoB_AF  in  the  reduced  model  (yS5  =  0.3109,  p  <  0.01)  and  the 
full  model  (^85  =  0.2194,  p  =  0.01).  Hypothesis  4b  is  not  supported,  as  we  find  a  positive,  but 
insignificant,  relation  between  NUMQTRS  and  CAR_EA  in  the  reduced  model  (/S6  =  0.1405,  p  = 
0.36)  and  the  full  model  (y86  =  0.1348,  p  =  0.39).** 

Overall,  results  from  model  (2)  are  consistent  with  our  hypotheses  that  forecasting  frequency 
is  influenced  by  certain  outcomes  from  previously  issued  earnings  guidance.  Conditional  on  issu¬ 
ing  a  quarterly  forecast  for  a  given  year,  we  find  that  the  number  of  quarters  for  which  a  firm 
forecasts  is  decreasing  in  the  extent  to  which  analyst  forecasts  differed  from  management  forecasts 


This  specification  suffers  from  a  self-selection  bias  because  a  firm  must  decide  to  forecast  before  forecast  frequency  can 
be  observed.  A  standard  approach  to  self-selection  bias  would  be  to  utilize  a  Heckman  two-step  procedure,  which  in  this 
case  is  complicated  by  the  fact  that  the  second  stage  is  an  ordered  probit  model.  Therefore,  we  report  results  from  a 
one-stage  ordered  probit  model  and  conduct  two  specification  checks.  First,  all  results  using  the  Heckman  two-step 
procedure  are  qualitatively  similar  to  those  for  model  (2).  Second,  allowing  NUMQTRS  to  take  a  value  of  0  if  the  firm 
did  not  forecast  at  all  also  yields  qualitatively  similar  results. 

Re-estimating  model  (2)  using  principal  component  analysis  yields  similar  results.  We  find  a  significantly  positive 
coefficient  on  the  factor  related  to  meeting  or  beating  earnings  benchmarks  (ySi  =  0.1325,  p  <  0.01)  and  a  significantly 
negative  coefficient  on  the  factor  related  to  absolute  management  forecast  errors  and  a  lack  of  analyst  response  (^82  = 
-0.0835,  p  =  0.03).  The  coefficient  on  CHGDISP  remains  significantly  negative  (/Sj  = -50.1520,  p  <  0.01). 
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following  the  provision  of  guidance,  decreasing  in  the  change  in  dispersion  that  followed  the 
issuance  of  earnings  guidance  for  the  previous  year,  and  increasing  in  firm’s  past  ability  to  meet  or 
beat  guided  analysts’  forecasts.  While  reduced  model  results  suggest  that  forecast  frequency  is 
increasing  in  firm’s  past  ability  to  meet  or  beat  their  own  forecasts  for  the  prior  year,  this  effect  is 
insignificant  in  the  full  model. 

Determinants  of  Forecast  Specificity 

We  next  examine  the  extent  to  which  past  adverse  outcomes  from  forecasts  affect  the  speci¬ 
ficity  of  future  forecasts.  To  proxy  for  specificity,  we  construct  a  variable  FORM^  q  which  equals 
3  if  the  last  forecast  made  for  quarter  q  in  year  t  is  a  point  estimate,  2  for  a  range  estimate,  1  for 
an  open-ended  estimate  (maximum  or  minimum),  and  0  for  a  qualitative  statement.  FORMii^  is 
therefore  increasing  in  the  precision  or  specificity  of  guidance.  Empirical  tests  use  FORM^,  an 
average  of  FORM^  q  over  each  quarter  for  which  management  issued  earnings  guidance  during  the 
year. 

To  test  the  effect  of  outcomes  from  previous  guidance  on  forecast  form  we  estimate  the 
following  linear  regression  equation: 


FORM  it  =  ao  +  /3iMoB_MGi,_i  +  /32ABSERROR_MGit_i  +  (3iDIFF_AFMGit_i 

+  /i^CHGDlSPit.i  +  PiMoB^AFit^^  +  p(,CAR_EAit-x  +  PiDAYSu  +  p^STDRETit.^ 

+  PgCARSIGNit  +  PioNUMANit-i  +  PxiSIZEit_^  +  p^FORMit-x  +  e,-,.  (3) 

Following  Baginski  and  Hassell  (1997),  model  (3)  controls  for  the  effect  of  forecast  horizon, 
the  standard  deviation  of  raw  daily  stock  returns,  the  sign  of  the  price  reaction  to  the  forecast, 
analyst  following,  and  firm  size  on  management  forecast  precision.  STDRETit_x,  NUMANn^i,  and 
SIZEit_x  are  each  defined  earlier  for  model  (1).  DAYSj,  is  calculated  as  the  average  number  of  days 
between  the  fiscal  period  end  and  release  of  guidance  for  each  quarter  in  year  t  for  which  man¬ 
agement  issued  guidance.  CARSIGN^  q  is  an  indicator  variable  equal  to  1  if  the  three-day  abnormal 
returns  around  a  guidance  date  for  firm  i  in  quarter  q  of  year  t  is  positive,  and  0  otherwise.  Our 
empirical  tests  use  CARSIGNu,  an  average  of  CARSIGN^  q  over  each  quarter  in  year  t  for  which 
management  issued  guidance.  FORMit-\  is  included  to  control  for  past  behavior  regarding  forecast 
specificity.  Data  requirements  to  calculate  these  additional  control  variables  reduce  our  sample 
size  from  3,100  to  3,057  observations. 

To  control  for  selection  bias,  we  estimate  model  (3)  using  a  Heckman  two-step  procedure.  In 
the  first  step,  we  use  the  independent  variables  of  model  (1)  to  model  the  initial  decision  of 
whether  to  forecast,  using  all  firm-years  in  our  sample.  Estimated  parameters  from  this  probit 
model  are  then  used  to  calculate  the  inverse  Mills  ratio,  which  is  included  as  an  additional 
independent  variable  in  the  estimation  of  model  (3).  Table  6  presents  results  from  estimating 
model  (3)  over  observations  from  1998  to  2005.  While  we  find  a  marginally  significant  positive 
relation  between  FORM  and  MoB_MG  in  the  reduced  model  {Px  =  0.0665,  p  =  0.08),  this  relation 
is  insignificant  in  the  full  model  (ySi  =  0.0732,  p  =  0.12).  The  coefficient  on  DIFF_AFMG  is 
insignificant  in  the  reduced  model  {p^  =  -7.3427,  p  =  0.14)  and  significantly  negative  in  the  full 
model  (Pt,  =  -12.7623,  p  =  0.02).  Full  model  results  therefore  suggest  that  forecast  specificity  is 
decreasing  in  the  deviation  of  post-guidance  analyst  forecasts  from  prior  management  guidance. 
The  coefficient  on  CAR_EA  is  insignificant  in  the  reduced  model  (/S6  =  0.1227,  p  =  0.20),  but 
marginally  significant  in  the  full  model  (/36  =  0.1846,  p  =  0.06).  Full  model  results  therefore 
suggest  that  forecast  specificity  is  increasing  in  the  market  reaction  to  earnings  announcements 
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TABLE  6 

Linear  Regression  on  Average  Forecast  Specificity  for  Year  t 
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following  the  provision  of  guidance.  The  coefficients  on  other  treatment  variables  {ABSER- 
ROR_MG,  CHGDISP,  and  MoB_AF)  are  generally  insignificant  in  both  reduced  and  full  models.'^ 
Overall,  results  from  model  (3)  provide  only  weak  support  for  a  relation  between  forecast  form 
and  prior  adverse  outcomes.  The  next  section  investigates  the  extent  to  which  these  results  are 
influenced  by  prior  forecast  form. 

In  summary,  results  are  consistent  with  our  predictions  that  adverse  outcomes  affect  whether 
a  firm  guides  at  all  next  period  and  the  number  of  fiscal  quarters  for  which  firms  give  guidance. 
Two  patterns  warrant  further  discussion.  First,  we  find  that  managers’  ability  to  meet  or  beat  their 
own  forecasts  (Mo5_MG,y_i)  is  significant  in  model  (1)  but  insignificant  in  model  (2).  This 
suggests  that  firms  falling  short  of  their  own  forecasts  are  less  likely  to  issue  any  guidance  in  the 
next  year.  However,  conditional  on  forecasting  at  least  once  in  the  next  fiscal  year,  the  ability  to 
meet  or  beat  one’s  own  forecasts  is  unrelated  to  the  number  of  quarters  for  which  firms  issue 
guidance  in  that  fiscal  year.  Second,  we  measure  market  disappointments  at  earnings  announce¬ 
ments  using  both  analyst  forecasts  {MoB_AFit_{)  and  price  reactions  {CAR_EAit-i) .  In  models  (1) 
through  (3),  one  of  these  two  is  typically  significant  (or  marginally  significant),  although  the  exact 
proxy  for  disappointment  that  is  significant  varies  across  the  three  models. 

V.  SPECIFICATION  CHECKS  AND  ADDITIONAL  EVIDENCE 
Management  Responses  Conditioned  on  Prior  Forecast  Form 

The  evidence  in  Section  IV  suggests  that  prior  adverse  outcomes  influence  the  frequency  of 
future  guidance,  but  provides  limited  support  for  a  relation  between  prior  adverse  outcomes  and 
future  forecast  form.  This  section  investigates  the  extent  to  which  these  results  vary  with  prior 
forecast  form.  The  relation  between  adverse  outcomes  and  prior  forecast  form  is  complicated  by 
“ceiling  effects”  and  “floor  effects”  related  to  prior  forecast  form.  If  firms  issued  imprecise  fore¬ 
casts  in  the  prior  year,  then  they  have  limited  options  to  become  less  precise  (floor  effects). 
Similarly,  if  firms  issue  precise  forecasts  last  year,  then  they  have  limited  options  to  become  more 
precise  (ceiling  effects).  When  firms  with  adverse  outcomes  cannot  effectively  respond  by  adjust¬ 
ing  their  forecast  form  (due  to  ceiling  and  floor  effects),  they  may  instead  react  to  adverse  out¬ 
comes  by  omitting  future  forecasts. 

We  investigate  the  effects  of  prior  forecast  form  by  partitioning  our  sample  into  firms  with  an 
average  lagged  form  less  than  or  equal  to  2  (firms  imprecise  last  year)  and  firms  with  an  average 
lagged  form  greater  than  2  (firms  precise  last  year).  Recall  that  forecast  form  is  coded  as  3,  2,  1, 
and  0  when  the  forecast  is  a  point,  range,  open-ended,  and  qualitative  forecast,  respectively.  A 
lagged  average  form  greater  than  2  therefore  indicates  that  a  firm  gave  at  least  one  point  estimate 
in  the  prior  year  and  was,  on  average,  more  specific  than  a  firm  with  a  lagged  average  form  equal 
to  or  less  than  2?^  We  then  re-estimate  models  (1)  and  (3)  for  each  subsample,  to  investigate  how 
guidance  omission  and  form  decisions  are  conditioned  on  prior  forecast  form.^^  When  re- 
estimating  model  (3)  we  omit  lagged  form  as  an  independent  variable,  as  the  subsamples  were 
formed  by  partitioning  on  lagged  form  and  thus  lagged  form  has  little  variation  within  each 
subsample. 


When  re-estimating  model  (3)  using  principal  component  analysis  we  find  a  significantly  positive  coefficient  on  the 
factor  related  to  meeting  or  beating  earnings  benchmarks  {B\  =  0.0375,  p  =  0.04)  and  an  insignificant  coefficient  on  the 
factor  related  to  absolute  management  forecast  errors  and  a  lack  of  analyst  response  (ySj  = -0.0291,  p  =  0.20). 

We  consider  the  distinction  between  ranges  and  point  estimates  to  be  the  key  factor  in  selecting  a  cut-off  for  classifying 
prior  forecast  form,  as  point  estimates  cannot  become  more  specific.  In  addition,  the  high  frequency  of  point  estimates 
and  ranges  (as  compared  to  the  relatively  rare  open-ended  and  qualitative  forecasts)  suggests  that  for  most  firms,  the  key 
decision  regarding  forecast  form  is  whether  to  issue  a  point  versus  a  range  forecast. 

For  this  additional  analysis,  we  omit  model  (2)  regarding  the  number  of  quarters  within  a  year  for  which  a  firm  issues 
forecasts  because  we  feel  that  predictions  about  how  within-year  frequency  would  vary  based  on  prior  forecast  form  are 
more  ambiguous. 
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Results  from  re-estimating  models  (1)  and  (3)  for  each  subsample  are  presented  in  Table  7. 
Re-estimating  model  (1)  for  imprecise  prior  forecasts  ^  2)  yields  results  qualitatively 

similar  to  the  full  sample  results.  In  particular,  MoB_MGif_i  and  CAR_EAi,_i  remain  negatively 
related  to  forecast  omissions,  while  DIFF_AFMGit_i  and  CHGDISPn-i  remain  positively  related 
to  forecast  omissions.  These  results  suggest  that  firms  that  were  imprecise  in  the  past  respond  to 
adverse  outcomes  by  omitting  future  guidance.  In  contrast,  when  re-estimating  model  (3)  for  firms 
that  were  imprecise  in  the  prior  year,  we  find  that  (our  only  treatment  variable  with 

a  directional  prediction)  is  not  significantly  related  to  specificity.  In  other  words,  because  these 
firms  issue  imprecise  forecasts  in  prior  years,  they  cannot  further  reduce  the  precision  of  future 
forecasts  in  an  attempt  to  increase  the  likelihood  of  meeting  or  beating  their  own  forecasts.  Taken 
together,  these  results  suggest  that  imprecise  prior  forecasters  respond  primarily  to  adverse  out¬ 
comes  by  omitting  forecasts,  rather  than  further  reducing  forecast  specificity. 

When  re-estimating  model  (1)  for  precise  prior  forecasts  >  2),  the  coefficients  on 

most  treatment  variables  become  insignificant.  Therefore,  when  firms  were  precise  in  the  prior 
year,  past  adverse  outcomes  are  generally  not  significantly  related  to  omissions.  A  notable  excep¬ 
tion  is  the  positive  relation  between  omissions  and  DIFF_AFMGit_\  (/S3  =  68.8437,  p  =  0.04). 
That  is,  when  analysts  do  not  closely  follow  precise  forecasts,  managers  respond  by  omitting 
forecasts.  This  makes  sense,  as  managers’  ability  to  become  more  precise  is  bounded  given  that 
they  have  already  been  precise.  If  analysts  do  not  follow  precise  guidance,  then  they  seem  unlikely 
to  follow  imprecise  guidance,  so  managers  respond  by  omitting  guidance  rather  than  reducing 
specificity.  When  re-estimating  model  (3)  using  firms  that  were  precise  last  year,  we  find  a  mar¬ 
ginally  positive  relation  between  Afo5_MG„_i  and  future  forecast  form.  Therefore,  firms  that  were 
precise  in  the  past  and  fell  short  of  their  own  guidance  respond  by  being  less  precise  in  the  future 
in  order  to  increase  the  likelihood  of  meeting  or  beating  future  guidance. 

In  sum,  results  from  re-estimating  models  (1)  and  (3)  conditioning  on  prior  forecast  form 
suggest  that  management’s  reaction  to  adverse  outcomes  varies  with  prior  specificity.  If  managers 
were  imprecise  in  the  past,  adverse  outcomes  lead  them  to  omit  future  guidance.  If  managers  were 
precise  in  the  past,  they  will  reduce  specificity  if  they  failed  to  meet  or  beat  their  own  guidance, 
or  will  omit  guidance  if  analysts  deviated  from  management’s  precise  guidance. 

Quarterly  and  Two-Year  Rolling  Windows 

The  results  of  Section  IV  are  based  on  annual  horizons  that  compare  outcomes  from  year 
t-\  to  guidance  behavior  in  year  t.  We  next  examine  the  robustness  of  our  full-model  results  to 
two  alternate  windows;  queirterly  analyses  and  two-year  rolling  windows.  We  first  re-estimate 
models  (1)  and  (3)  using  quarterly  data  in  which  outcomes  from  quarters  q—A  to  q—  1  are  used  to 
predict  guidance  behavior  in  quarter  q.  The  quarterly  version  of  OMIT  takes  a  value  of  1  if  the 
firm  does  not  issue  any  quarterly  guidance  in  quarter  q,  and  0  otherwise.  The  quarterly  version  of 
FORM  is  based  on  the  specificity  of  guidance  issued  during  quarter  q.  Variable  definitions  for  all 
independent  variables  are  similar  to  those  given  for  models  (1)  and  (3)  except  that  variables 
previously  calculated  over  year  t-l  are  instead  calculated  over  quarters  q—A  to  q—l.  Quarterly 
results  are  generally  similar  to  the  annual  results  for  all  treatment  variables  in  model  (1).  When 
re-estimating  model  (3)  using  quarterly  results,  MoB_MG  becomes  significantly  positive  {Px 
=  0.0729,  p  =  0.05),  consistent  with  the  predictions  of  HI.  Using  annual  data,  this  relation  is  only 
significant  in  the  reduced  model.  CAR_FA  becomes  significantly  positive  (ySg  =  0.1902,  p  <  0.01), 
while  this  is  only  marginally  significant  using  annual  data. 

We  next  re-estimate  models  (1)  through  (3)  by  aggregating  prior  outcomes  and  forecasting 
decisions  over  two-year  horizons.  For  this  analysis,  we  use  a  rolling  window  in  which  outcomes 
from  years  t-2  and  t- 1  are  used  to  predict  guidance  behavior  for  years  t  and  t+ 1 .  OMIT  takes  a 
value  of  1  if  the  firm  does  not  issue  any  quarterly  guidance  in  years  t  and  r-i-1  and  0  otherwise. 
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,  ,  ^  Indicates  statistical  significance  at  the  0.10,  0.05,  and  0.01  levels,  respectively,  under  two-tailed  tests. 

Robust  z-statistics  are  based  on  standard  error  estimates  that  control  for  firm  clustering  effects. 

All  variables  are  defined  in  Table  3,  Table  4,  and  Table  6. 
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NUMQTRS  can  take  a  value  from  1  to  8,  based  on  the  number  of  quarters  in  years  t  and  t+\  for 
which  the  firm  issues  guidance.  FORM  is  based  on  the  average  of  guidance  specificity  over  each 
quarter  of  years  t  and  t+ 1  for  which  the  firm  issues  guidance.  Variable  definitions  for  all  indepen¬ 
dent  variables  are  similar  to  those  given  for  models  (1)  through  (3)  except  that  variables  previ¬ 
ously  calculated  over  year  t-\  are  instead  calculated  over  t-1  and  t-\. 

When  re-estimating  model  (1)  using  a  two-year  rolling  window,  the  coefficients  on  MoB_MG 
and  CHGDISP  are  insignificant  (/3i  = -0.1290,  p  =  0.38;  /34  =  30.2898,  p  =  0.19),  while  these 
coefficients  were  significant  in  the  annual  results.  The  coefficient  on  MoB_AF  is  significantly 
negative  in  the  two-year  rolling  window  results  (yS5  = -0.4316,  p  =  0.01),  while  this  coefficient 
was  insignificant  in  the  annual  results.  Other  two-year  rolling  window  results  are  qualitatively 
similar  to  the  annual  results  reported  in  Table  4.  Overall,  these  results  are  consistent  with  a  link 
between  prior  outcomes  and  future  guidance  omissions,  although  the  exact  factors  that  matter 
differ  between  two-year  rolling  windows  and  annual  tests. 

When  re-estimating  model  (2)  using  a  two-year  rolling  window,  the  coefficients  on  MoBJMG 
and  CAR_EA  are  positive  and  significant  (/3i  =  0.1896,  p  =  0.01;  ^65  =  0.4703,  p  <  0.01),  while 
these  coefficients  were  insignificant  in  the  annual  results.  All  other  two-year  rolling  window 
results  for  model  (2)  are  qualitatively  similar  to  the  annual  results.  Overall,  two-year  rolling 
window  results  are  consistent  with  a  link  between  prior  outcomes  the  number  of  quarters  for 
which  forecasting  firms  provide  guidance.  Full  model  results  for  re-estimating  model  (3)  using 
two-year  rolling  windows  suggest  that  our  treatment  variables  are  not  significantly  related  to 
forecast  form  over  this  horizon. 

Alternate  Measures  of  Accuracy 

One  important  outcome  examined  in  this  study  is  the  extent  to  which  managers  are  able  to 
meet  or  beat  their  own  forecasts.  Our  main  analysis  uses  First  Call’s  conversion  of  range,  open- 
ended,  and  qualitative  forecasts  into  point  estimates  to  determine  whether  managers  meet  or  beat 
their  own  forecasts.  To  check  the  robustness  of  our  results  to  the  treatment  of  imprecise  guidance, 
we  code  range  observations  as  meeting  or  beating  management’s  forecast  when  actual  earnings 
were  greater  than  or  equal  to  the  lower  bound  of  the  range.  Similarly,  minimum  forecasts  are 
coded  as  meeting  or  beating  management’s  forecast  when  actual  earnings  were  greater  than  or 
equal  to  the  minimum  forecast.^^  Results  using  this  alternate  criterion  for  coding  MoBJAG  are 
qualitatively  similar  to  those  reported  in  Table  4. 

As  an  additional  sensitivity  check,  we  create  an  alternate  proxy  for  management’s  ability  to 
meet  or  beat  their  own  forecast,  in  which  weight  given  to  the  outcome  from  a  quarter  is  increasing 
in  the  precision  of  that  forecast.  We  recalculate  by  multiplying  ^  for 

each  quarter  for  which  management  issued  earnings  guidance  in  the  previous  year  by  4  if  that 
forecast  was  a  point  estimate,  3  for  a  range  estimate,  2  for  an  open-ended  estimate,  or  1  for  a 
qualitative  forecast.  The  summed  values  of  weighted  MoB_MGi,_i  q  are  then  scaled  by  the  sum  of 
the  weights.  Results  using  this  alternate  criterion  for  coding  MoB_MG  are  qualitatively  similar  to 
those  reported  in  Table  4.  In  summary,  all  results  on  the  relation  between  guidance  omissions  and 
past  outcomes  are  robust  to  alternate  specifications  of  MoB_MG. 

Seasoned  Equity  Offerings 

Our  final  set  of  tests  examines  whether  the  sensitivity  of  forecasting  omissions  to  prior 
outcomes  varies  with  future  seasoned  equity  offerings.  Prior  research  (e.g.,  Frankel  et  al.  1995) 


Under  this  alternate  criterion,  maximum  forecasts  are  always  met  or  beaten,  as  they  provide  an  upper  bound  on  predicted 
future  earnings.  Our  results  are  unaffected  by  the  exclusion  of  maximum  forecasts. 
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suggests  that  legal  sanctions  and  other  institutional  penalties  to  optimistic  forecasting  are  particu¬ 
larly  severe  when  such  forecasts  are  made  shortly  prior  to  equity  offerings.  As  a  result,  we  predict 
that  managers  with  a  history  of  falling  short  of  their  own  guidance  or  disappointing  market 
participants  at  earnings  announcements  are  more  likely  to  discontinue  forecasting  in  the  year 
before  a  seasoned  equity  offering.^^ 

To  test  this  conjecture,  we  examine  whether  the  relation  of  OMIT^  with  MoB_MGit_i, 
MoB_AFii_i,  and  CAR_EAn_^  varies  with  whether  a  firm  makes  a  seasoned  equity  offering  in  year 

This  test  presumes  that  managers  are  aware  that  a  seasoned  equity  offering  will  occur  in  year 
t+\  when  making  guidance  decisions  in  year  t.  Using  the  SDC  seasoned  equity  offering  database, 
we  construct  a  new  variable,  that  takes  a  value  of  1  if  the  firm  i  issues  a  seasoned  equity 

offering  in  year  t-i-1,  and  0  otherwise.  We  re-estimate  model  (1),  adding  four  additional  indepen¬ 
dent  variables:  SEOu^i,  SEO^^i  interacted  with  MoB_MGn_i,  SEOn+i  interacted  with  MoB_AEij_i, 
and  SEOif+i  interacted  with  CAR_EAn_i. 

Results  (untabulated)  are  similar  to  those  in  Table  4,  as  the  coefficient  on  MoB_MGif^i 
remains  significantly  negative,  ABSERROR^MG^-i  remains  insignificant,  DIEE_AEMGij_x  re¬ 
mains  positive  and  significant,  CHGOlSPn-i  remains  positive  and  significant,  MoB_AEii_i  re¬ 
mains  insignificant,  and  CAR_EAit_i  remains  negative  and  significant.  We  find  a  marginally  sig¬ 
nificant  negative  coefficient  on  the  interaction  between  SEOn+i  and  MoB_MGit_i  (yS  =  -0.9860,  p 
=  0.09),  which  suggests  that  the  negative  relation  between  forecast  omissions  and  past  ability  to 
meet  or  beat  one’s  own  forecast  is  even  more  pronounced  for  firms  that  will  have  a  seasoned 
equity  offering  in  the  following  year.  Coefficients  on  SEOn+i,  SEO^t+i  interacted  with  MoB_AEit_i, 
and  SEOit+i  interacted  with  CAR_EAit_i  are  all  insignificant.  In  summary,  these  results  are  con¬ 
sistent  with  the  prediction  that  legal  liability  concerns  associated  with  optimistic  forecasting  are 
more  pronounced  prior  to  a  seasoned  equity  offering.  Legal  liability  concerns  associated  with 
market  disappointments  do  not  change  prior  to  a  seasoned  equity  offering. 

VI.  CONCLUSIONS 

This  study  examines  how  outcomes  of  previous  earnings  forecasts  affect  a  firm’s  quarterly 
forecasting  strategy,  including  forecasting  incidence,  frequency,  and  specificity.  Specifically,  we 
examine  four  aspects  of  adverse  outcomes.  First,  we  argue  that  earnings  guidance  generates  costs 
for  the  firm  (in  the  form  of  legal  liability,  reputation  loss,  and  stock  price  declines)  when  realized 
earnings  fall  short  of  management  earnings  predictions.  Second,  we  argue  that  guidance  is  un¬ 
likely  to  be  effective  when  analysts  do  not  closely  follow  management’s  forecast.  Third,  we  argue 
that  managers  are  discouraged  from  future  forecasting  when  guidance  is  unsuccessful  at  reducing 
information  asymmetry.  Fourth,  we  argue  that  firms  face  adverse  consequences  from  issuing 
earnings  guidance  that  results  in  subsequent  disappointments  at  earnings  announcements. 

Our  empirical  work  finds  that  firms  are  more  likely  to  omit  quarterly  forecasts  when  facing 
each  of  the  above  adverse  outcomes.  The  likelihood  a  firm  omits  quarterly  forecasts  for  a  given 
year  is  decreasing  in  the  firm’s  past  ability  to  meet  or  beat  their  own  forecasts,  increasing  in  the 
extent  to  which  post-guidance  analyst  forecasts  deviated  from  prior  management  guidance,  in¬ 
creasing  in  the  extent  to  which  prior  guidance  increased  the  dispersion  in  analyst  forecasts,  and 
decreasing  in  the  past  ability  to  avoid  earnings  disappointments  after  issuing  guidance.  For  firms 
that  continue  to  give  guidance,  adverse  prior  outcomes  also  affect  the  precision  of  future  guidance 


Alternatively,  adverse  outcomes  could  be  less  associated  with  forecast  omissions  prior  to  equity  offerings  if  managers’ 
primary  concerns  at  this  time  are  to  correct  prices  and  reduce  information  asymmetry.  However,  Frankel  et  al.  (1995) 
find  that  firms  are  not  more  likely  to  issue  forecasts  in  the  period  shortly  before  equity  offerings.  They  interpret  this  as 
evidence  that  benefits  to  cost  of  capital  from  reducing  information  asymmetry  through  pre-offering  forecasts  are  offset 
by  the  increased  legal  risk  associated  with  such  forecasts. 
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and  the  number  of  quarters  within  a  year  for  which  they  give  guidance.  Prior  research  has  sug¬ 
gested  a  number  of  firm-specific  factors  associated  with  decisions  about  providing  guidance.  Our 
study  adds  to  existing  literature  by  documenting  that  forecasting  decisions  are  also  influenced  by 
the  outcomes  from  previously  issued  earnings  guidance. 
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ABSTRACT:  We  examine  a  communication  game  between  an  analyst  and  a  decision¬ 
maker  and  investigate  how  the  presence  of  public  information  affects  the  precision  of 
the  information  the  analyst  gathers  and  communicates  to  the  decision-maker.  We  char¬ 
acterize  conditions  under  which  public  information  causes  the  analyst  to  underinvest  or 
overinvest  in  the  information  gathered  relative  to  the  case  where  analyst  credibility  is 
not  an  issue.  We  then  discuss  when  the  presence  of  public  information  causes  the 
analyst  to  reduce  the  depth  of  coverage  of  the  firm,  suggesting  that  the  introduction  of 
public  information  can  make  the  decision-maker  strictly  worse  off. 

Keywords:  information  acquisition;  cheap-taik;  analyst  credibility. 


I.  INTRODUCTION 


Decision-makers  obtain  information  from  various  sources.  Some  sources  have  incentives  to 
gather  information  and  communicate  in  a  self-serving  fashion  and  face  limited  regulation 
regarding  these  activities.  Other  sources  have  little  incentive  to  behave  opportunistically 
or  face  fairly  stringent  regulation.  Investors,  for  instance,  can  obtain  information  from  sell-side 
analysts,  who  are  thought  to  have  incentives  to  communicate  strategically  and  face  relatively  little 
regulation  regarding  their  reporting  behavior,  and  also  from  a  firm’s  audited  financial  statements, 
where  regulation  plays  a  significant  role  in  limiting  strategic  behavior.  This  study  analyzes  how 
changes  in  the  information  disclosed  by  a  non-strategic  source  influence  the  information-gathering 
and  communication  decisions  of  a  strategic  information  intermediary. 

The  model  we  employ  has  four  stages  and  two  players:  an  information  intermediary,  called 
the  analyst,  and  a  decision-maker.  In  the  first  stage,  both  players  observe  an  imperfect  public 
signal  about  the  state  of  nature.  In  the  second  stage,  the  analyst  engages  in  costly  information 
gathering  that  determines  the  precision  of  a  signal  about  the  state  of  nature  he  will  privately 
observe.  In  the  third  stage,  the  analyst  privately  observes  a  signal  and  costlessly  sends  a  message 
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to  the  decision-maker.  In  the  fourth  stage,  the  decision-maker,  given  her  beliefs  about  the  state  of 
nature,  takes  an  action  affecting  both  players’  payoffs.  The  players  have  divergent  preferences 
regarding  the  decision-maker’s  action.  Within  the  context  of  this  model,  we  explore  how  the 
precision  of  the  public  information  the  players  observe  in  the  first  stage  and  how  the  public 
observability  of  the  analyst’s  precision  choice  in  the  second  stage  affects  the  analyst’s  information 
gathering  and  communication. 

While  the  model  can  be  applied  to  various  settings  in  which  decision-makers  obtain  informa¬ 
tion  from  both  strategic  and  non-strategic  sources,  we  find  this  model  useful  for  framing  the  recent 
evolution  in  the  sell-side  equity  analyst  market.  The  amount  of  information  firms  make  available 
directly  to  investors  has  exploded  recently.  Following  the  introduction  of  Reg  FD  in  2000,  for 
instance,  firms  can  no  longer  privately  reveal  information  to  analysts  and,  hence,  are  more  likely 
to  publicly  release  their  forward-looking  information  (see  Heflin  et  al.  2003).  Further,  firms  now 
routinely  hold  open  conference  calls  following  their  earnings  releases  (Bushee  et  al.  2003).  In 
addition,  the  $1.4  billion  settlement  between  the  large  brokerage  firms  and  the  State  of  New  York 
in  2002  required  the  firms  to  provide  information  to  investors  from  unbiased  sources  together  with 
their  own  analysis.  Following  this  settlement,  there  has  been  a  flight  of  equity  analysts  from  the 
research  departments  at  the  large  investment  firms,  and  entire  industry  sectors  have  experienced  a 
reduction  in  analyst  coverage.  As  a  consequence,  several  commentators  have  argued  that  these 
changes  are  likely  to  hurt  individual  investors  and  reduce  stock  price  efficiency  (see  Schack  2003). 
A  contribution  of  this  study  is  to  illustrate  how  alternative  sources  of  information  might  affect 
analyst  information-gathering  activities,  the  integrity  of  their  communication  with  investors,  and 
whether  investors  are  likely  to  be  better  off  as  a  consequence  of  changes  in  the  amount  of  publicly 
available  information. 

Our  analysis  suggests  that  improvements  in  the  quality  of  public  information  have  subtle 
consequences  for  analyst  behavior.  When  investors  know  the  precision  of  the  analyst’s  information 
(but  not  the  signal  realization),  we  find  that  changes  in  publicly  observed  information  affect  the 
analyst’s  behavior  as  follows.  On  one  hand,  when  credibility  issues  are  minimal,  an  increase  in  the 
precision  of  public  information  causes  it  to  substitute  for  the  analyst’s  information.  As  information 
is  costly  to  gather,  the  analyst  reduces  the  precision  of  his  information.  Although  more  precise 
public  information  crowds  out  some  analyst  information,  this  crowding  out  is  insufficient  to  offset 
the  benefit  of  more  precise  public  information.  Consequently,  the  decision-maker  is  better  off.  On 
the  other  hand,  when  the  analyst’s  credibility  is  in  doubt — because  the  incentives  of  the  analyst 
and  decision-maker  are  misaligned — the  analyst’s  behavior  depends  on  the  cost  of  obtaining 
information  that  is  sufficiently  precise  for  it  to  be  credibly  communicated.  Specifically,  when 
information-gathering  costs  are  moderate,  more  precise  public  information  leads  the  analyst  to 
gather  more  precise  information  to  increase  the  decision-maker’s  responsiveness  to  his  report  and 
thereby  facilitate  credible  communication.  The  heightened  precision  of  the  analyst  information 
coupled  with  more  precise  public  information  combine  to  make  the  decision-maker  better  off. 
Alternatively,  when  information-gathering  costs  are  high,  it  becomes  exorbitantly  costly  for  the 
analyst  to  gather  sufficiently  precise  information  to  be  able  to  credibly  communicate.  In  this  case, 
an  increase  in  the  precision  of  public  information  completely  crowds  out  the  analyst’s  information. 
Further,  even  though  the  decision-maker  obtains  more  precise  public  information,  the  crowding 
out  of  the  analyst’s  information  can  be  so  severe  that  it  reduces  the  total  information  the  decision¬ 
maker  obtains  and  leaves  her  worse  off. 

When  the  analyst  privately  chooses  the  precision  of  information  to  gather,  the  analyst’s 
information-gathering  behavior  depends  on  the  realization  of  the  public  signal.  Specifically,  after 
observing  favorable  public  information,  the  analyst  gathers  and  credibly  communicates  more 


The  Accounting  Review 
American  Accounting  Association 


November  2010 


Analyst  Information  Acquisition  and  Communication 


1987 


precise  information  when  the  precision  choice  is  not  publicly  observable  than  when  it  is  publicly 
observable.  Alternatively,  after  observing  unfavorable  public  information,  the  analyst  gathers  and 
credibly  communicates  less  precise  information. 

The  primary  theoretical  antecedent  of  our  analysis  is  Crawford  and  Sobel  (1982).  They  con¬ 
sider  a  communication  game  between  an  informed  expert  and  an  uninformed  decision-maker.  The 
expert  perfectly  observes  the  realization  of  a  payoff-relevant  state  variable  and  sends  a  costless 
message  to  the  decision-maker,  who  then  takes  an  action  that  affects  both  players.  Their  cheap-talk 
model  has  been  extended  in  various  directions.  In  work  most  closely  related  to  ours,  Austen-Smith 
(1994)  considers  a  communication  game  in  which  the  expert  chooses  whether  to  perfectly  observe 
the  state  of  nature  at  some  fixed  cost  and  decides  whether  to  reveal  being  informed,  but  is  unable 
to  prove  being  uninformed.  He  establishes  that  there  exist  circumstances  under  which  the  players 
cannot  communicate  when  the  receiver  knows  the  expert  is  informed,  although  they  can  do  so 
when  the  receiver  is  uncertain  about  whether  the  expert  is  informed.  In  contrast  to  Austen-Smith 
(1994),  we  consider  a  model  in  which  the  expert,  after  observing  a  public  signal,  can  choose  the 
precision  of  his  privately  observed  signal;  further,  we  allow  for  the  possibility  that  the  precision  of 
the  expert’s  signal  is  either  publicly  observed  or  privately  chosen. 

The  extant  literature  also  has  extended  the  model  in  Crawford  and  Sobel  (1982)  by  examining 
the  effect  of  multiple  experts  on  the  communication  game  (e.g.,  Austen-Smith  1993;  Krishna  and 
Morgan  2001;  Wolinsky  2002;  Battaglini  2004;  Morgan  and  Stocken  2008),  by  introducing  mul¬ 
tiple  decision-makers  (e.g.,  Farrell  and  Gibbons  1989;  Newman  and  Sansing  1993;  Gigler  1994), 
or  by  allowing  the  players  to  exchange  messages  (e.g.,  Aumann  and  Hart  2003;  Krishna  and 
Morgan  2004).  In  these  studies,  the  expert  is  exogenously  endowed  with  information.  In  contrast, 
we  examine  a  setting  where  it  is  costly  for  the  expert  to  gather  information  and  consider  how  the 
presence  of  a  non-strategic  information  source  affects  the  information  the  expert  gathers  and 
communicates. 

Our  analysis  is  also  related  to  the  voluntary  disclosure  literature.  Verrecchia  (1990)  examines 
how  a  manager’s  discretionary  disclosure  behavior  is  influenced  by  the  exogenous  quality  of  his 
privately  observed  information.  In  contrast,  Penno  (1996),  Pae  (1999),  and  Hughes  and  Pae  (2004) 
extend  this  model  to  consider  a  setting  in  which  the  manager  makes  a  costly  information- 
acquisition  decision,  which  endogenizes  the  quality  of  the  manager’s  information.  A  common 
feature  of  these  studies,  like  much  of  the  extant  literature  examining  voluntary  disclosure  behavior 
(e.g.,  Verrecchia  1983;  Dye  1985;  Arya  and  Mittendorf  2007;  Einhom  and  Ziv  2008),  is  that 
private  information,  if  it  is  disclosed,  must  be  disclosed  truthfully.  In  the  analyst  reporting  envi¬ 
ronment,  however,  analysts  often  report  in  a  self-serving  fashion  and  face  little  regulation  regard¬ 
ing  how  they  report  (e.g.,  Michaely  and  Womack  1999;  Lin  and  McNichols  1998).  Indeed,  this 
concern  motivated  key  provisions  of  the  Sarbanes-Oxley  Act  as  well  as  the  legal  settlement 
between  the  large  brokerage  firms  and  the  State  of  New  York  in  2002.  We  examine  a  reporting 
environment  in  which  the  analyst’s  disclosure  is  not  constrained  by  particular  disclosure  rules  and 
find  the  desire  to  communicate  credibly  fundamentally  influences  the  analyst’s  information¬ 
gathering  behavior.  Hence,  a  contribution  of  our  analysis  is  to  provide  insights  into  information¬ 
gathering  and  communication  behavior  when  reporting  credibility  is  a  key  feature  of  the 
environment.^ 

Section  II  describes  the  model.  Section  III  characterizes  the  analyst’s  precision  choice  when 
this  choice  is  observable  and  the  analyst  is  constrained  to  report  truthfully.  Section  IV  continues  to 


‘  Some  extant  work  has  studied  analyst  information  gathering.  Hayes  (1998),  for  instance,  examines  an  analyst’s  incen¬ 
tives  to  gather  and  report  information  in  an  environment  in  which  the  analyst’s  report  influences  an  investor’s  trading 
behavior  that,  in  turn,  determines  the  trading  commission  that  the  analyst  receives.  The  analyst  always  reports  truthfully 
so  that  strategic  disclosure,  which  is  the  focus  of  this  paper,  does  not  arise. 
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examine  a  setting  in  which  the  analyst’s  precision  choice  is  observable,  but  relaxes  the  assumption 
that  the  analyst  must  report  truthfully.  It  examines  how  changes  in  the  precision  of  public  infor¬ 
mation  influence  an  analyst’s  information  gathering  and  communication,  and  how  these  changes 
affect  the  quality  of  the  decision-maker’s  information.  Section  V  further  extends  the  model  to 
consider  a  setting  which  the  decision-maker  does  not  observe  the  precision  of  the  analyst’s  infor¬ 
mation.  It  contrasts  the  precision  of  information  the  analyst  gathers  in  this  case  relative  to  the  case 
in  which  the  precision  of  the  analyst’s  information  is  publicly  observable.  Section  VI  concludes. 
All  proofs  are  in  the  Appendix. 


II.  MODEL 

Consider  a  communication  game  in  which  an  analyst  chooses  to  obtain  private  information 

about  an  unknown  state  variable  and  sends  a  report  to  a  decision-maker  who  then  takes  an  action. 

The  unknown  state  variable  is  represented  by  the  random  variable  a,  which  has  support  {sj]  where 

s  >  f.  The  common  prior  beliefs  are  that  the  successful,  s,.  and  failure,  /,  state  realizations  are 

2 

equally  likely  to  occur. 

The  game  has  four  stages.  In  the  first  stage,  information  represented  by  the  realization  x  of  a 
random  variable  x  is  publicly  observed.  The  support  for  J  is  A  =  {h,l}.  The  probability  that  x 
=  h  conditional  on  a  =  s  and  the  probability  that  x  =  l  conditional  on  a  =  f  are  both  q  e  (1/2,1). 
The  variable  q  captures  the  precision  of  x,  where  a  higher  value  for  q  implies  that  x  is  more 
informative.  Setting  q  =  1/2  is  equivalent  to  supposing  that  the  public  information  is  absent.  The 
public  information  can  be  viewed  as  the  filing  of  a  firm’s  audited  financial  statements  or  the 
release  of  governmental  statistics  relevant  to  a  firm’s  future  outlook.  The  analyst  observes  the 
realization  of  the  public  information  x  before  choosing  the  precision  of  his  private  information. 

In  the  second  or  information- gathering  stage,  the  analyst  chooses  how  much  to  spend  gath¬ 
ering  information.  The  analyst’s  information,  which  he  privately  observes,  is  represented  by  the 
realization  y  of  a  random  variable  y,  which  has  support  Y  =  {g,b}-  Conditional  on  the  realization 
of  a,  X  and  y  are  independent.  The  probability  that  y  =  g  conditional  on  5  =  5  and  the  probability 
that  y  =  b  conditional  on  a-f  are  both  p  e  [1/2,1].  The  amount  spent  gathering  information  c(p) 
is  a  function  of  the  precision  p  of  the  information,  where  c{p)  is  a  twice  differentiable  function  that 
satisfies  c'{p)  >  0  and  c”{p)  >  0  for  p  >  1/2;  that  is,  the  analyst’s  cost  of  gathering  information 
increases  in  the  precision  of  the  information  and  at  an  increasing  rate.  The  analyst  choosing  not  to 
gather  private  information  is  analogous  to  the  case  in  which  the  analyst’s  information  has  precision 

p  =  1/2.^ 

In  the  third  or  reporting  stage,  the  analyst  costlessly  sends  a  report  r  to  the  decision-maker 
who  then  takes  an  action  a  e  (— °°).  Without  loss  of  generality,  we  restrict  the  analyst’s  possible 
reports  to  the  set  R  =  {g,b}-  The  analyst’s  reporting  strategy  is  a  function  over  the  public  signal  x 
and  the  analyst’s  information  y  such  that  AX  F  — >  i?.  The  decision-maker’s  strategy  is  a  function 
over  the  publicly  observed  signal  and  the  analyst’s  report  such  that  A  X  /?  — >  A. 

In  the  fourth  stage,  the  state  of  nature  a  is  realized  and  the  analyst’s  and  decision-maker’s 
payoffs  are  determined.  The  decision-maker’s  payoffs  are: 

U,  =  -{a-af.  (1) 

Given  this  payoff  function,  it  is  optimal  in  the  third  stage  for  the  decision-maker  to  choose  an 
action  a  =  E[a\x  ,r\  where  £[ .  |  .  ]  is  the  expectation  operator,  which  is  analogous  to  investors 


^  As  in  this  paper,  many  cheap-talk  models  assume  the  players’  prior  beliefs  are  diffuse  (e.g.,  Austen-Smith  1993;  Krishna 
and  Morgan  2001;  Battaglini  2004). 

^  In  the  context  of  this  binary  model,  the  precision  p  of  the  analyst’s  information  affects  both  the  mean  and  variance  of  a 
conditional  on  the  realization  of  y. 
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valuing  a  firm  at  its  expected  value."^  The  analyst’s  payoffs  are: 

Ui=  cf)a-{a-af-c(p),  (2) 

where  0  >  0.^  When  0  =  0,  the  players’  interests  are  perfectly  aligned  because,  given  the  same 
information,  both  players  prefer  the  same  action  in  the  third  stage.  When  0  >  0,  however,  their 
interests  are  misaligned  because  the  analyst’s  preferred  action  exceeds  the  decision-maker’s  pre¬ 
ferred  action.  These  utility  representations  are  broadly  descriptive  of  environments  in  which  an 
analyst  wants  to  induce  a  higher  action  than  a  decision-maker  prefers,  but  is  constrained  from 
inducing  an  action  that  is  too  high  because  of,  say,  reputation  or  litigation  concerns  (e.g.,  Dugar 
and  Nathan  1995;  Grossman  and  Helpman  2001;  Morgan  and  Stocken  2003).  For  instance,  sell- 
side  equity  analysts  often  have  incentives  to  offer  optimistic  recommendations  on  target  firms  to 
curry  favor  with  the  firms’  management.  They  are  constrained  from  being  too  exuberant,  however, 
because  their  employers  often  consider  their  ranking  in  the  annual  Institutional  Investor  All- 
American  Research  Team  poll  when  setting  their  compensation.  This  poll  of  money  managers  and 
institutions  is  widely  viewed  as  a  measure  of  an  analyst’s  reputation  (Michaely  and  Womack 
1999).^ 

Furthermore,  consistent  with  the  institutional  observation  that  analysts  do  not  update  their 
stock  reports  in  response  to  every  public  information  release  affecting  a  firm,  the  specification  of 
the  analyst’s  objective  function  implies  that,  even  when  the  analyst  does  not  collect  any  private 
information,  the  analyst’s  payoff  continues  to  be  affected  by  the  decision-maker’s  actions.  Hence, 
within  the  context  of  the  analyst  reporting  environment,  p  reflects  the  depth  of  coverage. 

All  aspects  of  the  game  are  common  knowledge  except  the  analyst’s  private  signal  y  and, 
depending  on  the  case,  the  analyst’s  precision  choice.  Figure  1  summarizes  the  time  line  of  events 
and  the  model’s  notation. 

We  study  Perfect  Bayesian  Equilibria,  which  require  that  players’  beliefs  satisfy  Bayes’  rule 


FIGURE  1 
Time  Line  of  Events 


Information  gathering  stage 


Reporting  stage 


The  unknown  state  a  assumes  the 
values  s  oif  with  equal  probability. 
In  stage  one,  the  analyst  and 
decision-maker  observe  the  public 
information  x,  which  is  either  h  or 
I,  with  probability  Pr(hl  s)  = 
PrOlf)  =  q. 


In  stage  two,  the  analyst  chooses  the 
precision  p  of  his  private  information 
y,  which  is  either  ^or  b,  with 
probability  Pr(gl  s)  =  Pr(bl  f)  =p. 
The  cost  function  c(p)  reflects  tlie 
analyst’s  cost  of  gathering 
information  with  precision  p. 


Ip  stage  three,  the  analyst  observes 
his  private  information  y  and  sends  a 
report  r  to  the  decision-maker.  The 
decision-maker  then  chooses  an 
action  a.  The  parameter  (p  denotes  the 
extent  to  which  the  players’  interests 
are  misaligned. 


In  stage  four,  the  state 
of  nature  a  is  realized, 
and  the  analyst’s  payoff 
Vj  and  the  decision¬ 
maker’s  payoff  are 
determined. 


The  presumption  that  investors  value  a  firm  at  its  expected  value  is  standard  in  the  voluntary  disclosure  literature  (e.g., 
Verrecchia  1990;  Penno  1996;  Einhom  2007). 

^  We  can  also  assume  (^  <  0  without  altering  the  flavor  of  the  results. 

®  Several  recent  regulatory  actions  were  aimed  at  eliminating  the  reporting  conflict  between  analysts  and  investors.  The 
Sarbanes-Oxley  Act,  enacted  on  July  25,  2002,  was  drafted  to  strengthen  the  independence  of  security  analysts  (Razaee 
2007).  The  legal  settlement  between  the  large  brokerage  firms  and  the  State  of  New  York  in  2002  has  separated  research 
and  investment  banking  activities  within  investment  firms,  altered  the  way  in  which  analysts  are  compensated,  and 
mandated  disclosure  in  stock  reports  of  conflicts  of  interest  that  might  exist.  Despite  these  actions,  analysts  continue  to 
have  incentives  to  bias  their  reports  to  curry  favor  with  management.  Mayew  (2008)  finds  that  management  favors  those 
analysts  in  conference  calls  who  have  bullish  stock  recommendations  on  the  firm. 
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whenever  possible  and  their  actions  maximize  their  expected  payoffs.  The  decision-maker’s  action 
a  maximizes  her  expected  payoff  U^  =  -E\_{a- a)^\x  The  analyst’s  choice  of  information 
precision  p  maximizes  f/,  =  £'[</)a-(a-a)^-c(/?)  |x],  and  when  the  cost  of  gathering  informa¬ 
tion  is  sunk,  the  analyst’s  choice  of  report  r  maximizes  Ui- E\^(t)a- {a- aY\x  ,y\  It  can  be 
established  that  there  do  not  exist  equilibria  in  mixed  strategies  in  which  some  communication 
occurs  in  the  reporting  stage.  Hence,  the  exposition  disregards  the  possibility  of  mixed  strategies.^ 


III.  BENCHMARK  WITH  TRUTHFUL  REPORTING 

We  consider  a  benchmark  setting  in  which  the  analyst  must  report  truthfully,  r  =  y,  and  his 
precision  choice  p  is  common  knowledge.  In  the  reporting  stage,  the  decision-maker’s  action  is 
a  =  £[5  I  X  ,  r  =  y].  Given  this  action,  the  analyst’s  precision  choice  maximizes: 

E[E[(f)E[a\x,y']  -{a-  E[a\x,y'\f  -  c07)|x:,y]|x]  =  (j)Eia\x~\  -  — - ^ ^  -  c{p),  (3) 

2p  -  1 

where  A(f>)  is  the  difference  between  the  decision-maker’s  actions  when  she  believes  she  knows  y, 
that  is: 


A(p)  =  E{a\x,g]  -  E[a\x,b]  = 


q{l  -  q){2p  -  \){s  -  f) 


{p{\-q)  +  (1  -p)q){{\ -p){l  -q)  +  pq) 


(4) 


The  difference  A(p)  is  increasing  in  p,  which  implies  that  the  decision-maker’s  action  choice  is 
more  responsive  to  the  analyst’s  information  when  the  analyst’s  information  is  more  precise.  It  is 
decreasing  in  q,  which  implies  the  decision-maker’s  action  is  less  responsive  to  the  analyst’s 
information  when  the  public  information  is  more  precise.  The  difference  is  independent  of  the 
realization  of  x,  which  stems  from  the  assumption  that  the  prior  beliefs  regarding  the  unknown 
state  variable  a  are  diffuse.® 

To  ensure  a  unique,  interior  optimal  choice  of  precision,  we  assume  the  analyst’s  cost  function 
c{p)  satisfies  the  following  additional  conditions  for  the  remainder  of  the  analysis:  (i) 

c'{p)  ^  ...s  ,,  ,  ,...,  l-3p(l-p) 


lim„ 


-p^l/2(p_i/2)  (i*)  ^  ^p)  ^  c'(p)  "  p(l-p)(p-l/2) 

(i)  implies  that  when  p  is  close  to  one-half,  the  marginal  benefit  of  increasing  the  precision  of  the 
analyst’s  information  is  strictly  greater  than  its  marginal  cost.  Condition  (ii)  implies  that  when  p  is 
close  to  one,  the  marginal  cost  of  increasing  the  precision  of  information  exceeds  the  marginal 
benefit.  Condition  (iii)  implies  the  analyst’s  objective  function  is  quasi-concave  (i.e.,  single- 
peaked),  which  ensures  the  analyst’s  precision  choice  is  unique.  The  assumption  requires  that  the 
analyst’s  cost  function  for  gathering  information  must  be  sufficiently  convex  that  a  departure  from 
the  optimal  choice  of  precision  would  cause  the  analyst’s  expected  utility  to  strictly  decline.  The 
power  function  c{p)  =  {{p-  \ !  2) !  {  \  -p)Y,  where  n  >  2,  is  an  example  of  a  cost  function  that 
satisfies  these  conditions. 

Given  the  precision  choice  p  must  be  in  the  interior,  that  is  p  g  (1/2,1),  the  analyst’s  optimal 
precision  choice  satisfies  the  first-order  condition  characterized  in  Lemma  1 . 

Lemma  1:  When  the  analyst’s  precision  choice  is  observable  and  the  analyst  must  report 
truthfully,  the  analyst’s  unique  precision  choice  p*  satisfies: 

AV)/(2p*-I)-cV*)  =  0.  (5) 

The  analyst’s  choice  of  precision  for  the  signal  y  is  independent  of  the  realization  of  the  public 


(iii) 


> 


for  all  p  e  (1/2,  1).  Condition 


’  Further  details  available  from  the  authors  on  request. 

*  Details  about  the  consequence  of  varying  the  prior  beliefs  are  available  from  the  authors  on  request. 
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information  x.  To  develop  intuition  for  this  finding,  notice  that  the  analyst’s  objective  function 
prior  to  observing  the  signal  y,  as  given  in  (3),  is  increasing  in  the  decision-maker’s  expected 
action  and  decreasing  in  the  cost  of  inducing  the  decision-maker  to  take  an  inappropriate  action. 
Consider  the  first  term,  £'[a|x],  in  (3).  The  expected  action  is  increasing  in  the  realization  of  the 
public  information  x.  However,  a  change  in  the  precision  of  the  anticipated  signal  y  does  not  affect 
the  decision-maker’s  expected  action — it  remains  Zi[a|x].  Consider  the  second  term,  £'[£'[(5 
-  £[5  I  X  ,  yff-  \  x  ,y]\x],  in  (3).  Here  a  change  in  precision  of  the  anticipated  signal  y  does  affect 
the  decision-maker’s  expected  responsiveness  to  the  analyst’s  message.  However,  the  expected 
variance  term  does  not  vary  in  the  realization  of  x  because,  conditional  on  the  realization  of  a,  x, 
and  y  are  independent.  It  follows,  therefore,  that  the  analyst’s  optimal  precision  choice  does  not 
depend  on  the  public  information. 

Analyst  Behavior  and  Decision-Maker  Information 

The  focus  of  our  analysis  is  on  the  impact  of  the  quality  of  the  public  information,  q,  on  the 
analyst’s  behavior.  Accordingly,  we  consider  the  relation  between  the  analyst’s  precision  choice, 
p*,  and  the  quality  of  the  public  information,  q 

Corollary  2:  When  the  analyst’s  precision  choice  is  observable  and  the  analyst  must  report 
truthfully,  the  chosen  precision  of  the  analyst’s  information,  /?*,  is  decreasing 
in  the  precision  of  the  public  information  q-,  that  is,  dp*ldq  <  0. 

The  intuition  for  the  inverse  relation  between  q  and  p*  is  familiar.  As  the  precision  of  public 
information  increases,  it  substitutes  for  the  analyst’s  information,  causing  the  marginal  benefit  of 
the  analyst’s  information  to  fall.  Since  gathering  information  is  costly,  the  choice  p*  that  maxi¬ 
mizes  the  analyst’s  expected  utility  decreases. 

The  intuitive  crowding-out  finding  in  Corollary  2  suggests  that  the  impact  of  any  increase  in 
public  information  on  decision-maker  information  is  tempered  by  a  loss  of  analyst  information. 
Given  the  countervailing  decline  in  the  analyst  information  associated  with  an  increase  in  the 
public  information,  it  is  unclear  whether  an  increase  in  public  information  will  ultimately  lead  to 
an  increase  in  decision-maker  information.  To  formally  assess  the  effect  of  the  change  in  the 
public  information  on  the  decision-maker’s  information,  however,  requires  a  measure  of  the 
decision-maker’s  information.  We  define  the  quality  of  information  as  the  decision-maker’s  ex¬ 
pected  posterior  variance  after  observing  the  public  signal  and  analyst  report,  -E{var\_a  |  x  ,  r]  |  x]. 
The  statistical  measure,  -E[varia\x  ,y  ;  p*]  |  x],  which  is  consistent  with  Crawford  and  Sobel 
(1982),  is  appealing  because  it  is  equivalent  to  the  decision-maker’s  expected  utility.  The  decision¬ 
maker’s  expected  posterior  variance  after  the  public  signal  is  realized  and  analyst  report  is  ob¬ 
served  is  identical  to  the  decision-maker’s  expected  utility  in  the  information-gathering  stage  after 
the  public  signal  is  realized,  but  before  che  analyst’s  information  is  observed.^ 

The  decision-maker’s  quality  of  information  when  the  analyst  chooses  p  =  p*  h  given  by 
-E[varia\  x  ,  y  ;  /?*]  |  x]  =  -p*(l  -  p*)(5  -  f)Hp*)  /  (2p*  -  1).  Using  this  measure,  we  find  an 
increase  in  the  precision  of  the  public  information  q  leads  to  an  increase  in  the  decision-maker’s 
information;  that  is: 

Corollary  3:  Suppose  the  analyst’s  precision  choice  is  observable  and  the  analyst  must  report 
truthfully.  The  decision-maker’s  quality  of  information  is  increasing  in  the  pre¬ 
cision  of  the  public  information,  q. 


®  The  decision-maker’s  utility  is  identical  to  the  middle  term  in  the  analyst’s  objective  function  in  Equation  (3).  Accord¬ 
ingly,  this  term  serves  to  align,  at  least  partially,  the  interests  of  the  analyst  and  decision-maker. 
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IV.  UNCONSTRAINED  REPORTING 

Analysts  are  fairly  unconstrained  in  how  they  report  their  private  information.  Consistent  with 
this  observation,  we  alter  the  model  and  allow  the  analyst  to  report  without  cost  either  g  or  b, 
regardless  of  the  realization  of  his  private  information  y.  We  continue  to  assume  that  the  analyst’s 
choice  of  p  is  common  knowledge  to  focus  solely  on  how  communication  incentives  affect  analyst 
information-gathering  decisions.  This  setting  is  arguably  consistent  with  the  sell-side  equity  ana¬ 
lyst  environment.  Analyst  stock  reports  vary  in  content  and  often  contain  detailed  analysis  of  a 
company,  its  key  competitors,  and  its  industry.  The  detail  in  the  analyst’s  stock  report,  which  the 
investor  can  readily  observe,  reflects  the  precision  of  the  analyst’s  information  p.  Investors,  how¬ 
ever,  do  not  observe  the  analyst’s  procedures  for  evaluating  and  interpreting  the  information  that 
is  gathered.  It  is  this  interpretation  and  evaluation  that  provides  the  analyst  with  private  informa¬ 
tion  y.  Later,  we  shall  relax  the  assumption  that  p  is  common  knowledge. 

Like  most  cheap-talk  games,  there  are  multiple  equilibria.  There  always  exist  equilibria  in 
which  the  analyst  does  not  gather  any  information.  There  may  or  may  not  also  exist  equilibria  in 
which  the  analyst  gathers  and  communicates  information.  When  there  exist  equilibria  in  which 
information  is  gathered,  we  focus  on  this  class  because  these  equilibria  ex  ante  Pareto  dominate 
any  equilibrium  in  which  information  is  not  gathered.  Within  the  class  of  equilibria  in  which 
information  is  gathered,  we  focus  on  the  truthful  communication  equilibrium — the  equilibrium  in 
which  r  =  y  for  all  y.  This  focus  is  without  loss  of  generality  because  when  there  exists  an 
equilibrium  with  full  revelation  that  involves  the  analyst  reporting  something  other  than  y  (e.g.,  r 
G  {b,g}  and  r  y  for  all  y),  it  is  economically  equivalent  to  the  truthful  communication  equilib¬ 
rium. 

When  the  players’  interests  are  perfectly  aligned,  communication  is  credible  and  the  analyst 
can  always  fully  reveal  his  privately  observed  information.  When  the  players’  interests  are  mis¬ 
aligned,  however,  an  analyst  cannot  always  fully  reveal  his  privately  observed  information.  To 
assess  whether  the  analyst  can  credibly  communicate  y,  note  that  the  analyst  prefers  a  higher 
action  than  the  decision-maker,  given  y,  and  that  the  decision-maker  would  take  a  higher  action  if 
she  believes  the  analyst  has  observed  y  =  g.  It  follows  that  the  analyst  is  always  willing  to  reveal 
when  y  =  g,  but  may  want  to  mislead  the  decision-maker  when  y=zb  by  nefariously  claiming  that 
y  =  g.  Recalling  that  the  value  of  the  analyst’s  objective  is  increasing  and  then  decreasing  in  the 
decision-maker’s  action  implies  the  analyst  will  not  try  to  dissemble  when  y  =  b  if  the  action 
induced  from  claiming  that  y  =  g  is  much  higher  than  the  action  preferred  by  the  analyst  when 
y-b.  Hence,  the  analyst  can  communicate  y  if  the  action  he  would  induce  by  misrepresenting  his 
information  is  sufficiently  high  relative  to  the  one  he  induces  if  he  truthfully  communicates  his 
information.  Formally,  the  analyst  can  credibly  communicate  if  and  only  if: 


A(»  =  E{a\x,g'\  -  E[a\x,b]  = 


_ ^(1  1)(^-/) _ 

(p(l  -^)  +  (l  -p)^)((l  -p){\-q)+pq) 


(6) 


Observe  that  A(p)  is  an  increasing  function  of  p,  which  implies  that  the  decision-maker’s 
action  choice  is  more  responsive  to  the  analyst’s  information  when  the  analyst’s  information  is 
more  precise.  As  discussed  above,  the  analyst  can  communicate  y  if  the  action  he  induces  when  he 
claims  to  have  observed  realization  g  is  sufficiently  large  relative  to  the  action  he  induces  when  he 
claims  to  have  observed  realization  b.  Hence,  if  p  is  sufficiently  high,  the  analyst  can  communicate 
y.  The  observation  that  the  analyst  can  fully  reveal  his  privately  observed  information  provided  the 


It  is  standard  in  the  cheap-talk  literature  to  focus  on  an  ex  ante  Pareto  dominant  equilibrium;  see,  for  instance,  Crawford 
and  Sobel  (1982)  and  Alonso  et  al.  (2008). 
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players’  interests  are  not  too  misaligned  and  the  analyst’s  information  is  sufficiently  precise  is 
formalized  in  the  next  lemma/ ^ 

Lemma  4:  Suppose  the  analyst’s  precision  choice  is  observable  and  the  analyst  need  not 
report  truthfully.  The  analyst  can  communicate  y  in  the  presence  of  the  other 
information  x  if  and  only  if  the  precision  of  the  analyst’s  information  is  suffi¬ 
ciently  high;  that  is,  there  exists  a  minimum  threshold  p  such  that,  for  any  x,  the 
analyst  can  credibly  communicate  y  if  and  only  if  p^p. 

When  the  analyst  must  truthfully  report  y.  Lemma  1  established  that  the  analyst’s  precision 
choice  is  maximized  at/?*.  When  the  analyst  is  not  restricted  to  report  truthfully.  Lemma  4  implies 
that  p  must  be  at  least  p  for  credible  communication  to  occur.  It  follows  from  these  two  lemmas 
that,  when  the  analyst  is  not  restricted  to  report  truthfully,  the  analyst  chooses  information  quality 
/?*  if  credible  communication  is  possible,  specifically  /?=/?*>  p.  Otherwise,  the  analyst  chooses 
a  higher  level  of  information  quality  to  allow  communication,  p=p>  p*,  or  he  abstains  from 
gathering  information  if  information  of  higher  quality  is  too  costly,  that  is,  p  =  1/2.  To  formalize 
when  the  analyst  decides  to  forgo  gathering  information  as  opposed  to  gathering  information  that 
is  sufficiently  precise  to  facilitate  communication,  we  characterize  a  threshold  p*.  This  threshold  is 
the  level  of  precision  such  that  the  analyst  is  indifferent  between  gathering  information  with 
precision  p*  that  is  truthfully  revealed  and  not  gathering  information,  p  =  1/2;  formally,  p*  solves; 


c{p’)  =  S(p*)q{l-q){2p‘-\)(s-f}.  (7) 

Note  that  the  threshold  p*  strictly  exceeds  the  interior  choice  /?*. 

We  characterize  the  analyst’s  equilibrium  information-gathering  and  communication  decisions 
as  a  function  of  p*,  p,  and  p*  in  Proposition  5. 

Proposition  5:  Suppose  the  analyst’s  precision  choice  is  observable  and  the  analyst  need  not 
report  truthfully.  There  always  exists  an  equilibrium  in  which  the  analyst  does 
not  gather  any  information,  p  =  1/2.  Furthermore: 

(i)  If  p  <  p^  <  p*,  then  there  exists  a  unique  equilibrium  with  an  interior  level  of  preci¬ 
sion,  which  equals  /?*.  The  analyst  truthfully  communicates  the  information  gathered. 

(ii)  Ifp*  <p<p*,  then  there  exists  a  unique  equilibrium  with  an  interior  level  of  preci¬ 
sion,  which  equals  p.  The  analyst  truthfully  communicates  the  information  gathered. 

(iii)  If  /?*  <p*  <p,  then  there  does  not  exist  an  equilibrium  with  an  interior  level  of 
precision. 

A  key  feature  of  Proposition  5  is  that  the  analyst’s  credibility  problem  alters  the  structure  of 
the  costs  the  analyst  faces.  When  the  analyst  is  unable  to  communicate,  he  cannot  coordinate  the 
decision-maker’s  actions.  This  inability  is  costly.  By  gathering  more  precise  information,  however, 
the  analyst  can  credibly  communicate  information  that  causes  the  decision-maker  to  take  more 
informed  actions. Since  the  players’  interests  are  partially  aligned,  this  communication  not  only 


The  claim  that  the  analyst  can  credibly  communicate  when  p  is  sufficiently  high  can  be  established  for  a  more  general 
specification  of  the  analyst’s  objective  function  U{a,a,(f>)  that  has  the  properties:  (i)  dU{a,a,f)/da  =  0  for  some  action 
a;  (ii)  d^U{a ,  a ,  (f>)  /  da^  <  0  so  that  the  analyst’s  utility  function  [/(.)  has  a  unique  maximum  action  a  for  each  pair 
and  (iii)  fU(a ,  a ,  f)  I  dada  >  0  so  that  the  best  decision-maker’s  action  from  the  analyst’s  perspective  is 
increasing  in  the  state  a.  The  more  general  utility  function  does  not  lend  itself  to  comparative  static  exercises  in  a 
straight-forward  manner,  motivating  the  tractable  utility  function  specified  in  Section  II. 

In  contrast,  if  the  players’  interests  are  sufficiently  misaligned  that  (p  >  {s  -  f),  then  the  analyst  is  unable  to  gather 
sufficiently  precise  information  to  credibly  communicate.  Hence,  only  the  babbling  equilibrium  prevails.  In  this  equi¬ 
librium,  the  analyst  does  not  collect  any  private  information. 
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heightens  the  decision-maker’s  expected  utility,  but  also  raises  the  analyst’s  expected  utility.  Con¬ 
sequently,  the  analyst’s  desire  to  influence  the  decision-maker  motivates  the  analyst  to  gather 
information. 

Analyst  Behavior 

When  the  analyst  is  constrained  to  report  truthfully,  increasing  the  public  information,  q, 
causes  the  analyst  to  reduce  his  information  gathering.  In  contrast,  when  the  analyst  is  not  con¬ 
strained  to  report  truthfully,  the  effect  of  public  information  on  the  analyst’s  information-gathering 
activities  is  more  subtle.  Examining  how  the  public  information,  q,  influences  the  analyst’s 
information-gathering  behavior  entails  determining  how  q  affects  the  critical  values  p*,  p,  and  p^. 
The  inverse  relation  between  p*  and  q  was  established  earlier. 

For  the  analyst  to  credibly  communicate,  he  must  gather  information  with  precision  of  at  least 
p.  Corollary  6  shows  p  is  increasing  in  q. 

Corollary  6:  The  minimum  precision  of  the  analyst’s  private  information  necessary  for  the 
analyst  to  credibly  communicate,  p,  is  increasing  in  the  precision  of  public 
information  q\  that  is,  dp !  dq>  0. 

An  increase  in  the  precision  of  the  public  information  must  be  accompanied  by  an  increase  in  the 
precision  of  the  analyst’s  private  information  for  the  decision-maker  to  remain  responsive  to  the 
analyst’s  report  and  thereby  encourage  the  analyst  to  continue  to  report  truthfully.  Conversely,  an 
increase  in  the  precision  of  the  public  information  without  a  corresponding  increase  in  the  preci¬ 
sion  of  the  analyst’s  report  causes  the  decision-maker  to  place  relatively  more  weight  on  the  public 
information  and  relatively  less  weight  on  the  analyst’s  report.  The  analyst  is  then  less  capable  of 
influencing  the  decision-maker’s  action  and,  accordingly,  is  more  inclined  to  lie. 

The  analyst  might  decide  not  to  gather  information  that  is  sufficiently  precise  to  credibly 
communicate  because  it  is  too  costly  to  do  so.  This  decision  hinges  on  the  level  of  precision 
p^ — the  unique  level  of  precision  for  which  the  analyst  is  indifferent  between  gathering  informa¬ 
tion  that  can  be  communicated  and  not  gathering  information.  This  level  of  precision  p*  also 
depends  on  the  precision  of  the  public  information,  q.  As  the  precision  of  public  information 
increases,  the  benefit  to  the  analyst  of  providing  the  decision-maker  with  more  precise  information 
decreases.  Hence,  p^  declines.  Corollary  7  formalizes  this  discussion. 

Corollary  7:  The  level  of  precision  of  the  analyst’s  private  information  for  which  the  analyst 
is  indifferent  between  gathering  and  not  gathering  information,  p*,  is  decreas¬ 
ing  in  the  precision  of  public  information  q\  that  is,  dp^  I  dq  <0. 

Having  established  the  relations  between  precision  of  the  public  information,  q,  and  analyst 
information  precision  levels  p*,  p,  and  p*,  we  now  assess  how  the  analyst’s  equilibrium  precision 
choice  varies  in  the  precision  of  the  public  information.  Consider  the  case  for  which  the  precision 
of  the  public  information  is  relatively  low,  so  that  p*  >p.  As  q  increases,  p  rises  and  p*  falls. 
Because  the  analyst  chooses  p*  when  p*  >  p,  the  analyst’s  choice  of  p  falls  in  q  because  p*  falls 
in  q.  Eventually,  q  rises  to  a  level  where  p"^  =p.  As  q  continues  to  rise,  communication  is  no 
longer  credible  if  the  analyst  chooses  p*.  Hence,  the  analyst  must  choose  between  the  level  of 
precision  necessary  for  credible  communication  p  or  not  collect  any  information  by  choosing  p  = 
1/2.  Initially,  the  analyst  responds  by  choosing  p  and  overinvesting  in  the  quality  of  information  he 
gathers  to  allow  credible  communication.  Given  that  p  is  increasing  in  q,  it  follows  that  the 
precision  of  the  analyst  information  increases  in  q.  When  p  >  p*,  however,  implementing  p  be¬ 
comes  so  costly  that  the  analyst  decides  not  to  collect  any  information.  In  these  circumstances,  the 
improvement  in  the  public  information  causes  the  analyst  to  underinvest  in  the  quality  of  infor- 
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mation  he  gathers  relative  to  that  which  he  would  gather  if  he  could  commit  to  truthfully  reveal  his 
private  information.  In  summary,  we  have  the  following  observation: 

Remark  1:  The  precision  of  the  information  the  analyst  collects  is  decreasing  in  the  precision 
of  the  public  information,  then  increasing  in  the  precision  of  the  public  informa¬ 
tion,  and,  at  some  sufficiently  high  level  of  precision  of  the  public  information, 
the  analyst  stops  collecting  private  information;  that  is,  there  exists  values 
and  q>  q*  such  that  the  equilibrium  precision  of  the  analyst’s  private  informa¬ 
tion  is  decreasing  in  q  over  the  range  (1/2,^*],  increasing  in  q  over  the  range 
(g*  ,q},  and  is  uninformative  for  q>  q. 

This  characterization  of  the  analyst’s  information-gathering  behavior  is  illustrated  in  Figure  2.  The 
range  (l/2,g*]  can  be  empty  because  g*  =  1/2  for  some  set  of  parameters. 

Our  model,  which  emphasizes  the  role  of  reporting  credibility  for  understanding  the  informa¬ 
tion  intermediary’s  behavior,  contrasts  with  the  voluntary  disclosure  models  commonly  examined 
in  the  accounting  literature,  which  assume  all  disclosure  must  be  truthful,  but  allow  the  informed 
party  to  decide  whether  to  make  such  disclosure.  These  models  offer  reasons  for  why  managers 


FIGURE  2 

Characterization  of  the  Analyst’s  Information-Gathering  Behavior 


Analyst  information 


Analyst  gathers  p*  Analyst  gathers  p  Analyst  does  not 

and  over  invests  gather  information 
gathering  information 


This  figure  characterizes  the  relation  between  the  precision  of  the  public  information  q  and  the  analyst’s  equi¬ 
librium  choice  of  precision  p.  The  solid  line  reflects  the  analyst’s  equilibrium  precision  choice,  the  dashed  line 
reflects  the  level  of  precision  necessary  to  credibly  communicate  p,  and  the  dotted  line  represents  the  level  of 
precision  chosen  when  credibility  issues  are  absent,  p*. 
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might  not  voluntarily  release  private  information.  Our  model  suggests  that  analysts  might  not 
disclose  non-verifiable  information  because,  in  anticipation  of  being  unable  to  credibly  reveal  it, 
they  abstain  from  gathering  any  information.  Further,  we  find  that  analyst  credibility  is  an  impor¬ 
tant  ingredient  affecting  the  precision  of  information  the  analyst  gathers.  It  can  cause  an  analyst  to 
underinvest  or  overinvest  in  information  collection  relative  to  the  model  in  which  any  disclosure 
is  always  truthful.*^ 

Remark  1  provides  some  insight  into  the  recent  evolution  of  the  sell-side  analyst  industry. 
Recent  reporting  requirement  changes  have  greatly  increased  the  amount  of  information  firms 
make  available  directly  to  investors.  These  changes  have  eroded  stock  analysts’  information  ad¬ 
vantage.  Remark  1  suggests  that,  in  response  to  these  changes  in  the  information  environment, 
some  intermediaries  have  reduced  the  information  they  collect,  others  have  increased  the  quality 
of  their  analysis  in  response  to  the  heightened  competition  from  other  sources,  and  yet  others  have 
elected  to  no  longer  gather  information  on  particular  firms. 

Remark  1  is  not  empirically  helpful  because  it  suggests  “anything  can  happen.”  To  offer  more 
definitive  guidance  as  to  how  intermediaries  alter  the  quality  of  their  analysis  in  response  to 
changes  in  public  information,  we  consider  how  some  environmental  characteristics  might  influ¬ 
ence  analyst  behavior.  In  particular,  we  assess  how  the  extent  of  the  incentive  conflict,  captured  by 
(f),  and  the  degree  of  prior  uncertainty  about  the  firm’s  payoffs,  as  captured  by  (5  —  f),  alter  analyst 
behavior.  Doing  so  requires  that  we  first  examine  how  changes  in  these  characteristics  affect  the 
critical  values  of  precision. 

Corollary  8:  The  precision  of  the  analyst’s  information  chosen  when  the  analyst’s  informa¬ 
tion  is  public,  p*,  is  unaffected  by  the  extent  of  incentive  misalignment  0  and 
is  increasing  in  the  difference  between  the  state  payoffs  (s  —  f);  that  is,  dp^/8(f) 
=  0  and  dp^ld{s  —  f)  >  0.  The  minimum  precision  of  the  analyst’s  private 
information  necessary  for  the  analyst  to  credibly  communicate,  p,  is  increasing 
in  the  extent  of  incentive  misalignment  </>  and  decreasing  in  the  difference 
between  the  state  payoffs  (5  —  f);  that  is,  dpld(f)>0  and  dp  /  d{s  -f)  <0. 
Finally,  the  level  of  precision  of  the  analyst’s  private  information  for  which  the 
analyst  is  indifferent  between  gathering  and  not  gathering  information,  p^,  is 
unaffected  by  the  extent  of  incentive  misalignment  0  and  is  increasing  in  the 
difference  between  the  state  payoffs  {s  —  f);  that  is,  dp^ !  d(/)  =  0  and  dp^ !  d{s 
-f)>0. 

We  develop  intuition  for  how  p*  behaves  in  response  to  these  parameters.  The  value  p*  is 
derived  assuming  the  analyst  must  report  truthfully.  Accordingly,  the  extent  of  the  incentive 
misalignment  between  the  players,  (f),  does  not  influence  the  choice  of  p*.  In  contrast,  as  the  prior 
variance  of  the  firm’s  payoffs,  which  is  proportional  to  (5  —  f),  increases,  both  players  prefer  more 
precise  information,  implying  an  increase  in  p*.  Because  the  value  p^  presumes  the  analyst  can 
credibly  communicate  the  information  gathered,  p^  varies  in  a  similar  fashion  as  p*  to  changes  in 
(j)  and  (5  -  f). 

We  now  turn  to  how  p  behaves.  The  analyst  is  more  inclined  to  truthfully  reveal  his  privately 
observed  signal  when  the  decision-maker’s  actions  are  highly  responsive  to  his  disclosure.  The 
decision-maker  is  more  responsive  when  the  analyst’s  information  is  relatively  more  precise  or  the 


Examining  the  effect  on  the  analyst  behavior  of  changes  in  the  precision  of  the  other  information  is  reminiscent  of  the 
voluntary  disclosure  literature  that  considers  the  credibility  of  a  firm  manager’s  communication  when  investors  obtain 
other  information  that  allows  them  to  assess  the  veracity  of  the  manager’s  disclosure.  For  instance,  Sansing  (1992)  and 
Stocken  (2000)  examine  managers’  earnings  forecasting  behavior  when  investors  can  use  an  audited  earnings  report  to 
assess  a  forecast’s  credibility. 
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state  payoffs  are  more  divergent.  When  the  misalignment  of  incentives  is  large,  the  minimal 
precision  of  analyst  information  necessary  for  credible  communication  must  be  high.  Alterna¬ 
tively,  when  the  state  payoffs  are  far  apart  and  prior  uncertainty  is  large,  the  decision-maker  will 
be  more  responsive  to  the  analyst’s  information.  Therefore,  the  minimal  precision  of  the  analyst’s 
information  necessary  for  credible  communication  need  not  be  high. 

Given  Corollary  8,  we  now  specify  how  the  extent  of  incentive  misalignment,  0,  influences 
the  analyst’s  response  to  increases  in  the  precision  of  the  public  information. 

Corollary  9:  If  incentive  misalignment  is  low,  then  the  analyst  will  choose  to  gather  infor¬ 
mation  with  precision  p*,  implying  the  equilibrium  choice  of  precision  is  de¬ 
creasing  in  the  precision  of  public  information.  Alternatively,  if  incentive  mis¬ 
alignment  is  moderate,  then  the  analyst  will  choose  to  gather  information  with 
precision  p,  implying  equilibrium  choice  of  precision  is  increasing  in  the  pre¬ 
cision  of  public  information.  Finally,  if  incentive  misalignment  is  high,  then  the 
analyst  will  abstain  from  collecting  information. 

Within  the  sell-side  analyst  context,  these  observations  suggest  that  intermediaries  with  large 
incentive  conflicts,  such  as  those  whose  employers  do  a  great  deal  of  banking  business  (see  Lin 
and  McNichols  1998)  or  those  who  have  substantial  equity  positions  in  the  firm’s  stock,  may 
decide  to  not  gather  private  information  in  response  to  an  increase  in  public  information.  In 
contrast,  analysts  that  face  few  conflicts  of  interest  will  respond  more  modestly  and  simply  reduce 
their  analysis.  Finally,  analysts  with  more  moderate  conflicts  of  interest  will  be  more  inclined  to 
increase  their  analysis  in  response  to  increases  in  public  information. 

Prior  payoff  uncertainty,  which  is  proportional  to  (5  —  f),  also  influences  the  response  of 
intermediaries  to  the  increase  in  public  information,  as  formalized  in  Corollary  10. 

Corollary  10:  If  the  difference  between  the  state  payoffs  is  low,  then  the  analyst  will  abstain 
from  collecting  information.  Alternatively,  if  the  difference  between  the  state 
payoffs  is  moderate,  then  the  analyst  will  choose  to  gather  information  with  a 
level  of  precision  p,  implying  the  equilibrium  choice  of  precision  is  increasing 
in  the  precision  of  public  information.  Finally,  if  the  difference  between  the 
state  payoffs  is  high,  then  the  analyst  will  choose  to  gather  information  with  a 
level  of  precision  p*,  implying  the  equilibrium  choice  of  precision  is  decreas¬ 
ing  in  the  precision  of  public  information. 

Decision-makers  might  be  highly  uncertain  abouf  a  firm’s  payoffs,  modeled  {s  -  f)  being 
large,  when  the  firm  is  in  an  industry  that  uses  an  innovative  and  unproven  technology.  In  response 
to  an  increase  in  public  information.  Corollary  10  predicts  that  firms  with  highly  risky  payoffs  will 
face  relatively  modest  declines  in  analyst  attention,  those  with  low  risk  payoffs  will  have  analysts 
not  gather  any  information,  and  those  with  moderately  risky  payoffs  will  receive  greater  analyst 
attention. 

Decision-Maker  Information 

In  the  benchmark  setting  in  which  the  analyst  is  constrained  to  report  truthfully,  the  quality  of 
the  decision-maker’s  information  is  increasing  in  the  quality  of  the  public  information.  In  the 
setting  in  which  the  analyst’s  communication  is  unconstrained,  however,  the  relation  between  the 
quality  of  the  public  information  and  the  decision-maker’s  information  is  altered  by  changes  in  the 
analyst’s  information-gathering  incentives.  Hence,  we  revisit  the  relation  between  q  and  the 
decision-maker’s  information. 

Figure  2  shows  that  when  the  quality  of  public  information  is  low,  that  is,  q  e  (1/2,^*],  the 
analyst  chooses  the  same  level  of  precision  p*  as  in  the  benchmark  setting  with  truthful  reporting. 
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As  established  in  the  benchmark  setting,  p*  declines  in  q  because  the  public  information  substi¬ 
tutes  for  the  analyst’s  information.  Nevertheless,  the  quality  of  the  decision-maker’s  information  is 
increasing  in  q  because  this  increase  offsets  the  decrease  in  the  precision  of  the  analyst’s  infor¬ 
mation. 

As  the  quality  of  the  public  information  increases  above  q^,  however,  credibility  issues  arise 
in  the  unconstrained  reporting  setting  and  the  analyst  chooses  p,  the  lowest  level  of  precision  for 
which  credible  communication  is  possible.  In  this  case,  the  decision-maker’s  quality  of  informa¬ 
tion  is  given  by: 

-E[var[a\x,y,p]\x]  =  -p{\  - p){s  -  f)Mp)l{lp  -  1). 

Because  p  is  increasing  in  the  quality  of  the  public  information,  it  follows  that  the  analyst  in¬ 
creases  his  precision  choice  in  response  to  improvements  in  public  information.  Consequently,  the 
public  information  has  a  complementary  effect  on  the  analyst’s  information-gathering  activities. 

Finally,  as  the  precision  of  the  public  information  becomes  so  large  that  the  analyst’s  expected 
utility  when  p  is  chosen  is  less  than  the  analyst’s  expected  utility  when  no  information  is  collected, 
that  is,  p  =  1/2,  the  analyst  decides  not  to  gather  any  information.  Since  the  decision-maker  only 
observes  the  public  information  x,  the  decision-maker’s  quality  of  information  when  p  =  1/2  is 
given  by  -Var\^a\x^- -q{\  -  q){s  -  fp-.  This  observation  implies  that  an  increase  in  q  above 
some  threshold  value  q  causes  the  decision-maker’s  information  quality  to  fall  discontinuously  at 
q.  Thus,  the  increase  in  public  information  squashes  the  analyst’s  willingness  to  collect  private 
information,  which  makes  the  decision-maker  worse  off.  Of  course,  as  the  precision  of  the  public 
information  continues  to  increase,  the  quality  of  the  decision-maker’s  information  increases  and 
attains  a  maximum  value  aX.  q  =  1 .  This  observation  is  formalized  in  Corollary  1 1 . 

Corollary  11:  Suppose  the  analyst’s  precision  choice  is  observable  and  the  analyst  need  not 
report  truthfully.  As  the  precision  of  the  public  information  increases  from  q  = 
1/2,  the  decision-maker’s  quality  of  information  initially  increases,  falls  dis¬ 
continuously  at  q  when  the  analyst  no  longer  gathers  private  information,  and 
then  increases  as  q  approaches  1. 

Corollary  1 1  shows  that  increases  in  the  quality  of  public  information  typically  enhance  the 
decision-maker’s  welfare  except  at  a  threshold  level  of  precision  q,  at  which  the  public  informa¬ 
tion  crowds  out  the  analyst’s  information.  The  intuition  for  why  the  quality  of  the  decision¬ 
maker’s  information  drops  discontinuously  at  q  is  as  follows.  The  analyst  can  credibly  communi¬ 
cate  only  when  the  public  information  is  not  too  precise.  When  the  precision  of  the  public 
information  increases  above  a  threshold  q,  it  crowds  out  the  analyst’s  information.  For  public 
information  with  precision  q,  the  analyst  chooses  information  with  precision  p  >  p*  >  1 1  2,  as 
illustrated  in  Figure  2.  However,  when  q  increases  above  q,  the  decision-maker  loses  the  analyst’s 
information  that  has  a  level  of  precision  p>  1/2.  This  loss  causes  a  discontinuous  decrease  in  the 
quality  of  the  decision-maker’s  information.  While  it  is  displaced  by  an  increase  in  quality  of  the 
public  information,  the  increase  in  the  public  information  yields  only  a  continuous  increase  in  the 
quality  of  the  decision-maker’s  information.  The  combination  of  these  two  effects  is  a  discontinu¬ 
ous  fall  in  the  decision-maker’s  utility  as  q  increases  above  q.  Further,  we  can  show  that  the 
magnitude  of  the  drop  in  the  decision-maker’s  utility  increases  in  the  interest  misalignment  pa¬ 
rameter  (f). 

Corollary  1 1  suggests  that,  at  times,  improving  the  quality  of  public  information  can  crowd 
out  the  analyst’s  ability  to  credibly  communicate  private  information.  In  the  financial  reporting 
environment,  policymakers  often  decide  whether  information  of  a  given  precision  should  be  gath¬ 
ered  and  disclosed  rather  than  choosing  the  precision  of  the  public  information.  In  essence,  a 
policymaker  chooses  between  disclosure  and  no  disclosure.  To  assess  whether  the  crowding  out 
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phenomenon  can  make  no  disclosure  the  policymaker’s  preferred  choice,  we  analyze  the  differ¬ 
ence  between  the  decision-maker’s  information  quality  in  the  absence  of  disclosure,  which  is 
equivalent  \.o  q  =  1/2,  and  the  decision-maker’s  information  quality  when  other  information  of 
precision  q  is  publicly  disclosed. 

We  show  that  the  introduction  of  public  information  can  make  the  decision-maker  worse  off 
within  the  context  of  an  example  with  a  cost  function  c{p)  =  ((p  -  1  /  2)  /  (1  -p)Y,  where  n  >  2. 
Set  s  =  2,  f  =  3/20,  0  =  1/2,  and  n  =  3.  On  one  hand,  consider  the  setting  in  which  public 
information  is  absent  (i.e.,  q  =  1/2).  The  analyst  chooses  p  =  1 1  2  +  c^l  (2(5'  -/))  =  0.635  because 
p*  =0.608  <p<  =  0.649.  Thus,  the  analyst  gathers  more  precise  information  than  he  would 

gather  if  he  could  commit  to  truthfully  reveal  his  privately  observed  information.  Given  the 
analyst  chooses  a  level  of  precision  p  and  communicates  truthfully  in  the  reporting  game,  the 
decision-maker’s  quality  of  information  when  q  =  1/2  is  given  by:  -E[varia\y; p]]  = -p{l 
-  p){s  -  fY  =  -0.793.  On  the  other  hand,  consider  an  environment  in  which  the  public  disclosure 
of  information  is  mandatory  and  it  has  precision  q  e  (0.614,0.635).  For  instance,  suppose  q  = 
0.625;  we  then  observe  p*  =  0.602  =  0.640  <p  =  0.643.  In  this  setting,  it  is  so  costly  to 

gather  information  that  the  analyst  prefers  to  not  gather  any  private  information  at  all;  that  is,  the 
analyst  chooses  p  =  1/2.  Although  the  presence  of  any  public  information  crowds  out  the  analyst’s 
information,  the  decision-maker  benefits  from  observing  the  public  information.  The  decision¬ 
maker’s  quality  of  information  when  q  =  0.625  is  given  by  -Vaifa\x\  =  -q{\-q){s-ff'  = 
-0.802.  When  q  e  (0.614,0.635),  the  benefit  to  the  decision-maker  of  observing  the  public 
information  is  not  sufficient  to  offset  the  analyst’s  information  that  it  displaces.  Hence,  the  intro¬ 
duction  of  public  information  reduces  the  total  quality  of  the  decision-maker’s  information  and 
makes  her  worse  off.  Alternatively,  when  the  precision  of  the  public  information  is  such  that  q  e 
(0.5,0.614),  the  introduction  of  public  information  encourages  information  gathering  and  leaves 
the  decision-maker  better  off  than  when  public  information  is  absent.  Last,  when  q  e  (0.635,1), 
the  public  information  is  sufficiently  precise  that  its  introduction  makes  the  decision-maker  strictly 
better  off,  even  though  it  eliminates  the  analyst’s  willingness  to  gather  private  information. 

We  have  raised  the  possibility  that  introducing  mandatory  disclosures  of  other  information  can 
reduce  the  total  information  available  to  the  decision-maker  when  the  state  space  is  binary.  One 
might  be  concerned  that  this  result  can  arise  only  when  the  state  space  is  discrete,  thereby  allowing 
the  analyst  either  to  communicate  truthfully  or  not  to  communicate  at  all.  This  possibility  result, 
however,  can  also  be  established  when  the  state  space  is  continuous  and  the  analyst  can  only 
partially  reveal  his  privately  observed  information.^'^ 

The  observation  that  additional  public  information  may  reduce  the  players’  welfare  is  remi¬ 
niscent  of  a  result  in  Christensen’s  (1982)  study  of  performance  standards  in  an  agency  setting.  In 
that  study,  Christensen  (1982)  shows  that  the  principal  may  be  worse-off  when  the  players  observe 
additional  contractual  information  before  the  agent  takes  an  action.  Our  setting  differs  from  an 
agency  setting  because,  in  our  setting,  a  receiver  cannot  commit  (i.e.,  contract)  to  use  a  report  in 
a  particular  way.  Consequently,  the  character  of  our  equilibrium  and  the  economic  forces  deliver¬ 
ing  our  results  differ  from  those  in  Christensen  (1982). 

Finally,  we  examine  the  effect  of  the  incentive  misalignment  </>  and  the  difference  between  the 
state  payoffs  {s  -  f)  on  the  decision-maker’s  expected  utility.  Incentive  misalignment  affects  the 
decision-maker’s  expected  utility  when,  in  equilibrium,  the  analyst  chooses  the  level  of  precision 
p  only.  In  this  case,  as  the  level  of  incentive  misalignment  increases,  the  precision  of  information 
the  analyst  must  gather  to  credibly  communicate  increases,  which  heightens  the  decision-maker’s 
expected  utility.  Similar  to  the  effect  of  incentive  misalignment,  the  effect  of  the  difference  be- 
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tween  the  state  payoffs  {s  —  f)  on  the  decision-maker’s  expected  utility  depends  on  the  precision 
of  information  that  the  analyst  gathers  in  equilibrium.  When  the  analyst  chooses  a  level  of  preci¬ 
sion  p*,  then  the  effect  of  (5  —  /)  on  -E[L>ar[5 1  x  ,y,  /?*]  |  x]  is  ambiguous.  An  increase  in  — 
j)  has  a  negative  direct  effect  on  -E[var\^a\  x ,  y  ;p*]  |  x]  and  an  indirect  effect  through  p*  on 
-E[var\^a\  x ,  y  ;  p*]  |  x].  Whether  the  negative  direct  effect  dominates  the  indirect  effect  depends 
on  the  parameter  values  and  the  cost  of  gathering  information  c{p).  On  the  other  hand,  when  the 
analyst  gathers  information  with  precision  p  or  does  not  gather  any  information,  then  an  increase 
in  {s  —  f)  lowers  the  decision-maker’s  expected  utility. 


V.  UNOBSERVABLE  PRECISION 

At  times  a  decision-maker  might  be  unable  to  determine  the  analyst’s  expertise  from  the  stock 
report.  Hence,  we  extend  the  unconstrained  reporting  setting  and  assume  the  analyst’s  choice  of 
precision  p  is  unobservable.  In  contrast  to  the  analysis  when  p  is  public  information,  analyzing  the 
setting  in  which  the  analyst’s  precision  choice  p  is  not  public  is  more  complex  because  the 
decision-maker  must  conjecture  the  analyst’s  choice  of  p  and  the  analyst  must  conjecture  the 
decision-maker’s  response  to  the  analyst’s  report.  In  the  equilibrium  to  this  game,  the  choices  of 
both  players  must  maximize  their  objective  functions  conditional  on  each  having  conjectures  that 
are  consistent  with  the  other  player’s  equilibrium  choices.  As  in  the  earlier  setting  in  which  the 
precision  choice  is  observable,  we  begin  by  considering  the  analyst’s  precision  choice  when  the 
analyst  must  report  truthfully.  We  then  address  the  analyst’s  credibility  problem. 

Assume  the  analyst  must  truthfully  report  his  information  y,  but  privately  chooses  its  preci¬ 
sion.  Denote  the  decision-maker’s  conjecture  regarding  the  analyst’s  precision  choice  as  p.  When 
the  public  information  x  and  analyst  information  y  is  realized,  the  decision-maker’s  action  choice 
must  satisfy: 

P^x  .  (l-p)(l-0  , 

= - 5-1- - f  (8) 

pt^+{l-p){l-t^)  pt^  +  {l-p)il-t^) 

and: 


^xb- 


(1  -P)tx 


-s  -I- 


p(i  -  o 


{l-p)tx  +  pil-0  -p)tx  +  pi^  -tx) 


■f. 


(9) 


where  4  =  ^  when  x  =  h  and  4  =  1  -  ^  when  x  =  /.  Equations  (8)  and  (9)  imply  s>a^g>  >  /  in 
any  equilibrium.  Denote  the  analyst’s  conjecture  regarding  the  decision-maker’s  response  to  real¬ 
ization  (x,y)  as  d^y.  Consider  the  analyst’s  precision  choice  given  x  and  conjectured  actions  5 
>  d^g  >  djci,  >  /.  Given  the  particular  realization  of  x,  the  analyst’s  choice  of  p  maximizes: 

txPi(f>dxg  -  (d^g  -  s)^)  -I-  (1  -  4)(1  -p){(f)d^g  -  {d^g  -/)2)  -t-  4(1  -  p)i(f>d^h  -  i^xb  -  sf) 

+  (1  -  tx)p{(l>d^b  -  i^xb-ff)  -  c{p). 

Because  c(p)  is  sufficiently  convex,  the  analyst’s  interior  choice  of  p  satisfies  the  following 
first-order  condition: 


(4g  -  -  d^g  -  d^h)  +  (1  -  tx){d^g  +  d^f,  -2f)  +  0(24  -  1))  -  c'(p)  =  0.  (10) 

Substituting  in  p  for  p  in  (8)  and  (9)  and  then  taking  those  actions  and  substituting  them  in  for  d^g 
and  d^i,  in  (10)  yields  the  following  condition  that  any  equilibrium  choice  for  p,  namely  p^^,  must 
satisfy: 

l)-c'(p*„)-hA(p*^)0(24- 1)  =  0.  (11) 

Lemma  12  characterizes  the  analyst’s  precision  choice. 
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Lemma  12:  Suppose  the  analyst’s  precision  choice  is  unobservable  and  the  analyst  must 
report  truthfully.  There  always  exists  an  equilibrium  precision  choice,  in 
which  the  analyst  does  not  gather  any  information,  p  =  1/2.  In  addition,  if  the 
public  signal  is  favorable,  x  =  h,  there  always  exists  a  unique  equilibrium  preci¬ 
sion  choice  p^^  e  (p*  ,  1);  alternatively,  if  the  public  signal  is  unfavorable,  x 
=  /,  there  may  exist  one  or  more  equilibrium  precision  choices,  all  of  which  have 
the  property  that  e  ( 1  /  2  ,  p*). 

The  first  point  worth  noting  in  Lemma  12  is  that  there  always  exists  an  equilibrium  in  which 
the  analyst  does  not  provide  any  information.  This  equilibrium,  which  is  analogous  to  the  babbling 
equilibrium  that  always  exists  in  a  standard  cheap-talk  game,  arises  because:  (1)  the  analyst  has  no 
incentive  to  gather  information  if  the  decision-maker  does  not  respond  to  the  resulting  signal,  and 
(2)  the  decision-maker  has  no  reason  to  respond  to  a  signal  that  is  believed  to  be  uninformative. 
Thus,  in  contrast  to  cheap-talk  games  without  endogenous  information  collection  that  always 
feature  a  babbling  equilibrium,  in  this  setting  in  which  information  collection  is  endogenous,  we 
find  that  there  is  always  an  equilibrium  in  which  the  analyst  does  not  provide  the  decision-maker 
with  any  information  even  though  the  analyst’s  report  is  constrained  to  be  truthful.  The  second 
point  worth  noting  is  that  there  always  exists  one,  and  perhaps  more  than  one,  equilibrium  in 
which  the  analyst  gathers  information  for  at  least  one  of  the  two  realizations  of  the  public  signal. 

To  develop  intuition  for  the  equilibrium  with  an  interior  choice  for  p^^,  we  consider  the 
analyst’s  reporting  problem  for  each  realization  x.  When  x  =  h  and  the  analyst’s  conjecture  of  the 
decision-maker  actions  are  d^g  >  d^b,  the  decision-maker  can  increase  the  probability  of  inducing 
dbg  by  increasing  precision  p  because  it  increases  the  likelihood  that  the  analyst’s  signal  y  will  be 
favorable.  As  a  consequence,  the  analyst,  who  prefers  a  higher  action  than  the  decision-maker, 
chooses  more  precise  information.  The  analyst  does  not  choose  the  most  precise  information, 
however,  because  of  the  increasing  marginal  cost  of  precision.  On  the  other  hand,  when  x  =  I  and 
the  analyst’s  conjecture  of  the  decision-maker  actions  are  dig  >  dib,  the  analyst  can  increase  the 
probability  of  inducing  the  high  action,  dig,  by  reducing  the  precision  p  because  the  likelihood  that 
the  analyst’s  signal  will  be  favorable  is  decreasing  in  p.  Accordingly,  the  analyst  has  an  incentive 
to  choose  a  lower  precision  p,  which  is  mitigated  by  the  concern  that  a  decrease  in  the  precision 
of  the  report  increases  the  probability  of  a  mismatch  between  the  decision-maker’s  action  and  the 
state. 

As  indicated  in  Lemma  12,  the  relation  between  the  equilibrium  precision  choice  when  the 
precision  choice  is  not  observable,  p^^,  and  the  precisfon  choice  when  precision  is  observable,  p*, 
varies  depending  upon  whether  the  public  information  x  is  favorable,  x  =  h,  or  unfavorable,  x  =  l. 
To  develop  intuition  for  the  difference  between  p*  and  we  recognize  that,  when  the  precision 
choice  p  is  not  observable,  the  analyst  chooses  a  level  of  precision  without  considering  its  impact 
on  the  decision-makers’  actions.  However,  when  p  is  observable,  the  analyst  considers  the  impact 
of  his  precision  choice  on  the  decision-makers’  actions,  and  the  impact  on  those  actions  depends 
on  whether  x^h  or  x  =  L  Regardless  of  whether  x  =  h  or  x  =  I,  a  marginal  increase  in  precision  p 
increases  a^g  and  decreases  a^b-  When  the  public  information  is  favorable,  x  =  h,  the  magnitude  of 
the  marginal  decrease  in  abb  associated  with  an  increase  in  p  dominates  the  marginal  increase  in 
Ubg  because  the  contradictory  signal,  y  =  b,  has  a  larger  influence  on  beliefs  than  the  confirming 
signal  y  =  g.  As  a  consequence,  the  analyst  is  less  willing  to  increase  p  when  p  is  observable. 
Accordingly,  when  x  =  h,  we  find  p*  <  /?*„.  On  the  other  hand,  when  x  =  l,  the  magnitude  of  the 
marginal  increase  in  aig  associated  with  an  increase  in  p  dominates  the  marginal  decrease  in  aib 
because  the  contradictory  signal,  y=-g,  has  a  larger  influence  on  beliefs.  Therefore,  the  analyst  has 
a  greater  incentive  to  increase  p  when  p  is  observable.  Accordingly,  when  x  =  l,  p^^<  p*. 

Having  considered  the  game  when  the  analyst  must  report  y  truthfully,  we  now  consider  how 
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credibility  issues  affect  the  equilibrium  outcomes  by  assuming  the  analyst  is  not  constrained  to 
report  truthfully.  Suppose  the  favorable  public  signal  x  =  h  is  realized.  In  this  case.  Lemma  12 
shows  there  exists  one  interior  value  of  precision  that  can  be  an  equilibrium  outcome  in  the  game 
in  which  credibility  is  not  an  issue.  When  credibility  is  also  an  issue,  Lemma  4  implies  that  the 

:(«  >{t  _ 

value  of  precision  can  be  sustained  as  an  equilibrium  outcome  only  if  ^  jp.  In  contrast,  if 
Pno  <  A  then  the  analyst  will  always  choose  p  =  1/2  and  not  collect  any  information  in  equilib¬ 
rium.  Notice  that,  unlike  the  case  in  which  the  analyst’s  choice  is  observable,  there  does  not  exist 
an  equilibrium  in  which  the  analyst  chooses  p.  An  equilibrium  where  the  analyst  choose  precision 
p  (where  p  >  p^J  cannot  be  sustained  because  the  decision-maker  cannot  observe  the  analyst’s 
actual  precision  choice;  therefore,  the  analyst  will  choose  to  deviate  from  gathering  information 

with  precision  p.  Alternatively,  suppose  the  unfavorable  signal  x  =  /  is  realized.  If  there  exists  an 

_ 

interior  equilibrium  in  this  case,  then  it  must  again  be  true  that  p^^  exceeds  p.  Note,  however,  that 
the  likelihood  that  this  condition  is  satisfied  when  x  =  /  is  lower  because  must  be  lower  in  the 
case  when  x  =  I  than  in  the  case  when  x  =  h.  Hence,  the  analyst  is  more  likely  to  not  collect  any 
information  when  the  public  information  is  unfavorable.  The  analyst’s  equilibrium  behavior  is 
summarized  in  Proposition  13. 

Proposition  13:  Suppose  the  analyst’s  precision  choice  is  unobservable  and  the  analyst  need 

_  ifi. 

not  report  truthfully.  If  jp  <  p^^,  then  the  analyst  chooses  to  gather  informa¬ 
tion  with  a  level  of  precision  and  communicates  truthfully  in  the  report¬ 
ing  stage.  Alternatively,  if  p^^  <  p,  then  the  analyst  does  not  collect  any 
private  information. 

The  observation  that  an  analyst’s  precision  choice  varies  with  firm  performance  provides  an 
explanation  of  empirical  evidence  regarding  analyst  coverage  documented  by  McNichols  and 
O’Brien  (1997).  They  find  analysts  are  less  likely  to  opine  on  poorly  performing  firms.  The 
explanation  they  offer  for  this  finding  is  that  analysts  avoid  issuing  sell  recommendations  so  as  not 
to  jeopardize  their  prospects  of  winning  investment  banking  business.  Our  model  provides  another 
explanation  for  the  correlation  they  observe  in  the  data:  gathering  sufficiently  precise  information 
to  be  able  to  credibly  communicate  is  too  costly  for  a  poorly  performing  firm.  Consequently, 
analysts  do  not  gather  private  information  about  the  firm. 

VI.  CONCLUSION 

We  analyze  analyst  information-gathering  and  reporting  activities  in  a  setting  in  which  the 
analyst  chooses  the  precision  of  information  to  collect  in  response  to  changes  in  the  precision  of 
public  information  about  a  firm.  A  key  feature  that  distinguishes  our  model  from  the  extant 
reporting  literature  is  that  the  analyst  is  not  restricted  to  report  truthfully.  Rather,  recognizing 
credibility  is  an  important  feature  of  the  analyst  environment,  we  assume  the  analyst  reports  in  a 
self-serving  fashion. 

We  consider  how  the  analyst’s  information-gathering  behavior  varies  depending  on  whether 
the  decision-maker  observes  the  analyst’s  precision  choice.  First,  when  the  analyst’s  precision 
choice  is  publicly  observed,  we  obtain  the  following  key  results.  When  the  analyst’s  reporting 
credibility  is  not  an  issue,  an  increase  in  the  precision  of  public  information  causes  the  analyst  to 
reduce  the  precision  of  information  privately  collected  and  truthfully  communicated.  Although  the 
public  information  substitutes  for  the  private  information,  an  increase  in  the  precision  of  the  public 
information  nevertheless  makes  the  decision-maker  better  off.  On  the  other  hand,  when  an  ana¬ 
lyst’s  reporting  credibility  becomes  salient,  an  increase  in  the  precision  of  public  information 
causes  the  analyst  to  collect  more  precise  information.  In  this  case,  the  public  information  has  a 
complementary  effect  on  the  analyst’s  information  gathering,  which  makes  the  decision-maker 
better  off.  If  the  precision  of  the  public  information  continues  to  increase,  however,  then  a  thresh- 
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old  is  reached  beyond  which  an  analyst  can  no  longer  economically  gather  sufficiently  precise 
information  necessary  to  credibly  communicate.  As  a  consequence,  the  analyst  discontinues  gath¬ 
ering  any  private  information.  The  analyst’s  decision  not  to  gather  any  information  makes  the 
decision-maker  worse  off — which  aligns  with  the  concern  that  some  recent  regulations  might 
make  investors  worse  off.  Indeed,  we  find,  in  some  circumstances,  that  a  decision-maker  would  be 
better  off  if  public  information  was  never  introduced  than  to  have  public  information  that  under¬ 
mines  an  analyst’s  willingness  to  gather  information. 

Second,  when  investors  do  not  observe  the  analyst’s  precision  choice,  we  find  that  the  preci¬ 
sion  of  the  analyst’s  information  depends  on  the  realization  of  the  public  signal.  After  observing 
favorable  public  information,  the  analyst  gathers  more  precise  information  when  the  precision 
choice  is  not  publicly  observable  than  when  it  is  publicly  observable.  Conversely,  after  observing 
unfavorable  public  information,  the  analyst  chooses  less  precise  information  when  the  precision 
choice  is  not  publicly  observable  relative  to  when  it  is  publicly  observable. 

In  light  of  recent  changes  in  the  analyst  environment,  our  model  focuses  on  the  behavior  of 
analysts.  The  model,  however,  is  quite  general.  The  insights  might  be  applied  to  the  voluntary 
information-gathering  activities  of  a  firm  manager  in  the  presence  of  changes  in  the  mandatory 
reporting  environment,  particularly  when  the  manager’s  and  investors’  interests  are  imperfectly 
aligned  and  the  manager’s  voluntary  disclosure  is  not  verifiable — such  as  in  the  case  of  manage¬ 
ment  earnings  forecasts.  The  insights  also  might  be  applied  to  the  information-gathering  and 
communicating  behavior  of  a  seller  of  an  experience  good  when  the  potential  buyer  has  access  to 
another  source  of  unbiased  information — such  as  Consumer  Reports — that  reveals  the  properties 
of  the  seller’s  product.  Alternatively,  the  insights  might  be  applied  to  the  interaction  between  a 
venture  capitalist  and  a  firm  that  has  private  information  about  an  innovative  technology  for  which 
it  is  seeking  venture  financing. 

APPENDIX 

This  appendix  contains  the  proofs  of  the  lemmas,  propositions,  and  corollaries. 

Proof  of  Lemma  1 

After  differentiating  {/;(  . )  in  (3)  with  respect  to  p  and  using  the  first-order  condition,  the 
optimal  choice  of  p*  is  such  that  /  (2p*  -  1)  -  c'(/7*)  =  0.  Given  f/j  is  a  twice  differen¬ 

tiable  function,  calculating  dp^  and  evaluating  the  expression  ^Uil  dp^  at  p*  where  p*  is 
such  that  c'(/7*)  =  /  (2p*  -  1)  yields  2c'(/7*)A'(p*)  /  A(/;*)  -  /(/?*- 1  /  2) 

-c'fp*).  For  a  unique  maximum  it  must  necessarily  be  the  case  that  c”(p*)  I  c' (p*) 
>  2A'(p*)  /  A(/?*)  -  1  /  (/?*  -  1  /  2).  Since  the  right-hand  side  of  the  inequality  is  increasing  in  q, 
the  condition  c"(p*)  /  >  (1  -  3/7*(l  -;?*))  /  (/7*(1  -/?*)(/?*  -1/2))  is  sufficient  for  the 

objective  function  to  be  strictly  quasi-concave,  which  the  cost  function  c{p)  is  assumed  to  sat¬ 
isfy.  I 

Proof  of  Corollary  2 

Applying  the  implicit  function  theorem  to  (5)  and  using  the  fact  that  !/,(/?)  is  a  strictly 
quasi-concave  and  twice  differentiable  function  that  yields  the  observation  dp^/dq  <0.  ■ 

Proof  of  Corollary  3 

To  establish  that  d{~E[var[a\  x,y;  p*]  |  x])  I  dq>0,  observe  that: 

dj-  E[vatia\x,y;p*]\x])  dj-  E[varia\x,y,p^]\x])  dj-  E[var[a\x,y,p*]\x])  dp* 
dq  dq  dp"^  dq 

ccp*\\  -  p'^filq  - \)  +  q^{\  -  qf{lp  -  .  (12) 

dq 
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Consider  dp^ldq.  Applying  the  implicit  function  theorem  to  (5)  yields: 


dp* 

dq 


djdUjJdplp*) 

dq 


d{dUildp\p*) 


dp 


* 


2q{\-q){2p*-l)p*{\-p*){2q-\){s-jf 
((1  -p*)q  +  (1  -  q)p?{qp*  +  (1  -  g)(l  - P^)? 

2q\\  -  q)\s  -ff0p*i2q  - \f{p  -  \)  +  \ -3q  +  3q^)  ‘ 

(il-p*)q  +  il-q)p*)\qp^  +  {l-q){l-p*)y 

(13) 


Observe  that  d{dUi/  dp\p*)  /  dp*  <0  because  C,-  is  a  strictly  quasi-concave  and  twice  differen¬ 
tiable  function.  Substituting  (13)  into  (12)  and  using  d^dU-J  dp\p*)  t  dp*  <  0  yields: 

d{,- E\variQ\x,y,p*'\\x\)  2A^(>*)(1  -  3p*(l -/?*)) 

- - - ocp*(l-p*)c  (p*)- - - . 

Using  the  assumption  that  c"{p) !  c' {p)  >  (1  -3/7(1  -p))l  {p{l  -p)ip-  1  /  2))  and  the  relation 
c'(p*)  =  A^(/7*)  /  {2p*  -  1)  given  in  (5)  yields: 

c'V*)  >  2A^ip*)il  -  3p*  +  3{p*y)/ip*{l -p*){2p*  -  1)2). 


Using  this  relation,  which  is  both  necessary  and  sufficient,  yields: 

a|; 

dq 


d{- E[vat{a\x,y,p*^\x\)  2A'^ip*)(l -3p*{l  - p*)) 

- (p*)  - - - 


>0. 


Proof  of  Lemma  4 

The  analyst’s  cost  of  gathering  information  is  sunk  and  p,  which  is  commonly  observed,  is 
fixed.  If  the  analyst  communicates  truthfully,  then  the  decision-maker’s  action  is  a  =  E[a\x  ,y]. 
Given  the  decision-maker’s  action,  the  analyst  reports  observing  g  if  and  only  if: 

E[(t)E[a\x,g]  -  {a  -  E\_a\x,g'\)'^\g,x'\  >  E{4>E[a\x,b\  -  {a- E[a\x,b])\,x'\.  (14) 


This  truth-telling  condition  is  trivially  satisfied  because  the  analyst  never  wants  to  dissemble  and 
induce  a  lower  action  than  the  one  the  decision-maker  would  take  given  the  analyst’s  information. 
The  analyst  reveals  observing  b  if  and  only  if: 

£[</»£'[ 5|x:,Z7]  -  (5-  £'[a|x,Z7])2|Z7,x]  ^  E{(f)E{a\x,g^  -  (5-  £'[5|x,g])2|fi,x].  (15) 

This  condition  can  be  shown  to  be  satisfied  if  and  only  if  ]  x ,  g]  -  ^[a  |  x ,  /?]  -  0  >  0.  Con¬ 
sequently,  a  necessary  and  sufficient  condition  for  the  analyst  to  reveal  y  when  x  =  /  or  for  the 
analyst  to  reveal  y  when  x  =  h  is: 

£’[5|x,g]  -  £[5|x,Z7]  -  (f)  =  A{p)  -  </)  >  0.  (16) 


As  the  incentive  compatibility  conditions  are  identical,  without  loss  of  generality,  set  x  =  /  in  (16). 
When  q  =  1/2,  (16)  can  be  expressed  as  (2/7  —  1)(5  —  f)  —  0  >  0.  Solving  for  p  yields  p  >  1/2 
-I-  (])l{2{s  —  j)).  On  the  other  hand,  when  q  e  (1/2,1),  solving  for  p  in  (16)  yields  two  roots:  the 
negative  root  has  the  feature  that  p  <  112  and  is  discarded;  the  positive  root  has  the  feature  that  p 
>  1/2.  Hence,  the  truth-telling  constraint  requires: 

1 


V4^2(i  _  -/)2  -I-  4,\2q  -  \f  -  2q{\  -  q){s  -/) 

20(29-1)2 


(17) 
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The  following  lemma  is  used  in  the  proof  of  Proposition  5: 

Lemma  14:  Let  p*  e  (1  /  2  ,p).  The  analyst  will  gather  information  with  precision  p  >  1/2 
only  if  the  analyst  can  credibly  communicate  it  in  the  reporting  game. 


Proof  of  Lemma  14 

The  proof  proceeds  by  supposing  to  the  contrary  that  the  analyst  chooses  to  gather  informa¬ 
tion  with  precision  p  >  1/2  even  though  he  cannot  credibly  communicate  it  and  then  establishing 
a  contradiction.  The  proof  is  straightforward  and  is  omitted.  ■ 

Proof  of  Proposition  5 

For  the  analyst  to  credibly  communicate  his  private  information  y  in  the  reporting  game,  it 
follows  from  (16)  that  p  must  be  such  that  A(p)  -  0  >  0.  Further,  it  follows  from  (5)  that  p^  is 
such  that  A^(p*)  /  (2/?*  -  1)  -  c'{p^)  =  0.  It  follows  that  p*  >  p  if  and  only  if  (f)A(p^)  /  (2/?* 
-  1)  <  c' (/?*).  When  p^  >  p,  the  analyst  chooses  p*  and  communicates  credibly. 

When  /?*  e  (1/2,  p).  Lemma  14  establishes  that  the  analyst  will  only  gather  information  if 
he  can  credibly  communicate  it  in  the  reporting  game.  Thus,  to  determine  the  precision  of  infor¬ 
mation  the  analyst  will  choose  to  gather,  we  compare  the  analyst’s  expected  utility  from  gathering 
the  minimum  precision  of  information  that  will  allow  him  to  credibly  communicate  in  the  report¬ 
ing  game,  i.e.,  p=p,  denoted  Uiip),  with  the  expected  payoff  from  not  gathering  any  private 
information,  i.e.,  p  =  1/2,  denoted  t/,(l  /  2).  When  p*  e  (1  /  2  ,^),  the  analyst  prefers  p  if  and 
only  if  Uf  1  /  2)  -  Ufp)  <  0,  or,  equivalently: 


c{p)  < 


{p{\-q)  •+  (1  -p)q)iil  -p){l-q)+pq) 


(18) 


It  can  be  established  that  there  exists  a  unique  root,  denoted  p*,  that  solves  the  equality  c{p^) 
=  Aip^)q{  1  -  q){2p*  -  1)(5  -/).  Hence,  if  e  ( 1  /  2  ,  p)  and  ii  p  <  p*  (or  equivalently,  the  in¬ 
equality  in  (18)  is  satisfied),  then  the  analyst  gathers  more  information  than  he  would  if  his 
incentives  were  not  misaligned  with  those  of  the  decision-maker — the  analyst  overinvests  in  the 
quality  of  information  gathered.  Conversely,  if  p*  e  (1  /  2  ,p)  and  if  p*<  p  (or  equivalently,  the 
inequality  in  (18)  is  not  satisfied),  then  the  analyst  gathers  less  information  than  he  would  if  his 
incentives  were  not  misaligned  with  those  of  the  decision-maker — the  analyst  underinvests  in  the 
quality  of  information  gathered. 

Last,  it  is  straightforward  to  show  a  Nash  equilibrium  exists  in  which  no  information  is 
credibly  communicated  and  the  analyst  sets  p  —  \I2.  I 


Proof  of  Corollary  6 

dp!  dq>Q  is  established  by  partially  differentiating  p  given  in  (17)  with  respect  to  q  and 
observing  that  4>  <  {s  -  f)  for  the  truth-telling  constraint  to  be  satisfied.  ■ 


Proof  of  Corollary  7 

Recall  (7);  applying  the  implicit  function  theorem  to  expression  Ui{p^  -  t/,(l  /  2)  yields: 

dp*ldq  =  -  diUiip^  -  Uf\l2))ldqld{Ui{j>^  -  U,{ll2))ldp^ . 

It  is  straightforward  to  establish  that  the  numerator  is  negative  for  all  q  e  (1/2,1)  and  0  otherwise. 
To  show  that  the  denominator  is  negative,  we  use  the  following  arguments:  First,  we  show  that 
p*>  p"^.  Recall  that  p*  is  such  that  Ufp*)-  UfX  /2)  =  0.  It  then  follows  from  Lemma  1  that 
Ufp)  >  Ui{l  12)  for  all  p  e  ( 1  /  2  , p*]  and  dUfp) !  dp  <  0  when  evaluated  dX  p  >  p^.  Conse¬ 
quently,  it  must  be  the  case  that  p^  <p^.  Second,  it  follows  from  Lemma  1  that  dUiip)  /  dp  >0  if 


The  Accounting  Review 


November  2010 
American  Accounting  Association 


2006 


Fischer  and  Stocken 


and  only  \f  p  <  p*.  Because  p*  <p^,  we  conclude  that  dUiip)  I  dp  <0,  and  hence,  d{Ui{p*) 
-  Uii  1  /  2))  /  dp*  <  0.  Combining  the  above  arguments  yields  the  claim  that  dp* !  dq<Q.  ■ 

Proof  of  Corollary  8 

The  proof  uses  arguments  offered  earlier  and,  hence,  is  omitted.  ■ 

Proof  of  Corollary  9 

If  (/>  — >  0,  then  the  incentive  compatibility  conditions  (see  (16))  are  always  satisfied.  Conse¬ 
quently,  the  analyst  will  always  truthfully  reveal  y  for  any  x,  that  is,  p  =  1  /  2.  On  the  other  hand, 
if  ^  >  00,  then  the  incentive  compatibility  conditions  (see  (16))  are  never  satisfied  and  the  analyst 
cannot  credibly  reveal  any  of  his  private  information;  that  is,  p=  1.  Since  p*  e  (1/2,1)  from 
Lemma  1,  and  dp  I  d(f)>0  and  dp^  I  d(f)  =  0  from  Corollary  8,  it  follows  from  the  intermediate 
value  theorem  that  there  exists  a  threshold  0  such  that  for  all  4>^^,  we  have  p  <  p*.  In  this 
circumstance,  it  follows  from  Proposition  5  that  the  analyst  will  choose  precision  p*.  Using 
Corollary  2,  observe  that  dp^ldq  <  0.  Again  using  the  intermediate  value  theorem,  there  exist 
thresholds  ^  and  ^  such  that  for  all  </»  e  (^,  0],  we  have  p*  <p  < p*.  In  this  case,  it  follows 
from  Proposition  5  that  the  analyst  will  choose  precision  p.  Using  Corollary  6,  observe  that 
dpi  dq>  0.  Finally,  for  all  (p>  p*  <  p,  and  it  follows  from  Proposition  5  that  the  analyst 
chooses  to  not  collect  any  private  information.  | 

Proof  of  Corollary  10 

If  cr  =  (5  —  /)  — >  0,  then  A{p)  0  and  the  incentive  compatibility  conditions  (see  (16))  are 
never  satisfied;  equivalently,  p  =  1.  Since  dp  /  d{s  -/)  <  0  from  Corollary  8,  the  intermediate  value 
theorem  implies  there  exists  a  threshold  ^  such  that  for  all  cr  <  0,  the  analyst  will  choose  to  not 
collect  any  private  information;  that  is,  he  will  choose  p  =  1/2.  Alternatively,  if  cr  00^  then  A(p) 
^  00  and  the  incentive  compatibility  conditions  (see  (16))  are  always  satisfied;  equivalently,  p 
=  1/  2.  Since  dpi  d(s-f)  <0,  there  exists  a  threshold  a  such  that  for  all  cr>  o',  we  have  p 
<  e  ( 1  /  2 , 1).  Proposition  5  implies  the  analyst  will  choose  precision  p*.  Using  Corollary  2, 
observe  that  6*/?*  I  dq  <0.  Finally,  combining  these  arguments,  it  follows  that  for  all  </>  e  [d-,  d), 
we  have  p*  <  p.  Proposition  5  implies  that  the  analyst  will  choose  precision  p.  Using  Corollary  6, 
observe  that  dp  I  dq  >  0.  | 

Proof  of  Corollary  11 

Lemma  3  established  that  d{-E[var[a\x  ,y,p^]\xj)  I  dq>  0.  An  analogous  argument 
proves  di-E[var[a  \  x  ,y,p\\x]) !  dq>  Q.  It  is  straightforward  to  show  d{-var\^a  \  x])  I  dq>  0. 
Hence,  it  remains  to  establish  that  the  decision-maker’s  quality  of  information  falls  discontinu- 
ously  at  q.  Observe  that  when  c{p)  =  l^s{p)q{\  -  q){2p  -  \){s  -  f)  and  q  =  q,  then  {7,(1/ 2) 
=  Ui{p),  or  equivalently: 


cl)E[a\x'\-  q{l  -  q){s  -  f)^  =  (f)E[E[a\x,y']\x'\-— - -  c{p) . 


(2p-l) 

The  law  of  iterated  expectations,  £[5 1  x:]  =  E\^E\^a  1  x: ,  y]  |  x],  implies  (19)  may  be  written  as: 


(19) 


-Var{a\x'\  =  -q{\-q){s-f)^  =  - 

(2p-  1) 

^  -  E[var{a\x,y,p\\x'\. 

Hence,  there  exists  dL  q  =  qF  s  >  q^  for  £  >  0  such  that: 


2  P(1 -p)(‘S-/)A(p)  ^  p{\ -p){s-f)Mqp) 


-c{p)<- 


(2p-  1) 
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-  Var[a|ji:]  <  -  E[var[a\x,y;p]\x]. 


Proof  of  Lemma  12 

Because  A(l/2)  =  0,  it  follows  from  Equation  (11)  that  p  =  1/2  is  an  equilibrium  level  of 
precision  for  1^^  =  q  or  t^  =  \  -  q.  Now  consider  the  existence  of  an  equilibrium  when  p  >  1/2.  First, 
consider  the  case  in  which  t^  =  q;  that  is,  x  =  h.  Rewrite  Equation  (11)  as: 

(1/2)A(p)2  -  c'ip)ip  -  1/2)  +  Mp)<f^{2f  -  Dip  -  1/2)  =  0. 


The  derivative  of  the  left-hand  side  of  Equation  (11)  with  respect  to  p  is: 


2A'(p)/A(p)2  (  D 


^AW0(2,,-l)(p-- 


2 
l\ 


^{p)(f)i2t^-  D\p 


If  the  equation  is  satisfied  at  p,  this  expression  can  be  rewritten  as: 


-c"ip)\p- 


l\  A(p)2  A» 


2  p- 


1 


A(p) 


^(p)(f)i2f-  D\p-- 


where: 

A'(p)_  1  l-2p(l-p)(2r,-l)^-2r,(l-0 

A(p)  2(p--)  + 


Hence,  the  above  expression  can  be  written  as: 


l-2p(l-p)(2r,-l)2-2ai-g  ,,  , 

- 2  .  ,  — 77 (p)-(^  (P)iP  -  1/2)  - 


p(l -p){2t^- D  +t^i\-tD 
_  1  l-2p(l-p)(2r,-l)"-2ai-0 
2  p(l-p)(2r,-l)2  +  ai-0 


AO)^(2t,,-  1). 


(l/2)A(p)^ 

(p-1/2) 


Because  (11)  holds  in  equilibrium,  this  expression  can  be  rewritten  as: 


( _ i _ 

\pi\-p)i2t,-D^  +  hD-tD 


c'{p)-c''{p){p-ll2) 


( _ ^ _ 

\2pil-p)i2f-D'^  +  2fi\-tD 


A(p)0(24-  1). 


Noting  that  p(l  - p){2t^-  1)^-1- t^(l  -tj  is  decreasing  in  and  that  2?;,  -  1  >  0  because  t^-q 
>1/2  implies  that  the  above  expression  is  strictly  less  than  [(1  -  3p(l  -p))  /  (p(l  -p))]c'(p) 
-  c"ip)ip  -1/2),  which  is  proportional  to  (1  -  3p(l  - p))c' ip)  - pi  \  -p)ip  -  1  /  2)c"(p).  Hence, 
there  exists  a  unique  equilibrium  in  which  p  e  (1/2,1)  if  (1  -  3p(l  - p))  /  (p(l  -p)ip  -1/2)) 
<  c"ip)  /  c'ip)  for  all  p,  which  is  true  by  assumption. 

Second,  consider  the  case  of  4  =  1  -  <?;  that  is,  x  =  l.  There  may  or  may  not  exist  an  equilib- 
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rium  in  which  p  e  (1/2,1)  and,  sometimes  there  may  exist  more  than  one  equilibrium  in  which  p 
e  (1/2,1).  The  proof  follows  from  an  example  in  which  there  is  no  equilibrium  when  p  e  (1/2,1), 
one  equilibrium  when  p  e  (1/2,1),  and  two  equilibria  when  p  e  (1/2,1).  For  the  examples,  let 
c{p)  =  ((p  -  1  /  2)  /  ( 1  -  p))^  and  q  =  .85.  If  0  =  2  and  s  -  f  =  2,  then  there  does  not  exist  ape 
(1/2,1)  that  satisfies  the  equilibrium  condition.  If  0  =  1  and  s  —  f=  2,  then  p  =  .5227  is  the  only 
p  e.  (1/2,1)  that  satisfies  the  equilibrium  condition.  If  0  =  110,  5  — /—  150,  then p  =  .9072  and 
p  =  .5584  both  satisfy  the  equilibrium  condition.  ■ 

Proof  of  Proposition  13 

Consider  the  case  in  which  p  <  p^^.  When  the  analyst’s  information  y  is  privately  observed,  it 
follows  from  Lemma  4  that  the  analyst  can  only  credibly  reveal  it  when  it  has  a  level  of  precision 
p  such  that  p^p.  Therefore,  if  p  <  p^g,  then  the  analyst  can  credibly  reveal  his  privately  observed 
information  with  precision  p^g.  Alternatively,  consider  the  case  in  which  p^g  <  p.  It  follows  from 
Lemma  4  that  the  analyst  cannot  credibly  reveal  information  with  precision  p^g.  Thus,  the  analyst 
either  will  choose p  =  1/2  and  not  gather  any  privately  observed  information,  or  choose p=p  and 
credibly  reveal  his  privately  observed  information.  When  the  analyst’s  precision  choice  is  unob¬ 
servable,  Proposition  12  implies  that  p^g  is  the  only  unobservable  choice  of  precision  that  can  be 
supported  in  equilibrium.  Thus,  the  analyst  would  prefer  to  deviate  from  any  unobservable  preci¬ 
sion  choice  p  pl^g.  Since  Pno<P  and  the  decision-maker  cannot  observe  the  precision  choice  p, 
it  follows  that  the  analyst  will  not  choose  to  gather  information  with  precision  p.  Since  gathering 
information  y  is  costly.  Lemma  14  implies  the  analyst  will  choose  p  =  1/2.  ■ 
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ABSTRACT:  We  examine  auditor  independence  in  the  banking  industry  by  analyzing 
the  relation  between  fees  paid  to  auditors  and  the  extent  of  earnings  management 
through  loan  loss  provisions  (LLP).  We  also  examine  whether  this  relation  differs  across 
large  banks  whose  managements  are  required  under  the  Federal  Deposit  Insurance 
Corporation  Improvement  Act  to  evaluate  internal  control  over  financial  reporting  and 
whose  auditors  must  attest  to  the  effectiveness  of  such  internal  controls,  and  small 
banks  that  are  not  subject  to  those  requirements.  We  find  that  unexpected  auditor  fees 
are  unrelated  to  earnings  management  for  large  banks.  For  small  banks,  we  find 
greater  earnings  management  via  under-provisioning  of  LLP  by  banks  that  pay  higher 
unexpected  total  and  nonaudit  fees  to  the  auditor.  These  results  suggest  that  auditor 
fee  dependence  on  the  audit  client  is  associated  with  earnings  management  via  abnor¬ 
mal  LLP  and  is  a  potential  threat  to  auditor  independence  for  small  banks.  Our  findings 
are  relevant  to  policymakers  contemplating  new  regulations  in  light  of  the  recent  bank¬ 
ing  crisis. 

Keywords:  auditor  independence;  earnings  management;  auditor  fees;  bank  loan 
loss  provision;  FDICIA. 
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I.  INTRODUCTION 


uditor  independence  is  vital  to  maintaining  public  confidence  in  capital  markets  and  to  the 


integrity  of  corporate  financial  statements.  The  objective  of  this  study  is  to  examine 


^  A.  auditor  independence  in  the  banking  industry.  Banks  represent  a  large  percentage  of  the 
total  public  equity  market  and  are  critical  to  the  functioning  of  the  economy  as  a  whole.  Never¬ 
theless,  as  noted  by  Fields  et  al.  (2004),  accounting  researchers  have  done  little  to  investigate  the 
various  relationships  that  exist  between  banks  and  their  auditors  despite  the  economic  importance 
of  the  banking  industry.  Specifically,  we  provide  empirical  evidence  on  the  relation  between  fees 
paid  to  auditors  of  banks  and  the  extent  of  earnings  management  via  loan  loss  provisions  (LLP). 
Our  study  is  timely  and  relevant  given  the  recent  banking  crisis  and  that  governments  around  the 
world  are  contemplating  new  banking  regulations.  In  particular,  our  research  informs  policymak¬ 
ers  on  the  relationship  that  existed  between  fees  paid  to  auditors  and  the  extent  of  earnings 
management  in  banks  prior  to  the  current  banking  crisis. 

The  banking  industry  offers  a  unique  context  to  study  auditor  independence  for  a  number  of 
reasons.  First,  banks  are  subject  to  the  scrutiny  of  the  Federal  Deposit  Insurance  Corporation 
(FDIC),  the  Federal  Reserve  Board,  and  other  government  agencies.  Whether  this  intense  regula¬ 
tory  oversight  enhances  auditor  independence  is  clearly  an  important  and  relevant  question.  Sec¬ 
ond,  the  Federal  Deposit  Insurance  Corporation  Improvement  Act  of  1991  (FDICIA)  imposed  new 
auditing,  corporate  reporting,  and  governance  reforms  on  depository  institutions  with  assets  ex¬ 
ceeding  $500  million  (increased  to  $1  billion  in  2005)  and  on  their  auditors  (Murphy  2004). 
Section  36  of  the  Federal  Deposit  Insurance  Act  (FDI  Act)  and  Part  363  of  the  FDIC’s  regulations 
require  the  management  of  these  institutions  to  evaluate  internal  control  over  financial  reporting 
and  the  auditor  to  attest  to  the  effectiveness  of  internal  controls  over  financial  reporting.  Given 
these  requirements,  whether  auditor  independence  is  greater  for  larger  banks  that  are  subject  to 
greater  regulatory  oversight  relative  to  smaller  banks  is  an  interesting  and  important  question. 

Third,  bank  LLP  is  well-suited  to  studying  earnings  management  for  the  following  reasons. 
Bank  managers  have  considerable  discretion  in  estimating  LLP.  This  discretion  allows  them  flex¬ 
ibility  in  using  LLP  for  income-increasing  or  income-decreasing  earnings  management,  or  for 
smoothing  earnings.^  LLP  is  also  by  far  the  largest  and  most  important  accrual  for  banks,  thus 
affording  bank  managers  wide  latitude  in  its  use."^  Prior  research  indicates  that  banks  use  LLP  to 
manage  earnings  (Wahlen  1994;  Kanagaretnam  et  al.  2003;  Kanagaretnam  et  al.  2004).  To  the 
extent  that  banks  can  leverage  “fee  dependence”  to  influence  their  auditors  to  accept  abnormal 
LLP,  examining  the  relation  between  abnormal  LLP  and  auditor  fees  is  likely  to  reveal  such  a 
linkage.  We  reason  that  abnormal  LLP  is  a  better  proxy  for  earnings  management  than  the  abnor- 


'  There  is  some  prior  research  on  FDICIA.  For  example,  Altamuro  and  Beatty  (2010)  examine  several  earnings  quality 
measures  prior  to  and  following  FDICIA  and  find  that  the  mandated  internal  control  requirements  increased  the  validity 
of  LLP,  earnings  persistence,  and  cash  flow  predictability,  and  reduced  benchmark-beating  and  accounting  conservatism 
for  affected  versus  unaffected  banks. 

^  In  addition  to  bank  asset  size,  we  also  take  into  consideration  the  banks  that  have  to  comply  with  Section  404  of  the 
Sarbanes-Oxley  Act  (SOX),  which  became  effective  in  2004,  when  classifying  banks  as  large  and  small.  We  consider 
small  banks  as  banks  that  are  exempted  from  both  FDICIA  and  Section  404  of  SOX.  We  use  the  terms  small  banks  and 
exempted  banks  interchangeably  in  our  paper.  Section  II  provides  a  more  detailed  description  of  the  institutional  details. 

^  Prior  research  suggests  several  motives  for  bank  managers’  discretionary  behavior  with  respect  to  LLP  including, 
signaling,  capital  management,  management  compensation,  and  income  smoothing  (Wahlen  1994;  Collins  et  al.  1995; 
Kanagaretnam  et  al.  2003;  Kanagaretnam  et  al.  2004).  Income-increasing  abnormal  LLP  may  be  related  to  incentives 
associated  with  meeting  or  just  beating  earnings  benchmarks,  management  compensation,  and  income  smoothing. 
Income-decreasing  abnormal  LLP  may  be  related  to  incentives  associated  with  signaling,  capital  management,  and 
income  smoothing. 

""  The  mean  (median)  ratio  of  LLP  to  earnings  before  LLP  is  17  percent  (14.6  percent)  for  our  sample  banks. 
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mal  accrual  measures  used  in  prior  research.^  In  particular,  our  study  mitigates  error  in  measuring 
managerial  discretion  by  focusing  on  a  single  accrual  and  a  single  industry.  Focusing  on  a  single 
accrual  facilitates  a  sharper  separation  into  its  normal  (nondiscretionary)  and  abnormal  (discre¬ 
tionary)  components.  We  use  a  number  of  industry-specific  variables  to  better  isolate  the  normal 
LLP  from  the  abnormal  LLP.  Also,  focusing  on  a  single,  relatively  homogeneous  industry  provides 
control  over  other  determinants  of  cross-sectional  differences  in  accruals,  thus  increasing  the 
reliability  of  the  inferences  from  our  empirical  analysis. 

Our  main  prediction  is  that,  relative  to  large  banks,  small  banks  that  pay  higher  fees  to 
auditors  are  likely  to  engage  in  higher  levels  of  income-increasing  earnings  management — i.e., 
higher  fees  are  likely  to  facilitate  income-increasing  LLP  (under-provisioning  of  LLP).  Income- 
increasing  LLP  is  of  particular  importance  in  the  context  of  auditor  independence  because  auditors 
are  unlikely  to  be  sued  for  income-decreasing  accruals  (St.  Pierre  and  Anderson  1984).  Addition¬ 
ally,  we  expect  the  magnitude  of  income-decreasing  LLP  to  decrease  for  small  banks  as  banks  pay 
higher  fees. 

A  possible  alternative  hypothesis  to  the  impairment  of  auditor  independence  through  fees  is 
that  audit  fees  may  also  reflect  audit  risk  that  is  captured  by  the  abnormal  LLP.  Auditors  are  likely 
to  charge  higher  audit  fees  to  firms  that  are  more  difficult  to  audit  and,  if  firms  that  are  more 
difficult  to  audit  have  higher  incentives  to  manage  earnings,  this  will  be  manifested  in  a  positive 
relationship  between  audit  fees  and  abnormal  LLP.  We  attempt  to  address  this  concern  by  includ¬ 
ing  several  variables  that  reflect  differences  in  audit  risk  across  banks  as  well  as  examining  the 
relationship  using  abnormal  fees  after  explicitly  taking  account  of  possible  factors  driving  the 
normal  fees. 

Our  sample  consists  of  1,740  bank-year  observations  over  the  years  2000-2006.  Following 
prior  research,  we  estimate  abnormal  LLP  as  residuals  from  a  regression  of  LLP  on  beginning  loan 
loss  allowance,  total  loans  outstanding,  changes  in  total  loans  outstanding,  net  loan  charge-offs, 
beginning  balance  of  non-performing  loans,  change  in  non-performing  loans,  loan  mix,  and  con¬ 
trols  for  years  (Wahlen  1994;  Kanagaretnam  et  al.  2004).  Next,  we  estimate  a  regression  of 
abnormal  LLP  on  fee  measures,  an  indicator  variable  for  small  banks  (with  total  assets  of  less  than 
$500  million  or  less  than  $1  billion  effective  2005  or  non-accelerated  fliers  under  Section  404  of 
Sarbanes-Oxley  Act  [SOX]  effective  2004)  that  are  exempt  from  internal  control  regulation,  the 
interaction  of  the  fee  measure  and  the  indicator  variable  for  small  banks,  auditor  type  (Big  5 
versus  non-Big  5  auditor),  and  several  control  variables.  We  use  various  measures  of  fees  to 
capture  the  auditor’s  economic  dependence  on  the  client,  including  total  fees,  nonaudit  fees,  audit 
fees,  ratio  of  nonaudit  fees  to  total  fees  (fee  ratio),  abnormal  (unexpected)  total  fees,  abnormal 
nonaudit  fees,  abnormal  audit  fees,  and  abnormal  fee  ratio.  However,  our  main  focus  is  on  unex¬ 
pected  total  fees  and  unexpected  nonaudit  fees  because  these  fee  measures  are  based  on  control¬ 
ling  for  bank-level  determinants  of  normal  fees  and  prior  research  suggests  that  they  are  better 
measures  for  examining  fee  dependence  (Kinney  and  Libby  2002;  Srinidhi  and  Gul  2007). 

We  report  several  key  findings.  First,  for  large  banks,  we  document  that  both  unexpected  total 
fees  and  unexpected  nonaudit  fees  have  no  association  with  either  income-increasing  or  income- 
decreasing  abnormal  LLP.  Second,  in  contrast  to  the  results  for  large  banks,  we  find  a  strong 
negative  association  between  income-increasing  (negative)  abnormal  LLP  and  both  unexpected 
total  fees  and  unexpected  nonaudit  fees  for  small  banks.  In  other  words,  higher  auditor  fees  are 
associated  with  higher  income-increasing  earnings  management  for  small  banks.  We  also  docu- 


^  Prior  research  argues  that  proxies  of  abnormal  accruals  commonly  used  to  detect  earnings  management  are  subject  to 
serious  measurement  error  (Guay  et  al.  1996;  McNichols  2000,  2002;  Jones  et  al.  2008).  For  example,  McNichols 
(2002)  questions  the  construct  validity  of  a  proxy  based  on  aggregate  accruals  because  of  the  complexity  associated  with 
modeling  the  estimation  errors  in  aggregate  accruals. 
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ment  a  negative  association  between  unexpected  nonaudit  fees  and  income-decreasing  (positive) 
abnormal  LLP  for  small  banks.  These  results  are  consistent  with  greater  earnings  management  by 
small  banks  that  pay  higher  abnormal  fees  to  the  auditor.  Our  results  are  robust  to  a  variety  of 
sensitivity  tests,  including  separate  subsample  analyses  for  small  and  large  banks,  deletion  of 
influential  observations,  and  use  of  alternative  models  for  estimating  abnormal  LLP. 

Our  findings  are  consistent  with  economic  bonding  between  the  auditor  and  the  bank  being 
associated  with  income-increasing  earnings  management  through  LLP  for  small  banks.  In  particu¬ 
lar,  our  results  suggest  that  auditor  fee  dependence  on  the  audit  client  can  potentially  threaten 
auditor  independence,  particularly  among  banks  that  are  not  subject  to  the  same  level  of  regulatory 
scrutiny  as  large  banks.® 

The  next  section  describes  the  institutional  background  on  FDICIA  and  SOX,  particularly 
how  these  regulations  affect  small  and  large  banks.  Section  III  describes  our  research  design, 
including  the  empirical  models  used  to  estimate  abnormal  LLP,  unexpected  fees  paid  to  auditors, 
and  tests  of  the  relation  between  these  variables.  We  also  discuss  how  the  differences  in  regulation 
of  small  and  large  banks  could  impact  auditor  independence.  Section  IV  details  the  sample  selec¬ 
tion,  Section  V  discusses  the  results,  and  Section  VI  provides  the  conclusions  of  the  study. 

II.  INSTITUTIONAL  BACKGROUND 

In  response  to  the  breakdown  of  thrifts  and  banks  during  the  late  1980s  and  early  1990s,  the 
U.S.  Congress  enacted  FDICIA  in  1991  to  strengthen  the  financial  condition  of  the  banking  and 
thrift  industries.  FDICIA,  which  created  deposit  insurance  and  the  FDIC,  is  the  most  important 
banking  legislation  in  the  U.S.  since  the  Banking  (Glass-Steagall)  Act  of  1933  (Benston  and 
Kaufman  1997).  Further,  a  decade  later.  Congress  passed  another  landmark  legislation,  the 
Sarbanes-Oxley  Act  of  2002  (SOX)  which,  although  modeled  after  FDICIA,  is  much  broader  in 
scope.^  While  FDICIA  contains  much  more  than  deposit  insurance  reform,  of  particular  interest 
are  the  requirements  for  annual  audit  and  reporting  of  management’s  assessment  of  the  effective¬ 
ness  of  internal  control  that  are  intended  to  aid  early  detection  of  problems  in  the  financial 
management  of  insured  banks.  In  June  1993,  the  FDIC  enacted  Part  363  titled,  “Annual  Indepen¬ 
dent  Audits  and  Reporting  Requirements,”  which  requires  each  insured  depository  institution  with 
$500  million  or  more  in  total  assets  at  the  beginning  of  the  year  to  prepare  (see  Sections  363.2  and 
363.3  of  FDIC  1993):^’® 

1.  Audited  annual  financial  statements; 

2.  A  management  report  signed  by  the  CEO  and  the  CFO  that  contains: 

(a)  a  statement  of  management’s  responsibilities  for: 

i.  preparing  financial  statements, 

ii.  establishing  and  maintaining  an  adequate  internal  control  structure  over  finan¬ 
cial  reporting,  and 

iii.  complying  with  the  laws  and  regulations  that  are  designated  by  the  FDIC  and 
other  federal  banking  agencies; 


^  Interestingly,  over  85  percent  of  these  small  banks  are  audited  by  non-Big  5  auditors,  suggesting  that  fee  dependence 
may  be  greater  for  the  non-Big  5  auditors  relative  to  the  Big  5  auditors. 

’’  See  Altamuro  and  Beatty  (2010)  for  a  mapping  of  sections  of  FDICIA  with  SOX. 

®  Section  112  of  FDICIA  added  Section  36,  “Early  Identification  of  Needed  Improvements  in  Financial  Management,”  to 
the  FDI  Act.  Section  36  gives  the  FDIC  discretion  to  select  an  asset-size  threshold  of  $150  million  or  more  to  exempt 
smaller  depository  institutions  from  the  above  reporting  requirements.  This  exemption  was  intended  to  ease  compliance 
cost  for  small  and  community  banks. 

®  In  addition.  Section  363.5  requires  each  depository  institution  to  establish  an  independent  audit  committee,  the  members 
of  which  shall  be  outside  directors  who  are  independent  of  the  management. 
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(b)  assessments  by  management  of: 

i.  the  effectiveness  of  the  internal  control  structure  over  financial  reporting,  and 

ii.  the  institution’s  compliance  with  the  designated  laws  and  regulations  during 
the  year; 

3.  The  independent  public  accountant’s  report  on  the  audited  financial  statements;  and 

4.  The  independent  public  accountant’s  attestation  report  on  the  assertion  of  management 
concerning  the  institution’s  internal  control  structure  and  procedures  for  financial  report¬ 
ing. 

Effective  2005,  the  threshold  for  requirements  2(b)i  and  4  was  increased  from  $500  million  to 
$1  billion  in  total  assets  (FDIC  2005b). Although  depository  institutions  with  $500  million  or 
more  but  less  than  $1  billion  in  total  assets  are  currently  exempt  from  internal  control  assessments 
by  management  and  external  auditors  per  Section  36,  Section  404  of  SOX  mandates  that  all  public 
companies  or  subsidiaries  of  public  companies  must  file  a  report  on  the  auditor’s  attestation  of  the 
effectiveness  of  internal  control  over  financial  reporting.  Currently,  Section  404  exempts  non¬ 
accelerated  filers — i.e.,  firms  with  public  float  less  than  $75  million. 

In  summary,  publicly  held  banks  with  less  than  $500  million  in  total  assets  during  the  years 
2000  through  2004  are  exempt  from  Section  36  of  the  EDI  Act.  Effective  2004,  publicly  held 
banks  that  are  non-accelerated  filers  are  exempf  from  Section  404  of  SOX.  Currently,  publicly  held 
banks  with  less  than  $1  billion  in  total  assets  and  with  a  public  float  less  than  $75  million  are 
exempt  from  both  Section  36  as  well  as  Section  404.  Privately  held  banks  are  exempt  from  Section 
404;  further,  effective  2005,  privately  held  banks  with  $500  million  or  more  but  less  than  $1 
billion  in  total  assets  (or  less  than  $500  million  in  total  assets  during  the  years  2000  through  2004) 
are  exempt  from  Section  36  pertaining  to  management’s  and  auditor’s  assessments  of  the  effec¬ 
tiveness  of  internal  control  over  financial  reporting.  We  do  not  focus  on  privately  held  banks,  as 
information  on  fees  paid  to  the  auditor  is  not  disclosed  for  these  banks. 

In  addition  to  FDICIA  and  SOX,  banks  are  expected  to  comply  with  guidance  issued  by  the 
SEC  and  the  Federal  Financial  Institutions  Examination  Council  (FFIEC)  on  LLR^^  Both  SAB  No. 
102  and  the  policy  statement  emphasize  the  methodology  used  to  measure  LLP  and  the  documen¬ 
tation,  rather  than  external  financial  reporting  (Phillips  and  Lierley  2001/2002).^^ 

III.  RESEARCH  DESIGN 

In  this  section,  we  first  discuss  our  conceptual  model,  depicted  in  Figure  1,  on  the  relation 
between  economic  bonding  and  earnings  management.  Second,  we  describe  the  empirical  model 
used  to  estimate  abnormal  LLP.  Third,  we  describe  the  empirical  model  used  to  estimate  unex¬ 
pected  (abnormal)  measures  of  fees  paid  to  the  auditor.  Fourth,  we  discuss  the  empirical  model 
used  to  test  possible  linkages  between  abnormal  LLP  and  unexpected  fees  paid  to  auditors. 

The  model  in  Figure  1  is  based  on  the  predictive  validity  model  of  Kinney  and  Libby  (2002) 
that  was  developed  by  Runkel  and  McGrath  (1972)  and  Libby  (1981).  As  in  prior  research,  the 
economic  bond  to  the  audit  client  and  earnings  management  by  the  client  (link  1)  are  the  two 
theoretical  constructs  we  examine.  We  identify  proxies  to  measure  the  economic  bond  and  eam- 


Note  that  the  FDIC  increased  the  asset  size  threshold  for  internal  control  assessments  by  management  and  external 
auditors  but  retained  the  financial  statement  audit  and  other  repotting  requirements  for  all  institutions  with  $500  million 
or  more  in  total  assets  to  preserve  safety  and  soundness  in  financial  management  (FDIC  2005a). 

In  July  2001,  the  SEC  issued  SAB  No.  102,  Selected  Loan  Loss  Allowance  Methodology  and  Documentation  Issues,  and 
the  FFIEC  issued.  Policy  Statement  on  Allowance  for  Loan  and  Lease  Losses  Methodologies  and  Documentation  for 
Banks  and  Savings  Institutions. 

Although  this  guidance  applies  equally  to  all  institutions,  the  policy  statement  suggests  that  small  banks  may  produce 
fewer  supporting  documents  in  support  of  their  loan  loss  methodologies  than  large  banks  (FDIC  2001). 
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FIGURE  1 
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ings  management  because  these  two  constructs  are  unobservable.  We  use  multiple  fee  measures  to 
capture  the  economic  bond  (link  2).  Kinney  and  Libby  (2002)  reason  that  the  economic  bond  is 
likely  to  strengthen  as  unexpected  nonaudit  and  audit  fees  increase  because  the  abnormal  rents 
earned  by  the  auditor  increase.  We  use  unexpected  total  fees  and  unexpected  nonaudit  fees  as 
primary  measures  of  the  economic  bond  (link  2)  and  the  abnormal  component  of  a  bank’s  LLP  as 
the  measure  of  earnings  management  (link  3).  We  examine  banking  firms  because  they  provide  a 
setting  with  a  stronger  link  3  than  that  characterizing  prior  research.  By  using  an  industry-specific 
measure  of  earnings  management,  we  can  also  improve  upon  prior  research  in  terms  of  a  sharper 
separation  between  abnormal  and  normal  accruals.  We  control  for  several  bank  characteristics  that 
can  also  affect  LLP  (link  4).  We  then  examine  the  relation  between  the  various  fee  measures  and 
abnormal  LLP  (link  5)  to  draw  inferences  about  link  1 . 

We  expect  banks  that  are  exempt  from  assessments  of  the  effectiveness  of  internal  control 
over  financial  reporting  by  managers  and  auditors  to  pose  a  greater  risk  to  auditor  independence 
than  banks  that  are  not  exempt.  When  bank  managers  and  the  auditor  know  that  the  bank  will  not 
be  subject  to  regulatory  scrutiny,  such  a  setting  likely  encourages  earnings  management  by  man¬ 
agers  and  could  increase  auditors’  tolerance  for  earnings  management  relative  to  banks  that  are 
expected  to  comply  with  the  requirements  of  FDICIA  and  Section  404  of  SOX.^^ 

Estimation  of  Abnormal  Loan  Loss  Provisions 

In  this  section  we  describe  our  two-stage  approach  to  examine  link  5  (see  Figure  1).  We 
estimate  the  normal  or  nondiscretionary  component  of  LLP  by  regressing  LLP  on  beginning  loan 


Prior  research  suggests  that  increased  legal  liability  increases  the  auditor’s  effort  and  audit  accuracy  (Boritz  and  Zhang 
1997;  De  and  Sen  2002).  Similarly,  Kim  et  al.  (2008)  argue  that  legal  environment  plays  an  important  role  in  deter¬ 
mining  auditor  effort. 
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loss  allowance,  beginning  non-performing  loans,  change  in  non-performing  loans,  net  loan  charge- 
offs,  change  in  total  loans  outstanding,  total  loans  outstanding,  loan  mix,  and  controls  for  period 
effects  using  the  following  model  (all  variables  are  scaled  by  beginning  total  assets) 

LLP  =  ao  +  a^BEGLLA  +  a2BEGNPL  +  a^CHNPL  +  a^LCO  +  a^CHLOANS  +  a^^LOANS 

+  LOANCATEGORIES  +  YEARCONTROLS  +e  ( 1 ) 

where: 


LLP  = 
BEGLLA  = 
BEGNPL  = 
CHNPL  = 
LCO  = 
CHLOANS  = 
LOANS = 
LOANCATEGORIES  = 


provision  for  loan  losses; 
beginning  loan  loss  allowance; 
beginning  non-performing  loans; 
change  in  non-performing  loans; 
net  loan  charge-offs; 
change  in  total  loans  outstanding; 
total  loans  outstanding;  and 

amount  of  commercial  loans  {COMM),  consumer  loans  {CON),  real 
estate  loans  {RESTATE),  agriculture  loans  {AGRI),  loans  to  foreign 
banks  and  governments  {FBG),  and  loans  to  other  depository  institutions 
{DEPINS). 


The  residuals  from  model  (1)  are  the  abnormal  component  of  LLP,  referred  to  as  Al.T  P  We 
expect  a  negative  coefficient  on  BEGLLA  (i.e.,  accumulated  LLP  less  write-offs  at  the  beginning 
of  the  year),  as  a  higher  initial  loan  loss  allowance  will  require  a  lower  LLP  in  the  current  period. 
Consistent  with  prior  research,  we  expect  a2,  0:3,  a^,  and  to  be  positive.  Higher  levels  of 
non-performing  loans  indicate  that  problems  with  the  loan  portfolio  will  require  higher  loss  pro¬ 
visions.  Therefore,  we  expect  the  beginning  balance  of  non-performing  loans  {BEGNPL)  to  be 
positively  related  to  LLP.  We  expect  change  in  non-performing  loans  {CHNPL)  to  be  positively 
related  to  LLP  because  an  increase  in  non-performing  loans  will  require  a  higher  loan  loss  provi¬ 
sion  in  the  current  period.  We  expect  net  loan  charge-offs  {LCO)  to  be  positively  related  to  LLP. 
As  explained  by  Beaver  and  Engel  (1996),  because  current  loan  charge-offs  can  provide  informa¬ 
tion  about  future  loan  charge-offs,  they  can  influence  expectations  of  the  collectability  of  current 
loans  and  hence  current  LLP.  We  expect  the  level  of  loans  {LOANS)  to  be  positively  related  to  LLP 
because  a  higher  level  of  loans  will  also  require  higher  provisions.  We  do  not  offer  a  prediction  for 
0:5  because  the  effect  of  change  in  total  loans  {CHLOANS)  on  LLP  depends  on  the  quality  of 
incremental  loans. 

Although  non-performing  loans  and  loan  charge-offs  serve  as  measures  of  risk,  we  include  six 
additional  variables  to  control  for  differences  in  loan  composition  that  also  likely  contribute  to 
differences  in  risk.  For  example,  banks  with  a  higher  proportion  of  commercial  and  real  estate 
loans  are  likely  to  have  higher  loan  loss  provisions  than  banks  with  a  higher  proportion  of 
consumer  loans.  Failure  to  account  for  these  differences  makes  the  residuals  from  the  loan  loss 
provision  model  a  function  of  the  hank  type.  This,  in  turn,  could  affect  the  inferences  of  our 
auditor  independence  tests  because  both  audit  fees  and  ALLP  may  systematically  vary  across 
banks  based  on  their  loan  portfolio  mix. 


These  variables  have  also  been  used  in  several  prior  studies  (e.g.,  Wahlen  1994;  Kanagaretnam  et  al.  2004)  to  estimate 
the  normal  component  of  LLP. 

We  also  use  beginning  total  loans  as  an  alternate  deflator.  Our  results  are  not  sensitive  to  this  choice  of  deflator. 

For  convenience,  we  summarize  the  variable  definitions  in  Table  1. 
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LAFEE  = 
LTOTFEE  = 
LNAFEE  = 
EEERATIO  = 
UTOTFEE  = 
UNAFEE  = 
UAEEE  = 
UEEERATIO  = 
LLP  = 
ALLP  = 
LOANS  = 
BEGLLA  = 
CHLOANS = 
BEGNPL  = 
CHNPL  = 
LCO  = 
COMM  = 
CON  = 
RESTATE  = 
AGRI  = 
FBG  = 
DEPINS  = 
BIGS  = 
MB  = 
LMVE  = 
LOSS  = 
PASTLLP  = 
EBP  = 
TIERI  = 
TCAP  = 
LASSETS  = 
SECURITIES  = 
SNPL  = 
INTANG  = 
EEFICIENCY  = 

SLCO  = 
SCOMM  = 
SCON  = 
SRESTATE  = 
EXEMPT  = 


SMALL  = 
NT  = 


TABLE  1 

Definitions  of  Variables 

natural  log  of  audit  fees; 

natural  log  of  total  fees; 

natural  log  of  nonaudit  fees; 

nonaudit  fees  divided  by  total  fees; 

unexpected  total  fees  from  model  (2); 

unexpected  nonaudit  fees  from  model  (2); 

unexpected  audit  fees  from  model  (2); 

unexpected  fee  ratio  from  model  (2); 

provision  for  loan  losses  divided  by  beginning  total  assets; 

abnormal  LLP  (residual  from  model  (1)); 

total  loans  outstanding  divided  by  beginning  total  assets; 

beginning  loan  loss  allowance  divided  by  beginning  total  assets; 

change  in  total  loans  outstanding  divided  by  beginning  total  assets; 

beginning  non-performing  loans  divided  by  beginning  total  assets; 

change  in  non-performing  loans  divided  by  beginning  total  assets; 

net  loan  charge-offs  divided  by  beginning  total  assets; 

conunercial  loans  divided  by  beginning  total  assets; 

consumer  loans  divided  by  beginning  total  assets; 

real  estate  loans  divided  by  beginning  total  assets; 

agriculture  loans  divided  by  beginning  total  assets; 

loans  to  foreign  banks  and  governments  divided  by  beginning  total  assets; 
loans  to  other  depository  institutions  divided  by  beginning  total  assets; 
indicator  variable  that  equals  1  if  audited  by  a  Big  5  firm,  and  0  otherwise; 
market-to-book  ratio  at  the  end  of  the  year; 

natural  log  of  market  value  of  common  equity  at  the  end  of  the  year; 
indicator  variable  that  equals  1  if  net  income  <0,  and  0  otherwise; 
prior  year’s  LLP  divided  by  beginning  total  assets; 

net  income  before  extraordinary  items  and  LLP  divided  by  beginning  total  assets; 
tier  1  risk-adjusted  capital  ratio  at  the  beginning  of  the  year; 
total  risk-adjusted  capital  ratio  at  the  beginning  of  the  year; 
natural  log  of  total  assets; 

[1  —  (total  securities/total  assets)]; 
non-performing  loans  divided  by  lagged  total  loans; 
intangible  assets  divided  by  total  assets; 

total  operating  expenses  divided  by  total  revenues  (measured  as  net  interest  income  plus 
non-interest  income); 

net  loan  charge-offs  divided  by  loan  loss  allowance; 
total  commercial  and  agricultural  loans  divided  by  total  loans; 
total  consumer  loans  divided  by  total  loans; 
total  real  estate  loans  divided  by  total  loans; 

banks  exempted  from  internal  control  regulations.  An  indicator  variable  that  equals  1  if 
a  bank  has  less  than  $500  million  in  total  assets  during  the  years  2000  through  2003  or 
less  than  $500  million  in  total  assets  and  public  float  less  than  $75  million  for  the  year 
2004  or  less  than  $1  billion  in  total  assets  and  public  float  less  than  $75  million  for  the 
years  2005  and  2006,  and  0  otherwise; 

indicator  variable  that  equals  1  if  a  bank  has  less  than  $500  million  in  total  assets  during 
the  years  2000  through  2003;  and 

indicator  variable  that  equals  1  if  a  bank  has  less  than  $500  million  in  total  assets  and 
public  float  less  than  $75  million  for  the  year  2004  or  less  than  $1  billion  in  total  assets 
and  public  float  less  than  $75  million  for  the  years  2005  and  2006,  and  0  otherwise. 
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The  six  loan  portfolio  composition  variables  included  in  the  model  are  commercial  loans 
{COMM),  consumer  loans  {CON),  real  estate  loans  {RESTATE),  agriculture  loans  {AGRI),  loans  to 
foreign  banks  and  governments  {FBG),  and  loans  to  other  depository  institutions  {DEPINS).  We 
also  include  six  year- indicator  variables  representing  years  2000  through  2005  in  model  (1)  to 
control  for  period-specific  effects. 

Next,  we  describe  the  model  for  computing  unexpected  (abnormal)  fees.  Auditor  fees  are  also 
influenced  by  various  economic  determinants  including  size  and  complexity  of  audit  task.  There¬ 
fore,  controlling  for  the  economic  determinants  of  auditor  fees  and  using  unexpected  (abnormal) 
auditor  fees  is  likely  a  better  measure  of  auditor  fee  dependence  than  using  actual  fees. 

Estimation  of  Unexpected  Fees 

Kinney  and  Libby  (2002)  argue  that  unexpected  fees  are  a  better  measure  of  the  auditor-client 
economic  bond  because  they  reflect  the  “excess”  profit  derived  from  an  audit  client.  We  estimate 
unexpected  fees  as  the  difference  between  actual  and  expected  (normal)  fees,  where  expected  fees 
are  the  predicted  values  from  a  regression  of  auditor  fees  on  a  set  of  firm  characteristics. 

Prior  research  models  auditor  fees  as  a  function  of  a  firm’s  auditor  choice,  audit  complexity, 
and  audit  risk,  in  addition  to  other  variables  (Firth  1997;  Ashbaugh  et  al.  2003).  Fields  et  al. 
(2004)  examine  the  determinants  of  audit  fees  in  the  banking  industry.  We  use  the  variables 
identified  in  the  Fields  et  al.  (2004)  study  to  estimate  unexpected  fees.  Audit  fees  and  other  fees 
are  likely  to  be  higher  when  the  auditor  is  a  Big  5  auditor.  Auditor  size  also  proxies  for  client  size. 
We  measure  firm  size  as  the  natural  log  of  total  assets.  The  normal  audit  fee  is  directly  related  to 
a  bank’s  credit  risk,  operating  risk,  liquidity  risk,  and  capital  risk.  We  include  SNPL,  SLCO, 
SCOMM,  SCON,  and  SRESTATE  as  proxies  for  a  bank’s  credit  risk  and  the  efficiency  ratio 
{EEEICIENCY)  as  a  proxy  for  operating  risk.  We  measure  the  efficiency  ratio  as  the  ratio  of  total 
operating  expenses  to  total  revenues.  As  in  Fields  et  al.  (2004),  we  use  SECURITIES  as  a  proxy  for 
liquidity  risk,  and  intangible  assets  {INTANG)  and  total  capital  ratio  {TCAP)  to  account  for  capital 
risk.  In  addition,  we  include  a  control  for  exempted  banks  {EXEMPT),  which  are  expected  to  pay 

17 

lower  fees  to  auditors.  We  estimate  the  following  model: 

FEE  =  4  +  S^BIGS  +  S2LASSETS  +  S^SECURITIES  +  S4SNPL  +  S5LOSS  +  8(,INTANG 

y  8^EFFICIENCY  +  S^SLCO  +  SgSCOMM  +  S^oSCON+  SnSCESTATE+  S^fTCAP 
+  S^EXEMPT  +  YEARCONTROLS  +  e  (2) 

where: 

FEE  =  fees  paid  to  the  auditor  {LTOTFEE  =  In  (audit  fees  +  nonaudit  fees);  or 

LNAFEE  =  In  (nonaudit  fees);  or  LAFEE  =  In  (audit  fees));  or  FEERATIO  = 
nonaudit  fees/total  fees); 

BIGS  =  indicator  variable  that  equals  1  if  audited  by  a  Big  5  firm,  and  0  otherwise; 
LASSETS  =  natural  log  of  total  assets; 

SECURITIES  =  [l-(total  securities/total  assets)]; 

SNPL  =  non-performing  loans  divided  by  lagged  total  loans; 

LOSS  =  indicator  variable  that  equals  1  if  net  income  <  0,  and  0  otherwise; 

INTANG  =  intangible  assets  divided  by  total  assets; 


As  in  Fields  et  al.  (2004),  this  model  includes  three  loan  categories.  We  also  estimate  unexpected  fees  after  including  all 
the  six  loan  categories  used  in  model  (1).  The  relations  between  ALLP  and  UFEE  are  qualitatively  the  same  when  we 
use  all  six  loan  categories  to  estimate  abnormal  fees. 
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EFFICIENCY  =  total  operating  expenses  divided  by  total  revenues  (measured  as  net  interest 
income  plus  non-interest  income); 

SLCO  =  net  loan  charge-offs  divided  by  loan  loss  allowance; 

SCOMM  =  total  commercial  and  agricultural  loans  divided  by  total  loans; 

SCON  =  total  consumer  loans  divided  by  total  loans; 

SRESTATE  =  total  real  estate  loans  divided  by  total  loans; 

TCAP  =  total  risk-adjusted  capital  ratio;  and 
EXEMPT  =  indicator  variable  that  equals  1  if  a  bank  has  less  than  $500  million  in  total 
assets  during  the  years  2000  through  2003  or  less  than  $500  million  in  total 
assets  and  public  float  less  than  $75  million  for  the  year  2004  or  less  than  $1 
billion  in  total  assets  and  public  float  less  than  $75  million  for  the  years 
2005  and  2006,  and  0  otherwise. 

We  use  the  residuals  from  model  (2)  as  the  unexpected  (abnormal)  fees  for  our  four  fee 
measures. 

Main  Tests 

Our  objective  is  to  test  whether  the  association  between  abnormal  LLP  {ALEP)  and  fees  paid 
to  the  auditor  is  stronger  for  banks  that  are  exempt  from  the  annual  auditing  and  reporting  of 
internal  control  assessments  under  FDICIA  and  Section  404  of  SOX.  We  create  EXEMPT,  an 
indicator  variable  that  equals  1  for  banks  that  are  exempt  from  FDICIA  and  Section  404  of  SOX 
and  0  otherwise,  and  interact  EXEMPT  with  fees  paid  to  the  auditor.  This  specification  allows  us 
to  test  whether  the  effect  of  fee  dependence  on  earnings  management  is  more  pronounced  for 
small  banks  that  are  subject  to  less  scrutiny  than  large  banks. 

We  estimate  the  model  separately  for  negative  (income-increasing)  ALEP  and  positive 
(income-decreasing)  A  LIP.  Negative  A  LLP  are  of  particular  interest  because  of  their  positive 
impact  on  reported  earnings.  We  control  for  the  following  factors  that  prior  research  has  docu¬ 
mented  are  associated  with  abnormal  accruals  (Ashbaugh  et  al.  2003):  firm  size,  auditor  type, 
market-to-book  ratio,  level  of  accruals,  and  performance.  We  use  log  of  market  value  of  equity  to 
measure  size,  and  represent  performance  by  two  variables,  existence  of  loss  and  earnings  before 
LLP,  and  growth  by  market-to-book  ratio.  We  control  for  the  reversal  of  accruals  over  time  by 
including  past  LLP,  and  for  capital  management  incentives  by  including  beginning  of  year  tier  1 
capital  ratio  and  total  capital  ratio. We  estimate  the  following  model: 


AEEP  =  7o  +  JiEXEMPT  +  y2UFEE  +  y^EXEMPT  X  UFEE  +  y^^BIGS  -f-  y^MB  +  y(,EMVE 
-I-  yjLOSS+  y^PASTEEP+  ygEBP  +  yioTIERl +  yuTCAP,_^  +  YEARCONTROES 
+  e  (3) 

where: 

A  LLP  =  abnormal  loan  loss  provision  (the  residual  from  model  (1)); 


We  report  the  results  of  estimating  model  (3)  using  both  positive  and  negative  ALLP  in  Table  6. 

Banks  had  incentives  to  influence  regulatory  capital  through  LLP  prior  to  the  change  in  bank  capital  adequacy  require¬ 
ments  in  1990,  This  change  altered  banks’  incentives  to  manage  capital  through  LLP  as  loan  loss  allowance  is  no  longer 
considered  part  of  tier  I  or  core  capital.  Therefore,  we  do  not  expect  7,0  to  be  significantly  different  from  0.  Furthermore, 
loan  loss  allowance  is  included  in  tier  II  or  supplementary  capital  only  up  to  1.25  percent  of  risk-adjusted  assets. 
Therefore,  we  predict  a  negative  coefficient  on  y^Q  as  higher  capital  ratios  would  require  lower  LLP,  ceteris  paribus. 
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EXEMPT= 


UEEE  = 
BIGS  = 

MB  = 
LMVE  = 
LOSS  = 
PASTLLP-- 
EBP- 

TIERl  = 
TCAP  = 


indicator  variable  that  equals  1  if  a  bank  has  less  than  $500  million  in  total  assets 
during  the  years  2000  through  2003  or  less  than  $500  million  in  total  assets  and 
public  float  less  than  $75  million  for  the  year  2004  or  less  than  $1  billion  in  total  assets 
and  public  float  less  than  $75  million  for  the  years  2005  and  2006,  and  0  otherwise; 
unexpected  total  fees  or  nonaudit  fees  or  audit  fees  or  fee  ratio  from  model  (2); 
indicator  variable  that  equals  1  if  the  bank  is  audited  by  a  Big  5  firm,  and  0 
otherwise; 

market-to-book  ratio  at  the  end  of  the  year; 

natural  log  of  market  value  of  common  equity  at  the  end  of  the  year; 
indicator  variable  that  equals  1  if  net  income  <  0,  and  0  otherwise; 
prior  year’s  LLP  divided  by  total  assets  at  the  beginning  of  the  year; 
net  income  before  extraordinary  items  and  loan  loss  provisions  divided  by  total 
assets  at  the  beginning  of  the  year; 

tier  1  risk-adjusted  capital  ratio  at  the  beginning  of  the  year;  and 
total  risk-adjusted  capital  ratio  at  the  beginning  of  the  year. 


The  coefficients  of  interest  in  model  (3)  are  72  73.  For  negative  (income-increasing) 

ALEP,  a  negative  72  is  consistent  with  auditor  fee  dependence  on  the  client,  and  indicates  that 
higher  fees  are  associated  with  greater  (more  negative)  income-increasing  ALEP  for  large  banks 
that  are  not  exempt  from  regulation.  A  negative  73  indicates  greater  fee  dependence  on  clients  that 
are  exempt  from  regulation  relative  to  large  banks  that  are  not.  For  positive  (income-decreasing) 
AEEP,  a  negative  72  implies  that  higher  auditor  fees  are  associated  with  less  conservative  earnings 
management  (i.e.,  income-decreasing  AEEP  are  less  positive)  for  large  banks  that  are  not  exempt 
from  regulation.  Similarly,  a  negative  73  indicates  greater  fee  dependence  on  clients  that  are 
exempt  from  regulation  relative  to  large  banks  that  are  not. 

Kinney  and  Libby  (2002)  indicate  that  both  audit  fees  and  nonaudit  fees  are  capable  of 
increasing  the  economic  bond.  They  reason  that  the  auditor  might  be  willing  to  decrease  audit  fees 
or  nonaudit  fees  to  maximize  potential  total  fees.  Therefore,  it  is  the  total  fees  derived  from  a 
single  client,  rather  than  just  audit  fees,  that  best  reflect  the  economic  bond.  Prior  research  also 
employs  nonaudit  fees  to  measure  auditor  independence  because  lower  regulation  and  the  wide 
variety  of  nonaudit  services  (from  information  technology  to  tax  preparation)  allow  for  inclusion 
of  rents  in  nonaudit  fees  (Srinidhi  and  Gul  2007).  BCinney  and  Libby  (2002,  109)  state,  “more 
insidious  effects  on  economic  bond  may  result  from  unexpected  nonaudit  and  audit  fees  that  may 
more  accurately  be  likened  to  attempted  bribes”  (emphasis  in  the  original).  This  suggests  that 
unexpected  fees  are  a  better  measure  of  the  auditor-client  economic  bond  because  they  reflect  the 
“excess”  profit  derived  from  the  audit  client.  Another  benefit  of  using  unexpected  fees  is  that  we 
explicitly  control  for  several  risk-related  variables  that  we  include  in  the  estimation  of  expected 
fees.  Therefore,  we  use  unexpected  total  fees  and  unexpected  nonaudit  fees  as  our  primary  mea¬ 
sures  of  the  economic  bond.  However,  for  consistency  with  prior  research  (e.g.,  Ashbaugh  et  al. 
2003;  Srinidhi  and  Gul  2007)  we  also  conduct  our  analysis  with  unexpected  audit  fees  and 
unexpected  fee  ratio  (nonaudit  fees/total  fees)  as  alternate  measures  of  the  economic  bond.  Al¬ 
though  we  use  unexpected  fees  as  our  primary  measure  of  economic  bonding,  we  also  estimate  a 
variation  of  model  (3)  where  we  replace  UEEE  with  actual  fees  paid  to  the  auditor.  We  discuss 
sample  selection  in  the  next  section. 

IV.  SAMPLE  SELECTION 

We  select  sample  banks  from  banks  listed  in  the  2007  Bank  Compustat  annual  data  files  and 
obtain  auditor  fees  for  the  years  2000-2006  from  Audit  Analytics.  The  intersection  of  the  Audit 
Analytics  with  Bank  Compustat  data  results  in  an  initial  sample  of  2,044  bank-year  observations. 
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We  hand-collect  non-performing  loans  data  for  the  period  1999-2006  from  annual  reports  and 
obtain  data  on  loan  portfolio  composition  from  the  Federal  Reserve  Bank  Holding  Company 
Database  (call  reports).  Our  final  sample  with  available  data  for  all  variables  and  without  outliers 
comprises  1,740  bank-year  observations  for  301  banks.^°  We  have  1,618  observations  with  avail¬ 
able  data  for  the  models  that  use  abnormal  fee  measures.  The  sample  does  not  include  any 
privately  held  banks. 

Table  2,  Panel  A  reports  descriptive  statistics  for  the  scaled  variables  used  in  the  regressions. 
On  average,  nonaudit  fees  as  a  proportion  of  total  fees  (FEERATIO)  is  about  0.32.  The  mean 
values  of  UTOTEEE,  UNAFEE,  and  ALEP  are  zero  by  construction.  Note  that  UTOTEEE  and 
UNAFEE  are  estimated  from  1,618  observations.  More  than  52  percent  of  the  sample  observations 
are  audited  by  Big  5  auditors.  Turning  to  the  bank  loan  variables,  the  ratios  of  average  LLP,  loan 
charge-offs,  and  beginning  non-performing  loans  to  beginning  total  assets  are  0.003,  0.002,  and 
0.004,  respectively.^^  About  19.2  percent  of  total  observations  are  exempt  from  Section  36  of  the 
FDI  Act  and  Section  404  of  SOX.^^ 

Table  2,  Panel  B  reports  correlations  for  the  scaled  dependent  and  independent  variables.  As 
expected,  LLP  is  positively  correlated  with  non-performing  loans  (BEGNPL),  change  in  non¬ 
performing  loans  (CHNPL),  and  loan  charge-offs  (LCO).  LLP  is  also  positively  correlated  with 
total  loans  (LOANS)  and  change  in  loans  outstanding  (CHLOANS).  LLP  is  significantly  (p  <  0.05) 
positively  related  to  log  of  total  fees  and  log  of  nonaudit  fees,  and  ALLP  is  weakly  negatively 
correlated  with  nonaudit  fees. 


V.  RESULTS 

Estimation  of  Abnormal  LLP 

We  report  the  estimation  results  of  model  (1)  in  Table  3,  Panel  A.  The  t-statistics  reported  in 
Table  3  and  in  the  other  tables  are  based  on  standard  errors  adjusted  for  firm-level  clustering.  The 
results  in  Table  3  show  that  the  coefficients  on  the  determinants  of  LLP  have  the  expected  signs 
except  for  the  coefficient  on  BEGNPL,  which  is  not  significant.  The  coefficients  on  BEGLLA, 
CHNPL,  LCO,  and  LOANS  are  significant  at  the  0.01  level.  Of  the  variables  that  reflect  loan  type, 
only  the  coefficients  on  commercial  and  consumer  loans  are  statistically  significant.  The  explana¬ 
tory  power  of  the  model  is  high  (adjusted  =  66.32  percent),  indicating  that  the  model  describes 
the  variation  in  LLP  quite  well.^^ 

Estimation  of  Abnormal  Fees 

Table  3,  Panel  B  reports  the  results  from  estimating  model  (2).  While  we  report  results  of 
model  (2)  for  four  fee  measures — total  fees,  nonaudit  fees,  audit  fees,  and  fee  ratio — our  primary 
focus  is  on  total  fees  and  nonaudit  fees.  As  expected,  we  find  a  positive  relation  between  the  fee 


We  delete  observations  in  the  top  1  percent  of  scaled  loan  loss  provisions,  loan  charge-offs,  beginning  non-performing 
loans,  total  loans,  and  market-to-book  ratios.  The  values  of  all  variables  in  the  reduced  sample  are  generally  within  three 
standard  deviations  from  the  mean. 

These  ratios  are  lower  than  the  values  reported  in  Wahlen  (1994)  and  Kanagaretnam  et  al.  (2004)  due  to  increase  in 
overall  quality  of  loans  (Edwards  and  Mishkin  1995).  Similarly,  our  sample  data  indicate  that  mean  LLP  equals  17 
percent  of  earnings  before  provisions,  considerably  lower  than  the  148  percent  for  Wahlen’s  (1994)  sample  and  lower 
than  the  31  percent  for  Kanagaretnam  et  al.’s  (2004)  sample. 

We  closely  follow  the  approach  of  Gao  et  al.  (2009)  for  identifying  accelerated/non-accelerated  filers  under  Section  404 
of  SOX.  Our  sample  includes  14  bank-year  observations  that  have  assets  less  than  $500  million  in  2004  but  were 
accelerated  filers.  For  2005  and  2006,  it  includes  34  bank-year  observations  that  had  assets  between  $500  million  and  $1 
billion  that  were  non-accelerated  filers. 

Even  without  the  year  controls,  the  adjusted  is  56.55  percent,  indicating  that  our  model  fit  compares  favorably  with 
the  results  of  prior  research.  For  example,  McNichols  (2002)  reports  that  when  change  in  working  capital  accruals  is 
regressed  on  changes  in  sales  and  property,  plant,  and  equipment,  the  adjusted  for  the  pooled  sample  is  7.3  percent. 
Ashbaugh  et  al.  (2003)  report  mean  R^  values  of  11  percent  and  18  percent  for  their  models  of  abnormal  accruals. 
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TABLE  3 

Regressions  for  Computing  ALLP  and  Abnormal  Fees 

Panel  A:  Results  of  Regression  of  LLP  on  Determinants  of  Normal  LLP^ 

Expected 


Variable 

Sign 

Coefficient 

t-statistic 

Intercept 

? 

-0.0012*** 

-5.87 

BEGLLA 

- 

-0.1148*** 

-4.66 

BEGNPL 

+ 

-0.0072 

-0.43 

CHNPL 

+ 

0.0406*** 

2.69 

ECO 

+ 

0.9559*** 

25.88 

CHLOANS 

? 

0.0006 

0.69 

LOANS 

+ 

0.0034*** 

7.66 

COMM 

? 

0.0004*** 

2.89 

CON 

? 

-0.0014*** 

-2.76 

RESTATE 

7 

0.0000 

0.10 

AGRI 

7 

0.0007 

0.34 

EBG 

7 

-0.0209 

-1.39 

DEPINS 

7 

0.0072 

0.73 

Year  Controls 

Yes 

n 

1740 

Adjusted 

66.32% 

Panel  B:  Results  of  Regressions  of  Fee  Measures  on  Determinants  of  Fees'* 

Variable 

Total  Fee 

Nonaudit  Fee 

Audit  Fee 

Fee  Ratio 

(1) 

(2) 

(3) 

(4) 

(5) 

Intercept 

5.4366*** 

2.7859*** 

5.7552*** 

-0.0626 

(12.58) 

(3.34) 

(12.47) 

(-0.44) 

BIGS 

0.2908*** 

0.2693*** 

0.2508*** 

0.0193 

(5.63) 

(2.77) 

(4.87) 

(1.13) 

LASSETS 

0.6714*** 

0.7136*** 

0.6292*** 

0.0188** 

(22.53) 

(13.71) 

(21.50) 

(2.44) 

SECURITIES 

0.8548*** 

1.1918*** 

0.7794*** 

0.0678 

(4.00) 

(3.49) 

(3.10) 

(1.11) 

SNPL 

13.9441*** 

41.8254*** 

1.5396 

6.6917*** 

(2.61) 

(4.56) 

(0.35) 

(4.76) 

LOSS 

0.2522 

-0.3621 

0.4643* 

-0.0950* 

(1.08) 

(-0.79) 

(1.90) 

(-1.35) 

INTANG 

5.4330*** 

-1.0942 

8.4852*** 

-1.5201*** 

(3.92) 

(-0.38) 

(4.83) 

(-3.43) 

EEEICIENCY 

1.1689*** 

1.7519*** 

0.7808*** 

0.1665* 

(3.92) 

(2.94) 

(2.76) 

(1.80) 

SLCO 

-0.0863 

-0.3179 

0.0303 

-0.0718* 

(-0.85) 

(-1.46) 

(0.24) 

(-1.68) 

SCOMM 

0.2634*** 

0.2556 

0.2497** 

0.0021 

(2.68) 

(1.55) 

(2.22) 

(0.06) 

SCON 

-0.4945** 

-0.2637 

-0.5205** 

0.0339 

(-2.15) 

(-0.60) 

(-2.06) 

(0.39) 

(continued  on  next  page) 
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Panel  B:  Results  of  Regressions  of  Fee  Measures  on  Determinants 

of  Fees'* 

Variable 

Total  Fee 

Nonaudit  Fee 

Audit  Fee 

Fee  Ratio 

(1) 

(2) 

(3) 

(4) 

(5) 

SRESTATE 

-0.0812 

-0.1646 

-0.0511 

-0.0181 

(-1.20) 

(-1.20) 

(-0.80) 

(-0.68) 

TCAP 

0.0159*** 

0.0061 

0.0223*** 

-0.0027 

(2.77) 

(0.35) 

(3.19) 

(-0.74) 

EXEMPT 

-0.0167 

-0.1237 

-0.0589 

0.0281 

(-0.28) 

(-1.26) 

(-0.94) 

(1.57) 

Year  controls 

Yes 

Yes 

Yes 

Yes 

n 

1618 

1618 

1618 

1618 

Adj.  R2 

80.38% 

51.11% 

74.44% 

6.53% 

*,  **,  ***  Indicate  0.10,  0.05,  and  0.01  significance  levels,  respectively,  for  ; 

a  two-tailed  test. 

“  Panel  A:  The 

regression  model  is: 

LLP  =  ao  +  aiBEGLLA  +  a2BEGNPL  +  a^CHNPL  +  a^LCO  +  a^CHLOANS  +  a(,LOANS  +  LOAN CATEGORIES 
+  YEARCONTROLS  +  s . 

t-statistics  are  based  on  standard  errors  adjusted  for  firm-level  clustering. 

’’  Panel  B:  The  regression  model  is: 

FEE  =Sq+  SyBlGS  +  S2LASSETS  +  &,SECURIT1ES  -1-  S^SNPL  +  S^LOSS  +  SfNTANG  +  BjEFFICIENCY  +  S^SLCO 
+  SgSCOMM+  SiqSCON+  SuSCESTATE+  S12TCAP+  S^2EXEMPT+  YEARCONTROLS  +  e . 

t-statistics,  in  parentheses,  are  based  on  standard  errors  adjusted  for  firm-level  clustering. 

Please  see  Table  1  for  variable  definitions. 


measures,  total  fees  and  nonaudit  fees,  and  BIGS,  LASSETS,  SECURITIES,  NPL,  EFFICIENCY, 
and  SCOMM.  The  signs  of  the  coefficients  are  generally  consistent  with  Fields  et  al.  (2004).  For 
the  total  fees  model,  the  adjusted  is  80.38  percent,  indicating  a  good  fit.  For  the  nonaudit  fees 
model,  the  adjusted  is  51.11  percent.  These  values  are  higher  than  the  adjusted  R^  values 
reported  in  Ashbaugh  et  al.  (2003)  for  industrial  firms  and  consistent  with  the  R^  values  reported 
in  Fields  et  al.  (2004).  We  use  the  residuals  from  model  (2)  as  the  unexpected  (abnormal)  fees  for 
our  fee  measures. 

Association  between  Income-increasing  (Negative)  Abnormal  LLP  and  Abnormal  and  Actual 
Fee  Measures 

We  present  the  results  of  model  (3)  relating  abnormal  LLP  to  fee  measures  (total  fees,  non¬ 
audit  fees,  audit  fees,  and  fee  ratio)  in  Tables  4  and  5.  Although  we  report  estimation  results  for  all 
four  fee  measures  for  completeness,  we  only  discuss  the  results  for  unexpected  total  and  nonaudit 
fees,  the  measures  of  particular  interest.^'^  Table  4  reports  the  results  for  negative  (income- 
increasing)  ALEP,  and  Table  5  reports  the  results  for  positive  (income-decreasing)  ALEP.  In  each 
table.  Panel  A  presents  the  results  using  abnormal  (unexpected)  fee  measures,  and  Panel  B  presents 
the  results  using  actual  fee  measures. 


The  correlations  between  our  proxies  for  abnormal  LLP  and  abnormal  fee  measures  and  size  are  very  low,  confirming 
that  our  subsequent  analysis  for  large  and  small  banks  does  not  suffer  from  latent  size  issues. 

The  results  for  audit  fees  and  fee  ratio  are  discussed  in  the  section  on  sensitivity  analysis. 
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*,  **,  ***  Indicate  0.10,  0.05,  and  0.01  significance  levels,  respectively,  for  a  two-tailed  test, 
t-statistics,  in  parentheses,  are  based  on  standard  errors  adjusted  for  firm-level  clustering. 

“  Panel  A:  The  regression  model  is: 

ALLP  =  7o  +  yiEXEMPT+  y2UFEE  +  y^EXEMPT  X  UFEE  +  y^BlGS  +  y^MB  +  y^^LMVE  +  y-jLOSS  +  y^PASTLLP  +  y^EBP  +  y^JIERl,_i  +  ynTCAP,_i  +  YEARCONTROLS 
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For  each  fee  measure,  we  present  results  for  two  specifications.  The  first  specification  esti¬ 
mates  the  relation  between  ALLP  and  each  abnormal  and  actual  fee  measure  for  all  banks.  The 
second  specification  (column  3)  distinguishes  between  small  banks  that  are  exempt  from  FDICIA 
and  Section  404  of  SOX  and  large  banks,  thereby  allowing  the  relation  between  ALLP  and  fee 
measures  to  vary  across  these  two  classes  of  banks. 

The  results  in  column  2  of  Table  4,  Panel  A  indicate  that  the  coefficient  on  UTOTFEE  is  not 
significant,  but  the  results  in  column  4  indicate  that  the  coefficient  on  UNAFEE  is  significant  at  the 
0.10  level.  These  results  suggest  that  income-increasing  earnings  management  via  ALLP  is  in¬ 
creasing  in  unexpected  nonaudit  fees.  In  terms  of  the  control  variables,  LOSS  and  PASTLLP  (in 
column  4)  are  significantly  related  to  ALLP.  Interestingly,  there  is  no  difference  in  abnormal  LLP 
between  clients  of  BIGS  and  non-BIG5  auditors.  As  expected,  the  capital  ratios  are  not  signifi¬ 
cantly  associated  with  ALLP,  confirming  the  reduced  capital  management  incentives  through 
ALLP  in  the  post- 1990  period. 

Turning  to  the  second  specification  (columns  3  and  5  of  Table  4),  the  coefficients  on  UTOT¬ 
FEE  and  UNAEEE  are  not  significant,  indicating  no  relation  between  income-increasing  earnings 
management  and  unexpected  fees  for  large  banks.^^  In  contrast,  the  coefficients  on  EXEMPT  X 
UTOTEEE  and  EXEMPT  X  UNAEEE  are  negative  and  significant  at  the  0.05  level.  In  addition, 
both  the  sum  of  the  coefficients  on  UTOTFEE  and  EXEMPT  X  UTOTFEE  and  on  UNAEEE  and 
EXEMPT  X  UNAEEE  are  significantly  negative  at  the  0.01  level.  These  results  indicate  that 
income-increasing  ALLP  is  higher  for  audit  clients  that  are  small  banks  (and,  therefore,  are  exempt 
from  the  regulatory  oversight)  that  pay  higher  abnormal  total  fees  or  abnormal  nonaudit  fees  to 
their  auditors. 

The  results  in  Table  4,  Panel  B  indicate  that  the  coefficients  on  EXEMPT  X  LTOTEEE  and 
EXEMPT  X  LNAEEE  are  negative  and  significant  at  the  0.05  level.^*  Overall,  these  results  are 
consistent  with  the  results  based  on  unexpected  fee  measures  reported  in  Panel  A,  indicating  that 
our  findings  are  not  sensitive  to  the  use  of  actual  or  unexpected  fees.  In  addition,  the  results 
document  that  only  small  banks  that  are  exempt  from  regulatory  compliance  exhibit  a  significant 
relation  between  earnings  management  via  income-increasing  LLP  and  fees  paid  to  the  auditor.  In 
short,  our  findings  suggest  that  the  potential  impairment  of  auditor  independence  may  be  serious 
only  for  small  banks  and  not  for  large  banks. 

Association  between  Income-Decreasing  (Positive)  Abnormal  LLP  and  Abnormal  and  Actual 
Fee  Measures 

The  results  reported  in  Table  5,  Panel  A  show  that  UTOTEEE  is  not  significantly  associated 
with  income-decreasing  ALLP.  However,  the  coefficients  on  EXEMPT  X  UTOTEEE  and  EX¬ 
EMPT  X  UNAFEE  are  negative  and  significant  at  the  0.05  level.^^  These  results  indicate  that 
higher  unexpected  total  fees  and  unexpected  nonaudit  fees  paid  to  auditors  of  exempt  small  banks 
are  associated  with  a  lower  level  of  income-decreasing  ALLP.  Turning  to  columns  2  and  3  of  Panel 
B,  LTOTEEE  is  positively  and  significantly  associated  with  ALLP  at  the  0.01  level  in  each  of  those 
regressions.  One  interpretation  of  this  result  is  that  clients  of  auditors  who  pay  higher  total  fees 


For  the  abnormal  fee  regressions,  we  have  170  (70)  bank-year  (banks)  observations  for  which  EXEMPT  =  1.  For  the 
actual  fee  regressions,  we  have  177  (72)  bank-year  (banks)  observations  for  which  EXEMPT  =  1. 

We  also  estimate  model  (3)  using  only  observations  for  which  EXEMPT  =  0  (i.e.,  large  banks).  We  find  that  the 
coefficients  on  UTOTFEE  and  UNAFEE  are  insignificant  at  the  0.20  level  for  both  income-increasing  and  income- 
decreasing  ALLP,  indicating  no  evidence  of  fee  dependence  for  large  banks. 

We  mean-center  the  fee  variables  to  reduce  problems  with  multicollinearity  among  the  interaction  terms  (Neter  et 
al.l989;  Aiken  and  West  1991). 

For  the  abnormal  fee  regressions,  we  have  142  (70)  bank-year  (banks)  observations  for  which  EXEMPT  =  1.  For  the 
actual  fee  regressions,  we  have  157  (74)  bank-year  (banks)  observations  for  which  EXEMPT  =  1. 
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TABLE  5 

Relation  Between  Income-Decreasing  (Positive)  ALLP  and  Fee  Measures 
Panel  A;  Results  of  Regressions  of  Income-Decreasing  (Positive)  ALLP  on  Abnormal  Fee  Measures^ 
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*,  **,  ***  Indicate  0.10,  0.05,  and  0.01  significance  levels,  respectively,  for  a  two-tailed  test, 
t-statistics,  in  parentheses,  are  based  on  standard  errors  adjusted  for  firm-level  clustering. 

“  Panel  A:  The  regression  model  is: 

ALLP=  yQ+y^EXEMPT+  y2UPEE+  y^EXEMPTX  UFEE+  y^BIGS  +  yiMB+  y(JLMVE+  77LO55+  y^PASTLLP  +  y^EBP  +  y^oTIERl +  y^{rCAP,_^  +  YEARCONTROLS 
+  e. 
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engage  in  a  higher  level  of  income-decreasing  earnings  management.^^  Although  positive,  the 
insignificant  coefficient  on  LNAFEE  does  not  support  the  total  fee  results.  Further,  the  coefficients 
on  EXEMPT  X  LTOTEEE  and  EXEMPT  X  LNAEEE  are  insignificant.  These  results  differ  from 
the  results  in  Panel  A  due  to  the  use  of  different  fee  measures.  If  we  do  not  control  for  factors 
determining  normal  fees,  we  do  not  observe  a  significant  relation  between  income-decreasing 
ALEP  and  either  unexpected  total  fees  or  unexpected  nonaudit  fees  (see  Panel  A),  indicating  that 
the  results  for  income-decreasing  ALEP  are  sensitive  to  the  use  of  fees  or  abnormal  fees. 

Summary  of  Results 

We  briefly  summarize  our  main  results  on  the  relation  between  income-increasing  and 
income-decreasing  ALEP  and  unexpected  total  fees  and  unexpected  nonaudit  fees.  We  find  that  for 
large  banks,  both  unexpected  total  fees  and  unexpected  nonaudit  fees  have  no  relationship  with 
either  income-increasing  or  income-decreasing  AEEP — i.e.,  unexpected  total  or  nonaudit  fees  are 
unrelated  to  earnings  management  behavior  in  large  banks.  In  contrast,  we  find  a  significant 
negative  association  between  income-increasing  AEEP  and  both  unexpected  total  fees  and  unex¬ 
pected  nonaudit  fees  for  small  banks  relative  to  large  banks.  Recall  that  these  small  banks  are 
exempt  from  FDICIA  (as  well  as  Section  404  of  SOX),  which  mandated  assessments  of  the 
effectiveness  of  internal  control  over  financial  reporting  by  managers  and  auditors.  Furthermore, 
we  also  document  a  significant  negative  association  between  unexpected  total  fees  and  unexpected 
nonaudit  fees  and  income-decreasing  AEEP.  These  results  are  consistent  with  greater  earnings 
management  by  small  banks  that  pay  higher  abnormal  fees  to  the  auditor.  Collectively,  our  results 
suggest  that  auditor  independence  may  be  lower  when  auditor  fee  dependence  on  the  audit  client 
is  higher,  but  only  for  small  banks  that  are  not  subject  to  the  same  level  of  regulatory  scrutiny  as 
large  banks.  We  conjecture  that  auditors  of  small,  exempt  banks  may  not  be  as  concerned  with 
earnings  management  perhaps  because  the  failure  of  such  a  bank,  if  it  occurs,  will  cause  less 
damage  to  the  auditor’s  reputation  than  will  the  failure  of  a  large  bank.  Alternatively,  partners 
auditing  small,  exempt  banks  may  not  be  as  closely  monitored  internally  as  partners  auditing  large 
banks. 

Pooled  AEEP  Regressions 

For  consistency  with  prior  research  on  banks  (e.g.,  Ahmed  et  al.  1999;  Kanagaretnam  et  al. 
2004),  we  also  estimate  model  (3)  for  the  full  sample  (i.e.,  combining  income-increasing  and 
income-decreasing  AEEP)  and  present  these  results  in  Table  6.  The  coefficients  on  EXEMPT  X 
UTOTFEE  and  EXEMPT  X  UNAFEE  are  both  negative  and  significant  at  the  0.05  level,  indicat¬ 
ing  that  our  finding  of  a  positive  relation  between  higher  abnormal  auditor  fees  and  aggressive 
earnings  management  for  exempted  banks  is  robust. 

Association  between  Abnormal  LLP  and  Abnormal  Fee  Measures  in  the  Post-SOX  Era 

While  banks  with  total  assets  of  less  than  $500  million  (less  than  $1  billion  effective  2005) 
were  not  subject  to  the  annual  reporting  requirements  mandated  by  Section  36  of  the  FDI  Act,  the 
passage  of  SOX  (Section  404)  also  exempts  assessment  of  the  effectiveness  of  internal  control 
over  financial  reporting  by  managers  and  auditors  of  small  banks  (banks  with  public  float  less  than 
$75  million).  Therefore,  we  expect  our  results  on  the  association  between  AEEP  and  unexpected 
fees  observed  for  small  (exempted)  banks  to  be  similar  in  the  post-SOX  era.  We  examine  whether 
our  results  hold  for  a  class  of  banks  that  are  exempt  from  both  FDICIA  and  SOX.  Section  404  of 


Also,  note  that  EBP  is  positive  and  significant  in  both  panels  of  Table  5,  suggesting  that  firms  are  more  willing  to  take 
income-decreasing  LLP  when  they  have  high  earnings.  This  is  consistent  with  income  smoothing. 

We  thank  an  anonymous  reviewer  for  these  observations. 
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TABLE  6 


Results  of  Regressions  of  Pooled  ALLP  on  Abnormal  Fee  Measures 


Variable 

Abnormal 

Total  Fee 

Abnormal 
Nonaudit  Fee 

Intercept 

-0.0005 

0.0004 

(-0.87) 

(0.61) 

EXEMPT 

-0.0001 

-0.0001 

(-1.35) 

(-1.37) 

UTOTFEE 

0.0000 

(0.34) 

EXEMPT  X  UTOTFEE 

-0.0005*** 

(-2.59) 

UNAFEE 

-0.0001* 

(-1.69) 

EXEMPT  X  UNAFEE 

-0.0003** 

(-2.54) 

BIGS 

0.0000 

0.0000 

(0.40) 

(0.33) 

MB 

0.0000 

0.0000 

(1.26) 

(1.26) 

LMVE 

-0.0001* 

-0.0001* 

(-1.90) 

(-1.75) 

LOSS 

-0.0003 

-0.0002 

(-0.17) 

(-0.11) 

PASTEUP 

0.1345*** 

0.1327*** 

(4.36) 

(4.33) 

EBP 

0.0501*** 

0.0512*** 

(2.98) 

(3.00) 

TIERl 

0.0000 

0.0000 

(0.19) 

(0.21) 

TCAP 

0.0000 

0.0000 

(0.04) 

(0.10) 

Year  Controls 

Yes 

Yes 

n 

1618 

1618 

Adj. 

9.27% 

10.03% 

*,  **,  ***  Indicate  0.10,  0.05,  and  0.01  significance  levels,  respectively,  for  a  two-tailed  test. 

The  regression  model  is: 

ALLP  =  7o  +  yiEXEMPT+  y2UFEE  +  y-^EXEMPT  X  UFEE  +  y^BIG5  +  y^MB  +  y^LMVE  +  77LO5S  -1-  ysPASTLLP 
+  ygEBP  +  yioTIERl,^^  +  7,,rCAF,_i  +  YEARCONTROLS  +  e. 

t-statistics,  in  parentheses,  are  based  on  standard  errors  adjusted  for  firm-level  clustering. 

Please  see  Table  1  for  variable  definitions. 


SOX  became  effective  in  2004  for  accelerated  filers — i.e.,  firms  with  a  public  float  of  $75  million 
or  more.  We  re-estimate  model  (3)  for  the  years  2004  through  2006  by  replacing  EXEMPT  with 
NF  (non-accelerated  filers),  which  equals  1  for  banks  that  are  exempt  from  both  Section  404  of 
SOX  and  Section  36  of  the  FDI  Act,  and  0  otherwise.  The  results  of  this  specification  are  in  Table 

7. 
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We  report  results  for  the  pre-  and  post-Section  404  of  SOX  periods.  For  the  pre-Section  404 
period  (years  2000  through  2003),  we  interact  SMALL  with  UTOTFEE  and  UNAFEE.  For  the 
post-Section  404  period  (years  2004  through  2006)  we  interact  NE  with  UTOTEEE  and  UNAFEE. 
Table  7,  Panel  A  reports  the  results  for  income-increasing  (negative)  ALLP  and  Panel  B  reports  the 
results  for  income-decreasing  (positive)  ALLP.  We  find  that  for  income-increasing  ALLP,  the  sum 
of  the  coefficients  on  UTOTEEE  and  SMALL  X  UTOTFEE  and  on  UNAFEE  and  SMALL  X 
UNAEEE  are  negative  and  significant  at  the  0.05  level.  We  also  find  that,  for  income-decreasing 
(positive)  ALLP,  the  coefficients  on  SMALL  X  UTOTEEE  and  SMALL  X  UNAEEE  are  negative 
and  significant  at  the  0.05  level.  These  results  are  consistent  with  the  results  in  Tables  4  and  5, 
suggesting  that  there  is  greater  earnings  management  via  abnormal  LLP  by  small  banks  (less  than 
$500  million  in  total  assets)  that  pay  higher  abnormal  fees  to  the  auditor.  Turning  to  the  post- 
Section  404  period,  we  find  that,  for  income-increasing  ALLP,  the  coefficients  on  NF  X  UTOT¬ 
FEE  and  NE  X  UNAEEE  are  both  negative  and  significant  at  the  0.05  level.  We  do  not  find  a 
significant  coefficient  for  income-decreasing  ALLP.  Overall,  these  results  suggest  that  banks  that 
are  exempt  from  both  Section  36  of  the  FDI  Act  and  Section  404  of  SOX  and  pay  higher  abnormal 
fees  to  the  auditor  engage  in  greater  income-increasing  earnings  management  via  ALLP. 

Sensitivity  Tests 

We  conduct  several  sensitivity  checks  to  assess  the  robustness  of  these  results.  First,  we 
summarize  the  results  for  audit  fees  and  fee  ratio  reported  in  Tables  4  and  5.  Recall  that  our  main 
results  are  based  on  unexpected  total  fees  and  unexpected  nonaudit  fees;  however,  consistent  with 
prior  research  on  auditor  fees  for  industrial  firms  (Ashbaugh  et  al.  2003;  Srinidhi  and  Gul  2007), 
we  present  results  for  audit  fees  and  fee  ratio  as  additional  evidence.  For  income-increasing 
(negative)  ALLP,  neither  abnormal  audit  fees  nor  actual  audit  fees  is  significantly  associated  with 
ALLP.  On  the  other  hand,  unexpected  fee  ratio  has  a  strong  negative  (p  <  0.01)  relation  with 
income-increasing  ALLP  for  small  banks.  For  income-decreasing  (positive)  ALLP,  actual  audit 
fees  have  a  positive  (p  <  0.05  or  better)  association  with  ALLP  for  both  large  and  small  banks; 
however,  this  relation  does  not  hold  for  unexpected  audit  fees.  Abnormal  fee  ratio  is  negatively 
associated  with  income-increasing  ALLP  for  small  banks  (p  <  0.01),  but  fee  ratio  is  not  signifi¬ 
cantly  associated  with  income-increasing  ALLP. 

Second,  we  discuss  the  results  (untabulated)  for  separate  subsample  analyses  of  small  and 
large  banks.  For  income-increasing  ALLP,  our  main  results  for  unexpected  total  fees  and  unex¬ 
pected  audit  fees  hold  for  the  subsample  analysis.  More  specifically,  for  large  (non-exempted) 
banks,  there  is  no  significant  association  between  unexpected  total  fees  and  unexpected  nonaudit 
fees  and  income-increasing  ALLP.  For  small  (exempted)  banks,  the  estimated  coefficient  for 
unexpected  total  fees  is  —0.0006  (p  <  0.05),  and  for  unexpected  nonaudit  fees  the  estimated 
coefficient  is  —0.0002  (p  <  0.05),  consistent  with  our  main  results  reported  in  Table  4,  Panel  A. 
For  income-decreasing  ALLP,  the  only  significant  association  is  with  unexpected  nonaudit  fees  for 
small  (exempted)  banks  (estimated  coefficient  of  —0.0003,  p  <  0.01),  consistent  with  the  main 
results  reported  in  Table  5,  Panel  A. 

Third,  we  assess  the  sensitivity  of  our  results  to  omitting  observations  that  have  an  abnormal 
influence  on  our  regressions.  When  we  delete  observations  that  have  R-student  values  greater  than 
3  or  less  than  —3,  the  results  for  unexpected  total  fees  and  unexpected  nonaudit  fees  for  both 
income-increasing  and  income-decreasing  ALLP  are  similar  to  the  results  reported  in  Panel  A  of 
Tables  4  and  5.^^ 


Instead  of  deleting  the  top  1  percent  of  observations,  when  we  winsorize  the  key  variables  at  the  top  1  percent  for 
income-increasing  ALLP,  the  coefficient  on  EXEMPT  X  UTOTFEE  is  —0.0007  (significant  at  the  0.01  level).  Similarly, 
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Fourth,  we  examine  the  robustness  of  our  results  to  interacting  the  capital  management  and 
income  smoothing  variables  with  EXEMPT.  Using  income-increasing  ALEP  observations,  we 
re-estimate  model  (3)  with  the  following  interactions:  EXEMPT  X  EBP,  EXEMPT  X  TIERl,  and 
EXEMPT  X  TCAP.  Untabulated  results  indicate  that  neither  the  capital  ratios  nor  any  of  then- 
interactions  are  significant.  Additionally,  the  income  smoothing  variable  and  its  interaction  are  also 
not  significant.  However,  the  coefficients  on  EXEMPT  X  UTOTEEE  and  EXEMPT  X  UNAEEE 
continue  to  be  negative  and  significant  at  the  0.05  level. 

Finally,  we  examine  the  relation  between  abnormal  LLP  and  auditor  fees  using  an  alternate 
estimate  of  abnormal  LLP.  Ahmed  et  al.  (1999)  suggest  that  the  earnings  management  through 
LLP  reported  in  prior  research  is  conditional  on  the  inclusion  of  beginning  non-performing  loans 
{NPLt_i)  in  the  abnormal  LLP  model.  Our  main  results  are  robust  to  using  estimates  of  abnormal 
LLP  after  excluding  NPLf_x  from  model  (1). 

VI.  CONCLUSIONS 

Despite  the  economic  importance  of  the  banking  industry,  there  is  limited  research  on  the 
various  relationships  that  exist  between  banks  and  their  auditors.  We  provide  empirical  evidence 
on  the  relation  between  fees  paid  to  the  auditor  and  the  extent  of  earnings  management  via  loan 
loss  provisions  in  the  banking  industry.  In  particular,  we  study  a  setting  where  small  banks  are 
exempt  from  regulatory  scrutiny.  Section  36  of  the  FDI  Act,  which  was  intended  to  aid  early 
detection  of  problems  in  the  financial  management  of  insured  banks,  requires  an  assessment  of  the 
effectiveness  of  internal  control  over  financial  reporting  by  auditors.  However,  this  requirement 
does  not  apply  to  banks  with  total  assets  of  less  than  $1  billion  (less  than  $500  million  until  2004). 
We  study  whether  there  is  a  relation  between  earnings  management  via  loan  loss  provisions  and 
fees  paid  to  the  auditor,  particularly  for  banks  that  are  exempt  from  the  FDI  Act. 

We  report  several  important  findings.  First,  for  large  banks,  we  find  that  both  unexpected  total 
fees  and  unexpected  nonaudit  fees  are  not  related  to  either  income-increasing  or  income- 
decreasing  AJ.IP  These  findings  indicate  that  unexpected  auditor  fees  are  unrelated  to  earnings 
management  behavior  in  large  banks,  suggesting  that  auditor  fee  dependence  is  not  a  threat  to 
auditor  independence  in  large  banks.  In  contrast,  we  find  a  strong  negative  association  between 
income-increasing  (negative)  ALEP  and  both  unexpected  total  fees  and  unexpected  nonaudit  fees 
for  small  banks  that  are  exempt  from  Section  36  of  the  FDI  Act  and  Section  404  of  SOX. 
Furthermore,  we  also  document  a  strong  negative  association  between  unexpected  nonaudit  fees 
and  income-decreasing  (positive)  AEEP.  These  results  are  consistent  with  greater  earnings  man¬ 
agement  by  exempt  banks  that  pay  higher  abnormal  fees  to  the  auditor.  Collectively,  our  results 
suggest  that  auditor  independence  could  be  threatened  when  auditor  fee  dependence  on  the  audit 
client  is  higher,  but  only  for  small  banks  that  are  not  subject  to  the  same  level  of  regulatory 
scrutiny  as  larger  banks. 

Our  results  complement  the  findings  of  Altamuro  and  Beatty  (2010),  who  link  FDICIA  re¬ 
quirements  to  improved  loan  loss  provisioning.  We  also  find  that  loan  loss  provisions  of  banks  that 
are  subject  to  internal  control  requirements  under  FDICIA  or  SOX  are  independent  of  fees  paid  to 
auditors.  Additionally,  we  document  that  banks  that  are  exempted  from  internal  control  require¬ 
ments  under  FDICIA  or  SOX  are  more  likely  to  overstate  earnings  when  the  fees  paid  to  auditors 
are  high. 

Our  results  have  several  implications.  First,  they  inform  policymakers  that  auditor  fees  are 
related  to  earnings  management  for  small  but  not  large  banks.  This  suggests  that  any  potential 


the  coefficient  on  EXEMPT  X  UNAEEE  is  -0.0003  (significant  at  the  0.01  level).  For  income-decreasing  ALEP,  the 
coefficient  on  EXEMPT  X  UNAEEE  is  -0.0002  (significant  at  the  0.05  level).  Once  again,  we  find  that  unexpected  total 
and  nonaudit  fees  are  unrelated  to  ALEP  for  non-exempted  banks. 
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impairment  of  auditor  independence  through  LLP  could  be  a  problem  for  small  banks.  Second,  to 
the  extent  that  the  FDICIA  can  be  viewed  as  a  precursor  to  SOX  and  to  the  extent  that  our  results 
for  banks  are  generalizable  to  other  industries,  our  findings  of  increased  earnings  management  by 
small  banks  that  are  less  closely  regulated  are  relevant  to  the  debate  on  reducing  the  requirements 
of  SOX  for  small  firms.  Finally,  our  findings  are  timely  and  relevant  given  the  recent  hanking 
crisis  and  the  response  from  governments  around  the  world  contemplating  new  regulations.  In 
particular,  our  results  inform  policymakers  on  the  relationship  that  existed  between  fees  paid  to 
auditors  and  the  extent  of  earnings  management  in  banks  prior  to  the  current  banking  crisis. 
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ABSTRACT:  This  study  examines  whether  the  market  reacts  more  strongly  to  earnings 
forecast  revisions  when  financial  analysts  supplement  their  earnings  forecasts  with 
sales  forecasts.  1  find  that  earnings  forecast  revisions  supplemented  with  sales  forecast 
revisions  have  a  greater  impact  on  security  prices  than  do  stand-alone  earnings  fore¬ 
cast  revisions,  controlling  for  the  incremental  information  content  in  sales  forecasts. 
Supplemented  earnings  forecasts  are  more  accurate  ex  post,  controlling  for  other  indi¬ 
vidual  analyst  characteristics.  Results  are  robust  to  controlling  for  earnings  persistence 
and  time  effects.  Taken  as  a  whole,  financial  analysts  are  more  likely  to  supplement 
their  earnings  forecasts  with  sales  forecasts  when  they  have  better  information.  Supple¬ 
mentary  sales  forecasts  appear  to  lend  credibility  to  earnings  forecasts  because  finan¬ 
cial  analysts  provide  sales  forecasts  when  they  are  more  informed. 

Keywords:  financial  analyst;  sales  forecast;  revenue  forecast;  forecast  accuracy. 

Data  Availability:  All  data  are  available  from  the  sources  identified  in  the  text. 

1.  INTRODUCTION 

This  study  investigates  two  research  questions:  (1)  How  does  the  market  react  to  earnings 
forecast  revisions  when  individual  financial  analysts  justify  their  earnings  forecasts  with 
sales  forecasts?  (2)  Do  financial  analysts  provide  supplementary  sales  forecasts  when  they 
are  better  informed?  I  examine  whether  investors  react  more  strongly  to  earnings  forecast  revi¬ 
sions  supplemented  with  sales  forecast  revisions  (hereafter  “supplemented  earnings  forecast  revi¬ 
sions”)  than  to  revisions  not  supplemented  with  sales  forecast  revisions  (hereafter  “stand-alone 
earnings  forecast  revisions”).  I  define  an  earnings  forecast  revision  as  supplemented  by  a  sales 
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forecast  if  a  financial  analyst  revises  both  his/her  earnings  and  sales  forecasts  on  the  same  day  as 
reported  in  the  Institutional  Brokers  Estimate  System  (I/B/E/S)  database* 

I  define  forecast  credibility  as  the  extent  to  which  investors  believe  and  react  to  earnings 
forecasts.  I  measure  forecast  credibility  using  the  forecast  response  coefficient  to  an  earnings 
forecast  revision.  Supplementary  sales  forecasts  appear  to  increase  the  credibility  of  earnings 
forecasts  for  two  reasons.  First,  investors  can  better  evaluate  the  credibility  of  earnings  forecasts 
with  their  knowledge  of  the  business  and  other  macroeconomic  factors  when  financial  analysts 
reveal  sales  forecasts.  Sales  are  an  important  driver  of  firm  value  and  a  key  variable  to  investors 
engaged  in  fundamental  analysis.  For  example,  Penman  (2001,  503)  suggests  that  sales  forecasts 
contain  information  on  sales  growth  and  the  firm’s  marketing  plan,  which  is  not  contained  in 
earnings  forecasts.  Financial  analysts  convey  the  causes  of  the  earnings  forecast  revisions  through 
sales  forecasts.  Sales  forecasts  explain  whether  the  revisions  in  earnings  forecast  are  due  to  growth 
in  revenue,  increase  in  profit  margin,  or  cost  control.  Earnings  growth  supported  by  an  increase  in 
revenue  is  more  sustainable  because  growth  in  revenue  reflects  the  success  of  the  underlying 
product  differentiation  strategy  (Porter  1980).  Second,  sales  forecasts  are  ex  post  verifiable  at  the 
year-end.^  Because  investors  can  use  sales  forecasts  to  gauge  the  information  provided  by  indi¬ 
vidual  financial  analysts,  financial  analysts  are  more  likely  to  issue  supplementary  sales  forecasts 
when  they  are  more  informed. 

While  prior  research  mainly  focuses  on  financial  analysts’  earnings  forecasts  and  their  stock 
recommendations,  it  is  an  open  question  as  to  how  the  market  reacts  to  other  supplementary 
information  provided  by  financial  analysts.'*  My  first  set  of  tests  examines  the  association  between 
supplementary  sales  forecast  revisions  and  the  market  response  to  financial  analysts’  earnings 
forecast  revisions.^  On  the  one  hand,  if  supplementary  sales  forecasts  add  credibility  to  earnings 
forecast  revisions,  then  price  reactions  to  the  earnings  forecast  revisions  would  be  stronger,  con¬ 
trolling  for  the  information  content  of  sales  forecasts.  On  the  other  hand,  if  the  market  perceives 
supplementary  sales  forecasts  as  “cheap  talk”  to  generate  trading  volume,  then  sales  forecasts  and 
price  reactions  would  not  be  associated.  My  results  show  that  supplemented  earnings  forecast 
revisions  have  a  greater  impact  on  contemporaneous  stock  prices  than  do  stand-alone  earnings 
forecast  revisions,  controlling  for  the  independent  information  in  sales  forecast  revisions  and  other 
individual  analyst  characteristics.  The  difference  in  market  responses  suggests  that  investors  view 
supplemented  earnings  forecast  revisions  as  more  credible. 

I  then  examine  whether  the  financial  analysts  have  better  information  when  they  choose  to 
reveal  their  supplementary  sales  forecasts.  My  second  set  of  tests  examines  the  accuracy  of 
supplemented  earnings  forecasts,  controlling  for  other  individual  analyst  characteristics.  Because 
analysts’  private  information  is  unobservable,  I  control  for  other  confounding  factors  in  two  ways. 
First,  I  measure  forecast  accuracy  relative  to  other  analysts  issuing  forecasts  for  the  same  firm-year 
to  control  for  potential  differences  in  forecasting  difficulty  across  firm-years  (Clement  and  Tse 


'  For  the  purposes  of  this  study,  sales  forecast  and  revenue  forecast  carry  the  same  meaning  and  are  used  interchangeably 
unless  otherwise  stated. 

^  Ex  post  verifiable  forecasts  {hard  forecasts)  aie  those  forecasts  that  the  actual  values  are  observed  by  the  market  after  a 
definite  period  of  time,  for  example,  at  subsequent  earnings  announcement  dates.  In  contrast  to  hard  forecasts,  soft 
forecasts  are  those  that  are  not  specific  enough  to  be  compared  with  subsequent  realizations  (Hutton  et  al.  2003).  An 
example  of  a  soft  forecast  would  be  a  qualitative  discussion  of  a  firm’s  prospects  in  an  analyst  report. 

^  For  example,  Loh  and  Mian  (2006)  argue  that  quality  of  justification  of  earnings  forecasts  in  analyst  reports  is  part  of 
an  analyst’s  utility  function. 

Call  et  al.  (2009)  is  an  exception.  However,  they  do  not  examine  the  information  content  of  and  the  market  reactions  to 
cash  flow  forecasts. 

^  Because  forecasting  sales  is  the  first  step  to  forecasting  earnings,  it  is  unlikely  that  an  analyst  would  maintain  the  same 
sales  forecast  if  he/she  revises  his/her  own  earnings  forecast.  Therefore,  I  assume  that  it  is  a  decision  by  individual 
financial  analysts  whether  to  disclose  their  revised  sales  forecasts. 
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2005;  Herrmann  and  Thomas  2005).  Second,  I  directly  control  for  other  individual  analyst  and 
forecast  characteristics  (Mikhail  et  al.  1997;  Clement  1999;  Jacob  et  al.  1999;  Brown  2001; 
Clement  and  Tse  2003).  Consistent  with  my  hypotheses,  I  find  that  supplementary  sales  forecasts 
are  associated  with  more  accurate  earnings  forecasts.  Financial  analysts  are  more  likely  to  supple¬ 
ment  their  earnings  forecasts  with  sales  forecasts  when  they  have  better  information.  These  results 
also  suggest  that  supplementary  sales  forecasts  lend  credibility  to  earnings  forecasts  because 
financial  analysts  provide  sales  forecasts  when  they  are  more  informed. 

This  study  extends  the  findings  of  Baginski  et  al.  (2004)  who  demonstrate  that  management 
earnings  forecasts  supplemented  by  sales  forecasts  receive  larger  price  responses,  which  is  con¬ 
sistent  with  investors  perceiving  these  earnings  forecasts  as  more  credible.  Because  managers  and 
analysts  have  different  incentives  and  differential  access  to  information,  I  investigate  whether 
Baginski  et  al.’s  (2004)  results  extend  to  analysts’  earnings  forecasts. 

It  is  not  self-evident  that  investors  will  have  identical  interpretations  of  managers’  and  ana¬ 
lysts’  forecasts.  First,  firm  mangers  and  financial  analysts  belong  to  different  labor  markets  and  are 
subject  to  different  incentives.  On  the  one  hand,  management  compensation  and  career  outcome 
are  directly  tied  to  the  performance  of  the  company.  While  supplementary  information  could  be 
useful  to  the  market,  the  primary  concern  of  managers  is  not  to  convey  their  forecast  accuracy.  If 
the  company  is  not  performing  well,  then  management  would  have  incentives  to  avoid  shareholder 
discipline  by  attributing  the  poor  performance  to  factors  that  are  beyond  their  control.  On  the  other 
hand,  firm  performance  does  not  have  a  direct  effect  on  the  payoff  of  financial  analysts.  They  are 
compensated  for  the  amount  of  trading  commissions  generated.  As  a  result,  the  market  may  not 
react  to  supplementary  information  if  the  market  perceives  such  supplementary  sales  forecast  to  be 
merely  cheap  talk. 

Second,  firm  mangers  and  financial  analysts  have  differential  access  to  information.  While 
financial  analysts  have  a  competitive  edge  in  understanding  industry  information,  firm  managers 
have  access  to  firm-specific  proprietary  information  related  to  the  firms  they  are  running.  In  other 
words,  they  have  first-hand  information  relative  to  the  financial  analysts  and  have  discretion  on 
how  much  information  (if  any)  is  to  be  released  to  financial  analysts.  A  priori,  it  is  not  obvious 
whether  supplementary  information  provided  by  firm  mangers  and  financial  analysts  is  equally 
reliable.^ 

Baginski  et  al.  (2004)  regress  three-day  cumulative  abnormal  returns  on  the  interaction  be¬ 
tween  unexpected  earnings  and  an  indicator  variable  for  the  presence  of  sales  forecasts.  However, 
they  do  not  control  for  the  incremental  information  content  in  the  sales  forecasts.  I  include  both 
revisions  in  sales  forecasts  and  indicators  for  the  presence  of  sales  forecasts  to  disentangle  whether 
the  market  reactions  are  driven  by:  (1)  specific  information  in  sales  forecasts,  and/or  (2)  the  act  of 
disclosing  sales  forecasts.  My  study  also  goes  one  step  further  and  investigates  whether  these 
supplemented  earnings  forecasts  are  indeed  more  accurate  ex  post.  Because  the  role  of  financial 
analysts  has  increased  dramatically  over  time,  it  is  important  to  understand  the  nature  of  and  the 
market  responses  to  supplementary  sales  forecasts. 

Givoly  et  al.  (2009)  provide  evidence  that  cash  flow  forecasts  appear  to  be  a  naive  extension 
of  analysts’  earnings  forecasts.  They  show  that  analysts’  forecasts  of  cash  flows  are  of  limited 
information  content  and  are  only  weakly  associated  with  stock  returns.  My  evidence  suggests  that 
sales  forecasts,  unlike  cash  flow  forecasts,  provide  incremental  information  to  the  market.^ 


®  I  thank  Steve  Matsunaga  for  sharing  his  insights  on  the  differences  between  management  forecasts  and  analysts’ 
forecasts  with  me. 

^  One  possible  reason  is  that  from  a  financial  statement  analysis  perspective,  forecasting  sales  is  the  first  step  to  forecast¬ 
ing  earnings  (for  example,  see  Penman  [2001],  among  others).  In  other  words,  financial  analysts  derive  their  earnings 
forecasts /ram  their  sales  forecasts.  Givoly  et  al.’s  (2009)  evidence  suggests  cash  flow  forecasts  appear  to  be  a  naive 
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My  work  is  related  to  Rees  and  Sivaramakrishnan  (2007).^  However,  this  study  differs  in  two 
ways.  First,  while  Rees  and  Sivaramakrishnan  (2007)  examine  the  implication  of  meeting  or 
exceeding  revenue  forecasts  at  the  time  of  the  quarterly  earnings  announcement,  I  do  not  restrict 
my  sample  to  the  last  forecast  revisions  issued  by  financial  analysts  and  look  at  revisions  of  both 
earnings  and  supplementary  sales  forecasts  throughout  the  year.  Because  individual  analyst  fore¬ 
casts  are  a  vital  source  of  timely  information  for  investors,  it  is  important  to  examine  how 
supplementary  information  affects  the  immediate  price  impact  of  forecast  revisions.  Second,  I 
examine  why  the  market  reacts  more  strongly  to  supplemented  earnings  forecasts.  I  find  that 
supplemented  earnings  forecasts  are  more  informative  because  supplementary  sales  forecasts  are  a 
credible  signal  of  analysts’  accuracy. 

My  study  contributes  to  the  literature  in  three  ways.  First,  I  extend  the  literature  on  the  role  of 
and  the  market  reactions  to  information  that  supplements  earnings  forecasts.  I  examine  supple¬ 
mentary  sales  forecasts,  a  relatively  new  form  of  forecast  disseminated  by  analyst  tracking  ser¬ 
vices.  Second,  I  rule  out  the  competing  hypothesis  that  sales  forecasts  are  uninformative  cheap  talk 
provided  by  financial  analysts,  merely  hoping  to  stimulate  trading  volume  and  generate  trading 
comnussions.  Third,  I  provide  empirical  evidence  that  the  presence  of  supplementary  sales  fore¬ 
casts,  conditional  on  other  individual  analyst  and  forecast  characteristics,  could  be  used  to  identify 
the  more  accurate  and  informative  earnings  forecasts. 

Section  II  discusses  prior  literature  and  develops  hypotheses.  Section  III  describes  sample 
selection  criteria  and  provides  sample  descriptive  statistics.  Sections  IV  and  V  present  the  empiri¬ 
cal  results  for  tests  of  market  reaction  and  tests  of  accuracy  of  supplemented  versus  stand-alone 
earnings  forecasts.  Section  VI  presents  the  results  for  additional  robustness  tests.  Section  VII 
concludes. 


II.  RELATED  LITERATURE  AND  HYPOTHESES  DEVELOPMENT 

Prior  research  documents  the  market  responses  to  individual  analyst  earnings  forecast  revi¬ 
sions  (Lys  and  Sohn  1990;  Stickel  1991;  Park  and  Stice  2000).^  While  individual  analyst  earnings 
forecasts  are  informative,  earnings  forecasts  are  only  one  of  the  many  signals  about  firms’  pros¬ 
pects  provided  by  financial  analysts.  Francis  and  Soffer  (1997)  find  that  earnings  forecasts  and 
stock  recommendations  are  incrementally  informative  to  one  another.  When  stock  recommenda¬ 
tions  are  favorable,  investors  place  greater  reliance  on  earnings  forecast  revisions,  which  are  ex 
post  verifiable.  Bradshaw  (2002)  examines  a  sample  of  analyst  reports  and  documents  the  fre¬ 
quency  at  which  analysts  disclose  target  prices  as  justifications  for  their  stock  recommendations. 
He  finds  that  the  most  favorable  recommendations  and  target  prices  are  more  likely  to  be  justified 
by  quantitative  forecasts  such  as  price-earnings  ratios  and  expected  growth,  while  the  least  favor¬ 
able  recommendations  are  more  likely  to  be  justified  with  other  qualitative  statements.  These 
findings  suggest  that,  when  favorable  stock  recommendations  are  perceived  as  less  credible,  fi¬ 
nancial  analysts  have  the  incentives  to  justify  their  recommendations. 

Supplementary  sales  forecasts  could  be  useful  in  two  ways.  First,  they  convey  information 
about  the  prospects  of  the  firms,  independent  of  the  information  in  earnings  forecasts.  Second, 


extension  of  analysts’  earnings  forecasts,  i.e.,  financial  analysts  derive  their  cash  flow  forecasts  from  their  earnings 
forecasts. 

*  Rees  and  Sivaramakrishnan  (2007)  examine  whether  meeting  or  exceeding  revenue  forecasts  serves  as  an  additional  cue 
to  the  market  in  pricing  earnings  performance.  They  show  that  meeting  revenue  forecasts  is  of  greater  importance  to 
market  participants  than  the  magnitude  of  the  revenue  forecast  error  per  se. 

®  Lys  and  Sohn  (1990)  find  that  individual  earnings  forecasts  are  informative  to  the  market.  Stickel  (1991)  find  that 
earnings  forecast  revisions  affect  stock  prices,  but  prices  do  not  immediately  assimilate  the  information  in  the  revisions. 
Prices  continue  to  drift  in  the  direction  of  the  revision  for  about  six  months  after  the  revision.  Park  and  Stice  (2000)  find 
that  current  revision  response  coefficient  is  increasing  in  the  analyst’s  past  earnings  forecast  accuracy. 
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supplementary  sales  forecasts  increase  the  credibility  of  earnings  forecasts.  Asquith  et  al.  (2005) 
examine  the  market  reactions  to  analyst  reports,  finding  that  market  reactions  are  associated  with 
both  the  magnitude  of  earnings  forecast  revision  and  the  strength  of  analysts’  arguments  in  the 
analyst  reports. Although  Asquith  et  al.  (2005)  show  that  supplementary  information  in  analyst 
reports  provides  incremental  information  to  the  market,  they  do  not  distinguish  whether  the 
supplementary  information  provided  by  financial  analysts  conveys  information  independent  of 
earnings  forecasts  or  whether  it  adds  credibility  to  earnings  forecasts. 

Hutton  et  al.  (2003)  find  that  good  news  management  earnings  forecasts  are  informative  only 
when  supplemented  with  verifiable  forward-looking  statements.  Baginski  et  al.  (2004)  investigate 
market  reactions  to  explanation  (attribution)  for  the  management  forecasts.  They  classify  attribu¬ 
tions  into  two  types:  external  and  internal.  Internal  attributions  are  causal  factors  internal  to  the 
organization,  such  as  strategy  or  management  skill.  External  attributions  are  events  outside  the 
company,  such  as  competition  and  certain  economic  factors.  They  find  a  positive  relation  between 
security  price  reaction  and  management  forecast  with  external  attribution,  but  not  for  internal 
attribution.  Wasley  and  Wu  (2006)  find  that  managers  issue  cash  flow  forecasts  to  signal  good 
news  in  cash  flows  and  to  lend  credibility  to  good  news  in  earnings.  Taken  together,  these  findings 
suggest  that  verifiable  supplementary  information  lends  credibility  to  earnings  forecasts.  Because 
supplementary  sales  forecasts  justify  earnings  forecasts  and  are  ex  post  verifiable,  I  hypothesize 
that  they  increase  the  credibility  of  earnings  forecasts.  Thus,  my  first  hypothesis  is  as  follows: 

HI:  Supplementary  sales  forecasts  add  credibility  to  earnings  forecast  revisions. 

Perceived  forecast  accuracy  increases  with  analyst  reputation  (Shekel  1992;  Sinha  et  al. 
1997). Clement  and  Tse  (2003)  find  the  market  reaction  is  stronger  when  individual  analyst 
characteristics  predict  that  the  earnings  forecast  revisions  are  more  likely  to  be  accurate  ex  ante. 
Prior  literature  finds  that  individual  analyst  earnings  forecast  accuracy  increases  with  prior  accu¬ 
racy,  brokerage  size,  forecast  frequency,  and  firm  experience,  and  decreases  with  days  elapsed 
since  the  last  forecast,  forecast  horizon,  number  of  companies,  and  number  of  industries  the 
financial  analyst  follows  (Mikhail  et  al.  1997;  Clement  1999;  Jacob  et  al.  1999;  Brown  2001).^^ 
The  market  learns  about  the  ability  of  financial  analysts  and  updates  its  beliefs  about  their  per¬ 
ceived  accuracy  through  their  performance  and  action  over  time  (Chen  et  al.  2005). 

As  more  information  about  analyst  ability  is  revealed,  the  market  reacts  more  strongly,  re¬ 
sulting  in  higher  trading  volume  (Abarbanell  et  al.  1995).  Hayes  (1998)  shows  that  trading  volume 


Asquith  et  al.  (2005)  measure  the  level  of  analyst  justification  by  aggregating  the  number  of  positive  remarks  less  the 
number  of  negative  remarks  related  to  14  specific  criteria  about  the  firm  in  the  analyst  report:  revenue  growth,  earnings 
growth,  new  product  introductions,  new  projects,  cost  efficiencies,  expectations  met,  mergers  and  acquisitions,  repur¬ 
chase  programs,  industry  climate,  management,  international  operations,  leverage,  competition,  and  risk.  I  go  through  a 
sample  of  analyst  reports  that  include  revenue  forecasts  and  those  that  do  not.  I  find  both  types  of  analyst  reports  are  not 
substantially  different  from  each  other  in  terms  of  structure  and  length.  While  most  reports  discuss  sales  growth  and 
other  quantitative  or  qualitative  analysis  supporting  the  EPS  forecasts  and  stock  recommendation,  financial  analysts 
often  do  not  explicitly  disclose  their  revenue  forecasts  in  their  reports.  Some  reports  contain  detailed  sales  forecasts  by 
geographic  segments  and  product  lines,  and  explain  how  these  sales  forecasts  translate  into  the  EPS  forecasts,  assuming 
different  profit  margins  across  different  segments.  Operating  cash  flow  forecasts  are  not  as  common  as  sales  forecasts 
and  are  often  provided  only  when  the  financial  analysts  estimate  the  target  price  using  some  cash  flow-based  models. 
Shekel  (1992)  defines  that  an  analyst  has  a  superior  reputation  if  s/he  is  included  in  the  Institutional  Investor  All- 
American  Research  Team.  He  shows  that  earnings  forecast  revisions  by  All-American  analysts  have  a  larger  impact  on 
security  prices  than  do  revisions  by  other  analysts.  Sinha  et  al.  (1997)  use  past  forecast  accuracy  to  identify  superior 
analysts  and  find  that  superior  forecasting  ability  persists  over  time. 

Mikhail  et  al.  (1997)  document  a  decline  in  the  absolute  value  of  analyst  quarterly  forecast  errors  as  firm-specific 
experience  increases.  Clement  (1999)  finds  that  forecast  accuracy  is  positively  associated  with  analysts’  experience  and 
employer  size,  but  is  negatively  associated  with  the  numbers  of  firms  and  industries  followed  by  the  analyst.  Jacob  et  al. 
(1999)  report  that  analysts’  aptitude  and  brokerage  house  characteristics  are  associated  with  forecast  accuracy.  Brown 
(2001)  finds  that  past  performance  predicts  forecast  accuracy.  Clement  and  Tse  (2003)  find  that  forecast  frequency  is  as 
important  as  past  accuracy  in  explaining  future  accuracy. 
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increases  in  the  perceived  accuracy  of  the  forecast.  Because  financial  analysts  are  rewarded  for  the 
amount  of  trading  commissions  they  generate  for  their  brokerage  houses,  I  argue  that  the  more 
able  analysts  have  incentives  to  convey  their  ability  so  that  the  market  will  trade  on  their  analyst 
reports  (Schipper  1991;  Jackson  2005).^^  On  the  one  hand,  if  financial  analysts  provide  sales 
forecasts  to  enhance  credibility  when  they  are  more  informed,  then  the  presence  of  supplementary 
sales  forecasts  should  be  associated  with  more  accurate  earnings  forecasts,  controlling  for  other 
determinants  of  forecast  accuracy.  On  the  other  hand,  if  supplementary  sales  forecasts  are  issued 
by  uninformed  analysts  hoping  to  stimulate  trading  volume  and  generate  trading  commissions, 
then  supplementary  sales  forecasts  would  have  no  association  with  earnings  forecast  accuracy  ex 
post.  I  examine  whether  supplementary  sales  forecasts,  which  are  provided  to  the  investors  at  the 
discretion  of  financial  analysts,  could  be  an  ex  ante  signal  to  the  market  that  the  financial  analysts 
are  better  informed.  Thus,  my  second  hypothesis  is  as  follows: 

H2:  Earnings  forecasts  supplemented  with  sales  forecast  revisions  are  more  accurate  than 
stand-alone  earnings  forecasts. 

III.  SAMPLE  SELECTION  AND  DESCRIPTIVE  STATISTICS 
Sample  Selection 

I  obtain  data  on  analyst  forecasts  and  individual  analyst  characteristics  from  the  Institutional 
Brokers  Estimate  System  (I/B/E/S)  database,  financial  statements  data  and  earnings  announcement 
dates  from  Compustat,  and  stock  returns  data  from  the  Center  for  Research  on  Security  Prices 
(CRSP)  database.  I  restrict  my  analyses  to  annual  forecasts  to  avoid  the  complication  of  season¬ 
ality.  Forecast  revisions  in  the  initial  sample  are  required  to  have  corresponding  daily  security 
return  data  available  in  the  CRSP  database.  I  define  an  earnings  (sales)  forecast  revision  as  the 
newly  revised  earnings  (sales)  forecast  less  the  analyst’s  own  prior  earnings  (sales)  forecast. A 
sales  forecast  revision  is  defined  as  supplementary  when  a  financial  analyst  revises  both  his/her 
earnings  and  sales  forecasts  on  the  same  day  as  reported  in  the  I/B/E/S  detail  file.  I  measure  the 
information  content  of  a  supplementary  sales  forecast  using  three-day  size-adjusted  buy-and-hold 
return  around  the  revision  date.  Size-adjusted  buy-and-hold  return  is  the  raw  buy-and-hold  return 
of  the  firm  minus  the  mean  buy-and-hold  return  of  an  equal-weighted  portfolio  of  firms  in  the 
same  NYSE/ AMEX/NASDAQ  size  decile  over  the  three-day  holding  period. 

To  allow  for  variation  in  individual  analyst  and  revision  characteristics  at  the  time  of  the 
forecast  revisions,  I  perform  the  empirical  analysis  at  the  forecast  revision  level.  I  remove  earnings 
forecast  revisions  around  quarterly  earnings  announcements  (—1  to  +1)  because  the  price  re¬ 
sponses  to  the  news  in  the  earnings  announcements  might  confound  measures  of  price  responses 
to  earnings  forecast  revisions.  1  eliminate  forecast  revisions  after  the  fiscal  year-end  to  make  my 
results  consistent  and  comparable  to  other  results  documented  in  the  literature  (for  example, 
Clement  and  Tse  [2005],  among  others).  These  forecasts  are  likely  to  be  confounded  by  earnings 
pre-announcements  made  by  management  (Soffer  et  al.  2000).*^  Following  Gleason  and  Lee 
(2003),  in  order  to  mitigate  inflated  test  statistics  due  to  sample  dependence  induced  by  clustered 


This  is  known  as  the  “soft  dollar”  practice,  where  institutional  investors  intentionally  direct  a  portion  of  their  trades 
through  the  brokerage  houses  whose  financial  analysts  have  provided  the  trade-generating  research.  Because  brokerage 
firms’  profits  depend  directly  on  commission  revenues,  analyst  compensation  (usually  in  the  form  of  year-end  bonuses) 
is  implicitly  tied  to  the  amount  of  commissions  they  have  generated  from  their  research  (Irvine  2000,  2004). 

Prior  studies  (for  example,  Stickel  1990;  Park  and  Stice  2000;  Gleason  and  Lee  2003)  show  that  an  analyst’s  own  prior 
forecast  is  a  better  benchmark  than  the  consensus  forecast  for  measuring  the  amount  of  surprise  in  an  individual  analyst 
forecast  revision.  Results  are  robust  to  the  choice  of  benchmark. 

Forecast  revisions  after  the  fiscal  year  account  for  6  percent  of  all  the  individual  annual  earnings  forecasts.  My  results 
are  not  affected  by  whether  I  include  those  forecasts. 
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revisions,  I  create  a  non-overlapping  sample  by  including  revisions  that  have  no  other  revisions 
occurring  within  a  three-day  window.  In  other  words,  when  more  than  one  revision  occurs  in  a 
three-day  event  window,  I  randomly  pick  only  one  of  those  forecast  revisions.  The  non¬ 
overlapping  sample  contains  239,339  earnings  forecast  revisions  from  1998  to  2003.^^  For  regres¬ 
sions  with  individual  analyst  characteristics  as  independent  variables,  I  randomly  select  one  revi¬ 
sion  for  an  analyst  if  there  are  multiple  revisions  by  the  same  analyst  within  a  firm-year.  This 
mitigates  the  concern  that  the  sample  is  over-represented  by  multiple  revisions  from  the  same 
analyst.  This  reduces  the  sample  to  132,146  earnings  forecast  revisions.*^ 

Descriptive  Statistics 

Table  1  reports  descriptive  statistics  on  supplemented  and  stand-alone  earnings  forecast  revi¬ 
sions.  Both  earnings  and  sales  forecast  revisions  are  deflated  by  stock  prices  two  days  before  the 
revisions.  On  average,  financial  analysts  revise  their  earnings  forecasts  downward  (—0.0054  and 
—0.0057).  This  direction  is  consistent  with  firm  managers’  incentives  to  guide  analysts’  forecasts 
downward  (Matsumoto  2002),  so  that  firm  managers  could  meet  or  beat  consensus  analyst  fore¬ 
casts.  The  average  magnitude  of  earnings  forecast  revisions  (0.0076)  is  smaller  for  supplemented 
earnings  forecast  revision  than  for  stand-alone  earnings  forecast  revisions  (0.0086).  However, 
supplemented  earnings  forecast  revisions  result  in  a  stronger  magnitude  of  market  reactions  (6.68 
percent)  than  do  stand-alone  revisions  (5.21  percent).  Upward  earnings  forecast  revisions  are  more 
common  for  supplemented  revisions  (43.6  percent)  than  for  stand-alone  revisions  (40.6  percent). 
When  an  earnings  forecasts  revision  is  supplemented  with  a  sales  forecast  revision,  the  market  is 
more  likely  to  react  in  the  same  direction  as  the  earnings  forecast  revision.  For  example,  58.5 
percent  of  supplemented  earnings  forecast  revisions  are  associated  with  abnormal  returns  in  the 
same  direction  as  the  earnings  forecast  revisions,  as  compared  to  55.7  percent  for  stand-alone 
earnings  forecast  revisions. 

Taken  as  a  whole,  the  univariate  results  show  that  the  market  reacts  more  strongly  and  in  a 
more  consistent  manner  to  the  direction  of  earnings  forecast  revisions  when  they  are  supplemented 
with  sales  forecast  revisions.  However,  these  results  do  not  distinguish  whether  supplementary 
sales  forecasts  convey  information  independent  of  earnings  forecast  revisions  or  add  credibility  to 
earnings  forecasts.  I  examine  this  issue  using  multivariate  regression  analyses  in  the  next  section. 


IV.  MARKET  REACTIONS  TO  SUPPLEMENTARY  SALES  FORECASTS 
Research  Design 

In  this  section  I  test  whether  supplementary  sales  forecast  revisions  add  credibility  to  earnings 
forecast  revisions  (HI)  by  examining  whether  investors  react  more  strongly  to  them.  I  estimate  the 
following  regression  (subscripts  omitted): 

BHAR3  =  ySo  +  I3iE_REV  -l-  /32E_REV  *  SUP  PL  +  jS^S^REV  +s  ( 1 ) 

where: 


I  start  with  the  year  1998  because  it  is  the  first  year  with  a  significant  number  of  annual  sales  forecasts.  Pooled  ordinary 
least-squares  (OLS)  regression  is  applied  to  the  non-overlapping  sample  in  the  short-window  price  reactions  analysis 
because  by  construction  clustering  of  the  observations  is  mitigated. 

This  design  choice  is  made  to  be  conservative  such  that  the  statistical  significance  would  not  be  overestimated.  My 
results  are  robust  to  using  the  original  overlapping  sample  or  using  the  last  forecast  revisions  issued  by  the  financial 
analysts  in  the  firm-years. 
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This  table  presents  descriptive  statistics  on  supplemented  and  stand-alone  earnings  forecast  revisions.  Revision  in  earnings  (sales)  forecast  is  the  difference  between  the  revised 
earnings  (sales)  forecast  and  the  analyst’s  own  prior  earnings  (sales)  forecast  deflated  by  stock  price  two  days  before  revision.  Size-adjusted  return  is  the  compounded  buy-and-hold 


return  of  the  firm  minus  the  compounded  return  for  an  equal-weighted  portfolio  of  firms  in  the  same  NYSE/AMEX/NASDAQ  size  decile.  I  define  a  sales  forecast  revision  as 
supplementary  if  a  financial  analyst  revises  both  his/her  earnings  and  sales  forecasts  on  the  same  day  as  reported  in  the  I/B/E/S  detail  file.  Corresponding  t-statistics  for  parametric 
two-sample  t-tests  and  z-scores  for  nonparametric  Wilcoxon  rank-sum  tests  are  reported. 


Credibility  of  Financial  Analysts  ’Earnings  Forecasts 


2055 


The  Accounting  Review 


November  2010 
American  Accounting  Association 


2056 


Keung 


BHAR3  =  three-day  size-adjusted  buy-and-hold  return  (—1  to  -fl)  surrounding  an  annual 
earnings  forecast  revision.  Size-adjusted  buy-and-hold  return  is  the  raw 
buy-and-hold  return  of  the  firm  minus  the  mean  buy-and-hold  return  of  an 
equal- weighted  portfolio  of  firms  in  the  same  NYSE/ AMEX/NASDAQ  size  decile 
over  the  holding  period; 

E_REV  =  revision  in  annual  earnings  forecast  relative  to  the  analyst’s  own  prior  forecast 
deflated  by  stock  price  two  days  before  revision; 

SUP  PL  =  1  if  the  revision  in  earnings  forecast  is  supplemented  with  revision  in  sales 
forecast,  and  0  otherwise;  and 

S_REV  =  revision  in  annual  sales  forecast  relative  to  the  analyst’s  own  prior  forecast 

deflated  by  stock  price  two  days  before  revision  if  it  is  revealed,  and  0  otherwise. 

In  Equation  (1),  I  examine  whether  the  supplementary  information  provided  by  financial  analysts 
conveys  information  independent  of  earnings  forecast  revisions,  or  whether  such  information 
conveys  information  on  the  credibility  of  the  earnings  forecasts.  I  include  revision  in  sales  forecast 
{S_REV)  as  a  separate  independent  variable  to  control  for  the  information  in  supplementary  sales 
forecast  revisions  independent  of  earnings  forecast  revisions.  I  use  the  interaction  between  earn¬ 
ings  forecast  revision  {E_REV)  and  the  indicator  variable  for  the  presence  of  supplementary  sales 
forecast  (SUPPL)  to  capture  the  incremental  effect  of  supplementary  sales  forecast  on  the  forecast 
response  coefficient. 

My  methodology  extends  that  used  by  Baginski  et  al.  (2004).  They  regress  three-day  CAR  on 
the  interaction  between  unexpected  earnings  and  an  indicator  variable  for  the  presence  of  sales 
forecasts.  However,  they  do  not  control  for  the  incremental  information  content  in  the  sales 
forecasts.  Including  both  revisions  in  sales  forecasts  and  indicators  for  the  presence  of  sales 
forecasts  allow  me  to  disentangle  whether  the  market  reactions  are  driven  by  (1)  specific  infor¬ 
mation  in  sales  forecasts  and/or  (2)  the  act  of  disclosing  sales  forecasts.  I  expect  that  the  magni¬ 
tude  of  sales  forecast  revision  is  positively  associated  with  market  reactions  (yS3  >  0)  and  that 
supplementary  sales  forecasts  add  to  the  credibility  of  earnings  forecasts  {^2  ^  0).  There  should 
be  stronger  market  reactions  for  the  same  amount  of  earnings  forecast  revisions  when  the  revised 
sales  forecasts  are  revealed. 

Abarbanell  et  al.  (1995)  argue  that  inventors’  response  to  earnings  forecast  revisions  increases 
in  the  expected  accuracy  of  the  forecast.  Because  ex  post  accuracy  of  an  earnings  forecasts  is 
unknown  to  the  market  when  a  financial  analyst  issues  a  forecast,  investors  use  information 
available  at  the  time  of  the  forecast  to  estimate  the  ex  ante  perceived  accuracy  of  the  earnings 
forecast  and  react  accordingly  (Clement  and  Tse  2003).  As  a  robustness  check,  I  control  for 


Sales  forecasts  could  provide  information  about  the  persistence  of  future  earnings  because  aggregate  earnings  mask  the 
information  on  the  persistence  of  different  revenue  and  expense  items  by  weighting  them  equally  in  the  calculation  of 
net  income  (Lambert  2004).  Because  I  also  find  a  positive  coefficient  on  sales  forecast  revisions  (ySj),  it  is  likely  that  the 
persistence  effect  has  been  controlled  for.  I  also  partition  my  sample  into  five  quintiles  according  to  earnings  growth, 
earnings  change,  and  earnings  persistence  as  measured  in  Beaver  et  al.  (1980).  My  main  results  hold  for  firms  from 
different  quintiles  of  these  firm  characteristics  (results  not  tabulated).  Therefore,  even  if  revisions  in  sales  forecasts  do 
not  fully  capture  information  about  persistence,  my  results  are  not  driven  by  the  possibility  that  supplemented  earnings 
forecast  revisions  are  perceived  as  more  persistent  than  stand-alone  earnings  forecast  revisions. 


The  Accounting  Review 
American  Accounting  Association 


November  2010 


Credibility  of  Financial  Analysts  ’Earnings  Forecasts 


2057 


individual  analyst  and  revision  characteristics  that  are  associated  with  perceived  accuracy,  includ¬ 
ing  forecast  frequency  {FREQ),  forecast  horizon  {HORIZON),  days  elapsed  since  the  last  forecast 
{DAYS),  prior  earnings  forecast  error  {LAG_FE),  brokerage  size  {BROKER),  firm-specific  and 
general  analyst  experience  {FIRM_EXP  and  GEN__EXP,  respectively),  the  numbers  of  companies 
and  industries  followed  by  the  analyst  {COMPANY  and  INDUSTRY,  respectively),  and  the  level  of 
earnings  forecast  revision  innovation  {E_INNO)  (Clement  1999;  Jacob  et  al.  1999;  Brown  2001; 
Clement  and  Tse  2003;  Gleason  and  Lee  2003)/^  Following  Clement  and  Tse  (2005)  and  Her¬ 
rmann  and  Thomas  (2005),  I  measure  individual  analyst  characteristics  of  a  financial  analyst 
relative  to  the  characteristics  of  other  analysts  following  the  same  firm  in  the  same  year.  I  scale  the 
individual  analyst  characteristic  to  lie  between  0  and  1  by  computing  the  difference  between  the 
observed  value  of  the  characteristic  and  the  minimum  value  of  the  characteristic  for  the  analysts 
following  firm  j  in  year  t,  divided  by  the  range  of  the  characteristic  for  the  analysts  following  firm 
j  in  year  t.  The  scaled  individual  analyst  characteristic  for  a  revision  issued  by  analyst  i  for  firm  j 
in  year  t  is  given  by; 


CHAR:.- mmf  CHAR) 

SCALED  CHAR:.= - ^ ^ 

maxp{CHAR)  -  min  ,( C/M/?) 


(2) 


where  maxjfCHAR)  and  min^XC/M/?)  are  the  maximum  and  the  minimum  values,  respectively,  of 
the  individual  analyst  characteristic  for  the  analysts  following  firm  j  in  year  t.  This  scaling  pre¬ 
serves  the  relative  distances  among  the  individual  analyst  characteristics  for  firm  j  in  year  t  and 
facilitates  comparisons  of  regression  coefficients  across  firms  and  over  time.^° 

I  run  the  following  regression  (subscripts  omitted): 

BHAR3  =  ySo  +  PiE_REV+  p2S_REV+  f3^E_REV  *  DAYS  +  f3^E_REV *  HORIZON 
+  I3^E_REV  *  FREQ  +  f(,E_REV  *  LAG_FE  +  fi-jE_REV  *  NEW 
+  fisE-REV  *  BROKER  +  I3^E_REV  *  FIRM_EXP  +  f^oEJlEV  *  GEN_EXP 
+  /3nC_/?£F  *  COMPANY  -i-  fiy2E_REV  *  INDUSTRY  +  Pi2,E_REV  *  EJNNO 
+  fiuE-REV  *  SUPPL  +  8  ,  (3) 


Another  reason  for  including  individual  analyst  characteristics  is  that  supplementary  sales  forecasts  are  voluntary.  If 
supplemented  and  stand-alone  earnings  forecast  revisions  are  substantially  different  from  one  another,  this  would  result 
in  a  self-selection  bias.  To  access  the  severity  of  the  self-selection  bias,  I  used  the  Heckman  (1979)  approach  to  compute 
the  inverse  Mills  ratio  from  the  first-stage  Probit  model,  with  the  individual  analyst  characteristics  as  first-stage  inde¬ 
pendent  variables.  In  the  second-stage  accuracy  regression  model  (Equation  (4)),  the  inverse  Mills  ratio  is  added  as  an 
additional  control  variable.  The  inverse  Mills  ratio  is  insignificant  (coefficient  estimate  =  0.0276;  p  =  0.165).  The 
estimate  for  regression  coefficient  on  SUPPL  is  still  negative  and  significant  (coefficient  estimate  =  -0.780;  p  = 
0.031).  Correcting  for  self-selection  bias  does  not  appear  to  be  necessary.  Moreover,  the  possibility  of  presence  of 
self-selection  in  and  of  itself  does  not  necessarily  mean  that  OLS  estimates  are  more  biased  than  two-stage  estimates 
(Wooldridge  2002).  If  the  instruments  are  weak  and/or  partially  endogenous,  two-stage  instrumental  variable  estimation 
can  produce  estimates  that  are  more  biased  than  OLS  estimation.  Two-stage  estimates  also  have  poor  finite  sample 
properties.  Therefore,  instead  of  including  the  inverse  Mills  ratios  (Heckman  1979),  I  explicitly  control  for  potential 
determinants  of  supplementary  sales  forecasts  in  the  regressions  to  mitigate  the  endogeneity  problem.  Because  not  all 
determinants  of  disclosure  of  sales  forecasts  are  observable  to  the  econometrician,  I  could  not  totally  rule  out  the 
possibility  of  self-selection.  I  caution  the  reader  to  keep  this  in  mind  in  interpreting  my  results. 

I  delete  observations  from  firm-years  with  only  one  financial  analyst  issuing  earnings  forecasts  because  relative  accuracy 
is  not  measurable. 
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where: 

BHAR3ijt 

E_REVij, 

S_REViji 

DAYS^j, 

HORIZONijt 

EREQ.j, 

LAG_EE,j, 

NEW^j, 

BROKERij, 

ElRM_EXPij, 

GEN_EXPij, 

COMPANYij, 


=  three-day  size-adjusted  buy-and-hold  return  (—1  to  -1-1)  surrounding  an  annual 
earnings  forecast  revision.  Size-adjusted  buy-and-hold  return  is  the  raw 
buy-and-hold  return  of  the  firm  minus  the  mean  buy-and-hold  return  of  an 
equal-weighted  portfolio  of  firms  in  the  same  NYSE/ AMEX/NASDAQ  size 
decile  over  the  holding  period; 

=  revision  in  annual  earnings  forecast  relative  to  the  analyst’s  own  prior  forecast 
deflated  by  stock  price  two  days  before  revision; 

=  revision  in  annual  sales  forecast  relative  to  the  analyst’s  own  prior  forecast 
deflated  by  stock  price  two  days  before  revision  if  it  is  revealed,  and  0 
otherwise; 

=  number  of  days  elapsed  since  the  last  forecast  by  any  analyst  following  firm  j 
in  year  t,  calculated  as  the  number  of  days  between  analyst  /’s  forecast  of 
firm  fs  earnings  in  year  t  and  the  most  recent  preceding  forecast  of  firm  /s 
earnings  by  any  analyst,  minus  the  minimum  number  of  days  between  two 
adjacent  forecasts  of  firm  /s  earnings  by  any  two  analysts  in  year  t,  scaled 
by  the  range  of  days  between  two  adjacent  forecasts  of  firm  /s  earnings  in 
year  /; 

=  forecast  horizon,  calculated  as  the  number  of  days  from  the  forecast  date  to 
the  fiscal  year-end,  for  analyst  i  following  firm  j  in  year  t  minus  the  minimum 
forecast  horizon  for  analysts  who  follow  firm  j  in  year  t,  scaled  by  the  range  of 
forecast  horizons  for  analysts  following  firm  j  in  year  t; 

=  forecast  frequency,  calculated  as  the  number  of  forecasts  made  by  analyst  i  for 
firm  j  in  year  t  minus  the  minimum  number  forecasts  made  for  firm  j  in  year  t 
out  of  all  analysts  following  firm  j,  scaled  by  the  range  of  the  number  of 
forecasts  made  for  firm  j  issued  by  analysts  following  firm  j  in  year  t; 

=  absolute  forecast  error  of  last  earnings  forecast  of  analyst  i  following  firm  j  in 
year  t-1  minus  the  minimum  absolute  forecast  error  for  analysts  who  follow 
firm  j  in  year  t-l,  scaled  by  the  range  of  absolute  forecast  error  for  analysts 
following  firm  j  in  year  t- 1 ; 

=  1  if  the  analyst  i  first  appears  in  the  I/B/E/S  detail  file  in  year  t,  and  0 
otherwise; 

=  the  number  of  analysts  employed  by  the  broker  employing  analyst  i  following 
firm  j  in  year  t  minus  the  minimum  number  of  analysts  employed  by  brokers 
for  analysts  following  firm  j  in  year  t,  scaled  by  the  range  of  brokerage 
size  for  analysts  following  firm  j  in  year  t; 

=  firm-specific  experience,  which  is  the  number  of  years  analyst  i  has  been 
following  firm  /  in  year  t  minus  the  minimum  number  of  years  of  firm-specific 
experience  for  analysts  following  firm  j  in  year  t,  scaled  by  the  range  of 
years  of  firm-specific  experience  for  analysts  following  firm  j  in  year  t; 

=  general  experience,  which  is  the  number  of  years  analyst  i  has  been  identified 
in  the  I/B/E/S  detail  file  in  year  t  minus  the  minimum  number  of  years  of 
general  experience  for  analysts  following  firm  j  in  year  t,  scaled  by  the  range 
of  years  of  general  experience  for  analysts  following  firm  j  in  year  t; 
the  number  of  companies  analyst  i  follows  in  year  t,  calculated  as  the  number 
of  companies  followed  by  analyst  i  following  firm  j  in  year  t  minus  the 
minimum  number  of  companies  followed  by  analysts  who  follow  firm  j  in 
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year  t,  scaled  by  the  range  of  the  number  of  companies  followed  by  analysts 
following  firm  j  in  year  t; 

INDUSTRYijf  =  the  number  of  industries  analyst  i  follows  in  year  t,  calculated  as  the  number 
of  two-digit  SICs  followed  by  analyst  i  following  firm  j  in  year  t  minus  the 
minimum  number  of  two-digit  SICs  followed  by  analysts  who  follow  firm  j  in 
year  t,  scaled  by  the  range  of  the  number  of  two-digit  SICs  followed  by 
analysts  following  firm  j  in  year  t; 

E_INNOiji  =  1  if  the  revised  earnings  forecast  is  either  higher  or  lower  than  both  analyst’s 
own  prior  forecast  and  the  prior  day’s  consensus  forecast  (high  innovation), 
and  0  otherwise;  and 

SUPPLjjt  =  1  if  the  earnings  forecast  revision  is  supplemented  by  a  sales  forecast  revision 
issued  by  the  same  analyst  on  the  same  day,  and  0  otherwise. 

In  Equation  (3),  the  coefficient  on  the  interaction  term  (^14)  captures  the  incremental  effect  on 
the  market  reactions  to  supplemented  earnings  forecasts.  The  variable  SUPPL  equals  1  if  the 
revision  in  earnings  forecast  is  supplemented  with  sales  forecast  revision,  and  0  otherwise.  I 
expect  that  supplementary  sales  forecasts  increase  the  credibility  of  earnings  forecast  revisions  and 
lead  to  stronger  market  reactions  (ySi4  >  0),  controlling  for  the  information  content  of  sales  fore¬ 
casts  (/32  >  0).  Individual  analyst  characteristics  that  are  associated  with  increased  forecast  accu¬ 
racy  (e.g.,  prior  accuracy,  brokerage  size,  forecast  frequency,  and  firm-specific  and  general  expe¬ 
rience)  should  be  positively  associated  with  market  reactions  to  forecast  revisions,  while 
characteristics  associated  with  decreased  forecast  accuracy  (e.g.,  the  numbers  of  companies  and 
industries  followed  by  the  analyst)  should  be  negatively  associated  with  market  reactions  to 
forecast  revisions. 

Association  between  Contemporaneous  Market  Reaction  and  Supplementary  Sales  Forecast 
Revision 

Table  2,  Panel  A  reports  the  results  of  the  regression  analyses  to  examine  the  impacts  of 
supplementary  sales  forecast  revisions  on  stock  prices.  In  Model  I,  the  coefficient  on  earnings 
forecast  revisions  (ySi)  is  positive  (0.4042)  and  significant  (p  <  0.001).  This  confirms  the  positive 
relationship  between  contemporaneous  stock  returns  and  earnings  forecast  revisions  (Lys  and 
Sohn  1990;  Shekel  1991;  Park  and  Stice  2000).  The  coefficient  on  sales  forecast  revision  {fif}  is 
also  positive  and  significant  (0.0619;  p  <  0.001),  controlling  for  revision  in  earnings  forecast. 
Sales  forecast  revisions  are  incrementally  informative  to  the  market,  which  is  consistent  with  the 
findings  from  prior  research  that  sales  are  informative  about  stock  returns  (Ertimur  et  al.  2003; 
Ghosh  et  al.  2005). 

In  Model  II,  the  coefficient  on  the  interaction  term  between  earnings  forecast  revision  and  the 
indicator  variable  for  the  presence  of  supplementary  sales  forecasts  {pf)  is  positive  (0.2188)  and 
significant  (p  <  0.001).  The  market  reacts  more  strongly  to  supplemented  earnings  forecast  revi¬ 
sions  (0.4377)  than  to  stand-alone  earnings  forecast  revisions  (0.2189),  controlling  for  the  inde¬ 
pendent  information  in  sales  forecast  revisions.  The  reaction  to  earnings  forecast  revisions  is  twice 
as  strong  when  accompanied  by  revenue  forecast  revisions.  In  Model  III,  I  include  both  revision  in 
sales  forecasts  and  the  interaction  term  in  the  regression.  I  find  the  coefficient  on  the  interaction  is 
still  positive  and  significant  (0.2158;  p  <  0.001),  while  the  coefficient  on  sales  forecast  revision  is 
marginally  significant  (p  =  0.069).  My  findings  support  the  conclusion  that  the  market  reaction  is 
driven  by  both  the  specific  information  in  sales  forecast  revisions  and  the  act  of  disclosing  sales 
forecasts.  Supplementary  sales  forecast  revisions  appear  to  bolster  the  credibility  of  earnings 
forecasts. 
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I  further  investigate  whether  the  effect  of  a  supplementary  sales  forecast  revision  is  condi¬ 
tional  on  the  direction  of  the  earnings  forecast  revision.  Each  earnings  forecast  revision  is  classi¬ 
fied  as  an  upward  or  a  downward  revision  based  on  whether  the  revised  forecast  is  above  or  below 
the  prior  forecast.  Results  in  Model  IV  show  that  the  market  reacts  more  strongly  to  upward 
earnings  forecast  revisions  (0.5322)  than  to  downward  revisions  (0.1771).  However,  the  presence 
of  supplementary  sales  forecasts  increases  the  market  response  to  downward  earnings  forecast 
revisions  (0.2235)  to  a  greater  extent  than  it  increases  the  response  to  upward  earnings  forecast 
revisions  (0.1620).  There  is  no  market  reaction  to  the  independent  information  in  downward  sales 
forecast  revisions  after  controlling  for  earnings  forecast  revision. 

Table  2,  Panel  B  reports  additional  results  on  the  impact  of  supplementary  sales  forecast 
revisions  on  stock  prices.  I  control  for  other  individual  analyst  characteristics,  which  could  be 
potential  determinants  of  supplementary  sales  forecasts.  In  Model  V,  I  control  for  prior  forecast 
accuracy  {LAG_FE).  I  measure  prior  earnings  forecast  accuracy  using  the  absolute  forecast  error 
of  the  last  earnings  forecasts  provided  by  the  analyst.^^  The  coefficients  on  earnings  and  sales 
forecast  revisions  (ySj  and  /32,  respectively)  remain  positive  and  significant.  The  coefficient  on  the 
interaction  between  earnings  forecast  revision  and  the  presence  of  supplementary  sales  forecasts 
(^14)  is  positive  (0.3254)  and  significant  (p  <  0.001),  controlling  for  other  individual  analyst  and 
forecast  revision  characteristics.  The  coefficients  on  the  control  variables  are  consistent  with  the 
finding  in  Clement  and  Tse  (2003)  that  market  reactions  increase  in  ex  ante  perceived  accuracy. 
For  example,  days  since  last  forecasts  {DAYS),  forecast  frequency  {FREQ),  brokerage  size  {BRO¬ 
KER),  and  number  of  industries  followed  {INDUSTRY)  are  positively  associated  with  market 
reactions  to  earnings  forecast  revisions,  while  forecast  horizon  {HORIZON)  and  number  of  com¬ 
panies  followed  {COMPANY)  are  negatively  associated  with  market  reactions  to  earnings  forecast 
revisions. 

In  Model  VI,  I  retain  observations  from  financial  analysts  that  first  appear  in  the  I/B/E/S 
database  in  the  year  by  dropping  prior  forecast  accuracy  {LAG_FE).  Instead,  I  include  the  indi¬ 
cator  variable  NEW,  where  NEW  equals  1  if  the  financial  analyst  first  appears  in  the  EB/E/S 
database  in  that  year,  and  0  otherwise.  The  results  are  similar  to  those  from  Model  V.  In  summary, 
the  results  in  Table  2  suggest  that  the  market  reacts  more  strongly  to  supplemented  earnings 
forecasts.  Supplementary  sales  forecasts  add  to  the  credibility  of  earnings  forecast  revisions, 
controlling  for  independent  information  in  sales  forecasts  and  other  individual  analyst  and  revision 
characteristics.  In  Section  V  below  I  explore  the  causes  of  the  stronger  market  reactions  by 
investigating  whether  supplemented  earnings  forecasts  are  more  accurate  ex  post. 

V.  ASSOCIATION  BETWEEN  PROPERTIES  OF  EARNINGS  FORECAST 
REVISIONS  AND  SUPPLEMENTARY  SALES  FORECASTS 
Research  Design 

Following  prior  literature,  I  model  forecast  accuracy  as  a  function  of  forecast  frequency 
{FREQ),  forecast  horizon  {HORIZON),  days  elapsed  since  the  last  forecast  {DAYS),  prior  earnings 
forecast  accuracy  {LAG_FE),  brokerage  size  {BROKER),  analyst  experience  {FIRM_EXP  and 
GEN_EXP),  and  the  numbers  of  companies  and  industries  followed  by  the  analyst  {COMPANY 
and  INDUSTRY)  (Clement  1999;  Jacob  et  al.  1999;  Brown  2001;  Clement  and  Tse  2003).  I  also 
include  the  magnitudes  of  earnings  and  sales  forecast  revisions  {EJREV]  and  \S_REV\)  and  the 


Absolute  forecast  error  of  last  earnings  forecast  provided  by  the  analyst  might  be  a  noisy  measure  for  forecast  accuracy 
of  financial  analysts  because  last  forecasts  in  fiscal  years  are  issued  at  different  points  in  time  by  different  financial 

analysts. 
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level  of  earnings  forecast  revision  innovation  {E_INNO)  to  control  for  the  information  content  of 
the  earnings  and  the  sales  forecasts.  I  run  the  following  regression: 

\FEij,\  =  fio  +  PxDAYSijt  +  PjHORIZONij,  +  IS^EREQ^j,  +  l3^LAG_FEij,  +  /SsNEW^j, 

+  /3(,BROKERiji  +  p^FlRM_EXPij,  +  l3^GEN_EXPij,  +  /SgCOMPANVij^ 

+  /3ioINDUSTRYij,  +  fi^EJNNOij,  +  /S^KREV^j,]  +  /3,,\S_REVij,\  +  /3,,SUPPL,j, 

+  ^ijt  (4) 

where; 

I  FEij,\  =  absolute  value  of  forecast  error  for  analyst  i  who  follows  firm  j  in  year  t  minus 
the  minimum  absolute  forecast  error  for  the  analysts  following  firm  j  in  year  t, 
scaled  by  the  range  of  absolute  forecast  error  for  the  analysts  following  firm  j  in 
year  t. 

Definitions  of  other  independent  variables  are  the  same  as  in  previous  analyses.  The  test  variable, 
SUPPL,  is  an  indicator  variable  for  the  presence  of  a  supplementary  sales  forecast.  SUPPL  equals 
1  if  the  revision  in  earnings  forecast  is  supplemented  with  a  sales  forecast  revision,  and  0  other¬ 
wise.  I  expect  that  forecast  accuracy  increases  with  prior  accuracy  {LAG_EE),  brokerage  size 
(BROKER),  forecast  frequency  (FREQ),  analyst  experience  (F1RM_EXP  and  GEN_EXP),  and  the 
level  of  revision  innovation  (EJNNO),  but  decreases  with  days  elapsed  since  the  last  forecast 
(DAYS),  forecast  horizon  (HORIZON),  and  the  numbers  of  companies  and  industries  followed  by 
the  analyst  (COMPANY  and  INDUSTRY,  respectively)^^  If  earnings  forecasts  supplemented  with 
sales  forecast  revisions  are  more  accurate,  then  the  coefficient  on  SUPPL  (P\^  should  be  negative 
(H2).  A  negative  coefficient  means  that  supplementary  sales  forecasts  are  associated  with  earnings 
forecasts  with  lower  absolute  forecast  errors.  If  supplemented  earnings  forecasts  are  issued  by  the 
less  informed  financial  analysts,  then  the  presence  of  supplementary  sales  forecasts  should  not  be 
associated  with  earnings  forecast  accuracy. 

Univariate  Analysis  of  Supplemented  and  Stand-Alone  Earnings  Forecast  Revisions 

I  compare  the  means  of  absolute  forecast  error  and  other  individual  analyst  characteristics  for 
supplemented  and  stand-alone  earnings  forecast  revisions  in  Table  3,  Panel  A.  Most  of  the  indi¬ 
vidual  analyst  characteristics  differ  significantly  across  supplemented  and  stand-alone  earnings 
forecast  revisions  under  both  two-sample  t-tests  and  nonparametric  Wilcoxon  rank-sum  tests.^^ 
The  means  for  days  elapsed  since  the  last  forecast  (DAYS),  forecast  frequency  (FREQ),  number  of 
firms  followed  (COMPANY),  and  number  of  industries  followed  (INDUSTRY)  are  higher  for 
supplemented  earnings  forecast  revisions,  whereas  the  means  for  absolute  forecast  error  (IFEI), 
forecast  horizons  (HORIZON),  brokerage  size  (BROKER),  firm-specific  experience  (FRIM_EXP), 
and  general  experience  (GEN_EXP)  are  lower  for  supplemented  earnings  forecast  revisions.  Al- 


I  include  both  firm-specific  and  general  experienee  because  I  argue  that,  a  priori,  they  are  proxies  for  different  dimen¬ 
sions  of  analyst  attributes.  General  experience  could  proxy  for  an  analyst’s  general  industry  knowledge,  while  firm- 
specific  experience  could  proxy  for  an  analyst’s  familiarity  with  a  specific  firm.  For  the  same  reason  I  include  both 
numbers  of  companies  and  industries  followed  by  the  analyst.  Empirically,  firm-specific  experience  and  general  expe¬ 
rience  are  highly  correlated.  Nevertheless,  my  results  are  robust  to  including  both  of  them  or  dropping  either  one  of 
them  from  the  regressions. 

The  parametric  two-sample  t-test  tests  the  null  hypothesis  that  the  mean  difference  of  two  populations  is  zero,  assuming 
that  the  observations  are  drawn  from  normal  distributions  with  equal  variances.  The  nonparametric  Wilcoxon  rank  sum 
test  compares  the  locations  of  two  populations  to  determine  if  one  population  is  shifted  with  respect  to  the  other. 
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though  financial  analysts  who  provide  supplementary  sales  forecasts  are  more  likely  to  be  less 
experienced  and  employed  by  smaller  brokerage  houses,  supplemented  earnings  forecast  revisions 
are  more  accurate  than  stand-alone  earnings  forecast  revisions. 

Table  3,  Panel  B  reports  Pearson  correlations  among  individual  analyst  characteristics.  The 
presence  of  a  supplementary  sales  forecast  revision  {SUP PL)  is  positively  correlated  with  forecast 
accuracy  (negative  correlation  with  absolute  forecast  error,  \FE\).  Forecast  accuracy  is  negatively 
correlated  with  days  elapsed  since  the  last  forecast  (DAYS),  forecast  horizon  {HORIZON),  and  the 
numbers  of  companies  and  industries  followed  by  the  financial  analyst  {COMPANY  and  INDUS¬ 
TRY).  Prior  accuracy  is  not  significantly  correlated  with  the  accuracy  of  earnings  forecast  in  the 
current  year.  One  possible  reason  is  that  prior  research  measures  forecast  accuracy  using  the  last 
earnings  forecast  issued  by  a  financial  analyst  in  the  previous  year.  In  contrast,  I  randomly  pick  an 
earnings  forecast  issued  anytime  during  the  current  year.  If  a  more  able  financial  analyst  issues 
earnings  forecasts  earlier  in  the  year  at  the  cost  of  lower  accuracy,  then  the  accuracy  of  his/her 
earlier  forecasts  in  the  current  year  might  not  be  related  to  the  accuracy  of  his/her  last  forecast  in 
the  prior  year. 

Association  between  Earnings  Forecast  Accuracy  and  Supplementary  Sales  Forecasts 

Table  4  reports  regression  results  for  the  association  between  the  magnitude  of  earnings 
forecast  error  and  the  presence  of  a  supplementary  sales  forecast.  In  Model  I,  the  coefficient  on  the 
supplementary  sales  forecast  indicator  variable  {SUPPL)  is  negative.  This  supports  the  prediction 
(H2)  that  earnings  forecasts  supplemented  with  sales  forecast  revisions  are  more  accurate,  con¬ 
trolling  for  other  individual  analyst  characteristics.  Although  the  differences  in  forecast  accuracy 
seem  to  be  economically  small,  as  shown  in  Table  2,  market  reactions  are  twice  as  strong  for 
supplemented  earnings  forecast  revisions.  An  apparently  small  increment  in  ex  post  forecast  ac¬ 
curacy  could  trigger  a  much  stronger  ex  ante  market  reaction.  Note  that  the  coefficient  on  SUPPL 
picks  up  its  partial  effect  on  forecast  accuracy,  helping  to  explain  why  the  magnitude  of  the 
coefficient  seems  to  be  low.  Consistent  with  prior  literature,  I  also  find  that  forecast  accuracy 
increases  with  general  experience  and  decreases  with  days  lapsed  since  the  last  forecast,  forecast 
horizon,  and  forecast  frequency.  High-innovation  earnings  forecasts  are,  on  average,  more  accu¬ 
rate  than  low-innovation  earnings  forecasts. 

In  Model  II,  the  indicator  variable  NEW  equals  1  if  it  is  the  first  year  that  the  financial  analyst 
appears  in  the  I/B/E/S  detail  file,  and  0  otherwise.  The  results  suggest  that  earnings  forecasts  with 
supplementary  sales  forecast  revisions  are  more  accurate  even  if  I  include  earnings  forecasts 
provided  by  new  financial  analysts  in  the  sample.  In  Models  III  and  IV,  I  include  \S_REV\  as  an 
additional  independent  variable.  The  coefficient  on  \S_REV\  is  insignificant.  This  means  that  earn¬ 
ings  forecast  accuracy  is  not  driven  mechanically  by  the  magnitude  of  sales  forecast  revision. 

As  a  robustness  check,  I  repeat  my  analysis  using  the  last  earnings  forecast  by  an  analyst  in 
each  firm-year  (Sinha  et  al.  1997;  Clement  and  Tse  2005;  Herrmann  and  Thomas  2005).  The 
results  are  qualitatively  the  same  and  are  not  tabulated.  Furthermore,  I  estimate  Equation  (4)  using 
the  change  in  absolute  forecast  error  as  the  dependent  variable  (Clement  and  Tse  2005).  Consistent 
with  using  the  level  of  absolute  forecast  error  as  the  dependent  variable  in  Table  4,  untabulated 
results  show  that  the  regression  coefficient  on  the  indicator  variable  for  the  presence  of  supple¬ 
mentary  sales  forecasts  {SUPPL)  is  negative  and  statistically  significant,  suggesting  that  supple¬ 
mented  earnings  forecast  revisions  lead  to  a  greater  decrease  in  the  magnitude  of  forecast  errors. 
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VI.  ADDITIONAL  ANALYSES 
Initiation  and  Termination  of  Sales  Forecasts 

In  this  section  I  investigate  whether  earnings  forecasts  are  more  or  less  accurate  when  finan¬ 
cial  analysts  start  or  stop  providing  sales  forecasts.^^  I  define  two  additional  indicator  variables 
START  and  STOP  and  estimate  the  following  regression: 

\FEij,\  =  /3o  +  PiDAYSijt  +  P2HORIZONij,  +  /S^FREQij,  +  P^G^FEij,  + 

+  (3(,BROKER,j,  +  f32FIRM_EXPij,  +  fS^GEN __EXP  +  fSgCOMPANY^j, 
(SiqINDUSTRY^j,  +  yS]  ^EJNNOijt  +  /3n\E_REVij,\  +  jS^^SUPPLij^  +  /SuSTARTij, 

+  f3,sSTOP,j,  +  e,j,  (5) 

where: 

\EEijt\  =  absolute  value  of  forecast  error  for  analyst  i  who  follows  firm;  in  year  t  minus 
the  minimum  absolute  forecast  error  for  the  analysts  following  firm  j  in  year  t, 
scaled  by  the  range  of  absolute  forecast  error  for  the  analysts  following  firm  j  in 
year  t; 

STARTUP  =  1  if  analyst  i  issues  at  least  one  sales  forecast  for  firm  j  in  year  t  but  does  not  do 
so  in  year  t-1,  and  0  otherwise;  and 

STOP  ip  =  1  if  analyst  i  issues  at  least  one  sales  forecast  for  firm  j  in  year  t-\  but  does  not 
do  so  in  year  t,  and  0  otherwise. 

As  in  Equation  (4),  the  dependent  variable  is  the  absolute  value  of  forecast  error.  The  test  variables 
are  START  and  STOP.  Definitions  of  other  independent  variables  are  the  same  as  in  previous 
analyses.  A  negative  (positive)  coefficient  on  START  or  STOP  means  that  earnings  forecasts  are 
more  (less)  accurate  when  financial  analysts  initiate  or  terminate  supplementary  sales  forecasts, 
respectively. 

I  identify  24,972  cases  of  initiation  of  sales  forecasts  (START)  and  3,614  cases  of  termination 
(STOP)  at  the  firm-year-analyst  level.  Table  5  reports  the  results.  Consistent  with  results  in  Table 
4,  the  coefficients  on  SUPPL  are  negative  in  all  three  models.  In  other  words,  the  presence  of 
supplementary  sales  forecasts  improves  forecast  accuracy,  controlling  for  START  and  STOP.  After 
including  other  individual  analyst  characteristics  in  Models  II  and  III,  the  sums  of  the  coefficients 
on  SUPPL  and  START  are  negative  (-0.0221  and  -0.0220  for  Models  II  and  III,  respectively). 
The  presence  of  supplementary  sales  forecasts  plays  a  signaling  role  even  for  the  initial  issuance 
of  sales  forecasts.  This  also  suggests  that  there  is  further  improvement  in  earnings  forecast  accu¬ 
racy  beyond  the  year  of  initiation  of  sales  forecasts. 

The  coefficients  on  STOP  are  positive  but  insignificant  in  all  three  models.  One  possible 
reason  is  that  there  are  relatively  few  cases  (less  than  3  percent  of  total  observations)  of  termina¬ 
tion  of  sales  forecasts.  On  the  whole,  these  results  support  the  argument  there  is  an  improvement 
in  forecast  accuracy  when  financial  analysts  start  issuing  sales  forecasts. 

Earnings  Persistence  and  Firm  Characteristics 

To  further  assess  the  effects  of  earnings  persistence  on  supplementary  sales  forecasts,  I  esti¬ 
mate  the  following  regression: 


I  use  a  cross-sectional  regression  approach  to  examine  the  changes  in  earnings  forecast  accuracy  at  the  firm-analyst  level 
around  the  initiation  and  termination  of  sales  forecasts.  I  do  not  adopt  a  time-series  research  design  because  restricting 
the  analyses  to  analysts  that  have  survived  for  a  certain  period  of  time  may  induce  results  that  are  sensitive  to  survivor 
bias. 
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TABLE  5 

Association  between  Forecast  Accuracy  and  Initiation/Termination  of  Supplementary  Sales 

Forecasts 

\FEijt\  =Po+  PiDAYSij,  +  l32HORIZONiji  +  ^sFREQ^j,  +  f34LAG_FEij,  +  fisNEWip  +  li(,BROKER.t 
+  /37FIRM_EXPijt  +  l38GEN_EXPijc  +  ^gCOMPANY^,  +  pjolNDUSTRYiu  +  /3nEJNNO.t  +  /3i2\E_REVJ 
+  ^isSUPPLijt  +  t3i4STARfj,  +  fSjsSTOP^p  +  6^, 

Dependent  Variable  =  \FE\ 

I  II  III 


Coefficient 

p-value 

Coefficient 

p-value 

Coefficient 

p-value 

Intercept 

0.3251 

<0.001 

0.0725 

<0.001 

0.0720 

<0.001 

DAYS 

0.0494 

<0.001 

0.0508 

<0.001 

HORIZON 

0.4435 

<0.001 

0.4422 

<0.001 

FREQ 

0.0380 

<0.001 

0.0454 

<0.001 

LAG_FE 

0.0347 

<0.001 

NEW 

-0.0018 

0.394 

BROKER 

-0.0026 

0.282 

-0.0011 

0.598 

FIRM_EXP 

0.0045 

0.090 

0.0103 

<0.001 

GEN_EXP 

-0.0050 

0.055 

-0.0021 

0.362 

N_COMP 

-0.0143 

0.117 

-0.0076 

0.358 

N_1ND 

0.0200 

0.027 

0.0121 

0.143 

EJNNO 

0.0006 

0.750 

0.0037 

0.022 

\E_REV\ 

-1.0810 

<0.001 

-0.9645 

<0.001 

SUPPL 

-0.0095 

0.002 

-0.0332 

<0.001 

-0.0320 

<0.001 

START 

-0.0518 

<0.001 

0.0111 

0.012 

0.0100 

0.009 

STOP 

0.0511 

0.532 

0.0886 

0.242 

0.0890 

0.232 

F-value 

150.01 

<0.001 

1613.26 

<0.001 

2111.58 

<0.001 

Adj.  r2 

0.0032 

0.1651 

0.1741 

This  table  reports  the  association  between  forecast  accuracy  and  initiation/termination  of  supplementary  sales  forecasts, 
controlling  for  other  individual  analyst  characteristics.  I  report  the  coefficient  estimates  and  the  p-values  for  t-statistics 
from  pooled  OLS  regression. 

Variable  Definitions: 

START iji  =  1  if  analyst  i  issues  at  least  one  sales  forecast  for  firm  j  in  year  t  but  does  not  do  so  in  year  f-1,  and 
0  otherwise;  and 

STOPiji  =  1  if  analyst  i  issues  at  least  one  sales  forecast  for  firm  j  in  year  f-1  but  does  not  do  so  in  year  t,  and 
0  otherwise. 


See  Table  2  for  definitions  of  other  variables. 


^^^f+1  =  ySo  +  /3,EPS,  +  PjSUPPL,  *  EPS,  +  8,^1  (6) 

where  EPS,+]  is  earnings  per  share  before  extraordinary  items.  I  estimate  the  above  regression 
with  Huber- White  standard  errors  clustered  by  firm.  Untabulated  results  show  that  the  coefficient 
on  EPS,  iPi)  is  positively  significant  (coefficient  estimate  =  0.7841;  p  <  0.001),  but  the  coeffi¬ 
cient  {pf)  on  SUPPL,*  EPS,  is  insignificant  (coefficient  estimate  =  0.0012;  p  =  0.402).  My 
results  are  similar  when  including  other  individual  analyst  characteristics  as  control  variables. 
Therefore,  it  is  reasonable  to  conclude  that  the  primary  findings  are  robust  to  controlling  for 
earnings  persistence. 
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I  also  partition  the  sample  into  five  quintiles  according  to  each  of  the  following  firm  charac¬ 
teristics:  earnings  persistence,  earnings  growth,  earnings  volatility,  earnings  change,  firm  size,  and 
book-to-market  ratio.^^  I  run  separate  regressions  for  each  quintile  and  find  that  the  main  results 
hold  for  firms  from  different  quintiles  of  these  firm  characteristics  (results  not  tabulated). 

My  sample  consists  of  firm-years  from  firms  that  may  not  have  sales  forecasts.  However,  the 
reported  conclusions  are  robust  to  restricting  the  sample  to  earnings  forecast  revisions  from  firm- 
years  with  at  least  one  supplementary  sales  forecast  revision.  These  results  suggest  that  my  main 

results  are  not  driven  by  the  possibility  that  sales  forecasts  might  be  more  common  for  some 
26 

particular  firms. 

Time  Effects 

Although  the  reported  analyses  capture  relative  forecast  accuracy  and  individual  analyst  char¬ 
acteristics  relative  to  those  of  other  financial  analysts  following  the  same  firm  in  the  same  year,  1 
assess  the  robustness  of  these  results  to  including  a  time  trend  variable.  The  main  results  are 
qualitatively  the  same.  Results  are  also  robust  to  inclusion  of  year  indicators  for  fixed  time  effects. 

The  Directions  of  Earnings  Forecast  Revisions  and  Sales  Forecast  Revisions 

To  provide  further  evidence  on  whether  earnings  forecast  revisions  and  sales  forecast  revi¬ 
sions  are  in  the  same  direction,  I  define  two  indicator  variables,  CONFIRM  and  CONFLICT. 
CONFIRM  equals  1  if  the  revision  in  sales  forecast  is  in  the  same  direction  as  the  revision  in 
earnings  forecasts,  and  0  otherwise.  Similarly,  CONFLICT  equals  1  if  the  revision  in  sales  forecast 
is  in  the  opposite  direction  as  the  revision  in  earnings  forecasts,  and  0  otherwise.  I  identify  15,786 
cases  of  confirming  sales  forecasts  (CONFIRM)  and  4,146  cases  of  conflicting  sales  forecasts 
(CONFIRM)  at  the  firm-year-analyst  level.  Instead  of  including  SUPPL  as  the  test  variable  in  the 
regressions,  I  replace  SUPPL  with  the  variables  CONFIRM  and  CONFLICT.  Regression  coeffi¬ 
cients  on  both  CONFIRM  and  CONFLICT  are  negative  and  significant.  The  main  results  (untabu¬ 
lated)  are  essentially  the  same,  although  the  statistical  significance  on  the  CONFLICT  variable  is 
weaker.  I  also  remove  observations  for  which  the  revenue  and  the  earnings  forecast  revisions  are 
in  the  opposite  directions.  Again,  the  main  findings  are  the  same.  I  conclude  that  it  is  the  act  of 
disclosing  sales  forecasts,  not  the  content  of  the  sales  forecasts  that  accounts  for  the  primary 
results. 


VII.  CONCLUSION 

Although  a  vast  amount  of  academic  research  has  examined  different  aspects  of  financial 
analysts’  earnings  forecasts,  the  interaction  between  earnings  forecasts  and  other  supplementary 
information  provided  by  financial  analysts  is  largely  unexplored.  In  this  study,  I  examine  the 
nature  and  the  market  responses  to  financial  analysts’  supplementary  sales  forecasts.  I  find  that 
earnings  forecast  revisions  supplemented  with  sales  forecast  revisions  have  a  greater  impact  on 
security  prices,  controlling  for  the  incremental  information  content  in  sales  forecasts.  The  differ¬ 
ences  in  return  responses  suggest  that  investors  view  supplemented  earnings  forecast  revisions  as 
more  credible,  and  that  verifiable  supplementary  information  increases  the  credibility  of  earnings 
forecasts.  Furthermore,  supplemented  earnings  forecasts  are  more  accurate  ex  post,  suggesting  that 
financial  analysts  provide  sales  forecasts  to  convey  their  credibility.  To  the  extent  that  the  provi- 


“  Beaver  et  al.  (1980)  characterize  the  time-series  of  annual  earnings  as  a  first-order  moving  average  process  in  the  first 
differences,  IMA(1,1).  Following  their  approach,  I  measured  earnings  persistence  as  the  moving  average  parameter  {d) 
in  the  IMA(1,1)  process. 

Note  that  this  study  differs  from  prior  research  (e.g.,  DeFond  and  Hung  2003),  as  I  use  relative  forecast  accuracy  as 
dependent  variable  and  carry  out  the  analyses  at  the  analyst,  not  at  the  firm  level. 
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sion  of  sales  forecasts  is  associated  with  analyst  ability,  it  is  an  open  question  as  to  whether  less 
informed  financial  analysts  would  be  penalized  if  they  issue  ex  post  inaccurate  sales  forecasts  to 
mimic  their  more  informed  counterparts.  Although  sales  forecasts  are  one  possible  way  to  lend 
credibility  to  earnings  forecasts,  there  could  also  be  other  nonfinancial  information  disclosed  in 
analyst  reports  that  impacts  the  market  reactions  to  earnings  forecasts.  I  leave  this  to  future 
research. 
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ABSTRACT:  The  literature  provides  mixed  evidence  on  whether  investors  find  audit 
reports  modified  for  going  concern  reasons  to  be  useful.  Using  a  substantially  larger 
sample  than  previous  studies,  we  observe  negative  excess  returns  when  the  going 
concern  audit  report  (GCAR)  is  disclosed.  We  find  that  the  reaction  is  more  negative  if 
the  GCAR  cites  a  problem  with  obtaining  financing,  suggesting  that  the  GCAR  provides 
new  information  to  investors.  Also,  the  reaction  is  more  adverse  if  the  GCAR  triggers  a 
technical  violation  of  a  debt  covenant  that  restricts  the  firm  from  getting  a  GCAR.  The 
evidence  suggests  that  institutional  investors  drive  the  reaction  to  the  GCAR,  since 
there  is  no  detectable  reaction  at  low  levels  of  institutional  ownership.  The  market 
reaction  gets  more  negative  as  the  level  of  institutional  ownership  increases,  and  there 
is  a  decline  in  institutional  ownership  after  the  GCAR  is  issued.  We  attribute  these 
results  to  sophisticated  investors’  awareness  of  the  firm’s  financing  needs  and  the 
covenants  carried  by  the  firm’s  debt. 

Keywords:  audit  reports;  going  concern;  debt  covenants;  institutional  ownership. 

Data  Availability:  The  data  used  in  this  study  are  publicly  available  through  sources 
identified  in  the  study. 


I.  INTRODUCTION 

SAS  No.  59  (AICPA  1988)  requires  auditors  to  assess  a  client’s  going  concern  status.  If,  after 
considering  all  relevant  information,  the  auditor  has  substantial  doubt  about  the  entity’s 
ability  to  continue  as  a  going  concern,  then  the  auditor  is  required  to  modify  the  audit 
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report.*  We  refer  to  an  audit  report  modified  for  going  concern  reasons  as  a  “going  concern  audit 
report”  (GCAR).  There  is  a  long-standing  debate  on  the  usefulness  of  the  GCAR.  Auditors  have 
expertise  in  assurance  audits,  not  in  judging  the  going  concern  status  of  a  firm,  and  their  assess¬ 
ment  may  not  add  to  what  investors  already  know.  On  the  other  hand,  auditors  have  access  to 
information  unavailable  to  investors  and  can  reveal  this  information  through  the  GCAR. 

One  way  of  judging  the  usefulness  of  the  GCAR  is  to  see  whether  investors  react  to  GCAR 
announcements.  If  the  auditor’s  unfavorable  assessment  provides  new  information  on  the  firm, 
then  the  price  of  the  firm’s  securities  should  decline.  Several  event  studies  have  examined  the 
market  reaction  to  the  issuance  of  GCARs.^  While  early  studies  (Elliott  1982;  Dodd  et  al.  1984)  do 
not  detect  a  negative  market  reaction  to  qualified  reports,  including  GCARs,  later  studies  (Fleak 
and  Wilson  1994;  Jones  1996)  find  a  negative  reaction  to  GCARs.  More  recent  research  by 
Herbohn  et  al.  (2007),  using  an  Australian  sample,  and  Blay  and  Geiger  (2001)  do  not  report  a 
significant  negative  reaction  to  GCARs. 

Our  study  provides  evidence  on  this  issue,  using  a  substantially  larger  sample  than  those  used 
in  previous  studies.  We  find,  on  average,  a  significant  negative  reaction  to  the  announcement  of  a 
first-time  GCAR.  However,  our  primary  contribution  to  the  literature  lies  in  going  beyond  the 
initial  question  of  whether  investors  react  to  the  GCAR.  We  investigate  why  investors  react  to  the 
GCAR  and  who  (which  class  of  investors)  reacts  to  the  GCAR. 

One  reason  a  GCAR  may  elicit  a  reaction  from  investors  is  that  it  can  provide  new  informa¬ 
tion  on  the  status  of  the  client’s  negotiations  with  lenders  and  the  client’s  plans  to  raise  financing. 
Sometimes,  auditors  specifically  refer  to  financing  problems  in  the  GCAR.  We  find  that  investors 
react  more  adversely  to  GCARs  that  cite  financing  problems.  Further,  since  many  firms  carry  debt 
with  a  covenant  that  requires  the  firm  to  present  GCAR-free  financial  statements,  the  GCAR  can 
a  trigger  covenant  violation  that  results  in  stricter  loan  terms.  We  show  that  the  reaction  to  the 
GCAR  is  worse  for  firms  that  violate  a  covenant  when  they  receive  a  GCAR. 

We  also  consider  whether  the  reaction  to  the  GCAR  depends  upon  the  firm’s  ownership 
composition.  Kausar  et  al.  (2006)  present  evidence  that  institutional  investors  divest  shares  in 
firms  receiving  a  GCAR.  We  find  that  the  price  reaction  to  the  GCAR  varies  inversely  with  the 
level  of  institutional  ownership.  Also,  the  negative  market  reaction  for  firms  with  GCARs  citing 
financing  problems  or  for  firms  violating  a  debt  covenant  by  receiving  a  GCAR  is  observed  only 
at  higher  levels  of  institutional  ownership.  We  attribute  the  differences  to  institutional  investors 
likely  having  greater  knowledge  of  the  terms  of  the  firm’s  debt  contracts. 

Our  study  contributes  primarily  to  the  line  of  auditing  reseaich  on  GCARs.  First,  we  provide 
relatively  unambiguous  evidence  that,  on  average,  going  concern  audit  reports  provide  useful 
information.  Second,  we  provide  insight  into  the  reasons  that  investors  find  the  GCAR  to  be  useful 
by  showing  that  GCARs  that  cite  financing  problems  result  in  a  more  negative  reaction  than  do 
other  GCARs.  Third,  while  there  is  evidence  of  the  importance  of  restrictive  debt  covenants  in 
other  areas  of  accounting  research,  there  is  little  or  no  evidence  in  auditing.  We  show  that  audit- 
related  covenants  also  affect  investors’  decisions.  Fourth,  our  study  adds  to  the  literature  by 
showing  that  institutional  investors  use  the  information  in  audit  reports. 

In  the  next  section,  we  discuss  the  usefulness  of  GCARs  and  review  the  relevant  literature. 
We  expand  on  the  reasons  why  the  report  should  be  useful  to  investors  and  we  develop  hypoth- 


‘  The  Financial  Accounting  Standards  Board  has  an  ongoing  project  intended  to  make  managers,  not  just  auditors, 
responsible  for  identifying  and  disclosing  going  concern  uncertainties. 

^  For  convenience,  we  use  the  term  “GCAR”  to  refer  not  only  to  audit  reports  modified  for  going  concern  purposes,  the 
appropriate  terminology  under  SAS  No.  59,  but  also  to  “going  concern  qualifications,”  the  terminology  used  before  SAS 
No.  59. 
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eses.  In  Section  III,  we  present  the  sample,  descriptive  statistics,  hypothesis  tests,  and  results. 
Section  IV  includes  additional  analyses  and  robustness  tests.  In  Section  V,  we  present  some 
concluding  comments. 

II.  ARE  GOING-CONCERN  AUDIT  REPORTS  USEFUL? 

SAS  No.  59  (AICPA  1988)  directs  auditors  to  actively  assess  whether  the  client  is  a  going 
concern,  and,  if  there  is  substantial  doubt  about  the  client’s  going  concern  status,  to  modify  the 
audit  report  accordingly.^  The  usefulness  of  GCARs  has  long  been  questioned,  even  within  the 
accounting  profession  (Bellovary  et  al.  2006).  Auditors  have  unique  training  and  expertise  in 
audits,  not  in  assessing  the  firm’s  future  financial  prospects.  Further,  investors  are  able  to  form 
their  own  assessments  of  the  likelihood  that  the  firm  will  be  a  going  concern  using  a  variety  of 
information  sources,  including  financial  statements.  At  the  same  time,  auditors  have  access  to 
insider  information  (such  as  management  plans  and  the  status  of  client-lender  negotiations)  that 
could  allow  them  to  form  superior  judgments  to  those  formed  by  investors  who  do  not  have  access 
to  similar  information. 

Several  studies  have  investigated  the  market  reaction  to  GCARs."^  Some  of  these  studies  do 
not  look  at  GCARs  separately,  but  along  with  other  audit  report  modifications  or  qualifications. 
Elliott  (1982)  studies  firms  receiving  “subject-to”  audit  qualifications  and  finds  no  significant 
market  reaction  to  the  release  of  the  audit  qualification.  Dodd  et  al.  (1984)  also  find  no  evidence 
that  investors  react  adversely  to  audit  qualifications,  although  a  follow-up  paper  by  Dopuch  et  al. 
(1986)  reports  that  firms  that  have  qualified  opinions  that  are  reported  in  the  business  press 
experience  negative  excess  returns  around  the  report  date. 

Jones  (1996)  observes  negative  returns  around  the  announcement  date  for  68  firms  receiving 
GCARs,  with  the  return  worsening  for  more  unexpected  GCARs.  Similarly,  Fleak  and  Wilson 
(1994)  find  that  114  firms  receiving  GCARs  experience  negative  returns  when  compared  with 
distressed  firms  receiving  unmodified  opinions.  In  contrast,  Blay  and  Geiger  (2001)  find  a  statis¬ 
tically  insignificant  market  reaction,  on  average,  to  a  sample  of  121  firms  receiving  GCARs. ^  In  a 
study  of  229  Australian  firms,  Herbohn  et  al.  (2007)  find  no  short-term  market  reaction  to  the 
release  of  the  annual  report  containing  the  GCAR.^ 

In  summary,  results  are  mixed  on  the  question  of  whether  investors  react  adversely  to  the 
issuance  of  going  concern  audit  reports.  We  expect  a  GCAR  to  result  in  a  negative  investor 
reaction  for  two  reasons.  First,  the  auditor  may  provide  new  information  to  investors  in  the  GCAR. 
While  investors  can  extract  information  about  the  firm’s  financial  condition  efficiently  from  finan¬ 
cial  statements,  SAS  No.  59  directs  auditors  to  go  beyond  the  financial  statements  in  assessing  the 
client’s  going  concern  status  and  to  look  at  management’s  plans  to  address  liquidity,  including 


^  Ellingsen  et  al.  (1989)  provide  a  detailed  comparison  between  SAS  No.  59  and  its  predecessor,  SAS  No.  34,  which 
merely  required  auditors  to  ensure  that  going  concern  uncertainties  that  come  to  light  during  the  normal  course  of  an 
audit  are  adequately  disclosed. 

Reports  in  the  business  press  commonly  claim  that  the  disclosure  of  a  GCAR  results  in  a  negative  investor  reaction  (e.g., 
Carms  2000). 

^  Blay  and  Geiger  (2001)  observe  a  difference  in  returns  between  firms  in  their  sample  that  subsequently  went  bankrupt 
and  firms  that  survived.  The  authors  attribute  their  results  to  differential  market  expectations  for  the  two  groups;  that  is, 
investors  may  already  have  anticipated  impending  bankruptcy  for  firms  that  went  bankrupt  following  the  GCAR. 

®  Of  the  event  studies  cited  here  that  use  samples  of  firms  receiving  GCARs,  Jones  (1996),  Blay  and  Geiger  (2001),  and 
Herbohn  et  al.  (2007)  study  only  first-time  GCARs;  Fleak  and  Wilson  (1994)  include  both  first-time  and  repeat  GCARs. 
Some  studies  treat  the  first  release  of  information  (either  a  press  release  or  the  filing  of  the  10-K)  on  the  GCAR  as  the 
event  date  (Blay  and  Geiger  2001;  Herbohn  et  al.  (2007));  others  use  the  annual  report  or  10-K  filing  date  as  the  event 
date.  Blay  and  Geiger  (2001)  use  size-adjusted  returns  to  measure  the  market’s  reaction  to  the  event;  Herbohn  et  al. 
(2007)  use  market-adjusted  returns;  other  studies  use  risk-adjusted  returns. 
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financing  plans,  and  the  likelihood  that  these  plans  will  adequately  address  the  firm  s  problems. 
Much  of  this  information  is  not  publicly  available,  and  by  issuing  a  GCAR  the  auditor  may  reveal 
this  information  and  change  investors’  beliefs.  Hopwood  et  al.  (1989)  show  that  the  audit  report 
provides  a  statistically  significant  contribution  to  a  bankruptcy  prediction  model.  Also,  Chen  and 
Church  (1996)  and  Holder- Webb  and  Wilkins  (2000)  find  that  GCARs  reduce  the  surprise  asso¬ 
ciated  with  bankruptcy  announcements,  suggesting  that  investors  incorporate  the  information  in  a 
GCAR  in  assessing  the  likelihood  of  bankruptcy. 

Second,  we  argue  that  GCARs  can  impose  contracting  costs  on  firms,  which  is  likely  to  result 
in  an  adverse  market  reaction.  We  focus  on  GCARs  that  result  in  technical  violation  of  debt 
covenants.  We  discuss  this  explanation  later  in  this  section. 

Where  most  previous  studies  of  GCARs  are  limited  by  relatively  small  sample  sizes,  our 
study  uses  a  large  sample  of  1,194  firms  that  receive  first-time  GCARs.  Improved  data  availability 
allows  us  to  identify  the  first  time  that  investors  learn  of  the  GCAR  more  precisely  than  done  in 
many  previous  studies.  With  these  improvements,  we  expect  to  show  that  investors  value  the 
auditor’s  assessment  of  the  firm’s  going  concern  status.  Our  first  hypothesis,  in  the  alternative 
form,  follows. 

HI:  Investors  react  negatively  to  the  announcement  of  a  GCAR. 

While  our  paper  contributes  to  the  literature  in  providing  large-sample  evidence  on  the 
GCAR’s  usefulness  to  investors,  its  main  contribution  lies  in  exploring  reasons  for  the  market’s 
reaction.  We  investigate  whether  the  informativeness  of  a  GCAR  depends  upon  the  reason  pro¬ 
vided  by  the  auditor  for  issuing  the  modified  report,  the  presence  of  a  GCAR-restricting  debt 
covenant,  and  the  level  of  institutional  ownership. 

In  issuing  a  GCAR,  the  auditor  reveals  his/her  assessment  of  the  likelihood  that  the  firm  will 
survive  beyond  a  year,  and  in  doing  so  may  implicitly  reveal  information  about  financing  plans 
and  the  status  of  the  firm’s  dealings  with  lenders  and  others.  Sometimes,  however,  the  auditor 
explicitly  provides  information  on  the  firm’s  difficulties  in  servicing  debt  or  in  raising  capital.  In 
a  GCAR,  the  auditor  typically  provides  an  explanation  for  why  the  audit  report  was  modified. 
Often,  the  reason  provided  is  very  general  (for  example,  that  the  firm  has  sustained  recurring 
operating  losses)  and  does  not  reveal  anything  that  is  not  already  apparent  from  financial  state¬ 
ments.  However,  some  audit  reports  are  more  specific  and  attribute  the  auditor’s  decision  to  issue 
a  going  concern  modification  to  particular  operating  or  financing  problems.  If  the  auditor  cites  the 
firm’s  financing  problems,  including  difficulties  in  negotiating  with  existing  lenders  or  problems  in 
obtaining  new  financing,  then  we  expect  that  investors  will  see  the  GCAR  as  conveying  the 
auditor’s  pessimism  about  the  firm’s  ability  to  raise  needed  capital,  and  will  react  more  negatively 
to  the  GCAR  than  if  the  GCAR  does  not  mention  financing  problems. 

H2:  Investors  react  more  negatively  to  GCARs  that  specifically  cite  financing  problems  than 
to  GCARs  that  do  not  cite  financing  problems. 

A  GCAR  potentially  affects  the  firm’s  contracting  costs  in  several  ways.  It  could  cause  stock 
exchanges  to  question  whether  the  firm  should  continue  to  be  listed.*  Lenders  may  adjust  the  terms 
of  their  loans,  credit-rating  agencies  may  adjust  their  ratings,  and  suppliers  may  curtail  credit  to 


’  Norris  (2003)  provides  an  anecdote  of  an  instance  where  the  critical  determinant  in  the  auditor’s  decision  to  issue  a 
GCAR  was  the  likelihood  that  the  client  (Polaroid  Corp.)  would  not  obtain  refinancing. 

^  For  example,  New  York  Stock  Exchange  rules  indicate  that  receiving  a  GCAR  may  provide  cause  for  a  firm  to  be 
delisted.  Both  the  NASDAQ  and  the  American  Stock  Exchange  have  recently  (since  2004)  required  listed  firms  to 
inform  investors  of  the  receipt  of  a  GCAR  by  issuing  a  press  release  within  seven  days  following  the  10-K  filing. 
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firms  receiving  GCARs.^  In  all,  the  GCAR  can  severely  constrain  the  firm’s  ability  to  raise  capital 
and  continue  to  operate.  Investors  are  likely  to  anticipate  the  increased  contracting  costs  that 
follow  GCARs  and  to  react  negatively  to  the  announcement  of  a  GCAR. 

We  focus  on  the  existence  of  a  restrictive  debt  covenant  requiring  the  firm  to  prepare  financial 
statements  without  a  GCAR.  If  the  debt  carries  such  a  covenant,  then  the  issuance  of  a  GCAR  to 
the  borrower  would  likely  lead  to  higher  contracting  costs.  Beneish  and  Press  (1993)  find  that 
technical  violation  of  accounting-related  covenants  results  in  higher  interest  costs,  debt¬ 
restructuring  costs,  and  increased  lender  control.  As  a  result,  investors  react  negatively  to  reports 
of  technical  violations  of  covenants  (Beneish  and  Press  1995).  Since  investors  are  aware  of 
covenants,  we  expect  that  violation  of  a  covenant  restricting  the  firm  from  receiving  a  GCAR  will 
have  an  adverse  result  on  stock  prices.  This  leads  to  our  third  hypothesis: 

H3:  Investors  react  more  negatively  to  GCARs  issued  to  firms  that  have  a  debt  covenant  that 
restricts  the  firm  from  receiving  a  GCAR  than  to  GCARs  issued  to  firms  that  do  not  have 
such  a  covenant. 

Kausar  et  al.  (2009)  find  that  U.S.  firms  receiving  GCARs  experience  negative  returns  in  the 
year  following  the  GCAR  disclosures. They  attribute  this  apparent  under-reaction  on  the  part  of 
investors  to  market  frictions,  specifically  high  transaction  costs  associated  with  trading  in  GCAR- 
receiving  firms.  In  a  related  study,  Kausar  et  al.  (2006)  find  that  institutional  investors  react 
differently  from  individual  investors  to  the  GCAR.  Institutional  investors  reduce  their  holdings  in 
firms  receiving  GCARs. 

Prior  studies  have  identified  differences  in  the  way  that  institutional  investors  and  individual 
investors  react  to  accounting  information.  Several  potential  reasons  have  been  identified  for  this 
differential  reaction.  Two  common  reasons  offered  are:  (1)  institutional  investors  possess  infor¬ 
mation  advantages,  such  as  greater  predisclosure  information  (e.g.,  Utama  and  Cready  1997);  and 
(2)  there  are  differences  in  the  level  of  sophistication  of  institutional  and  individual  investors, 
stemming  in  part  from  differences  in  the  amount  of  time  and  resources  that  these  two  investor 
groups  can  afford  to  spend  on  their  investment  decisions  (e.g.,  Walther  1997). 

If  institutional  investors  have  an  informational  advantage  with  respect  to  the  GCAR,  then  this 
advantage  should  be  reflected  in  these  investors  knowing  about  the  GCAR  before  it  is  announced 
and  reacting  before  the  GCAR  is  announced,  rather  than  reacting  after  it  is  announced.  We  should 
then  observe  the  adverse  reaction  to  the  announcement  of  a  GCAR  being  mitigated  by  increasing 
levels  of  institutional  ownership.  However,  even  if  institutional  investors  had  predisclosure  infor¬ 
mation  about  the  GCAR  pre-Regulation  FD,  it  is  likely  that  this  advantage  would  be  largely 
dissipated  following  Regulation  FD.  Since  most  of  our  sample  is  drawn  from  the  post-Regulation 
FD  period,  this  advantage,  if  it  exists,  should  be  muted  for  our  sample. 

Even  absent  information  advantages,  institutional  investors  could  use  their  superior  resources 
and  expertise  to  make  different  investment  decisions  from  individual  investors.  For  example, 
institutional  investors  may  be  better  aware  of  the  status  and  terms  of  the  firm’s  debt  contracts, 
including  whether  the  debt  carries  a  covenant  restricting  a  GCAR  and  the  likely  costs  of  violating 
this  covenant. 


®  For  example,  in  2002,  Standard  &  Poor’s  cut  GenTek  Inc.’s  corporate  credit  rating,  citing  the  company’s  poor  financial 
performance  and  receipt  of  a  GCAR  {Business  Recorder  2002).  handler  (2004)  cites  a  company  (Spiegel  Inc.)  that 
delayed  filing  its  10-K  because  disclosing  that  the  firm  had  received  a  GCAR  “would  have  jeopardized  Spiegel  by 
unnerving  its  suppliers  and  disrupting  [the  CEO’s]  negotiations  with  the  banks.” 

Similarly,  Taffler  et  al.  (2004)  compute  buy-and-hold  abnormal  returns  for  a  portfolio  of  U.K.  firms  in  the  12  months 
following  their  receipt  of  a  GCAR,  and  find  that  these  firms  substantially  underperform  reference  portfolios.  Ogneva  and 
Subramanyam  (2007),  in  contrast,  using  a  sample  of  U.S.  firms,  suggest  that  the  Taffler  et  al.  (2004)  results  depend  on 
the  model  used  to  generate  abnormal  returns. 
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Note  that  the  information  advantage  and  investor  sophistication  arguments  point  to  different 
predictions  for  the  relationship  between  the  level  of  institutional  holdings  and  the  announcement 
period  reaction  to  the  GCAR.  If  the  investor  sophistication  argument  prevails,  then  we  expect 
institutional  investors  to  react  more  negatively  to  the  GCAR  than  individual  investors,  and  the 
market  reaction  to  the  GCAR  should  become  more  adverse  with  increasing  levels  of  institutional 
holdings.  We  believe  the  investor  sophistication  argument  is  more  persuasive  and  we  express  H4 
accordingly. 

H4:  The  investor  reaction  to  the  announcement  of  a  GCAR  is  negatively  related  to  the  level 
of  institutional  ownership  in  the  firm. 

III.  METHOD  AND  TESTS 

The  sample  for  the  study  consists  of  firms  that  receive  an  initial  GCAR  in  the  period  1995  to 
2006.  We  identify  firms  receiving  an  initial  GCAR  (that  is,  firms  receiving  a  GCAR  that  did  not 
receive  a  GCAR  in  the  previous  year)  by  searching  the  Compact  Disclosure  database,  the  Audit 
Analytics  database,  Compustat,  and  the  business  media,  and  then  confirming  the  existence  of  a 
first-time  GCAR  by  reading  audit  reports  in  current  and  previous  10-K  filings.  We  exclude  all 
American  Depositary  Receipts  (ADRs).  We  include  only  the  first  GCAR  received  by  the  firm  in 
our  sample,  even  if  the  firm  obtains  a  GCAR,  or  several  consecutive  GCARs,  is  then  issued  an 
unmodified  opinion,  and  later  receives  a  GCAR  once  more.  Consequently,  each  firm  appears  only 
once  in  our  sample.  We  apply  this  rule  since  we  believe  market  expectations  and  security  price 
reactions  will  be  different  for  firms  receiving  continuing  GCARs. 

Our  initial  sample  consists  of  4,541  firms  with  first-time  GCARs.  Table  1,  Panel  A  details  how 
we  arrived  at  the  final  sample.  We  exclude  firms  that  do  not  have  data  on  CRSP  and  Compustat. 
We  also  omit  firms  that  are  already  in  bankruptcy.  We  drop  very  small  firms  (firms  with  less  than 
$1  million  in  either  total  assets  or  market  value  of  equity). Our  final  sample  consists  of  1,194 
firms.  Table  1,  Panel  B  provides  the  frequency  of  observations  by  year.  The  highest  frequency  of 
GCARs  is  observed  in  the  year  2000. 

Table  2  provides  descriptive  statistics  for  the  firms  in  the  sample.  Market  prices  are  obtained 
from  CRSP,  institutional  ownership  data  are  obtained  from  the  CD  A/Spectrum  Institutional  Hold¬ 
ings  database,  and  filing  dates  are  obtained  through  the  SEC’s  EDGAR  database  or  Lexis-Nexis. 
On  average,  sample  firms  receiving  GCARs  are  small  (mean  total  assets  =  $165.7  million;  mean 
market  value  of  equity  =  $50.7  million).  The  firms  are  very  distressed,  as  should  be  expected  for 
GCAR-receiving  firms,  with  negative  and  declining  profitability,  negative  cash  generating  ability, 
and  high  leverage.  The  level  of  institutional  ownership  averages  13.0  percent.  SEC  rules  specify 
that  firms  must  file  their  10-K  reports  within  a  certain  number  of  days  of  the  end  of  the  fiscal  year. 
We  find  that  nearly  one  half  (48.9  percent)  of  the  sample  have  material  delays  in  filing  their  10-K, 
which  we  define  as  filing  more  than  five  days  late;  68.2  percent  of  the  sample  firms  have  Big  4 
auditors.  We  also  calculate  the  earnings  surprise  for  the  497  firms  in  the  sample  for  which  we  have 
analyst  forecast  data.  The  earnings  surprise,  scaled  by  stock  price,  has  a  mean  of  —0.62  and  a 
median  of  —0.07.  The  low  mean  likely  reflects  the  nature  of  the  GCAR  sample  (highly  distressed 
firms  with  low  stock  prices). 

Market  Reaction  to  First  GCAR  Disclosure 

We  use  the  earliest  date  on  which  the  GCAR  is  announced  to  investors  as  the  event  date  to 
determine  whether  there  was  a  market  response  to  the  GCAR  announcement.  Some  firms  an- 


"  The  excluded  small  firms  have  the  following  characteristics  (all  medians):  total  assets:  $1.52  million;  market  value  of 
equity:  $0.58  million;  earnings  before  interest  and  taxes:  $(2.60)  million;  stock  price:  $0.21  per  share. 
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TABLE  1 

Sample  Selection  and  Distribution  by  Year 


Panel  A:  Sample  Selection 

Sample _ Frequency 


Firms  receiving  initial  going  concern  audit  reports  from  1995-2006  4,541 

-  Firms  without  required  stock  price  data  2,130 

-  Firms  without  required  financial  data  1,044 

-  Firms  with  assets  or  market  value  under  $1  million  96 

-  Firms  that  declare  bankruptcy  prior  to  the  filing  of  Form  10-K  77 

Full  Sample  1,194 


Panel  B:  Sample  Distribution  by  Year 

Year 

Frequency 

Percentage 

1995 

66 

5.5% 

1996 

109 

9.1% 

1997 

101 

8.5% 

1998 

131 

11.0% 

1999 

79 

6.6% 

2000 

169 

14.2% 

2001 

133 

11.1% 

2002 

121 

10.1% 

2003 

70 

5.9% 

2004 

67 

5.6% 

2005 

77 

6.4% 

2006 

71 

6.0% 

Total 

1,194 

100.0% 

nounce  the  issuance  of  the  GCAR  prior  to  the  release  of  the  10-K  through  a  press  release  (“Early 
Disclosers”),  but  in  most  cases  investors  first  learn  of  the  GCAR  in  the  10-K  (“10-K  Disclosers”). 
For  Early  Disclosers,  we  use  the  press  release  date  as  the  event  date.  For  10-K  Disclosers,  we 
use  the  10-K  filing  date  as  the  event  date,  since  this  is  the  first  date  that  investors  become  aware 
of  the  GCAR,  absent  information  leakage. 

To  identify  firms  that  belong  to  the  Early  Disclosers  sample,  we  review  (through  Lexis-Nexis) 
all  press  releases  placed  on  newswires  by  each  GCAR  firm  prior  to  the  filing  of  the  10-K  an¬ 
nouncing  the  GCAR.  We  find  that  105  of  the  1,194  firms  issue  a  press  release  at  least  three  days 


It  is  unclear  why  some  firms  make  early  disclosures  of  GCARs.  Early  disclosers  in  our  sample  are  not  different  from 
other  sample  firms  in  terms  of  profitability,  financial  distress,  or  in  the  likelihood  they  employ  a  Big  4  auditor.  However, 
they  tend  to  be  larger  than  the  other  firms  and  to  have  more  analysts  following  them.  Early  disclosers  also  are  more 
likely  to  have  made  an  accounting  restatement  in  the  past  year.  The  larger  size  and  analyst  following  may  be  consistent 
with  arguments  in  the  voluntary  disclosure  literature  that  managers  voluntarily  disclose  bad  news  because  of  reputation 
and  litigation  concerns  (e.g.,  Skinner  1994). 
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TABLE  2 


Descriptive  Statistics 


Full  Sample  (n  =  1,194) 

Mean 

Standard 

Deviation 

Lower 

Quartile 

Median 

Upper 

Quartile 

Assets  ($  millions) 

165.7 

758.9 

9.0 

25.5 

90.9 

Market  value  of  equity  ($  millions) 

50.7 

100.5 

7.5 

19.0 

47.4 

Earnings  before  interest  and  taxes 

-0.60 

0.91 

-0.81 

-0.29 

-0.06 

Change  in  income 

-0.27 

0.79 

-0.44 

-0.16 

0.00 

Cash  flow  from  operations  ($  millions) 

-7.15 

48.9 

-10.45 

-3.22 

-0.20 

Return  on  Assets 

-0.83 

1.20 

-1.06 

-0.49 

-0.19 

Z-score 

2.03 

3.27 

0.54 

1.27 

3.40 

Exit  Value  ($  millions) 

-2.75 

90.4 

-1.71 

2.32 

11.59 

Earnings  Surprise  (n  =  497) 

-0.62 

4.16 

-0.47 

-0.07 

0.01 

Debt/Assets 

0.77 

0.76 

0.45 

0.70 

0.93 

Institutional  Ownership 

0.13 

0.16 

0.01 

0.07 

0.20 

Late  10-K  filers 

48.9% 

— 

— 

— 

— 

Big  4  auditors 

68.2% 

— 

— 

— 

— 

Total  assets,  market  value  of  equity,  earnings  before 

interest  and 

taxes,  change  in  income,  cash 

flow  from  operations,  return 

on  assets,  Exit  Value,  and  debt/assets  are  all  based  on  year-end  or  annual  numbers  for  the  period  for  which  the  GCAR  is 
issued.  Change  in  income  is  scaled  by  total  assets.  Earnings  before  interest  and  taxes  is  scaled  by  assets  and  adjusted  for 
the  industry  average.  Return  on  assets  is  net  income/total  assets.  Z-score  is  a  composite  measure  of  financial  distress  based 
upon  Zmijewski  (1984).  Exit  Value  is  the  firm’s  anticipated  liquidation  value  as  computed  based  upon  Berger  et  al.  (1996). 
Earning  Surprise  is  computed  as  (actual  EPS  -  mean  forecasted  EPS)/stock  price  at  beginning  of  the  fourth  quarter. 
Debt/Assets  is  total  liabilities  divided  by  total  assets.  Institutional  ownership  is  the  fraction  of  outstanding  common  stock 
owned  by  institutions.  Late  10-K  filers  indicates  the  percentage  of  firms  that  file  10-Ks  at  least  five  days  after  the  statutory 
filing  date.  Big  4  auditors  indicates  if  the  firm  is  audited  by  a  Big  4  audit  firm  or  one  of  its  predecessors. 


prior  to  the  10-K  date,  indicating  that  their  auditors  will  issue  a  GCAR  for  the  current  year.  Many 
of  these  disclosures  are  provided  in  early  earnings  announcements.^^  On  average,  these  firms 
disclose  the  GCAR  in  a  press  release  11.6  days  before  their  10-K  filing. 

In  Table  3,  we  report  daily  size-adjusted  excess  returns  for  the  event  date  and  the  five  days 
preceding  and  succeeding  the  event  date  for  the  1,194  sample  firms.'"^  Mean  excess  returns  for  Day 
0,  Day  -fl  and  Day  +2  are  significantly  negative,  with  the  most  negative  return  taking  place  on 
Day  -HI  (-3.49  percent). The  cumulative  three-day  excess  return  (0,  +2)  is  -6.28  percent.’® 


A  very  small  number  of  firms  issue  press  releases  that  announce  only  the  GCAR.  These  firms  provide  a  “pure”  sample, 
in  the  sense  that  there  is  no  other  information  included  in  the  press  release  that  may  explain  investors’  reaction. 
However,  there  are  too  few  of  these  firms  to  provide  any  detailed  analysis.  In  the  full  sample,  35  firms  issue  press 
releases  containing  GCAR  announcements  but  not  earnings  news.  Only  16  of  these  firms  issue  the  press  release  either 
before  or  concurrent  with  (up  to  one  day  following)  the  10-K  date.  The  mean  three-day  cumulative  excess  return  (0,  +2) 
for  the  16  firms  issuing  early  or  timely  press  releases  is  significantly  negative  (-8.52  percent,  p  <  0.05),  but  the  mean 
cumulative  excess  return  for  the  firms  issuing  late  press  releases  is  not  significantly  different  from  zero. 

Daily  excess  returns  are  computed  by  taking  the  firm’s  daily  return  and  subtracting  the  corresponding  size-decile 
portfolio  return  for  the  day  from  CRSR  We  choose  to  use  size-adjusted  returns  because  the  sample  is  dominated  by 
small  firms.  A  long  line  of  studies,  stretching  back  to  Banz  (1981),  shows  a  relationship  between  firm  size  and  market 
returns.  Standard  Student’s  t-statistics  are  used  to  test  whether  the  computed  daily  excess  return  are  significantly 
different  from  zero. 

It  is  possible  that  the  larger  magnitude  of  negative  returns  obtained  on  day  f-i-1,  relative  to  day  t,  is  explained  by  the 
availability  of  the  information  to  investors.  For  the  firms  in  our  sample  for  which  the  event  date  is  the  10-K  filing  date, 
the  information  may  have  been  available  to  investors  only  the  next  day.  Similarly,  some  firms  announcing  GCARs  in 
press  releases  may  have  made  the  disclosure  after  markets  closed  for  the  day. 
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TABLE  3 

Daily  Excess  Returns  Related  to  the  GCAR  Disclosure 

All  Firms  (n  =  1,194) 

10-K  Disclosers  Sample  (n 

=  1,089) 

Std. 

Std. 

% 

Day 

Mean  Dev. 

Median 

%  Negative 

Mean  Dev. 

Median 

Negative 

-5 

0.0059  0.1082 

-0.0019 

51.4% 

0.0066  0.1098 

-0.0022 

52.2% 

-4 

0.0023  0.1029 

-0.0004 

51.2% 

0.0029  0.1033 

-0.0003 

50.8% 

-3 

-0.0017  0.1064 

-0.0011 

52.1% 

-0.0015  0.1071 

-0.0018 

51.7% 

-2 

0.0011  0.1324 

-0.0021 

52.9% 

0.0009  0.1355 

-0.0023 

52.8% 

-1 

0.0004  0.1164 

-0.0020 

52.5% 

0.0008  0.1139 

-0.0017 

52.1% 

0 

-0.0150**  0.1301 

-0.0080 

59.2%** 

-0.0113**  0.1289 

-0.0076 

58.4%* 

1 

-0.0349**  0.1353 

-0.0179 

63.0%** 

-0.0323**  0.1315 

-0.0174 

63.2%** 

2 

-0.0129**  0.1217 

-0.0074 

58.4%** 

-0.0112**  0.1202 

-0.0073 

57.9%* 

3 

0.0042  0.1196 

-0.0017 

52.9% 

0.0038  0.1128 

-0.0010 

51.7% 

4 

0.0049  0.1296 

-0.0010 

51.5% 

0.0039  0.1304 

-0.0014 

52.9% 

5 

0.0004  0.1135 

-0.0035 

53.0% 

0.0002  0.1084 

-0.0029 

54.2% 

Early  Disclosers  Sample  (n  =  105) 

Day 

Mean 

Std.  Dev. 

Median 

%  Negative 

-5 

-0.0009 

0.0901 

0.0007 

47.6% 

-4 

-0.0041 

0.0981 

-0.0007 

52.3% 

-3 

-0.0043 

0.1002 

0.0031 

49.5% 

-2 

0.0031 

0.0953 

-0.0013 

54.2% 

-1 

-0.0034 

0.1409 

-0.0084 

57.1% 

0 

-0.0543** 

0.1367 

-0.0199 

67.6%* 

1 

-0.0622** 

0.1676 

-0.0298 

61.0% 

2 

-0.0299* 

0.1355 

-0.0091 

63.3% 

3 

0.0084 

0.1742 

-0.0077 

57.1% 

4 

0.0152 

0.1217 

0.0026 

45.7% 

5 

0.0030 

0.1577 

-0.0086 

54.3% 

*  ** 

p  <  0.05,  and  p  <  0.01,  respectively,  two-tailed. 

The  event  date  is  the  earliest  GCAR  disclosure  date  (for  10-K  Disclosers,  the  10-K  filing  date;  for  Early  Disclosers,  the 

GCAR-disclosing  press  release  date).  The  table  presents  statistics  for  daily  size-adjusted  excess  returns.  Significance  levels 

are  for  t-tests  for  differences  in  the 

mean  daily  excess  return  from 

zero  or  for  Chi-square 

tests  for  differences  in  the 

percentage  of  negative  excess  returns  in  a  given  day  from  the  average  for  the  21  days  around  the  event  date  for  sample 

firms. 

The  average  percentage  of  firms  incurring  negative  excess  returns  each  day  over  the  21  days  from 
Day  -10  through  Day  +10  (where  Day  0  is  the  event  date)  is  54.0  percent.  On  Day  0,  Day  +1, 
and  Day  +2,  the  percentages  of  firms  with  negative  excess  returns  are  59.2  percent,  63.0  percent, 
and  58.4  percent,  respectively,  all  significantly  higher  than  the  21 -day  average.  Table  3  reports 
daily  returns  separately  for  the  samples  of  10-K  Disclosers  and  Early  Disclosers,  as  well.  The 


The  cumulative  excess  return  for  the  announcement  period  (0,  +2)  is  computed  by  adding  the  daily  excess  returns  for 
the  three  days  beginning  with  the  event  day. 
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lO-K  Disclosers  have  a  cumulative  three-day  excess  return  of  —5.48  percent.  The  Early  Dis- 
closers  experience  a  mean  cumulative  three-day  (0,  +2)  excess  return  of  —14.64  percent. 

The  results  shown  in  Table  3  suggest  a  negative  investor  response  to  the  GCAR  announce¬ 
ment,  consistent  with  Hl.^*  However,  the  computed  announcement  period  return  may  be  contami¬ 
nated  by  other  information  that  is  released  at  the  same  time.  Like  other  studies  investigating  the 
market  reaction  to  GCARs,  we  are  unable  to  precisely  separate  the  GCAR  news  from  concurrent 
disclosures.  For  the  majority  of  firms  in  the  sample,  the  GCAR  news  becomes  available  simulta¬ 
neously  with  other  10-K  information.  Even  for  firms  that  announce  the  GCAR  early  in  a  press 
release,  there  is  generally  other  information  released  at  the  same  time,  such  as  annual  earnings.  In 
later  analyses,  we  attempt  to  mitigate  this  problem  and  provide  greater  confidence  that  the  ob¬ 
served  negative  reaction  is  due  to  the  GCAR. 

Cited  Reasons  for  the  GCAR 

The  GCAR  generally  carries  a  brief  explanation  for  why  the  auditor  chose  to  modify  the  audit 
report.  We  review  the  explanations  for  the  GCARs  in  our  sample  and  classify  explanations  into 
four  categories:  (1)  Poor  financial  performance.  This  category  includes  explanations  that  cite  the 
financial  condition  or  performance  of  the  company,  including  references  to  the  firm’s  present  or 
recurring  losses,  negative  cash  flows,  poor  working  capital  situation,  or  negative  equity.  (2)  Fi¬ 
nancing  problems.  In  this  category,  we  include  references  to  debt  defaults,  debt  covenant  viola¬ 
tions,  loss  of  credit  facilities,  or  the  need  for,  or  problems  in  securing,  additional  financing. (3) 
Operating  problems.  This  category  includes  references  to  specific  operating  problems,  such  as  the 
loss  of  a  key  customer,  problems  with  suppliers,  or  business  combination  issues.^^  (4)  Other.  All 
other  reasons  cited  for  the  GCAR  are  included  in  this  category  (such  as  litigation  and  regulatory 
issues). 

Table  4,  Panel  A  shows  the  frequency  with  which  the  auditor  attributes  the  GCAR  issuance  to 
each  of  these  reasons.  Since  a  GCAR  can  cite  more  than  one  reason,  the  total  frequency  adds  up 
to  more  than  1,194  for  the  full  sample.  Only  in  15  cases  does  the  auditor  not  provide  a  reason  for 
the  GCAR.  The  most  commonly  cited  reason  is  poor  financial  performance  (75  percent  of  firms), 
but  in  a  significant  number  of  cases  the  auditor  cites  financing  problems  (35.5  percent)  or  oper¬ 
ating  problems  (21.2  percent).  Table  4  also  breaks  out  the  frequency  separately  for  the  10-K 
Disclosers  sample  and  the  Early  Disclosers  sample.  The  most  notable  difference  between  these 
two  samples  is  the  higher  frequency  of  early  disclosing  firms,  where  the  auditor  cites  financing 
problems  (52.4  percent,  against  33.9  percent  for  the  10-K  Disclosers  sample). 

These  different  reasons  for  the  GCAR  cited  by  the  auditor  likely  have  different  information 
content  for  investors.  A  citation  of  poor  financial  performance,  for  example,  does  not  go  beyond 


Twelve  firms  in  the  sample  announce  the  GCAR  in  press  releases  on  the  day  preceding  or  two  days  preceding  the  10-K 
filing.  We  include  these  firms  in  the  10-K  Disclosers  sample  since  the  event  window  includes  the  10-K  filing.  Excluding 
these  firms  does  not  alter  any  of  the  reported  results  for  the  10-K  Disclosers  sample. 

The  investor  reaction  to  the  GCAR  announcement  is  conditioned  upon  the  extent  to  which  the  announcement  was 
anticipated.  Investors  may  anticipate  the  GCAR  because  of  the  information  they  have  from  interim  financial  reports  and 
other  sources  about  the  financial  condition  of  the  firm.  To  the  extent  that  the  GCAR  is  anticipated,  it  will  bias  against 
observing  a  negative  reaction,  since  investors  will  already  have  priced  in  the  information. 

As  an  example  of  a  financing  problem,  the  auditor’s  report  for  Blockbuster  Inc.  (not  in  the  sample)  for  the  fiscal  year 
2008  states  the  following:  “[T]he  risk  the  Company  may  not  successfully  complete  a  refinancing  of  its  credit  facility 
scheduled  to  mature  in  August  2009  and  obtain  related  amendments  of  financial  covenants  included  therein,  and/or  the 
risk  the  Company  may  not  have  adequate  liquidity  to  fund  their  operations  raise  substantial  doubt  about  the  Company’s 
ability  to  continue  as  a  going  concern.” 

An  example  of  a  GCAR  attributed  to  an  operating  problem  is  provided  by  Pinnacle  Airlines  Corp.’s  GCAR  for  the  fiscal 
year  2005.  The  auditor  notes:  “[T]he  uncertainty  regarding  the  future  of  the  Company’s  contract  with  its  primary 
customer  due  to  this  customer  seeking  protection  under  Chapter  11  of  the  U.S.  Bankruptcy  Code  raises  substantial  doubt 
about  the  Company’s  ability  to  continue  as  a  going  concern.” 
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Panel  A  shows  the  frequency  with  which  the  auditor  cites  certain  reasons  for  issuing  the  GCAR  in  the  audit  report  (for  the  10-K  Disclosers  Sample)  or  the  press  release  (for  the  Early 
Disclosers  Sample).  The  percentage  is  based  on  the  sample  size.  Since  multiple  reasons  can  be  cited  in  a  GCAR,  the  aggregate  of  the  frequencies  exceeds  the  sample  size.  Panel  B 
shows  the  frequency  with  which  debt  covenants  require  firms  to  prepare  audited  financial  statements  as  well  as  the  frequencies  of  restrictions  on  the  audit  report. 
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what  can  be  ascertained  from  the  financial  statements  and  is  unlikely  to  convey  new  information 
to  investors.  In  contrast,  if  the  auditor  indicates  that  the  firm  is  having  problems  related  to  its 
financing,  the  disclosure  may  be  incrementally  informative  to  investors.  Operating  problems  and 
other  specific  reasons  cited  by  auditors  may  also  be  news  to  some  investors,  but  in  general  we 
expect  that  these  will  already  be  known  to  the  market.  We  focus  on  financing  problems  in  our 
analysis. 

Debt  Covenants  and  GCARs 

In  order  to  study  the  contracting  consequences  of  the  GCAR,  we  focus  on  debt  covenants.  We 
search  the  Dealscan  database  and  the  material  contracts  exhibit  in  10-K  filings,  using  Lexis/Nexis 
to  identify  covenants  related  to  the  GCAR.  Table  4,  Panel  B  reports  the  results.  We  are  unable  to 
find  any  restrictive  covenants  related  to  financial  statements  or  the  auditor’s  report  for  421  firms  in 
the  full  sample.  In  the  remaining  773  instances,  the  firm  is  required  to  submit  timely  audited 
financial  statements  to  creditors. 

Of  the  firms  that  are  required  to  provide  audited  financial  statements,  193  firms  are  subject  to 
a  requirement  that  the  audit  report  should  not  be  qualified  or  modified  for  going  concern  reasons. 
In  92  additional  cases,  the  requirement  is  for  an  “unqualified”  audit  report.  Since  a  GCAR  is  an 
unqualified  audit  report  according  to  auditing  standards,  we  do  not  view  these  92  covenants  as 
restricting  GCARs.  Covenants  in  the  remaining  488  firms  require  timely  audited  financial  state¬ 
ments,  but  impose  no  restrictions  on  the  audit  report.  Some  of  these  firms  have  special  wording  on 
the  financial  statements:  21  firms  require  that  the  financial  statements  be  prepared  on  a  going 
concern  basis;  49  firms  have  other  wording  with  respect  to  the  quality  of  financial  statements  (e.g., 
“presents  fairly,”  “fair  and  accurate,”  or  “in  good  faith”). 

In  our  analysis,  we  classify  only  the  193  firms  that  are  required  to  present  an  audit  report  that 
is  not  modified  for  going  concern  reasons  as  having  a  restrictive  covenant  prohibiting  the  GCAR. 
Descriptive  statistics  (untabulated)  show  that  these  firms  tend  to  be  larger  (mean  assets  =  $272.3 
million)  than  the  other  firms  in  the  sample  (mean  assets  =  $148.6  million)  and  to  have  lower 
levels  of  losses  (mean  return  on  assets  =  —0.664,  as  compared  to  -0.857).  In  addition,  they  are 
much  more  likely  to  have  Big  4  auditors  (82.3  percent  versus  65.4  percent). 

Table  4,  Panel  B  also  breaks  down  the  debt  covenant  requirement  separately  for  the  10-K 
Disclosers  and  the  Early  Disclosers  samples.  A  higher  percentage  of  the  early  disclosing  firms 
(72.4  percent  versus  64.0  percent)  have  covenants  that  require  the  firm  to  prepare  audited  financial 
statements,  and  a  substantially  higher  percentage  (34.3  percent  against  14.4  percent)  have  cov¬ 
enants  that  restrict  the  firm  from  getting  a  GCAR. 

Multivariate  Analyses 

We  use  OLS  regressions  to  test  our  hypotheses  related  to  the  informativeness  of  GCARs.  We 
begin  by  controlling  for  several  additional  variables  that  could  explain  the  investor  reaction  on  the 
event  date.  We  estimate  the  following  model: 

CAR  =  a  +  PiLMValue  +  ^2EBIT  +  /S^SIncome  +  ^^CFOPS -f-  fi^Zscore  +  /3^ExitValue 

+  ^jLateFiler  -l-  e .  ( 1 ) 

The  event  date  is  the  GCAR  disclosure  date.  The  dependent  variable  {CAR)  measures  the  cumu- 


Using  Lexis-Nexis,  we  search  Exhibit  10  (material  contracts)  of  10-K  filings  for  all  companies  in  the  sample  starting  in 
1993.  We  expect  to  detect  covenants  in  any  debt  taken  in  1993  or  later.  However,  it  is  possible  that  some  firms  in  our 
sample  had  debt  prior  to  1993  that  included  covenants  related  to  financial  statements,  and  that  we  fail  to  detect  these 
firms.  These  firms  are  misclassified  in  the  analysis,  and  the  potential  misclassification  biases  against  our  finding  sig¬ 
nificant  results  in  tests  for  GCCovenant. 
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lative  size-adjusted  excess  return  over  the  three  days  beginning  with  the  event  date,  which  is  the 
date  the  GCAR  is  first  announced.  The  independent  variables  LMValue,  EBIT,  Slncome,  CFOPS, 
Zscore,  and  ExitValue  capture  aspects  of  financial  performance  or  condition  that  may  explain  the 
market’s  reaction  to  GCAR-accompanying  disclosures.  LateFiler  is  an  additional  control  variable 
that  may  explain  the  market’s  reaction  in  the  event  window.  Consistent  with  HI,  we  expect  a 
negative  intercept,  which  would  indicate  that  the  market  reacts  negatively  to  the  GCAR  even  after 
controlling  for  contemporaneous  disclosures  of  financial  condition. 

LMValue,  the  natural  log  of  the  market  value  of  equity,  is  a  proxy  for  firm  size,  and  is 
measured  at  the  end  of  the  fiscal  year  for  which  the  firm  receives  a  GCAR,  that  is,  several  weeks 
before  the  GCAR  is  announced.  This  variable  controls  for  size  effects  not  fully  captured  by  our 
use  of  size-adjusted  returns.  We  expect  firm  size  to  proxy  for  the  firm’s  information  environment. 
Since  larger  firms  are  likely  to  have  richer  information  environments,  we  would  expect  less 
surprise  associated  with  their  GCAR  disclosures,  resulting  iri  a  positive  coefficient  for  LMValue. 
EBIT  is  the  firm’s  earnings  before  interest  and  taxes,  scaled  by  total  assets,  less  the  industry  mean; 
Slncome  is  the  change  in  net  income  scaled  by  total  assets;  CFOPS  is  cash  flow  from  operations 
scaled  by  total  assets.  EBIT  and  CFOPS  are  measured  using  annual  numbers  for  the  period  for 
which  the  GCAR  is  issued,  while  Slncome  is  the  change  in  net  income  from  the  previous  year  to 
the  GCAR  year.^^  Zscore  is  a  measure  of  financial  distress,  computed  using  the  Zmijewski  (1984) 
distress  score  metric.^^  In  general,  we  expect  the  dependent  variable  to  be  positively  associated 
with  financial  performance,  so  coefficients  for  EBIT,  Slncome,  and  CFOPS  should  be  positive  and 
the  coefficient  for  Zscore  should  be  negative. 

ExitValue  represents  the  exit  value  of  assets,  computed  according  to  the  Berger  et  al.  (1996) 
formula,  scaled  by  the  market  value  of  equity.  ExitValue  is  measured  at  the  end  of  the  fiscal  year 
for  which  the  GCAR  was  issued. We  expect  investors  to  revise  the  probability  of  bankruptcy 
upward  in  highly  distressed  companies  and  consequently  to  place  greater  weight  on  the  exit  value 
(e.g.,  Barth  et  al.  1998;  Collins  et  al.  1999).  As  a  result,  ySg  should  be  positive. 

LateFiler  is  a  dichotomous  variable  identifying  firms  that  file  the  10-K  after  the  SEC-specified 

25 

filing  date.  LateFiler  is  assigned  a  value  of  1  if  the  firm’s  10-K  filing  is  more  than  five  days  late. 
Late  filing  companies  are  known  to  be  poorer  performers  (e.g.,  Alford  et  al.  1994).  If  the  firm  files 
a  late  10-K,  then  the  market  will  likely  anticipate  that  the  financial  statements  contain  bad  news, 
and  perhaps  even  that  the  audit  process  has  been  extended  because  a  GCAR  is  being  contem¬ 
plated.  As  a  result,  some  of  the  bad  news  would  already  have  been  impounded  in  the  stock  price 
during  the  period  of  the  delay,  resulting  in  a  less  adverse  reaction  to  the  10-K  filing.  We  would 
then  expect  Pj  to  be  positive.  We  drop  LateFiler  in  estimating  the  model  for  the  Early  Disclosers 
sample  since  the  late  filing  has  not  yet  happened  at  the  event  date  for  this  sample. 

We  add  five  variables  to  Model  (1)  in  our  second  specification.  Three  of  these  variables 
{CitedDebt,  GCCovenant,  and  InstOwn)  allow  us  to  test  H2,  H3,  and  H4,  respectively. 


Some  year-end  values  may  not  be  available  to  investors  on  the  event  date  for  the  Early  Disclosers  sample.  Still,  we  use 
yecU'-end  values  for  the  Early  Disclosers  firms  because  a  substantial  majority  of  the  press  releases  disclosing  the  GCAR 
early  also  provide  annual  earnings  and  balance  sheet  data. 

The  Zmijewski  (1984)  model  includes  three  variables:  return  on  assets,  debt/assets,  and  the  current  ratio. 

Berger  et  al.  (1996)  derive  the  following  formula:  Exit  Value  =  1.0  *  Cash  -I-  1.0  *  Marketable  Securities  +  0.72  * 
Receivables  +  0.55  *  Inventory  -I-  0.54  *  Fixed  Assets  —  1.0  *  Payables  —  1.0  *  Total  Debt.  Like  Berger  et  al.  (1996) 
and  Collins  et  al.  (1999),  we  replace  negative  estimated  values  with  zero,  since  a  negative  exit  value  has  no  economic 
meaning  for  a  limited  liability  company. 

Until  2002,  all  firms  were  expected  to  file  their  10-K  reports  within  90  days  from  the  fiscal  year-end.  Starting  from 
December  15,  2002,  this  period  was  shortened  to  five  days  for  accelerated  filers  (mainly  firms  with  common  equity  float 
exceeding  $75  million).  In  2006,  the  filing  period  was  shortened  further  for  large  firms. 
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CAR  =  a  +  /B^LM  Value  +  I32EBIT  +  pT^SIncome  +  fi^CFOPS  +  fS^Zscore  +  /3(,ExitValue 
+  PqLateEiler  +  f3^Big4  +  figCitedDebt  +  PioESCovenant  +  jS^GCCovenant 
+  f3i2lnstOwn  +  s.  (2) 

We  include  Big4  (1  if  the  firm  employs  a  Big  4  auditor,  and  0  otherwise)  because  prior  studies 
have  shown  that  investors  view  Big  4  auditors  as  being  more  credible  than  smaller  auditors.^^  For 
example,  Geiger  and  Rama  (2006)  find  that  Big  4-issued  audit  reports  are  more  accurate  in 
bankruptcy  prediction  (that  is,  there  are  fewer  Type  I  and  Type  II  errors)  than  reports  issued  by 
smaller  auditing  firms.  It  is  plausible  that  investors’  reactions  to  the  GCAR  depend  upon  the 
credibility  of  the  issuing  auditor.  Thus,  investors  may  see  it  as  more  likely  that  a  firm  will  go 
bankrupt  if  it  receives  a  GCAR  issued  by  a  Big  4  auditor,  leading  to  a  more  adverse  market 
reaction  for  Big  4-issued  GCARs. 

CitedDebt  represents  the  reason  for  the  GCAR  cited  in  the  audit  report.  CitedDebt  equals  1  if 
the  auditor  attributes  the  GCAR  to  a  financing  problem  and  0  otherwise.  A  negative  coefficient 
will  support  H2,  indicating  that  investors  react  more  adversely  to  GCARs  issued  because  of 
financing  problems. 

FSCovenant  and  GCCovenant  represent  financial  statement-related  covenants.  FSCovenant 
equals  1  if  the  firm  has  lending  agreements  that  require  timely  audited  financial  statements  (and  0 
otherwise).  GCCovenant  is  coded  as  1  if  a  covenant  specifies  that  the  firm  should  not  receive  a 
GCAR  (and  0  otherwise). Although  we  do  not  have  a  hypothesis  related  to  FSCovenant,  we 
expect  the  coefficient  for  FSCovenant  to  be  negative.  Lenders  requiring  audited  financial  state¬ 
ments  are  likely  to  be  stricter  than  lenders  that  do  not  require  audited  statements.  A  negative 
coefficient  for  GCCovenant  would  indicate  that  firms  for  which  the  GCAR  violates  a  covenant 
experience  an  even  more  negative  investor  reaction  than  firms  whose  lenders  just  require  financial 
statements,  supporting  H3. 

InstOwn  indicates  the  percentage  of  outstanding  common  stock  held  by  institutions  in  aggre¬ 
gate.  InstOwn  indicates  the  level  of  institutional  holdings  at  the  calendar  quarter-end  preceding  the 
calendar  quarter  in  which  the  GCAR  is  announced.  Following  H4,  we  expect  institutional  inves¬ 
tors  to  react  more  strongly  to  a  GCAR  than  individual  investors.  The  coefficient  for  InstOwn 
should  be  negative.  A  positive  coefficient  would  be  consistent  with  the  information  advantage 
argument  dominating  the  investor  sophistication  argument. 

We  estimate  Models  (1)  and  (2)  on  the  Early  Disclosers  sample  (105  firms)  and  the  10-K 
Disclosers  sample  (1,089  firms).^*’^®  Table  5  reports  the  regression  results.  We  first  discuss  the 
results  for  the  10-K  Disclosers  sample.  The  intercept  in  Model  (1),  which  includes  only  variables 


“  We  refer  to  the  tier  of  largest  auditors  as  the  Big  4  in  this  study.  The  tier  includes  six  firms  in  the  initial  years  of  the 
sample  until  the  1998  merger  between  Price  Waterhouse  and  Coopers  &  Lybrand,  and  then  five  firms  until  the  breakup 
of  Arthur  Andersen  in  2002. 

Note  that  only  firms  that  are  required  to  present  audited  financial  statements  to  lenders  can  have  the  additional  covenant 
prohibiting  a  GCAR.  As  Table  4,  Panel  B  shows,  there  are  773  firms  that  are  required  by  lenders  to  prepare  audited 
financial  statements  (i.e.,  FSCovenant  =  1).  Of  these,  193  firms  are  required  to  have  audit  reports  without  a  GCAR  (i.e., 
GCCovenant  =  1),  whereas  580  firms  are  not  (i.e.,  GCCovenant  =  0). 

We  winsorize  all  continuous  independent  variables  used  in  this  regression  (and  in  the  other  regressions  used  in  this 
paper)  at  the  1st  and  99th  percentiles.  The  results  reported  in  the  tables  reflect  this  winsorizing  procedure.  In  robustness 
tests,  we  use  alternate  winsorizing  rules  (0.5th  and  99.5th  percentiles;  2nd  and  98th  percentiles)  and  obtain  similar 
results.  We  also  identify  observations  that  have  a  large  effect  on  regression  coefficient  estimates  (following  Belsley  et  al. 
1980)  and  delete  the  observations  (1  percent  of  the  sample)  with  the  largest  effect.  The  only  notable  difference  in  the 
results  is  a  decline  in  the  significance  level  for  FSCovenant  (p  =  0.06). 

We  test  for  multicollinearity  in  all  the  regressions  we  estimate  in  this  study,  using  variance  inflation  factors.  Variance 
inflation  factors  are  well  within  acceptable  levels,  the  highest  value  being  2.96. 
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that  control  for  financial  information  that  may  explain  the  market  reaction  on  the  event  date,  is 
significantly  negative,  as  expected  under  HI.  None  of  the  control  variables  are  significant. 

The  intercept  is  not  significant  with  the  inclusion  of  the  additional  variables  in  Model  (2).  The 
control  variables  again  are  not  significant,  as  in  Model  (1).  CitedDebt  has  a  negative  and  signifi¬ 
cant  coefficient,  suggesting  that  GCARs  that  cite  financing  problems  induce  a  more  negative 
reaction.  This  finding  supports  H2.  FSCovenant  has  a  significantly  negative  coefficient,  indicating 
that  firms  with  covenants  requiring  audited  financial  statements  experience  a  more  negative  reac¬ 
tion  do  firms  without  such  a  requirement,  but,  consistent  with  H3,  the  sign  and  significance  of  the 
GCCovenant  coefficient  point  to  an  even  more  negative  reaction  if  the  firm’s  audit  report  is 
required  to  be  free  of  a  GCAR.  Finally,  consistent  with  H4,  the  coefficient  for  InstOwn  is  negative 
and  significant,  showing  that  the  abnormal  stock  returns  in  the  event  window  are  more  negative  in 
firms  with  higher  levels  of  institutional  holdings. 

We  estimate  Models  (1)  and  (2)  on  the  sample  of  105  Early  Disclosers  firms,  as  well.  We 
make  two  changes  to  the  models  for  this  sample.  Late  Filer  is  not  included,  since  we  measure 
returns  around  the  press  release  date  and  not  the  10-K  date.  CitedDebt  is  defined  differently  for 
this  sample,  since  the  actual  GCAR  is  not  visible  to  investors  at  the  time  of  the  press  release.  For 
early  disclosers,  we  define  CitedDebt  as  1  (and  0  otherwise)  if  managers  attribute  the  GCAR  to  a 
financing  problem  when  they  disclose  the  impending  GCAR  in  the  press  release. 

The  regression  results  show  small  differences  from  the  results  for  the  larger  sample  of  firms 
that  disclosed  the  GCAR  only  in  the  10-K.  LMValue  (firm  size)  and  Slncome  (the  change  in 
profitability)  have  positive  and  significant  coefficients.  Among  the  variables  of  interest,  GCCov¬ 
enant  has  a  significant  coefficient,  even  in  the  presence  of  a  significant  coefficient  for  FSCovenant, 
and  CitedDebt  and  InstOwn  are  also  significant  in  the  expected  direction. 

Overall,  the  analysis  of  returns  on  the  first  announcement  of  the  GCAR  supports  the  position 
that  the  GCAR  is  informative  to  investors.  We  observe  negative  returns  on  the  announcement,  and 
these  returns  continue  to  be  negative  after  controlling  for  the  firm’s  financial  performance  (HI). 
Returns  are  more  negative  for  firms  where  the  auditor  cites  financing  problems  as  being  the  reason 
for  the  GCAR  (H2)  and  for  firms  that  violate  a  debt  covenant  by  receiving  a  GCAR  (H3).  Also, 
returns  are  increasingly  negative  as  the  level  of  institutional  holdings  increases  (H4). 

Controlling  for  Earnings  Surprise 

If  firms  make  an  early  disclosure  of  the  GCAR  and  the  disclosure  is  included  in  an  earnings 
announcement,  then  investors  will  likely  react  to  the  earnings  surprise  in  the  announcement  win¬ 
dow.  If  firms  do  not  make  an  early  earnings  announcement,  then  investors  may  react  to  the 
earnings  surprise  in  the  10-K  event  window.  It  is  important,  then,  to  control  for  earnings  surprise 
in  our  analysis.  The  availability  of  earnings  forecast  data  proves  to  be  a  major  constraint.  Of  the 
1,194  firms  in  our  sample,  we  are  able  to  obtain  earnings  forecast  data  from  I/B/E/S  for  only  497 
firms.  We  re-estimate  Models  (1)  and  (2)  on  firms  with  earnings  forecast  data  (“partial  sample’’)  to 
see  whether  the  results  in  Table  5  hold  after  controlling  for  earnings  surprise. 

Surprise  is  computed  as  the  actual  EPS  less  the  forecasted  EPS,  scaled  by  the  stock  price  at 
the  beginning  of  the  fourth  quarter,  where  the  forecasted  EPS  is  the  last  consensus  analyst  forecast 
reported  on  I/B/E/S  before  the  GCAR  disclosure.  As  Table  2  shows,  the  majority  of  firms  in  the 
partial  sample  have  negative  earnings  surprises.  Untabulated  comparisons  of  descriptive  statistics 
show  that  firms  in  the  partial  sample  are  larger  than  firms  in  the  full  sample  (mean  total  assets  = 
$267.5  million;  mean  market  value  of  equity  =  $74.5  million),  but  are  similarly  distressed. 
Consistent  with  the  larger  size,  firms  in  the  partial  sample  are  more  likely  to  engage  Big  4  auditors 
(87.1  percent).  They  have  a  higher  frequency  of  covenants  restricting  the  firm  from  presenting 
financial  statements  with  a  GCAR  (23.0  percent).  The  highest  frequency  of  GCARs  is  observed  in 
the  year  2000,  as  for  firms  in  the  full  sample. 
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The  cumulative  size-adjusted  excess  return  in  the  three-day  window  (0,  +2)  commencing 
with  the  event  date  for  firms  in  the  partial  sample  is  —8.37  percent  (compared  with  —6.28  percent 
for  the  firms  in  the  full  sample).  72  firms  with  forecast  data  issue  a  press  release  disclosing  the 
GCAR  at  least  three  days  prior  to  the  10-K  filing  date.  The  cumulative  three-day  return  for  these 
firms  is  —14.34  percent  (as  compared  to  —14.64  percent  for  105  early  disclosers  in  the  full 
sample). 

We  re-estimate  the  regressions  in  Table  5,  adding  Surprise  to  Models  (1)  and  (2)  (labeled 
Models  la  and  2a,  respectively).  Table  6  presents  the  regression  results  using  the  partial  sample. 
The  coefficient  for  Surprise  is  significant  in  every  estimation  in  the  expected  positive  direction  and 
the  increases  materially.  The  results  are  otherwise  very  similar  to  those  reported  in  Table  5,  the 
main  difference  being  that  the  control  variables  LMValue  and  SIncome  are  not  significant  for  the 
Early  Disclosers  sample.  The  intercept  is  significant  in  Model  (l)a  in  each  case,  showing  a 
negative  reaction  to  the  GCAR  announcement  even  after  controlling  for  information  on  financial 
performance  that  may  become  available  at  the  time.  However,  consistent  with  Table  5,  the  inter¬ 
cept  is  not  significant  in  either  estimation  of  Model  (2)a.  CitedDebt,  FSCovenant,  GCCovenant, 
and  InstOwn  have  significant  negative  coefficients  in  Model  (2)a,  both  for  the  10-K  Disclosers 
sample  and  the  Early  Disclosers  sample. 

Effect  of  Institutional  Holdings 

The  results  in  Tables  5  and  6  suggest  that  the  level  of  institutional  ownership  is  important  in 
explaining  investors’  reaction  to  the  GCAR  disclosure.  We  explore  the  issue  further  in  this  section 
by  partitioning  the  sample  into  two  groups  based  on  the  level  of  institutional  holdings.  We  do  a 
median  split  and  form  two  subsamples:  “low  institutional  ownership”  firms  and  “high  institutional 
ownership”  firms.  Although  we  use  the  label  “high  institutional  ownership”  to  describe  the  group 
with  larger  institutional  holdings,  the  median  for  the  sample  as  a  whole  is  only  7.0  percent  (as 
reported  in  Table  2),  so  the  threshold  to  enter  the  high  institutional  ownership  group  is  relatively 
low  by  contemporary  levels  of  institutional  investment.  Firms  in  the  low  institutional  ownership 
subsample  are  generally  smaller  than  firms  in  the  large  institutional  ownership  subsample.  Low 
institutional  holdings  firms  are  more  likely  to  be  late  in  their  10-K  filing  and  to  have  non-Big  4 
auditors  (untabulated). 

When  we  study  the  daily  size-adjusted  excess  returns  around  the  event  date  for  the  two 
subsamples,  it  is  clear  that  firms  with  high  institutional  holdings  drive  the  results  in  Table  3.  Table 
7  shows  the  daily  excess  returns  for  the  two  subsamples.  A  significant  mean  excess  return  is 
evident  only  on  Day  4- 1  for  firms  with  low  institutional  holdings,  and  a  disproportionate  number 
of  firms  in  this  group  report  a  significant  negative  excess  return  only  on  that  day.  The  three-day 
cumulative  excess  return,  beginning  with  the  event  day,  is  -3.05  percent.  In  contrast,  significant 
negative  returns  are  observed  on  Day  0,  Day  +1,  and  Day  +2  for  firms  with  high  institutional 
holdings.  Significantly  more  firms  report  a  negative  excess  return  on  each  of  these  days,  compared 
to  the  21 -day  average.  The  three-day  cumulative  excess  return  is  -9.53  percent. 

We  estimate  Models  (1)  and  (2)  separately  for  the  two  subsamples,  including  both  10-K 
Disclosers  and  Early  Disclosers,  with  the  event  date  being  the  date  of  the  first  disclosure.  The 
results  are  reported  in  Table  8,  showing  that  the  intercept  is  significantly  negative  in  both  the  low 
and  high  institutional  ownership  subsamples.  That  is,  after  controlling  for  other  financial  informa¬ 
tion,  there  is  a  negative  return  attributable  to  the  GCAR. 

When  we  estimate  Model  (2)  on  the  low  institutional  holdings  subsample,  none  of  the  inde¬ 
pendent  variables  are  significant.  However,  when  we  estimate  Model  (2)  on  the  high  institutional 
ownership  subsample,  the  coefficients  for  the  variables  of  interest  {CitedDebt,  GCCovenant,  and 
InstOwn)  are  all  statistically  significant,  as  in  Table  5,  consistent  with  H2,  H3,  and  H4,  respec¬ 
tively.  FSCovenant  has  a  significantly  negative  coefficient,  as  well.  The  evidence  suggests  that 
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sophisticated  institutional  investors  may  be  reacting  to  the  auditor’s  assessment  of  the  firms’ 
financing  problems,  as  well  as  to  additional  costs  imposed  because  the  GCAR  violates  a  debt 
covenant.  Interestingly,  the  coefficient  for  Big4  is  negative  and  significant.  Recall  that  this  coef¬ 
ficient  is  not  significant  when  the  regression  is  estimated  on  the  full  sample  in  Table  5.  GCARs 
issued  by  Big  4  auditors  are  likely  viewed  as  more  adverse  than  GCARs  issued  by  non-Big  4 
auditors  in  the  high  institutional  ownership  subsample. 

The  results  reported  in  Table  8  for  the  low  institutional  holdings  sample  indicate  that  there  is 
some  negative  reaction  to  the  GCAR  announcement  at  low  levels  of  institutional  ownership.  These 
results  are  sensitive  to  the  cutoff  employed  to  determine  low  holdings.  When  low  institutional 
holdings  are  defined  as  the  bottom  tertile  of  the  sample  (<1.5  percent  of  the  firm’s  outstanding 
shares)  or  the  bottom  quartile  (<1.2  percent),  the  intercept  is  not  significant  (p  =  0.26  and  p  = 
0.20,  respectively),  indicating  that  there  is  no  reaction  to  the  GCAR  in  firms  with  such  low  levels 
of  institutional  ownership.  Overall,  these  results  suggest  that  institutional  investors  may  be  largely 
responsible  for  the  negative  reaction  to  GCARs  that  we  observe. 

Institutional  Holdings  before  and  after  GCAR 

The  analysis  in  the  preceding  subsection  shows  that  the  market  reaction  to  the  GCAR  is 
associated  with  the  level  of  institutional  holdings  in  the  firm.  The  next  analysis  provides  additional 
evidence  that  the  GCAR  affects  institutional  holders’  investment  decisions.  This  analysis  follows 
the  work  of  Kausar  et  al.  (2006),  who  find  a  decline  in  net  institutional  holdings  in  firms  that 
receive  a  GCAR.  We  extend  Kausar  et  al.’s  (2006)  study  to  determine  whether  the  decline  in 
institutional  holdings  depends  upon  the  characteristics  of  the  GCAR.  Recall  that  institutional 
holdings  data  are  obtained  originally  from  quarterly  13F  filings.  We  compare  the  institutional 
holdings  for  the  firm  at  two  points  in  time:  the  quarter-end  that  precedes  the  calendar  quarter  in 
which  the  GCAR  is  disclosed,  and  the  end  of  the  first  full  quarter  after  the  GCAR  is  disclosed.^° 
Table  9  shows  that  the  mean  institutional  ownership  drops  from  13.3  percent  in  the  quarter-end 
preceding  the  GCAR  to  8.6  percent  at  the  end  of  the  quarter  following  the  GCAR.  The  decline  is 
statistically  significant,  and  confirms  the  result  reported  by  Kausar  et  al.  (2006). 

We  partition  the  sample  by  whether  the  GCAR  cites  a  financing  problem.  The  424  firms  that 
are  issued  GCARs  citing  a  financing  problem  have  a  higher  level  of  institutional  ownership  before 
the  GCAR,  on  average  (14.7  percent,  versus  12.5  percent  for  firms  where  the  GCAR  does  not  cite 
a  financing  problem),  but  they  experience  a  significantly  greater  decline  in  institutional  ownership. 
We  partition  the  sample  also  according  to  whether  the  firm  has  a  covenant  restricting  the  GCAR. 
The  level  of  institutional  ownership  for  the  firms  in  this  group  (mean  =  21.2  percent)  is  initially 
much  higher  than  for  firms  without  such  a  covenant  (mean  =11.8  percent).  There  is  no  statistical 
difference  in  the  level  of  institutional  ownership  following  the  GCAR  because  of  a  steep  drop  in 
the  institutional  ownership  in  firms  where  the  GCAR  violates  a  covenant. 

In  untabulated  results,  we  also  estimate  a  regression  using  the  change  in  institutional  owner¬ 
ship  as  the  dependent  variable  and  GCCovenant  (the  presence  of  a  covenant  restricting  the  GCAR) 
and  CitedDebt  (the  citing  of  financing  problems  in  the  GCAR)  as  the  main  independent  variables. 
We  control  for  size  (LMValue),  financial  performance  {EBIT,  SIncome,  CFOPS,  Zscore,  and  Ex- 
itValue),  auditor  type  {Big4),  and  whether  there  is  a  covenant  requiring  financial  statements  {ES- 
Covenant).  Of  the  variables  of  interest,  only  the  coefficient  for  GCCovenant  is  statistically  sig¬ 
nificant.  It  has  a  negative  sign,  consistent  with  the  univariate  analysis  in  Table  9. 


For  example,  for  a  firm  disclosing  a  GCAR  on  March  15,  2005,  we  would  study  the  change  in  institutional  ownership 
from  December  31,  2004  to  June  30,  2005. 
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TABLE  9 


Changes  in  Institutional  Ownership  Following  GCAR 


Mean 

Mean 

Institutional 

Institutional 

Ownership 

Ownership 

n 

before  GCAR 

after  GCAR 

Difference 

All  Firms 

1,194 

0.133 

0.086 

-0.046 

t-statistic 

5.68  ** 

Reasons  Cited  for  GCAR 

GCARs  citing  financing  problems 

424 

0.147 

0.084 

-0.063 

GCARs  not  citing  financing  problems 

770 

0.125 

0.088 

-0.037 

Test  of  differences 

1,194 

2.33  * 

0.50 

3.13  ** 

Covenant  Restricting  the  GCAR 


Firms  with  covenant  restricting  GCAR 

193 

0.212 

0.095 

-0.117 

Firms  without  covenant  restricting  GCAR 

1,001 

0.118 

0.085 

-0.033 

Test  of  differences 

1,194 

5.70  ** 

0.79 

5.66  ** 

Institutional  Ownership 

Firms  with  low  institutional  ownership 

597 

0.021 

0.032 

0.011 

Firms  with  high  institutional  ownership 

597 

0.244 

0.141 

-0.103 

Test  of  differences 

1,194 

34.50  ** 

14.83  ** 

15.37  ** 

*,  **  p  <  0.05,  and  p  <  0.01,  respectively,  two-tailed. 

The  table  shows  the  mean  level  of  institutional  ownership  as  a  proportion  of  total  common  shares  outstanding  in  firms 
before  and  after  they  are  given  a  GCAR.  t-statistics  are  for  tests  for  differences  in  means  before  and  after  the  GCAR  in  the 
case  of  the  full  sample,  and  for  differences  between  subgroup  means  in  other  cases. 


These  results  are  driven  by  firms  with  higher  levels  of  institutional  ownership.  Table  9  shows 
that  when  we  do  a  median  split  of  the  sample  along  institutional  ownership,  as  in  Tables  7  and  8, 
there  is  little  change  in  the  mean  institutional  ownership  following  the  GCAR  in  the  low  institu¬ 
tional  ownership  subsample,  but  a  substantial  drop  (from  24.4  percent  to  14.1  percent)  in  the  high 
ownership  subsample.  We  estimate  a  regression  with  the  change  in  institutional  holdings  as  the 
dependent  variable  in  each  subsample.  Results  for  the  high  institutional  ownership  subsample  are 
similar  to  the  results  reported  for  the  sample  as  a  whole,  but  in  the  low  institutional  subsample 
neither  GCCovenant  nor  CitedDebt  have  significant  coefficients  (p  =  0.48  and  0.60,  respectively). 

Table  9  does  not  directly  test  institutional  selling  in  response  to  the  GCAR,  because  the 
change  in  institutional  ownership  levels  around  the  time  the  GCAR  is  issued  does  not  necessarily 
mean  that  institutional  owners  reacted  to  the  GCAR.  Still,  the  evidence  is  consistent  with  institu¬ 
tions  being  net  sellers  of  stock  in  GCAR-receiving  firms,  and  in  particular,  in  firms  receiving 
GCARs  that  cite  financing  problems  and  in  firms  that  have  a  debt  covenant  that  restricts  them  from 
receiving  a  GCAR. 

IV.  ADDITIONAL  ANALYSES  AND  ROBUSTNESS  TESTS 
GCAR  Type  I  Errors 

It  has  long  been  speculated  that  a  GCAR  may  increase  the  likelihood  of  bankruptcy.  Prior 
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studies  have  referred  to  this  as  the  “self-fulfilling  prophecy”  effect  of  the  GCAR  (e.g.,  Mutchler 
1984;  Louwers  et  al.  1999).  Several  studies  have  investigated  the  GCAR’s  accuracy  in  predicting 
the  likelihood  that  a  firm  will  go  bankrupt  (e.g.,  Geiger  and  Rama  2006).  We  extend  this  literature 
by  considering  whether  auditors  are  less  likely  to  make  Type  I  errors  (issuing  GCARs  to  firms  that 
survive  for  more  than  one  year)  for  GCARs  with  certain  characteristics  than  for  others.  We 
conduct  this  analysis  to  rule  out  the  possibility  that  the  more  adverse  market  reaction  observed  for 
GCARs  citing  financing  problems  and  GCARs  violating  covenants  is  due  to  these  firms  having  a 
greater  likelihood  of  going  bankrupt  than  other  firms  receiving  GCARs. 

We  begin  by  determining  the  post-GCAR  status  of  the  companies  in  the  sample.  We  find  that 
174  of  the  1,194  firms  file  for  bankruptcy  (Chapter  7  or  11)  at  the  federal  or  state  level  in  the 
12-month  period  following  the  date  of  the  financial  statements  for  which  the  auditor  issued  a 
GCAR,  a  Type  I  error  rate  of  85.4  percent.  To  determine  whether  Type  I  errors  are  lower  for 
certain  types  of  GCARs,  we  estimate  a  logistic  model  where  the  dependent  variable.  Bankrupt  (1 
if  bankrupt,  and  0  otherwise)  indicates  whether  the  firm  goes  bankrupt  in  the  12  months  following 
the  GCAR.  CitedDebt  and  GCCovenant  are  the  variables  of  interest  in  this  analysis.  We  use 
FSCovenant  to  ensure  that  GCCovenant  is  not  simply  picking  up  firms  with  covenants  requiring 
audited  financial  statements.  As  control  variables  we  use  LMValue,  since  larger  firms  may  have 
greater  ability  to  survive,  EBIT,  SIncome,  CFOPS,  Zscore,  ExitValue,  and  LateFiler  to  represent 
financial  condition,  and  Big4,  since  Big  4  auditors  may  have  superior  accuracy  in  issuing  GCARs. 
The  variables  of  interest  are  not  significant  (lowest  p  =  0.16). 

Another  potential  adverse  economic  consequence  a  firm  faces  is  delisting.  As  many  as  467  of 
the  1,194  firms  in  our  sample  (39.1  percent)  are  delisted  during  the  12  months  following  the  date 
of  the  financial  statements  carrying  the  GCAR.  We  test  whether  the  likelihood  of  delisting  depends 
upon  the  characteristics  of  the  GCAR.  To  do  this,  we  replace  the  dependent  variable  in  the 
bankruptcy  model  described  above  with  Delist,  where  Delist  equals  1  if  the  firm  is  delisted  from 
the  stock  exchange  in  the  12  months  following  the  GCAR.  Again,  the  GCAR-related  variables  do 
not  have  significant  coefficients  (lowest  p  =  0.24).  The  likelihood  that  a  firm  will  go  bankrupt  or 
delist  is  not  greater  for  firms  with  GCARs  citing  financing  problems  or  firms  with  GCARs  that 
violate  debt  covenants. 

Potentially  Confounding  Disclosures 

Given  the  nature  of  the  firms  in  the  sample  and  their  financial  condition  at  the  time  of  the 
GCAR  disclosure,  it  is  plausible  that  the  negative  reaction  we  detect  is  related  to  other  significant 
firm-specific  events  that  are  concurrently  disclosed  to  investors.  To  rule  out  this  potential  expla¬ 
nation,  we  review  all  8-K  filings  made  by  each  firm  in  the  sample  during  the  period  starting  five 
days  before  the  event  date  and  ending  five  days  after.  We  retain  in  the  sample  those  firms  that  file 
8-Ks,  reproducing  previously  disseminated  earnings  or  GCAR  announcements,  but  drop  147  firms 
reporting  other  events.  The  principal  events  reported  in  the  filings  by  omitted  firms  are  financing 
issues,  purchase  or  sale  of  assets,  and  manager  or  director  changes.  We  re-estimate  Models  (1)  and 
(2)  on  a  sample  that  excludes  these  firms.  The  intercept  is  statistically  significant  in  Model  (1)  for 
the  10-K  Disclosers  sample.  The  coefficients  for  the  variables  of  interest  in  Model  (2)  are  signifi¬ 
cant  at  previous  levels,  as  shown  in  Table  5. 

We  also  review  the  Early  Disclosers  sample  to  identify  significant  disclosures  of  other  events. 
We  identify  10  firms  with  significant  events  that  were  not  previously  disclosed  (3  executive 
turnovers,  2  litigation  events,  3  loan  defaults,  and  2  mergers).  We  delete  these  firms  from  the  Early 
Disclosers  sample  and  re-estimate  regressions  in  Table  5  for  the  remaining  95  firms.  The  intercept 
is  statistically  significant  in  Model  (1)  and  the  coefficients  for  the  variables  of  interest  in  Model  (2) 
are  significant  at  previous  levels. 
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GCAR-Related  Variables  and  Returns  at  Earnings  Announcement 

A  potential  concern  is  that  the  GCAR-related  variables  (GCCovsnant  and  CitedDebt)  may 
proxy  for  factors  that  are  associated  with  the  firm’s  financial  condition  and  that  we  have  not 
adequately  captured  with  our  control  variables.  We  address  this  concern  by  estimating  Models  (2) 
and  2a  at  another  significant  event  date  when  the  GCAR  should  not  be  relevant.  The  sample  for 
this  analysis  consists  of  349  firms  in  the  full  sample  of  firms  (191  firms  in  the  partial  sample)  that 
first  announce  the  GCAR  at  the  10-K  date,  but  that  previously  issue  earnings  announcements.  The 
event  date  is  the  earnings  announcement  date.  Since  the  GCAR  is  not  known  at  this  time,  the 
GCAR-related  variables  should  be  associated  with  excess  returns  in  the  event  window  (0,  +2) 
only  if  they  are  surrogates  for  financial  distress.  The  only  statistically  significant  coefficients  when 
we  estimate  Models  (2)  and  2a  are  for  the  control  variables  representing  size  and  earnings  surprise 
(Model  2:  LMValue;  Model  (2)a:  LMValue  and  Surprise).  It  seems  unlikely  that  GCCovenant  and 
CitedDebt  represent  other  factors  associated  with  financial  distress. 

Additional  Variables 

Recall  that  we  expect  investors  to  react  adversely  to  GCARs  that  cite  financing  problems.  We 
do  not  expect  the  other  commonly  cited  reasons  (see  Table  4,  Panel  A)  to  be  incrementally 
informative  because  the  cited  circumstances  are  likely  to  be  known  to  investors.  Still,  we  indi¬ 
vidually  test  to  see  whether  each  reason  is  informative,  using  a  regression  in  the  form  of  Model 
(2).  We  do  not  find  an  incremental  negative  effect  on  returns  for  any  reason  except  financing 
problems  (lowest  p  =  0.13). 

Table  4,  Panel  B  indicates  that  there  are  92  firms  with  covenants  that  require  that  the  audit 
report  be  “unqualified.”  We  do  not  include  these  with  the  195  firms  carrying  covenants  that 
prohibit  a  GCAR  in  our  main  tests  because  auditing  standards  treat  the  GCAR  as  an  unqualified 
audit  report.  However,  it  is  possible  that  a  lender  that  prohibits  a  qualified  opinion  intends  to 
prohibit  a  GCAR,  since  the  GCAR  is  sometimes  referred  to  as  a  “going  concern  qualification.” 
When  we  add  a  variable  to  Model  (2)  to  indicate  whether  the  firm  has  a  covenant  requiring  an 
unqualified  report,  the  coefficient  is  not  significant  (p  =  0.35),  suggesting  that  investors  do  not 
react  more  adversely  to  the  GCAR  announcement  if  such  a  covenant  is  present. 

We  include  several  additional  variables  in  Models  (1)  and  (2)  as  potential  explanatory  vari¬ 
ables  for  the  event  period  returns.  Restate  incorporates  adverse  expectations  that  investors  may 
have  for  restating  firms.  PreReturn  captures  pre-event  returns  and  is  used  as  a  proxy  for  investors 
anticipation  of  the  GCAR.  We  use  Bankrupt  since  investors  may  make  a  distinction  at  the  time  of 
the  GCAR  announcement  between  firms  that  will  imminently  go  bankrupt  and  those  that  will  not 
(Blay  and  Geiger  2001).  Similarly,  Delist  is  included  because  investors  may  anticipate  that  addi¬ 
tional  costs  will  be  borne  by  firms  that  are  likely  to  be  delisted.^'  The  coefficients  for  Restate, 
PreReturn,  Bankrupt,  and  Delist  are  not  significant  at  the  thresholds  we  use  in  this  paper  in  any  of 
the  model-sample  combinations  used  in  Table  5  (lowest  p  =  0.10  for  Restate,  in  Model  (2)  for 
both  samples).  The  variables  of  interest  continue  to  be  significant  at  the  same  levels  as  in  Table  5. 

In  order  to  see  whether  there  is  a  difference  in  the  market  reaction  to  the  GCAR  in  the  pre- 
and  post- Regulation  FD  periods,  we  use  a  variable  PostFD,  coded  as  1  for  all  GCARs  with  event 
dates  following  the  effective  date  of  Regulation  FD.  We  also  use  interaction  terms  PostFD  * 
CitedDebt,  PostFD  *  GCCovenant,  and  PostFD  *  InstOwn.  This  analysis  is  motivated  by  the 
information  advantage  argument  that  suggests  that  in  the  period  before  Regulation  FD,  institu- 


Restate  equals  1  if  the  firm  reports  a  restatement  in  the  12  months  preceding  the  event  date,  and  0  otherwise;  PreReturn 
is  measured  as  the  stock  return  over  the  period  starting  from  the  end  of  the  fiscal  year  for  which  the  GCAR  is  obtained 
and  culminating  two  days  prior  to  the  event  date;  Bankrupt  is  as  defined  earlier;  and  Delist  is  as  defined  earlier. 
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tional  owners  may  react  to  the  information  in  the  GCAR  before,  rather  than  during,  the  event  date 
because  of  access  to  predisclosure  information.  We  re-estimate  Models  (1)  and  (2).  The  intercept 
in  Model  (1)  and  the  independent  variables  of  interest  in  Model  (2)  are  significant  at  previous 
levels.  The  additional  variable  and  interaction  terms  used  are  not  significant  (lowest  p  =  0.26), 
indicating  there  is  no  difference  in  the  reaction  of  individual  investors  at  the  announcement  date 
before  and  after  Regulation  FD. 

Alternative  Measurements  of  Variables 

For  235  firms  in  the  partial  sample,  there  is  no  prior  press  release  announcing  earnings,  and 
the  10-K  provides  the  first  news  on  earnings.  For  the  remaining  firms,  there  is  a  prior  earnings 
release,  and  earnings  are  no  longer  a  surprise.  For  these  firms.  Surprise  arguably  should  be  coded 
as  0.  Treating  Surprise  as  0  for  these  firms  does  not  change  the  statistical  significance  of  Surprise 
or  alter  other  results  in  Table  6. 

Our  tests  use  a  five-day  grace  period  for  LateFiler.  In  robustness  tests,  we  classify  a  firm  as 
a  late  filer  if  it  is  one,  two,  three,  or  four  days  late  in  filing.  We  also  use  two  alternative  measures 
of  financial  distress  to  replace  the  Zmijewski  (1984)  distress  metric:  Altman’s  Z-score  (Altman 
1968)  and  a  metric  based  on  the  Hillegeist  et  al.  (2004)  financial  distress  model.  Our  results  are 
robust  to  the  use  of  these  alternative  measures  (i.e.,  coefficients  for  the  variables  of  interest 
continue  to  be  significant  at  reported  levels). 

We  repeat  all  our  tests  using  buy-and-hold  abnormal  returns  (BHAR)  rather  than  cumulative 
abnormal  returns  (CAR)  and  obtain  essentially  similar  results.  The  coefficients  for  the  variables  of 
interest  are  significant  at  reported  levels  with  one  exception:  in  Table  5,  in  the  10-K  Disclosers 
sample,  the  coefficient  for  GCCovenant  is  significant  at  a  lower  level  (p  =  0.06)  than  reported 
previously.  Since  this  is  a  short-window  study,  we  choose  to  report  CAR  in  the  tables  to  be 
consistent  with  other  short-window  studies,  particularly  prior  GCAR  event  studies. 

V.  CONCLUSIONS 

There  have  been  questions  raised  in  the  past  about  whether  auditors  should  be  required  to 
make  going  concern  judgments.  One  view  has  been  that  GCARs  do  not  provide  investors  with 
new  information,  since  investors  can  make  going  concern  assessments  adequately  using  other 
sources  of  information.  Others  have  argued  that  the  auditor’s  access  to  private  information  regard¬ 
ing  the  client  makes  the  GCAR  informative.  Prior  research  that  attempts  to  ascertain  the  useful¬ 
ness  of  the  GCAR  by  studying  the  market  reaction  to  the  GCAR  announcement  provides  mixed 
results.  The  sample  in  this  study  is  substantially  larger  than  samples  used  previously  and  covers 
many  years.  We  find  that  investors  react  to  the  auditor’s  assessment  and  adjust  their  valuations 
accordingly.  In  addition,  we  show  that  the  market  reaction  to  the  GCAR  depends  on  certain  GCAR 
attributes.  Mixed  results  in  prior  studies  could  be  due  to  differences  in  sample  composition  with 
respect  to  the  reasons  cited  by  the  auditor  for  the  GCAR,  the  presence  of  covenants  prohibiting  the 
GCAR,  and  the  level  of  institutional  ownership. 

Our  results  suggest  that  the  informativeness  of  the  GCAR  depends  upon  its  content.  Auditors 
generally  cite  a  reason  for  issuing  the  GCAR.  Very  often,  the  cited  reason  is  the  firm’s  distressed 
financial  condition,  which  may  not  add  to  what  investors  can  already  glean  from  the  financial 
statements.  We  find  that  the  reaction  is  more  adverse  for  firms  when  the  auditor  attributes  the 
GCAR  to  problems  related  to  obtaining  financing. 

It  has  been  widely  speculated  that  GCARs  may  have  a  self-fulfilling  prophecy  effect;  that  is, 
the  auditor’s  decision  to  issue  a  GCAR  results  in  firms  finding  it  difficult  to  obtain  capital  and  to 
carry  on  business  as  usual  and,  that  as  a  consequence,  the  GCAR  leads  to  failure.  Our  study  shows 
one  specific  way  in  which  the  GCAR  affects  contracting  costs.  A  significant  number  of  firms  have 
lending  agreements  that  include  covenants  that  specifically  require  that  the  firm  not  obtain  a 
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GCAR.  Consistent  with  other  studies  that  find  that  investors  are  sensitive  to  the  violation  of  debt 
covenants,  investors  react  more  negatively  when  the  firm  violates  a  covenant  requiring  GCAR-free 
financial  statements.  However,  neither  the  auditor’s  attribution  of  the  GCAR  to  financing  problems 
nor  the  violation  of  a  covenant  requiring  GCAR-free  financial  statements  increases  the  likelihood 
that  the  firm  will  go  bankrupt  or  delist  from  the  stock  exchange,  relative  to  other  firms  receiving 
GCARs.  In  reacting  negatively  to  these  two  types  of  GCARs,  investors  are  recognizing  potential 
increases  in  the  cost  of  obtaining  capital  rather  than  to  imminent  bankruptcy  or  delisting. 

Finally,  we  find  that  the  reaction  to  the  GCAR  is  more  negative  when  the  level  of  institutional 
holdings  in  the  firm  is  high.  Interestingly,  there  is  no  reaction  at  all  to  the  GCAR  in  firms  with  very 
low  levels  of  institutional  ownership.  At  higher  levels  of  institutional  ownership,  the  magnitude  of 
negative  reaction  increases  for  firms  where  the  GCAR  cites  financing  problems  and  firms  for 
which  the  GCAR  results  in  a  debt  covenant  violation.  Also,  when  we  study  the  level  of  institu¬ 
tional  ownership  before  and  after  the  GCAR,  we  find  significant  net  institutional  selling  in  these 
types  of  firms.  We  attribute  these  results  to  sophisticated  investors’  awareness  of  the  firm’s  financ¬ 
ing  situation  and  the  covenants  carried  by  the  firm’s  debt. 
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ABSTRACT:  This  study  examines  the  effects  of  financial  reporting  complexity  on  inves¬ 
tors’  trading  behavior.  I  find  that  more  complex  (longer  and  less  readable)  filings  are 
associated  with  lower  overall  trading,  and  that  this  relationship  appears  due  to  a  reduc¬ 
tion  in  small  investors’  trading  activity.  Additional  evidence  suggests  that  the  association 
between  report  complexity  and  lower  abnormal  trading  is  driven  by  both  cross-sectional 
variation  in  firms’  disclosure  attributes  and  variations  in  disclosure  complexity  over  time. 

Given  regulatory  concerns  over  plain  English  disclosures  and  the  trend  toward  more 
disclosure,  my  investigation  into  the  effects  of  reporting  complexity  on  small  and  large 
investors  should  be  of  interest  to  regulators  concerned  with  reporting  clarity  and  level¬ 
ing  the  playing  field  across  classes  of  investors. 

Keywords:  disclosure;  readability;  plain  Engiish;  trading  voiume;  smaii  and  large 
investors. 

JEL  Classifications:  G30;  G12;  M41;  M48. 

I.  INTRODUCTION 

The  securities  industry  has  historically  been  driven  by  the  fundamental  concept  that  “all 
investors,  whether  large  institutions  or  private  individuals,  should  have  access  to  certain 
basic  facts  about  an  investment  prior  to  buying  it,  and  so  long  as  they  hold  it”  (SEC  2010, 
under  “Introduction”).  Consistent  with  this  philosophy,  investors  are  increasingly  able  to  access 
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more  data  from  longer  public  filings.'  However,  this  trend  may  not  necessarily  aid  investors  if  it 
is  more  costly  to  extract  useful  information  from  larger  and  more  complex  disclosures  (Bloomfield 
2002). 

Although  there  is  evidence  regarding  the  effects  of  reporting  length  on  long-term  market 
prices  (You  and  Zhang  2009),  the  effects  on  individual  market  participants  and  aggregate  trading 
volume  remain  unexamined.  As  such,  I  investigate  the  effects  of  reporting  complexity  (length  and 
readability)  on  small  and  large  investors’  trading  behavior  (volume  and  consensus)  around  10-K 
filings.  This  investigation  should  interest  regulators  concerned  with  both  reporting  clarity  and 
leveling  the  playing  field  across  classes  of  investors.^ 

I  first  consider  the  effect  of  more  complex  reports  on  abnormal  trading  around  the  10-K  filing. 

I  hypothesize  that,  faced  with  more  complex  reports,  investors  may  elect  not  to  process  the  report 
because  doing  so  is  too  costly.  Consequently,  these  investors  will  not  initiate  trades  in  response  to 
the  report  (Grossman  and  Stiglitz  1980;  Bloomfield  2002).  After  controlling  for  information  con¬ 
tent,  I  predict  that  more  complex  filings  will  lead  to  lower  trading  volume,  and  these  effects  will 
be  most  pronounced  among  smaller  investors  (due  to  their  limited  processing  capabilities  and 
expertise).  Although  theory  predicts  that  higher  processing  costs  affect  investor  trading  behavior, 
a  priori  it  is  unclear  whether  the  trend  toward  more  complex  filings  will  affect  overall  investor 
trading  behavior.  First,  contemporaneous  advances  in  technology  are  likely  to  dampen  overall 
processing  costs,  making  it  easier  for  all  investors  to  process  more  complex  reports.  Second,  the 
increased  availability  of  other  information  sources  (e.g.,  media  and  analyst  reports)  could  mitigate 
the  effects  of  these  attributes  on  investor  trading  behavior.^ 

I  examine  approximately  13,000  10-K  reports  filed  between  1995  and  2006  and  find  evidence 
consistent  with  more  complex  reports  affecting  trade  behavior  around  the  filing.  Specifically,  I  find 
that  (1)  more  complex  reports  are  associated  with  lower  levels  of  aggregate  trading  volume;  and 
(2)  the  relationship  between  report  complexity  and  trade  activity  appears  to  be  driven  by  a  reduc¬ 
tion  in  small  investor  trade  volume.  These  results  are  robust  to  controlling  for  the  information 
content  of  the  report  and  factors  such  as  profitability  and  persistence  that  have  been  shown  to  vary 
with  reporting  complexity.  Overall,  the  evidence  is  consistent  with  more  complex  reports  being 
more  costly  to  process,  with  the  effects  being  most  pronounced  among  small  traders. 

In  addition  to  trading  volume,  I  also  investigate  whether  more  complex  reports  lead  to  more 
dispersion  in  the  inferences  drawn  by  those  investors  who  do  elect  to  trade.  Based  on  prior 
experimental  evidence  (Barron  et  al.  2004),  I  predict  that  when  reports  are  more  complex,  small 
investors  will  interpret  the  information  in  a  more  diverse  manner.  Consistent  with  this  prediction, 
I  find  that  more  complex  reports  are  associated  with  a  decrease  in  consensus  among  small  inves¬ 
tors,  but  not  among  large  investors. 

Admittedly,  reporting  complexity  is  a  broad  notion  that  captures  both  differences  in  relative 
report  complexity  across  firms  as  well  as  variations  in  disclosure  attributes  over  time.  I  attempt  to 
isolate  these  potentially  different  effects  by  performing  both  within-year  and  within-firm  analyses. 


’  Radin  (2007)  points  out  that  increases  in  report  length  can  be  traced  to  new  disclosures  devoted  to  risk,  compensation, 
pension  accounting,  and  stock  options.  It  is  also  plausible  that  firms  have  voluntarily  increased  disclosure  over  time. 
Bloomfield  (2008)  provides  an  excellent  review  of  the  potential  theories  why  managers  may  alter  their  reporting  in 
certain  settings.  The  focus  of  this  study  is  how  report  complexity  affects  different  investors’  trade  behavior  and  not 
necessarily  the  causes  of  increased  disclosure  over  time. 

^  SEC  concerns  over  report  clarity  are  highlighted  in  the  Plain  English  Handbook  (SEC  1998),  while  concerns  over  a  level 
playing  field  are  the  focus  of  recent  regulatory  initiatives  (e.g.,  limit  disclosure  practices  that  discriminate  across  classes 
of  investors  (SEC  2000a)  and  regulate  the  fair  execution  of  trades  (SEC  2000b). 

^  Advances  in  technology  are  likely  to  assist  all  investors  with  accessing  and  processing  more  complex  reports.  However, 
large  investors  may  be  more  likely  to  take  advantage  of  new  parsing  tools  (e.g.,  Perl)  to  analyze  reports,  whereas  less 
sophisticated  investors  may  rely  more  on  public  information  from  sources  such  as  Yahoo!  Finance.  These  differences 
could  further  exacerbate  the  processing  differential  between  small  and  large  investors. 
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The  results  show  the  association  between  more  complex  reports  and  lower  abnormal  trading 
appears  to  be  associated  with  both  cross-sectional  variation  in  firms’  disclosure  attributes  (e.g., 
operational  complexity)  in  a  given  year  as  well  as  variations  in  disclosure  length  over  time  (e.g., 
increased  disclosure  due  to  regulation)  for  a  given  firm. 

While  the  evidence  in  this  study  substantiates  some  regulatory  concerns  regarding  the  length 
and  the  readability  of  mandatory  filings  (SEC  1998),  it  is  unclear  a  priori  which  of  these  com¬ 
plexity  attributes  have  a  larger  effect  on  investors.  Hence,  in  addition  to  investigating  the  effects  of 
length  and  readability  separately,  I  also  investigate  the  effects  of  both  measures  when  included  in 
the  same  model.  I  find  that  when  both  measures  are  analyzed  simultaneously,  the  effects  of  longer 
reports  remain  significant,  while  the  readability  measures  become  insignificant.  This  finding  sug¬ 
gests  the  measures  are  substitutes,  but  the  effects  of  longer  reports  dominate  the  readability  of  the 
filings. 

The  evidence  in  this  study  is  generally  consistent  with  the  theory  of  more  complex  filings 
being  too  costly  for  small  investors  to  process  in  the  short  window  surrounding  the  filing  date. 
This  evidence  should  prove  useful  to  both  academics  and  regulators.  The  findings  specifically 
challenge  some  long-standing  regulatory  assumptions  that  requiring  more  disclosure  will  not  only 
aid  investors  in  their  trading  decisions,  but  also  help  level  the  playing  field  between  small  and 
large  investors.  Thus,  although  the  SEC  has  made  progress  in  assisting  the  small  investor  in 
gaining  electronic  access  to  financial  information  (Asthana  et  al.  2004),  simply  making  more  data 
available  may  not  benefit  small  investors  if  the  reports  are  too  long  and  complex  to  process. 

Section  11  provides  additional  background  and  hypothesis  development.  Section  III  outlines 
the  research  design.  Sample  selection  and  descriptive  data  are  provided  in  Section  IV.  Empirical 
results  are  summarized  in  Section  V.  Section  VI  provides  additional  analyses,  and  Section  VII 
concludes. 


11.  BACKGROUND  AND  HYPOTHESIS  DEVELPMENT 

Efforts  by  the  SEC  to  make  financial  data  more  readable  and  understandable  date  back  to  the 
Securities  Act  of  1933  (Firtel  1999).  In  1969,  the  SEC  released  the  Wheat  Report,  which  indicated 
that  the  length  and  complexity  of  prospectuses  prevented  the  average  investor  from  readily  un¬ 
derstanding  them.  The  report  went  on  to  recommend  against  unnecessarily  long,  complex  and/or 
verbose  writing.  These  concerns  over  lengthy  reports  were  reiterated  by  Arthur  Levitt  in  his  1997 
remarks  to  the  Securities  Regulation  Institute,  when  fie  asserted  that  “[i]n  many  cases,  the  problem 
is  not  a  lack  of  information;  quite  the  opposite.  Too  much  information  can  be  as  much  a  problem 
as  too  little.  More  disclosure  does  not  always  mean  better  disclosure”  (Levitt  1997,  under  “More 
Timely  and  Useful  Disclosure  for  Investors”). 

Under  Levitt’s  leadership,  the  SEC  adopted  the  1998  plain  English  regulation,  SEC  Rule 
421(d),  which  required  issuers  to  use  plain  English  principles  in  the  design  of  prospectuses  (SEC 
1998).  Despite  this  regulation  and  concerns  that  financial  reports  have  simply  become  too  long  for 
investors  to  process  (Paredes  2003),  there  has  been  a  substantial  increase  in  report  length  over  the 
past  decade  (Li  2008).  Whether  this  additional  data  is  beneficial  to  investors  is  unclear.  In  fact, 
some  accounting  practitioners  argue  that  the  useful  information  disclosed  in  annual  filings  is  now 
hidden  among  a  plethora  of  boilerplate,  redundant,  inunaterial,  or  even  irrelevant  data,  making 
disclosures  increasingly  difficult  to  process  (Radin  2007). 

Related  Literature 

Despite  the  importance  practitioners  and  regulators  have  placed  on  disclosure  clarity  and 
length,  there  is  little  large-scale  empirical  evidence  of  effects  of  these  attributes  on  financial 
statement  accessibility.  Most  of  the  early  work  in  this  area  investigates  the  readability  of  financial 
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statements  and  footnotes  in  small  sample  sizes.  Jones  and  Shoemaker  (1994)  summarize  this 
literature,  concluding  that  financial  disclosures  are  difficult  to  read  and  “inaccessible”  to  a  large 
proportion  of  unsophisticated  investors. 

Recent  readability  research  focuses  primarily  on  managerial  incentives  to  disclose  more  or 
less  readable  reports  under  different  circumstances.  Li  (2008)  documents  an  association  between 
length  and  readability  of  10-K  filings  and  both  profitability  and  earnings  persistence,  while  Nelson 
and  Pritchard  (2007)  find  that  firms  subject  to  greater  shareholder  litigation  use  more  readable 
language  in  their  disclosures  and  avoid  boilerplate  warnings.  Both  studies  focus  on  the  managerial 
discretion  in  reporting  attributes,  but  neither  study  links  the  effects  of  reporting  complexity  to 
investor  behavior. 

Another  line  of  research  investigates  the  market  reaction  to  report  complexity.  You  and  Zhang 
(2009)  document  that  the  market  under-reaction  to  10-K  filings  is  stronger  for  firms  with  longer 
10-K  filings.  Other  linguistics  research  on  financial  disclosures  investigates  the  market  response  to 
optimistic  and  pessimistic  tone  in  media  (Tetlock  2007;  Tetldck  et  al.  2008)  and  earnings  press 
releases  (Davis  et  al.  2007).  Li  (2009)  finds  that  MD&A  tone  and  accruals  predict  future  earnings 
and  stock  returns.  Overall,  research  on  length,  readability,  and  tonality  focuses  on  market  re¬ 
sponses,  but  does  not  address  whether  these  reporting  attributes  affect  the  specific  trade  behavior 
of  large  and  small  investors. 

A  concurrent  working  paper  by  Loughran  and  McDonald  (2010)  also  investigates  report 
complexity  of  10-K  filings.  As  discussed  in  greater  detail  in  Section  VI,  there  are  several  differ¬ 
ences  in  the  research  design  and  readability  proxies  between  this  study  and  Loughran  and  Mc¬ 
Donald  (2010).  Despite  these  differences,  both  studies  provide  evidence  that  improvements  in 
report  readability  lead  to  increased  trading  activity  among  small  investors.  Further,  both  studies 
provide  evidence  that  the  effects  are  most  evident  when  readability  is  measured  using  proxies 
based  on  the  SEC’s  plain  English  guidelines.  The  two  studies  differ  in  that  Loughran  and  Mc¬ 
Donald  (2010)  focus  primarily  on  report  readability,  while  the  focus  of  the  current  study  encom¬ 
passes  the  effects  of  both  length  and  readability.'^  This  distinction  is  most  apparent  in  the  evidence 
provided  by  this  study  that  the  effects  of  report  readability  are  subsumed  by  the  effects  of  length 
when  both  measures  are  simultaneously  examined.  Therefore,  this  study  suggests  that  the  effects 
of  length  on  investor  trading  behavior  are  more  important  than  the  effects  of  readability. 

Finally,  in  addition  to  the  direct  contributions  to  the  emerging  literature  on  reporting  com¬ 
plexity,  this  study  also  adds  to  several  large  bodies  of  research  that  (1)  demonstrate  the  benefits  of 
more  informative  disclosures  (Lang  and  Lundholm  1996;  Botosan  1997;  Francis  et  al.  2002),  (2) 
examine  price  and  trading  volume  reactions  to  information  releases  (Beaver  1968;  Morse  1981; 
Cready  and  Mynatt  1991;  Asthana  and  Balsam  2001;  Griffin  2003;  Li  and  Ramesh  2009),  and  (3) 
document  the  differential  trading  behavior  of  small  and  large  investors  to  various  information 
events  (Bhattacharya  2001;  Shanthikumar  2004;  Asthana  et  al.  2004;  Allee  et  al.  2007;  Bhatta- 
charya  et  al.  2007;  Mikhail  et  al.  2007).  Although  this  literature  provides  a  necessary  background 
for  the  research  conducted  in  this  study,  it  does  not  provide  any  direct  evidence  on  the  effects  of 
report  complexity  on  different  classes  of  investors. 

Hypothesis  Development 
Trading  Volume 

Consistent  with  evidence  that  trading  volume  is  the  most  visible  indicator  of  investors’  re¬ 
sponse  to  public  disclosures  (Cready  and  Hurtt  2002),  I  focus  on  abnormal  trading  activity  around 


^  Loughran  and  McDonald  (2010)  do  include  a  measure  of  length  in  their  regressions.  However,  the  authors  investigate 
the  effect  of  a  change  in  readability  on  changes  in  abnormal  trading  and  include  the  level  (not  change)  in  length.  Thus, 
any  deviations  in  length  over  time  are  not  directly  examined  in  their  analysis. 
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the  10-K  filing  to  assess  investors  response  to  variations  in  report  complexity.  I  hypothesize  that 
faced  with  more  complex  filings,  some  investors  will  elect  not  to  process  the  report  because  doing 
so  is  too  costly.  Consequently,  these  investors  will  not  initiate  trades  in  response  to  the  report 
(Grossman  and  Stiglitz  1980;  Bloomfield  2002).  After  controlling  for  the  information  content  in 
the  report,  I  expect  that  more  complex  filings  will  be  associated  with  lower  total  trading  volume. 
Formally,  the  first  hypothesis  (in  alternative  form)  is  as  follows: 

Hla:  Total  abnormal  trading  around  a  10-K  filing  is  lower  when  reports  are  more  complex 
(e.g.,  longer/less  readable). 

Although  more  complex  reports  are  likely  to  have  some  effect  on  all  traders,  I  predict  that  the 
effects  of  these  attributes  will  be  most  pronounced  in  the  small  investor  group.  This  prediction  is 
consistent  with  prior  experimental  research  that  suggests  small  (nonprofessional)  investors  lack 
investment  expertise  and  have  ill-defined  valuation  models  (Maines  and  McDaniel  2000).  Specifi¬ 
cally,  these  non-professional  investors  tend  to  read  financial  statements  in  the  order  presented  and 
are  therefore  more  likely  to  be  affected  by  more  complex  reports  than  more  sophisticated  investors 
(analysts),  who  use  directed  information  search  strategies  to  analyze  financial  statements  (Bouw- 
man  et  al.  1987;  Hunton  and  McEwen  1997). 

Although  experimental  evidence  suggests  that  small  investors  are  most  likely  to  be  affected 
by  report  attributes,  other  research  indicates  that  even  sophisticated  investors  are  affected  by 
information  complexity.  For  instance,  prior  experimental  research  documents  that  the  way  finan¬ 
cial  statement  data  is  presented  can  influence  analysts’  judgments  (Hopkins  1996;  Hirst  and  Hop¬ 
kins  1998).  Consistent  with  these  experiments,  Plumlee  (2003)  finds  that  analysts  assimilate  in¬ 
formation  in  less  complex  tax  changes  to  greater  extent  than  they  assimilate  the  more  complex 
changes,  while  a  concurrent  study  by  Lehavy  et  al.  (2009)  documents  that  less  readable  reports  are 
associated  with  less  accurate  analyst  forecasts  and  greater  dispersion.  In  sum,  these  studies  suggest 
that  even  professional  investors  may  not  be  immune  to  report  complexity.  Thus,  although  the 
effects  of  complex  reports  are  likely  to  be  most  pronounced  among  smaller  investors,  there  is 
reason  to  believe  that  larger,  more  sophisticated  investors  could  also  be  affected.  Hence,  I  hypoth¬ 
esize  the  following  (in  alternative  form): 

Hlb:  The  effects  of  report  complexity  (i.e.,  longer/less  readable)  on  abnormal  trading  are 
most  pronounced  among  small  investors. 

Trading  Consensus 

In  addition  to  trading  volume,  I  also  investigate  whether  more  complex  reports  lead  to  dis¬ 
persion  in  the  inferences  drawn  by  those  investors  who  do  elect  to  trade.  Specifically,  I  investigate 
whether  small  (large)  investors  process  the  data  in  a  different  way  from  other  small  (large) 
investors  when  reports  are  more  complex.  This  investigation  contributes  to  a  large  stream  of 
research  on  the  causes  and  implications  of  disagreement.^  Furthermore,  this  study  builds  on  prior 
experimental  evidence  that  finds  that  increases  in  disclosure  lead  nonprofessional  investors  to 
generate  (or  infer)  private  information,  while  sophisticated  professional  investors  are  unaffected 
by  the  increased  disclosure  (Barron  et  al.  2004).  The  authors  interpret  this  evidence  as  larger 
investors  (smaller  investors)  being  more  homogeneous  (diverse)  with  respect  to  their  training  and 


^  Most  analytical  models  suggest  that  disagreement  of  some  form  drives  trading  around  disclosure  releases  (Kim  and 
Verrecchia  1991,  1994,  1997;  Kandel  and  Pearson  1995).  Karpoff  (1986)  raises  the  possibility  that  this  disagreement 
could  stem  from  information  interpreted  differently  by  market  agents.  Subsequent  empirical  work  by  Barron  (1995)  and 
Bamber  et  al.  (1997,  1999)  provides  evidence  that  the  trading  around  earnings  announcements  is  at  least  partially 
attributable  to  newfound  disagreement.  Recent  analytical  work  by  Bloomfield  and  Fischer  (2009)  highlights  the  impor¬ 
tance  of  studying  disagreement,  since  different  forms  of  disagreement  among  investors  can  affect  a  firm’s  cost  of  capital. 
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occupational  selection  and  therefore  more  (less)  likely  to  have  common  interpretations  of  disclo¬ 
sures.  I  investigate  whether  these  findings  regarding  small  and  large  investor  behavior  hold  in  an 
archival  market-based  study. 

As  discussed  in  greater  detail  later,  I  measure  small  and  large  investors’  trading  consensus  as 
the  absolute  value  of  daily  net  buyer  initiated  (buyer  minus  seller  initiated)  shares  traded,  deflated 
by  total  shares  (buyer  plus  seller  initiated)  traded  on  the  same  day.  If  complexity  results  in  a  high 
level  of  disagreement  within  an  investor  group,  then  buyer  and  seller  initiated  trades  would  offset 
and  trading  consensus  would  approach  0.  Alternatively,  if  investors  agree  about  the  information 
content  of  the  report,  then  they  are  more  likely  to  trade  in  the  same  direction  and  trading  consensus 
would  approach  1.  Based  on  these  arguments,  I  hypothesize  that,  conditional  on  trade,  more 
complex  reports  will  lead  to  more  disagreement  within  an  investor  class.  Formally,  I  hypothesize 
the  following  (stated  in  alternative  form): 

H2:  Trading  consensus  within  a  class  of  investors  is  lower  when  10-K  reports  are  more 
complex  (e.g.,  longer/less  readable). 

III.  RESEARCH  DESIGN 

The  research  design  consists  of  several  stages.  The  first  stage  examines  trading  responses  to 
complex  10-K  filings  using  abnormal  volume  in  aggregate  as  well  as  the  specific  trading  behavior 
of  small  and  large  investors.  The  second  stage  examines  the  effects  of  length  on  small  and  large 
investor  trading  consensus.  Finally,  I  examine  whether  these  associations  are  driven  by  cross- 
sectional  variation  in  firms’  disclosure  attributes  and/or  variations  in  disclosure  length  over  time. 

Total  Abnormal  Trading 

To  examine  the  effects  of  complex  reports  on  the  combined  trading  activity  of  small,  medium, 
and  large  investor  groups,  I  estimate  the  following  regression  model: 


AVOLi,  =  aQ  +  ayCOMPLEXi,  +  a2AFTEADn  +  a^AFTEXDi,  +  a^ABS_RETu  +  a^PVOLit 

+  +  ayMTBi,  +  a^EARNINGSi,  +  agNY_EARNINGSit  +  a^QNA_FOLLi, 

+  xNBSEGii  -I-  ai2NGSEGij  +  ( 1 ) 

where  the  dependent  variable,  AVOL,  is  the  measure  of  abnormal  trading  volume.  The  use  of 
abnormal  volume  (versus  raw  volume)  should  mitigate  the  effect  of  both  technological  trading 
changes  (e.g.,  increased  online  trading)  as  well  as  other  factors,  such  as  increases  in  decimalized 
trading  during  the  sample  period  (Barber  et  al.  2009).  Consistent  with  Asthana  et  al.  (2004),  I 
define  abnormal  trade,  AVOL,  as  the  mean  daily  trading  volume  during  the  event  period  (—1,  3) 
minus  the  mean  daily  trading  volume  during  the  non-filing  period  (—49,  —5),  deflated  by  the 
standard  deviation  of  daily  trading  volume  during  the  non-filing  period  (—49,  —5).^ 

To  determine  whether  the  event  windows  implemented  above  are  reasonable,  I  examine  the 
average  daily  trade  volume  scaled  by  shares  outstanding  classified  for  each  investor  group  (i.e., 
small  and  large  traders)  in  the  period  surrounding  the  filing.  Figure  1,  Panel  A  provides  confirming 
evidence  that  most  of  the  small  trading  volume  related  to  the  10-K  filing  appears  to  be  captured  in 


®  In  addition  to  abnormal  volume,  1  also  examine  the  effects  of  report  complexity  on  abnormal  number  of  transactions, 
ANUM.  Results  using  the  number  of  transactions  are  quantitatively  similar  to  the  volume  results  (untabulated).  I  also 
measure  excess  volume  as  in  Barron  et  al.  (2005),  where  EXVOL  (EXNUM)  is  defined  as  the  natural  log  of  the 
cumulative  trading  volume  (transactions)  over  the  five-day  event  period  (—1,  3)  minus  the  natural  log  of  the  median 
volume  (transactions)  for  contiguous  five-day  periods  during  the  non-filing  period  (—49,  —5).  Results  using  EXVOL 
{EXNUM)  are  consistent  with  the  AVOL  results,  where  small  investors  are  significantly  affected  by  more  complex 
reports  and  (in  most  cases)  small  investors  are  more  affected  than  larger  investors. 
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FIGURE  1 

Average  Daily  Trading  Volume  around  10-K  Filing  Dates 
Panel  A:  Small  Investors’  Daily  Trading  Volume 


Panel  B:  Large  Investors’  Daily  Trading  Volume 


Daily  trading  volume  is  the  number  of  shares  traded  scaled  by  total  number  of  shares  outstanding  by  investor 
group  (i.e.,  small  or  large).  Panel  A  (B)  plots  the  mean  of  small  (large)  trade  volume. 


the  —  1 ,  3  day  event  period.  Panel  B  reveals  that  the  trading  behavior  for  large  investors  appears 
to  be  less  concentrated  around  the  actual  filing  date,  which  is  consistent  with  large  investors 
relying  less  on  the  information  in  the  10-K  filing,  potentially  due  to  their  ability  to  gain  access  to 


The  Accounting  Review 


November  2010 
American  Accounting  Association 


2114 


Miller 


similar  information  prior  to  the  release  of  the  actual  filing.  Although  the  short  event  window 
appears  to  capture  the  trading  specifically  related  to  the  10-K  filing,  I  provide  supplementary 
analysis  of  longer  windows  in  Section  VI. 

COMPLEX  is  the  relevant  measure  of  length  or  readability.  Report  length  is  measured  as 
WORDS  or  WORDS +TABLES.  Because  of  skewness  in  the  raw  number  of  words,  I  follow  Li 
(2008)  and  define  WORDS  as  the  logarithm  of  the  number  of  words  in  the  entire  document. 
Although  this  measure  provides  a  useful  metric  of  the  amount  of  data  in  text  format,  it  fails  to 
incorporate  data  included  in  tables.  Therefore,  I  also  examine  WORDS +TABLES,  which  is  defined 
as  the  natural  log  of  the  total  words  plus  table  cells  contained  in  the  document.  Since  the  ability  to 
count  the  number  of  table  cells  requires  compliance  with  a  specific  HTML  standard,  the  measure 
is  only  available  for  a  subset  of  firms  after  1999.  Appendix  A  provides  details  on  the  methods  used 
to  create  these  variables. 

I  first  measure  readability  using  the  Fog  Index,  READ_FOG,  which  was  originally  developed 
by  Gunning  (1952).  This  measure  provides  a  simple,  well-known,  and  widely  accepted  formula  for 
measuring  readability.  However,  the  Fog  Index  was  not  developed  to  specifically  measure  the 
attributes  of  financial  reporting  complexity.  Thus,  I  also  use  a  proprietary  computational  software 
program,  StyleWriter — Plain  English  Editor,  to  develop  a  unique  multidimensional  measure  of 
financial  readability  based  on  the  writing  factors  outlined  by  the  SEC  plain  English  guidelines, 
READ_PE^  As  described  in  more  detail  in  Appendix  B,  READJPE  is  designed  similar  to  READ- 
_EOG  so  that  higher  readability  scores  are  interpreted  as  less  readable  reports. 

In  the  first  set  of  tests,  I  control  for  potential  firm  characteristics  that  may  affect  abnormal 
trading.  For  instance,  Asthana  et  al.  (2004)  point  out  the  importance  of  controlling  for  the  effect  of 
the  timing  of  the  10-K  filing  on  trading  activity.  To  control  for  the  effects  of  earnings  information 
disclosed  prior  to  the  10-K  release,  I  include  the  variable  AETEAD,  which  is  defined  as  the  number 
of  days  after  the  preliminary  earnings  announcement  date  that  the  10-K  is  filed.  I  expect  that  the 
longer  after  the  earnings  announcement  the  10-K  is  filed,  the  lower  the  abnormal  trading  around 
the  filing. 

Prior  research  also  shows  that  firms  delay  releasing  bad  news  (Kross  and  Schroeder  1984) 
and,  consequently,  the  timeliness  of  that  release  affects  the  market  reaction  (Chambers  and  Pen¬ 
man  1984).  Based  on  these  findings,  I  also  include  AETEXD,  which  is  defined  as  the  number  of 
days  the  current  10-K  is  filed  after  the  date  of  the  previous  year’s  10-K  filing.  I  expect  a  positive 
association  between  the  length  of  the  reporting  delay  after  the  expected  filing  date  (i.e.,  previous 
year’s  filing  date),  and  abnormal  trading  volume. 

To  control  for  the  information  content  of  the  report,  I  include  ABS_RET,  defined  as  the 
absolute  value  of  the  firm’s  abnormal  (market  model)  return  calculated  over  the  five-day  event 
window  (—1,  3).  I  expect  that  greater  information  content  will  lead  to  higher  abnormal  trading 
volume  (Bamber  and  Cheon  1995;  Bamber  et  al.  1997).  1  control  for  uncertainty  in  the  period  prior 
to  the  10-K  filing  by  including  PVOL,  defined  as  the  standard  deviation  of  stock  price  in  the  period 
prior  to  the  filing  (—49,  —5).  Theory  does  not  provide  a  clear  prediction  on  the  effects  of  PVOL. 
I  control  for  firm  size  by  including  the  independent  variable  MV,  the  logarithm  of  the  market  value 


’  Consistent  with  prior  literature  (Li  2008;  Collins-Thompson  and  Callan  2005;  Muresan  et  al.  2006),  I  measure  Fog 
readability  as  the  {(words  per  sentence  +  percent  of  complex  words) *  *  .4).  Additional  information  on  how  Fog  read¬ 
ability  is  calculated  is  provided  in  Appendix  B. 

*  I  create  the  plain  English  readability  measure,  defined  as  the  number  of  errors  identified  by  Stylewriter  that  match  the 
plain  English  concerns  of  the  SEC.  Specifically,  I  measure  the  number  of  errors  defined  by  the  program  as  [(Passive 
Verbs  -I-  Hidden  Verbs  -f  Overwriting  -I-  Legal  Words  &  Jargon  +  Tautologies)  *  10]  and  scale  by  the  approximate 
number  of  sentences  in  the  document  (number  of  words/average  sentence  length).  The  scalar  parses  out  the  effects  of  the 
document  length,  but  perhaps  more  importantly  addresses  the  SEC’s  concern  about  sentence  length  (longer  sentences 
decrease  the  denominator  and  thus  increase  the  readability  measure). 
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of  equity  at  the  end  of  the  fiscal  period.  The  effects  of  MV  on  abnormal  trading  volume  are  not 
straightforward.  Larger  firms  are  likely  to  have  better  information  environments  (Atiase  1985; 
Bamber  1987),  so  firm  size  could  be  associated  with  lower  abnormal  trading  around  the  filing. 
However,  large  firms  could  also  attract  more  attention-based  trading,  leading  to  higher  abnormal 
volume  around  the  filing  (Merton  1987).  Thus,  my  prediction  on  MV  is  nondirectional.  I  also 
control  for  a  firm  s  future  growth  opportunities  on  volume,  MTB,  defined  as  the  market  value  to 
book  value  of  equity  at  the  close  of  the  current  fiscal  year.  Theory  does  not  provide  a  clear 
prediction  on  MTB. 

Li  (2008)  documents  that  firms  with  lower  earnings  tend  to  have  longer  and  less  readable 
reports.  To  control  for  the  possibility  that  LENGTH  is  merely  proxying  for  earnings  information, 
I  follow  Li  (2008)  and  include  EARNINGS,  which  is  defined  as  operating  earnings  scaled  by  total 
assets.  Li  (2008)  also  documents  that  firms  with  shorter  reports  tend  to  have  higher  earnings 
persistence.  Specifically,  Li  finds  that  shorter  reports  are  associated  with  a  greater  ability  to  predict 
subsequent  earnings.  In  order  to  address  the  possibility  that  report  length  is  simply  proxying  for 
information  related  to  next  year’s  earnings,  I  include  NY__EARNINGS  as  a  proxy  for  the  expecta¬ 
tion  of  future  earnings.^  Theory  does  not  provide  a  clear  prediction  on  the  effects  EARNINGS  or 
NY_EARNINGS  on  abnormal  volume. 

I  expect  that  there  will  be  less  new  information  in  the  10-K  filings  for  firms  with  greater 
analyst  coverage.  As  such,  I  include  NA_FOLL,  which  I  define  as  the  log  of  1  plus  the  number  of 
analysts  as  reported  on  I/B/E/S  detail  and  excluded  files.  My  expectation  is  that,  all  else  equal, 
higher  analyst  following  will  be  associated  with  lower  abnormal  trade  volume.  I  attempt  to  control 
for  underlying  firm  complexity  in  this  analysis  by  including  the  logarithm  of  1  plus  the  number  of 
business  segments  (NBSEG)  and  geographic  segments  (NGSEG),  as  reported  at  the  end  of  the 
fiscal  period. Theory  does  not  provide  a  clear  prediction  on  the  effects  NBSEG  or  NGSEG  on 
abnormal  volume.  To  minimize  the  effects  of  outliers,  all  variables  are  winsorized  by  year  at  the 
top  and  bottom  1  percent.  Finally,  all  regressions  are  performed  with  clustered  robust  standard 
errors  (Huber  1967;  White  1980)  to  control  for  both  within  firm  and  within  year  correlation.  Table 
1  summarizes  all  variable  definitions. 

Small  and  Large  Investors  Abnormal  Trading 

In  addition  to  examining  the  total  trade  activity  of  all  investors,  I  also  examine  the  specific 
trade  activity  of  small  and  large  investors  by  estimating  the  following  equation: 


AVOLif  =  ao  +  SML  *  [aiCOMPLEXij  +  a2AETEADij  -i-  a^AETEXDu  +  a^ABS_RETif 
+  a^RVOLi,  +  a^MVij  +  a^MEB^  +  a^EARNINGSit  +  a^Y_EARNINGSit 


+  a^oNA^FOLLit  -i-  anNESEG^  +  a^fVGSEGi^  +  y8o  +  *  [piCOMPLEXi^ 

+  /32AFTEAD,,  +  fi^AFTEXD,,  +  p^ABS_RETu  +  +  f^^MV^,  +  tSjMTBi, 

+  p^EARNINGSit  +  /3gNY_EARNINGSii  -t  /3^o^A_FOLLii  +  (3^  ^NESEG^, 

+  f,2NGSEGi,]  +  s,,.  (2) 


®  Perfect  foreknowledge  of  next  year’s  earnings  is  admittedly  a  strong  assumption.  However,  the  use  of  next  year’s 
earnings  as  a  proxy  for  persistence  (ability  of  report  attributes  to  predict  next  year’s  earnings)  does  not  appear  to  bias 
toward  finding  that  more  complex  reports  affect  trading  behavior.  Further,  the  exclusion  of  this  control  variable  does  not 
affect  the  results. 

In  addition  to  NBSEG  and  NGSEG,  I  also  add  controls  to  all  regressions  for  whether  a  firm  had  a  merger  during  the  year 
as  this  would  also  increase  firm  complexity.  I  find  that  the  merger  control  does  not  affect  the  significance  of  the  results 
reported  in  this  study  (untabulated). 
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Variable 

TABLE  1 

Variable  Definitions 

Definitions  (Alphabetical  Order) 

ABSJRET 

Market  adjusted  return,  defined  as  the  absolute  value  of  the  abnormal  (market 
model)  share  price  return  over  the  event  period  (—1,  3). 

AFTEAD 

Number  of  days  after  the  earnings  announcement  date,  defined  as  the  number  of 
calendar  days  the  10-K  is  filed  after  the  preliminary  earnings  announcement  date. 

AFTEXD 

Number  of  days  after  the  expected  filing  date,  defined  as  the  number  of  calendar 
days  the  10-K  is  filed  after  the  expected  filing  date  (last  year’s  10-K  filing  date). 

AVOL 

Abnormal  trading  volume,  defined  as  the  mean  daily  trading  volume  during  the 
event  period  (—1,  3)  minus  the  mean  daily  trading  volume  during  the  non-filing 
period  (-49,-5^  deflated  by  the  standard  deviation  of  daily  trading  volume  during 
the  non-filing  period  (-49,-5). 

COMPLEX 

Relevant  measure  of  reporting  complexity  (i.e.,  WORDS,  WORDS +TABLES, 
READ_FOG,  or  READJPE). 

EARNINGS 

Operating  earnings  scaled  by  assets,  defined  as  Compustat  items  (datal78/data6). 

ERG 

Large  trades:  When  suffixed  to  a  variable,  implies  large  trades  (greater  than  or 
equal  to  $50,000). 

NA_FOLL 

Number  of  analysts  following  the  firm,  natural  logarithm  of  1  plus  the  number  of 
unique  analysts  providing  a  forecast  from  the  I/B/E/S  Detail  and  Excluded  Files. 

MV 

Market  value  of  equity,  defined  as  the  natural  logarithm  of  market  value  of 
common  equity  at  the  close  of  the  current  fiscal  year  (data25  *  data  199)  from 
Compustat. 

MTB 

Market-to-book,  defined  as  the  market  value  of  the  firm’s  equity  divided  by  its 
book  value  ((data24  *  data25)/data60)  from  Compustat. 

NBSEG 

Number  of  business  segments,  defined  as  the  natural  log  of  1  plus  the  number  of 
business  segments. 

NGSEG 

Number  of  geographic  segments,  defined  as  the  natural  log  of  1  plus  the  number 
of  geographic  segments. 

NY_EARNINGS 

Next  year’s  operating  earnings  scaled  by  assets,  defined  as  Compustat  items  (data 
178/data  6). 

PVOL 

Price  volatility,  defined  as  the  standard  deviation  of  the  firm’s  stock  price  prior  to 
the  10-K  filing  (-49,-5). 

READ_FOG 

Fog  index,  calculated  as  [(words  per  sentence  -1-  percent  of  complex  words)  *  .40] 
using  Perl’s  En  Fathom  module. 

READ_PE 

Plain  English  index,  calculated  using  Stylewriter  software  as  described  in 

Appendix  2  as  10  *  ((#  of  errors  attributable  to:  Passive  Verbs  -t-  Hidden  Verbs  + 
Overwriting  +  Legal  Words  &  Jargon  -1-  Tautologies)/(number  of  sentences)). 

SML 

Small  trades:  When  suffixed  to  a  variable,  implies  small  trades  (if  share  price  is  < 
$50,  then  dollar  amount  of  trade  is  less  than  or  equal  to  $5,000;  if  share  price 
>50,  but  less  than  or  equal  to  100,  then  dollar  amount  of  trade  is  less  than  or 
equal  to  100  *  price;  if  share  price  is  greater  than  100,  then  the  firm  is  excluded 
from  the  sample). 

WORDS +TABLES 

Natural  logarithm  of  the  total  of  number  of  words  plus  the  number  of  table  cells 
included  in  each  document. 

(continued  on  next  page) 
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TABLE  1  (continued) 


Variable 

TRADE_  CONS 

WORDS 


_ _ Definitions  (Alphabetical  Order) 

Trade  Consensus,  defined  as  the  absolute  value  of  the  daily  net  buys  during  the 
event  period  (buys  minus  sales),  deflated  by  the  total  number  of  shares  traded 
(buys  plus  sales)  during  the  event  period. 

Natural  logarithm  of  the  number  of  words  included  in  the  10-K  report. 


Consistent  with  Bhattacharya  (2001),  trades  are  assumed  to  be  made  by  small  investors  if  the 
dollar  amount  of  the  trade  is  less  than  or  equal  to  $5,000;  when  the  dollar  amount  of  the  trade  is 
greater  than  or  equal  to  $50,000,  the  trade  is  assumed  to  be  made  by  a  large  investor.  Consistent 
with  prior  literature,  I  ignore  medium-sized  trades  to  create  a  “buffer-zone”  between  the  trading 
activities  of  small  and  large  investors.”  Consistent  with  Asthana  et  al.  (2004),  I  modify  this 
formulation  so  that  for  firms  with  a  share  price  greater  than  $50, 1  define  small  trades  as  those  less 
than  or  equal  to  100  times  the  share  price. However,  in  order  to  keep  a  sufficient  buffer  between 
small  and  large  trades,  I  exclude  firms  with  a  share  price  greater  than  $100  (i.e.,  the  maximum 
small  trade  in  my  sample  is  $10,000). 

Equation  (2)  represents  the  stacking  of  two  regressions  that  allows  statistical  tests  of  differ¬ 
ences  in  coefficient  estimates  across  the  small  and  large  investor  groups.  SML  (LRG)  is  an  indi¬ 
cator  variable  equal  to  1  when  the  dependent  variable  represents  the  abnormal  trading  behavior  for 
small  (large)  investors,  and  0  otherwise.  Accordingly,  the  a  (0)  coefficients  measure  associations 
between  small  (large)  investor  trade  behavior  and  the  independent  variables. My  expectation  for 
both  small  and  large  trade  groups  is  that  the  coefficients  on  COMPLEX  will  be  negative  if 
increases  in  complexity  result  in  increased  processing  costs  leading  to  lower  abnormal  trade.  I  also 
examine  whether  small  investors  are  more  likely  to  be  affected  by  more  complex  reports.  Accord¬ 
ingly,  I  provide  a  test  of  coefficients  across  SML  *  COMPLEX  and  LRG  *  COMPLEX  and  expect 
the  effects  of  report  complexity  to  be  greater  for  small  investors. 

Trading  Consensus 

I  also  examine  whether  increases  in  report  complexity  lead  investors  within  the  same  class 
(e.g.,  small  investors)  to  draw  different  inferences  from  the  same  reported  information.  To  gauge 
the  amount  of  trade  consensus  by  investor  group  around  a  10-K  filing,  I  first  classify  all  buyer  and 
seller  initiated  trades  using  the  standard  Lee  and  Ready  (1991)  algorithm.*"^  Next,  I  create  a 


"  Lee  and  Radhakrishna  (2000)  find  that  these  trade  size  bins  perform  well  In  identifying  trades  initiated  by  individual 
investors  and  institutions.  Eliminating  medium-sized  trades  increases  the  power  of  the  test,  since  large  investors  may  try 
to  break  up  their  trades  to  disguise  their  identity  (Kyle  1985;  Meulbroek  1992;  Barclay  and  Warner  1993)  but  for  a 
variety  of  reasons  are  unlikely  to  make  very  small  trades  (Bhattachai7a  et  al.  2007). 

Consistent  with  prior  literature,  I  also  exclude  the  opening  trade  because  it  is  often  the  sum  of  multiple  orders  and 
including  it  could  add  noise  to  the  measures  (Lee  and  Ready  1991;  Lee  1992;  Bhattacharya  et  al.  2007).  Further,  I  only 
include  trades  with  a  “regular  sales”  condition  code.  Bhattacharya  et  al.  (2007,  587)  point  out  that  these  “trades  result 
from  continuous  two-sided  auctions  involving  market  orders,  limit  orders,  and  buys  and  sells  against  the  specialists’ 
inventories.  This  is  not  the  case  when  the  condition  code  indicates  something  other  than  a  ‘regular  sale’  (e.g.,  large  block 
trades  or  stopped  orders).” 

See  Maddala  (2001)  for  discussion  of  stacked  regressions.  Since  the  technique  captures  any  (potential)  correlations 
across  the  error  terms,  stacking  allows  statistical  tests  of  coefficients  across  equations  (i.e.,  small  and  large  investor 
groups). 

This  algorithm,  validated  by  Lee  and  Radhakrishna  (2000),  classifies  trades  above  (below)  the  bid/ask  midpoint  as  buyer 
(seller)  initiated.  Trades  occurring  at  the  midpoint  are  classified  using  the  tick  test,  which  compares  the  trade  price  to 
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variable  TRADEjCONS,  defined  as  the  absolute  value  of  daily  net  buyer  initiated  shares  traded 
(buyer  minus  seller  initiated)  deflated  by  total  volume  of  shares  traded  (absolute  value  of  buyer 
plus  seller  initiated)  on  the  same  day  for  each  investor  group  (e.g.,  small) I  then  replace  the 
dependent  variable,  AVOL,  in  Equation  (2)  with  TRADE_CONS. 

If  small  or  large  investors  process  the  10-K  and  agree  (disagree)  on  the  information’s  effect  on 
the  firm’s  terminal  value,  they  are  likely  to  trade  in  a  similar  (different)  direction  to  other  investors 
in  that  same  group  leading  to  higher  (lower)  TRADE_CONS.  As  predicted  in  H2,  I  expect  more 
complex  reports  to  increase  disagreement  for  both  small  and  large  investor  groups.  Hence,  I 
predict  negative  coefficients  on  all  measures  of  COMPLEX. 

IV.  SAMPLE  AND  DESCRIPTIVE  DATA 

Sample  Collection 

I  gather  the  sample  by  downloading  all  10-K  and  10-K405  filings  on  EDGAR  from  1994  to 
2006.*^  I  require  a  firm-year  match  on  PERMNO  (from  CRSP),  GVKEY  (from  Compustat),  CIK 
(from  EDGAR),  and  TAQ  data  for  small  and  large  volume  around  the  10-K  filing. Consistent 
with  prior  research,  I  eliminate  observations  where  the  ticker  identifier  on  TAQ  changed  during  the 
year,  the  stock  price  was  less  than  one  dollar,  or  the  firm  had  a  stock  split  or  issued  a  stock 
dividend  during  the  year.  Using  the  process  described  in  Appendix  A,  I  also  eliminate  all  obser¬ 
vations  with  insufficient  text  remaining  after  I  eliminated  the  header,  table,  and  other  information. 
In  order  to  calculate  the  AFTEXD  variable,  I  require  the  prior  year  filing  date  to  be  available  on 
EDGAR,  which  eliminates  all  1994  observations  and  a  significant  number  of  other  observations. 
Finally,  I  eliminate  observations  where  there  is  missing  Compustat  data  (e.g.,  MV,  ASSETS)  or 
where  analyst  following  is  unavailable  on  I/B/E/S.  The  final  sample  consists  of  3,809  unique  firms 
with  12,771  firm  year  observations.  This  number  reduces  to  4,724  observations  for  WORDS 
+TABLES  analysis.  Table  2  provides  a  detailed  description  of  the  sample  selection  procedure. 

Sample  Characteristics 

Table  3  provides  sample  characteristics.  Panel  A  provides  descriptive  statistics  for  several 
variables  of  interest.  The  positive  aggregate  mean  for  AVOL  is  0.17,  and  the  means  for  small 
investor  AVOL  and  large  investor  AVOL  are  0.22  and  0.18,  respectively.  The  positive  value  for 
small  investor  AVOL  is  consistent  with  survey  evidence  small  traders  appear  to  trade  on  the 
information  contained  in  the  10-K.  The  mean  for  small  investor  TRADEjCONS  is  0.24,  whereas 
the  mean  for  large  investor  TRADEjCONS  is  0.52.  These  averages  are  consistent  with  the  general 
notion  that  small  traders  are  more  likely  to  have  disagreement  surrounding  information  events  than 


adjacent  trades.  Under  the  tick  test,  a  trade  is  classified  as  buyer  (seller)  initiated  when  the  price  is  higher  (lower)  than 
the  price  of  the  previous  trade.  In  cases  when  the  price  is  the  same  as  the  previous  trade  (a  zero  tick),  the  classification 
of  buyer  (seller)  initiated  trades  defaults  to  the  last  trade  where  there  was  a  price  change. 

The  limit  order  trader  on  the  other  side  of  the  buyer  or  seller  initiated  trade  could  be  an  investor  from  the  same  or  a 
different  size  class.  For  example,  the  other  side  of  a  small  buyer  or  seller  initiated  trade  could  be  another  small  investor, 
the  market  maker,  or  a  large  or  medium  passive  trader  whose  limit  order  is  broken  up  into  smaller  lots.  The  important 
factor  is  not  whether  the  limit  order  investor  on  the  other  side  of  the  buyer/seller  initiated  transaction  is  a  small  or  large 
trader,  but  rather,  whether  the  transaction  was  initiated  as  a  buy  or  sale  by  a  small  or  large  trader. 

Li  and  Ramesh  (2009)  show  that  the  stock  market  and  volume  reaction  for  quarterly  and  small  business  filings  disappear 
after  controlling  for  the  concurrent  release  of  earnings  information.  As  such,  I  exclude  small  business  filers  and  restrict 
my  investigation  to  annual  filings.  Forms  10-K  and  10-K405  are  identical  in  substance,  except  that  form  10-K405 
indicates  that  an  officer  or  director  failed  to  file  a  Form  4  (or  Form  3  or  5)  within  the  required  time  period. 

For  comparative  purposes  across  small  and  large  investors,  I  ensure  that  each  finn  analyzed  has  both  small  and  large 
trade  activity  by  eliminating  a  trivial  number  of  observations  that  have  no  trade  for  an  investor  type  during  the  control 
and  five-day  window  around  the  10-K  filing.  The  results  are  unaffected  by  the  inclusion  of  these  observations. 

All  AVOL  coefficients  are  negative  at  the  median,  which  is  consistent  with  Asthana  et  al.  (2004)  and  consistent  with  the 
earnings  releases  having  a  significant  impact  on  pre-period  volume. 
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TABLE  2 
Sample  Selection 


No.  Firm- Year 
Observations 


10-K  Filings  from  EDGAR  (1994-2006)  with  GVKEY  and  PERMO  Identifiers  21  189 

and  TAQ  data 

less  where  the  TAQ  ticker  changed  during  the  year  (129) 

less  where  there  was  a  stock  split  or  stock  dividend  (1,523) 

less  where  the  stock  price  was  less  than  $1  (122) 

less  where  the  filing  is  missing  LENGTH  and  EOG  from  Perl  En  Fathom  (4) 

less  where  there  is  missing  prior  year  filing  date  (AETEXD)  (3,763) 

less  missing  Compustat  data  (MV,  ASSETS,  NBSEG,  NGSEG,  etc.)  (1,002) 

less  missing  I/B/E/S  data  to  calculate  analyst  following  (1,875) 

Observations  available  for  primary  analysis  12,771 

Observations  available  for  #  of  table  cells  in  10-K  available  4  724 

(WORDS +TABLES)  calculation 


The  ability  to  count  the  number  of  table  cells  requires  firms  to  comply  with  a  specific  HTML  standard.  Hence,  WORDS 
+TABLES  is  only  available  for  a  subset  of  firms  after  1999. 


larger  more  sophisticated  investors.  The  mean  ABSJRET  is  4.8  percent,  indicating  an  abnormal 
market  reaction  during  the  10-K  filing  period.  The  mean  WORDS  is  10.39,  which  translates  into  an 
average  of  32,533  words  per  10-K,  while  the  mean  Fog  Index  is  19.94,  which  classifies  the  vast 
majority  of  financial  statements  as  unreadable  according  to  most  interpretations  of  the  index. 

Panel  B  (C)  of  Table  3  provides  information  on  report  length  (readability)  across  time.  Con¬ 
sistent  with  Li  (2008),  there  is  a  significant  increase  in  WORDS  and  WORDS +TABLES  over  the 
sample  period  investigated.^^  Consistent  with  the  increase  in  length,  Panel  C  reveals  that  there  is 
also  a  significant  increase  in  READ_FOG,  indicating  that  reports  have  become  less  readable  over 
time.  However,  this  pattern  in  readability  is  not  evident  in  READ_PE,  where  there  is  no  significant 
difference  in  readability  during  the  sample  period.  In  summary,  there  appears  to  be  a  significant 
increase  in  report  length  over  time,  but  the  evidence  for  a  corresponding  change  in  readability  is 
only  evident  when  the  Fog  measure  of  readability  is  employed. 

V.  REGRESSION  RESULTS 

Total  Trading  Activity 

Table  4  reports  the  multivariate  regression  results  for  the  effects  of  complexity  on  abnormal 
trading  volume  (AVOL).  Overall,  I  find  evidence  consistent  with  more  complex  reports  affecting 
total  trading  volume.  Specifically,  both  coefficients  related  to  longer  reports  as  reported  in  columns 
(1)  and  (2)  of  Panel  A  in  Table  4  are  significantly  negative  (p  <  .01).^°  While  both  measure  of 
readability  are  negative,  only  the  readability  measure  based  on  plain  English  attributes  as  pre¬ 
scribed  by  the  SEC  is  significant  (p  <  .01). 

Although  the  majority  of  control  variables  are  significant  in  the  expected  direction  in  Table  4, 
Panel  A,  some  of  the  controls  warrant  further  discussion.  Specifically,  the  significant  positive 


In  untabulated  analysis,  I  find  a  high  correlation  between  WORDS  and  WORDS  +TABLES  (85  percent  Pearson  and  96 
percent  Spearman),  which  suggests  that  the  quantity  of  data  in  tables  is  proportional  to  the  length  of  text. 

All  p-values  reported  in  the  study  are  presented  as  two-tailed  when  no  prediction  is  given  and  one-tailed  when  predicted. 
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coefficient  on  MV  suggests  that  firm  size  is  likely  to  be  correlated  with  more  familiarity  and,  thus, 
greater  abnormal  trading  surrounding  the  filing.  The  coefficient  on  MTB  is  significantly  negative  in 
all  regressions,  suggesting  that  10-K  filings  may  contain  less  relative  information  to  traders  in¬ 
vesting  in  growth  firms.  The  remaining  control  variables  appear  to  be  significant  in  the  predicted 
direction  or  do  not  exhibit  enough  of  a  consistent  pattern  to  warrant  discussion. 

The  analysis  in  Table  4,  Panel  A  documents  an  association  between  report  complexity  and 
abnormal  trading  volume.  However,  it  is  unclear  whether  this  association  is  driven  by  cross- 
sectional  variation  in  firms’  disclosure  attributes  and/or  variations  in  disclosure  complexity  over 
time.  Figure  2  reveals  that  longer  and  less  readable  reports  tend  to  be  concentrated  in  industries 
that  are  generally  more  complex  (e.g.,  utilities  or  financial  institutions).  To  address  whether  cross- 
sectional  differences  in  length  and  readability  lead  to  abnormal  timing  while  holding  time  con¬ 
stant,  I  add  year  fixed  effects  to  regression  1.  The  results  in  Table  4,  Panel  B  are  consistent  with 
those  in  Panel  A,  where  LENGTH,  WORDS +TABLES,  and  READJPE  are  all  significantly  nega¬ 
tive  (p  <  .01).  It  appears  that  the  association  between  report  complexity  and  abnormal  trading  is 
attributable  to  cross-sectional  differences  in  firms’  reporting  after  controlling  for  time. 

Based  on  the  previous  analysis,  it  appears  that  firms  with  more  complex  reports  are  associated 
with  less  trading  volume.  However,  the  question  of  whether  the  significant  increase  in  report 
length  during  the  sample  period  affects  investor  trading  behavior  remains.  In  order  to  examine 
whether  variations  in  reporting  complexity  over  time  impact  trading  volume  after  holding  cross 
sectional  differences  in  disclosure  constant,  I  add  firm  fixed  effects  to  regression  Equation  (1).^^ 
This  analysis  enables  me  to  investigate  whether  the  variations  of  a  particular  firm’s  report  length 
over  time  lead  to  variations  in  trading  volume  and  transactions  for  that  particular  firm’s  investors. 
Table  4,  Panel  C  provides  the  results  from  the  within-firm  analysis,  where  the  coefficients  on 
WORDS,  WORDS +TABLES,  and  READ_PE  are  once  again  significantly  negative  (although 
slightly  weaker  than  in  other  panels). 

Overall,  the  results  in  Table  4  provide  evidence  consistent  with  a  decrease  in  total  trading 
activity  when  reports  are  more  complex.  Within-year  and  within-firm  analyses  suggest  that  the 
association  between  more  complex  reports  and  lower  abnormal  trading  is  driven  by  both  cross 
sectional  differences  in  firm  reporting  complexity  as  well  as  variation  in  disclosure  complexity 
over  time.  From  an  economic  perspective,  after  controlling  for  firm  fixed  effects,  I  find  that  a 
55-page  increase  in  a  firms  10-K  filing  (roughly  equivalent  to  increase  in  report  length  from  1995 
to  2006)  results  in  a  10.1  percent  reduction  in  total  abnormal  volume  around  the  filing.^^ 

Small  and  Large  Investor  Trading  Activity 

Table  5  reports  the  multivariate  regression  results  for  the  effects  of  report  complexity  on  small 
and  large  investors’  abnormal  trading  volume.  The  evidence  is  consistent  with  the  results  from 
Table  4  being  driven  by  small  investor  trade.  Columns  (1)  and  (2)  of  Panel  A  show  that  the  effects 


In  my  primary  analyses,  I  examine  firm  and  year  fixed  effects  separately,  as  including  both  effects  simultaneously  would 
likely  eliminate  any  meaningful  variation  in  the  complexity  measures.  Specifically,  including  both  effects  would  provide 
a  test  of  the  effects  of  report  complexity  on  trading  activity  relative  to  the  average  complexity  of  the  same  firm  and 
relative  to  the  average  of  other  firms  in  the  same  year'.  Consistent  with  controlling  away  firms’  increases  in  complexity 
measures  over  time,  the  results  after  including  both  fixed  effects  in  the  same  model  are  greatly  diminished.  While  the 
effects  of  longer  reports  on  total  and  small  investor  abnormal  volume  remaining  significant  at  conventional  levels,  all  the 
analyses  involving  readabihty  and  trading  consensus  are  insignificant. 

In  untabulated  results,  I  examine  the  effects  of  complexity  on  the  short-term  abnormal  market  reaction  and  find 
significant  evidence  that  longer  reports  (and  to  a  lesser  extent  less  readable  reports)  have  more  muted  market  reactions. 
This  is  consistent  with  Bloomfield’s  (2002)  and  You  and  Zhang’s  (2009)  long-window  return  evidence  that  information 
that  is  hard  to  extract  from  financial  statements  will  not  be  reflected  immediately  in  stock  prices. 

The  number  of  words  increased  from  a  median  of  approximately  24,000  words  in  1995  to  a  median  of  approximately 
39,000  words  in  2006.  The  page  estimate  provided  assumes  270  words  per  page,  which  is  the  approximate  number  of 
words  contained  on  a  standard  8-1/2  by  11  page  that  is  double-spaced  with  12-point  font. 
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FIGURE  2 

Report  Length  by  Industry  Classification 


Panel  A:  Report  Length  Measured  as  Number  of  Words 


Panel  B:  Report  Readability  Measured  as  READ  PE  (Plain  English) 
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Report  Readability  by  Industry 

READ_PE 

23 

22.5 
22 

21.5 
21 

20.5 
20 


This  figure  provides  a  summary  of  average  report  length  by  the  major  SIC  code  groups  as  identified  by  the  US. 
Census  Bureau  (http://www.census.gov/epcd/naics/nsic2ndx.htm).  Panel  A  (B)  plots  the  average  report  length 
using  number  of  words  (plain  English  readability). 


of  longer  reports  are  significant  for  small  investors,  wherein  both  the  coefficients  on  SML  * 
WORDS  and  SML  *  WORDS +TABLES  are  significant  (p  <  .01).  There  is  marginally  significant 
evidence  that  large  investors  are  affected  by  longer  reports,  reported  in  column  (2)  (p  =  .09). 
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The  Effects  of  Reporting  Complexity  on  Small  and  Large  Investor  Trading 


2127 


03 

+ 

C/5 


I 


I 


3 


I 

02 


a,  B 

S.sf 

K  VI 


13 

< 

"o 

o 


C 

CQ 

Ph 


'sO 

(N 

O 

o 

o 

* 

* 

CN 

as 

o 

o 

d 


CO 

wo 

o 

d 

* 

* 

* 

CO 

CO 

o 

d 


* 

CO 

CO 

CO 

o 

d 

* 

On 

CO 

CN 

o 

d 


o 

d 


* 

* 

* 

CNJ 

as 

o 

d 

I 


>< 


o 

u 


* 

X- 

X- 

•X- 

•X* 

X- 

X- 

* 

* 

■X- 

* 

X- 

0 

CN 

- - 1 

m 

CO 

SO 

-t 

Os 

0 

wo 

Os  0 

in  0 

^  SO 

^  wo 

^  WO 

0  0 

^  r- 

Tj-  ^ 

^  wo 

CM  ^ 

00 
CO  ON 

^  Os 
r-  VO 

^  WO 

ov 

^  00 

^  0 
0 

r\i  0 

00  0 

so  WO 

p  0 

>0  0 

WO  q 

-t  0 

as  ^ 

CM  0 

^  p 

^  0 

^  d 

^  d 

1  * 

^•X* 

^  d 

d  d 

1  * 

1— 1  0 

1 

CM  d 

1  xl 

d  d 

1 

^  d 

""xl 

wo  d 

ci  0 

* 

■X- 

•x* 

'^x- 

'^x- 

X- 

* 

* 

* 

* 

•X- 

X- 

X- 

X- 

so 

CO 

so 

CM 

VO 

0 

CM 

.. — V  \o 

^  CN 

^  wo 

^  r-- 

^  0 

^  00 

^  CO 

n— s  WO 

^  VO 

00 

0 

Os  0 

CN  00 

CO  0 

i>  in 

'i>0 

wo 

^  CM 

CO 

Ov  ^ 

CM  CM 

0  0 

00  0 

q  00 

^  0 

^  0 

p  p 

wo  CO 

\o  — < 

0  0 

-t  0 

l>  0 

d 

1  1 

CO  d 
! 

■ci  ^ 

d 

ci- 

G  d 

cm'  d 

1 

CM  d 

1  1 

CM  d 

1 

s  ^ 

1 

1  1 

1 

* 

* 

* 

X- 

X 

* 

* 

* 

X- 

X 

* 

cs 

•x- 

* 

X- 

X 

wo 

00 

00 

00 

CM 

Ov 

r- 

f— ( 

wo 

-t 

wo 

^  'sO 

CO 

- — ,  m 

^  SO 

^  VO 

^  CO 

^  OV 

^  0 

^  wo 

00  0 

so  0 

CO  so 

0  0 

(N  0 

1-H 

00  0 

VO  00 

r--  00 

Os  CM 

0 

^  0 

0 

f"  CO 

p  p 

0 

-t  0 

wo  0 

r-  -H 

wo  0 

q  q 

CM  0 

CO  d 

wo  d 

s  ^ 

wo  d 

d  d 

d  d 

CM  d 

d  d 

d 

wo  d 

1 

d 

1  * 

^-x- 

^  1 

1  ^ 

1  1 

1  xl 

! 

"'"xl 

1 

* 

* 

'^x- 

'^x- 

X 

* 

•X* 

* 

* 

X- 

x- 

X 

I-H 

CO 

CN 

CO 

00 

VO 

CO 

CO 

VO 

VO 

On 

^  so 

^  CN 

/—-V 

^  so 

t-H 

^  wo 

^  00 

^  CO 

^  CO 

X - s  -t 

CN  0 

as  0 

CO  r-- 

^  0 

0 

\0  ’-H 

1-^  CO 

CO  CO 

CM  00 

t-- 

3  CM 

r->  0 

P  p 

wo  0 

wo  0 

CM  0 

00  0 

CO  CO 

wo  ^ 

(M  0 

l>  0 

VO  0 

CN  d 

CO  d 

ri 

d 

d 

CM  d 

CM  d 

(M  d 

3 

1  1 

1 

1 

I  1 

1 

* 

•X- 

X 

* 

* 

•X- 

X 

* 

* 

* 

•X- 

•X* 

X- 

X 

X 

so 

CO 

r- 

as 

CM 

00 

CO 

r- 

c^ 

r- 

wo 

^  00 

^  CO 

^  00 

^  0 

^  VO 

/--“V 

^  OV 

^  CM 

^  CO 

X— S  CO 

r-  0 

so  0 

wo  1> 

so  CO 

as  0 

1— 1  1— ( 

00  ^ 

so  -t 

^  00 

CO  CO 

CO  0 

^  0 

CO  c^ 

^  0 

CO  0 

0  0 

VO  0 

0  CM 

as  0 

i>  0 

00  0 

^  d 

d 

ZZ' 

so  d 

CM  d 

d  d 

^  d 

s  ^ 

^  d 

CM  d 

r-J  d 

1  ^ 

1  * 

"^•x- 

1 

1  -xJ 

1  ^ 

1 

^  1 

1  X 

1 

* 

'^■x- 

■' — ' 

'^x 

* 

* 

* 

* 

X- 

X 

Os 

so 

0 

<N 

Os 

-t 

CM 

00 

-t 

r-' 

^  as 

^  (N 

^  so 

^  VO 

^  -t 

^  wo 

^  00 

n-N  CO 

as  0 

CO  0 

SO 

CM  0 

00 

VO  CO 

00  ^ 

^  00 

ilT  \o 

0  CO 

wo  CM 

^  P 

wo  q 

q  Os 

SO  0 

CM  0 

P  P 

0  0 

^  1-H 

so  0 

-H  0 

00  0 

CO  d 

CO  d 

0^  00 

d  d 

CO  d 

^  d 

3 

^  d 

^  d 

CM  d 

1  1 

1 

1 

1 

1 

^  1 

1 

1 

•X- 

* 

•x* 

X- 

X 

* 

* 

X- 

X- 

X 

* 

* 

X* 

X- 

X 

CO 

r- 

0 

CO 

VO 

wo 

wo 

00 

^  wo 

^  0 

^  wo 

^  VO 

^  VO 

^  CM 

n--s  00 

^  Os 

^  r- 

0 

C<^  0 

00 

wo  0 

^  0 

CO  ^ 

^  r- 

VO  00 

0  00 

CM  ^ 

-t  0 

^  p 

P  q 

CO 

p  p 

-It  0 

VO  0 

wo  0 

r-  ^ 

VO  0 

P  q 

P  q 

wo  d 

1 

S 

wo  d 

d  d 

d  d 

CM  d 

d  d 

-t  d 

wo  d 

1-H  0 

1 

1  1 

I  1 

1  1 

1 

1 

1 

' - ^  1 

•5f 

* 

* 

* 

X- 

* 

X 

* 

* 

* 

* 

X- 

X- 

X* 

X 

•X- 

* 

* 

* 

X- 

X- 

x- 

X 

CN 

CO 

CM 

CM 

so 

1-H 

VO 

CO 

wo 

00 

^  CN 

^  SO 

^  r- 

^  r- 

^  r- 

n-N 

^  r-* 

^  0 

0  0 

Os  0 

so  0 

so  0 

^  r-- 

00  0 

-t  CO 

ro  00 

CM  ^ 

so  CM 

0  q 

P  q 

^  0 

CM  0 

p  q 

VO  CO 

00  ^ 

m  0 

r--  0 

r-  0 

^  d 

CO  d 

1 

C'  ^ 

^  d 

wo  d 

CM  d 

CO  d 

CNI  0 

CM  d 

d  d 

1  1 

! 

1 

1 

1  1 

1 

^  1 

1  1 

1 

1 

+ 

+ 

c^- 

+ 

o* 

c- 

O'- 

1 

(>• 

c^- 

0 

>:} 

1 

AFTEAD 

AFTEXD 

tq 

5 

>2 

CQ 

PVOL 

MV 

MTB 

§ 

a; 

P 

0 

^1 

§ 

NBSEG 

NGSEG 

CN 


O  '~D 
0-^0 
300 


^  ^  c?, 
m  (N 
ro  m  O 

o  d  o 

!  ^  I 


CN 

as 

d 

I 


wo 

^  00 


so 

o 

d 

I 


Os 

O 


^  \0  SO 
wo  CO  O 

d  d  d 


00 

00 

d 

I 


V 

R 


o 

o 


Id 

3 


d 

d 


03 


4-1 

o 


Oi  P 


60 

a 

a. 

Si 

s; 

o 


s 

•S 

o 


The  Accounting  Review 


November  2010 
American  Accounting  Association 


Panel  B:  Within- Year  Analysis 


2128 


Miller 


Ctl 


Q 

til 

as 


as 

as 

3 


as 

O 

as 

3 


i 


as 

O 

as 

3 


fO 


®>  El 

op 

S  VI 


o  00  jJS 
d  -S  d 


* 

* 

* 

iri 

00 

o 

o 

d 

I 


o 

I 


Os 

VO 


O 

CN 


* 

* 

m 

ON 

o 

o 

d 

I 


* 

* 


’ — '  i) 
ON  ^ 

(N 


NO 

(N 


CS 

o 

d 

I 


(N 

I 


_ 

VO 

VO 

(Z) 

o 

o 

vq 

kP 

o 

d 

CN 

d 

* 

* 

* 

oo 

NO  ON 
O  Tl- 

d  r4 
I  I 


yri  ^  ^ 

O  NO  o 

O 

d  d 


* 

* 

m 

5  r'l' 

O  •'d- 

d 
I  I 


5 

o 


o 

>r> 

o 

d 

1 


00 

O 

O  (N 

o  ds 


o 

3 


><l 

oo 
*  ^ 

°o 

M  o 


as 

as 


O 

o 

I 

as 

as 


(A 

s 

< 

E 

la 

I 

fi 

2 


u 

"w 

a 

CQ 

CL, 


as 

o 

as 


as 

O 

as 

3 


as 

O 

as 

3 


I 


ro 

(N 

O 

o 

d 

I 


* 

NO 

CN 

o 

o 

d 

I 


o 


o 
o 

-H  O 
ON  d 


<N 

NO 


fO 


o 

c/5 

o 

o 

o 

o 

d 

d  ^ 

ON 

d 

<s 


O'! 

o  ^ 

O  (N 

o  o 
d  d 


* 

o  ^ 

•T)  ON 

q 

d  ’ — ' 

I  I 


* 

* 

* 

00 

o 

d  o 
d  d 
I  I 


«>  8S 

OJ  >0 

a”  d 


* 

* 

ON 

NO  ^ 
CN  NO 

q  q 
d  r4 
I  I 


S.SP 

Kc« 


§ 

I 

O 


00 

-J 

O 

£ 

o 

o 


(N 

lO 

o 

d 

I 


ON 

>sS) 

i 

(N 

as. 

o 

O 

On 

c/5 

1-H 

oo 

O 

CN 

o 

d 

d 

ON 

d 

CtS 


o 

2 

I 

§ 

I 

oo 

*  ^ 
-j  h 

°o 

to  O 


'  .22  ^ 


bT  S 


The  Accounting  Review 
American  Accounting  Association 


November  2010 


ion,  except  when  a  specific  fixed  effect  is  already  included  in  the  model. 


The  Effects  of  Reporting  Complexity  on  Small  and  Large  Investor  Trading 


2129 


Further,  small  investors  are  more  affected  by  longer  reports  than  large  investors  as  the  differences 
in  coefficients  in  columns  (1)  and  (2)  on  SML  *  COMPLEX  and  LRG  *  COMPLEX  are  both 
significant  (p  <  .01). 

I  find  that  the  effects  of  less  readable  reports  also  appear  to  be  driven  by  small  investors. 
Specifically,  columns  (3)  and  (4)  of  Table  5,  Panel  A  show  the  effects  of  less  readable  reports  for 
small  investors,  where  both  the  coefficients  on  SML  *  READ_EOG  and  SML  *  READ_PE  are  both 
significant  (p  <  .01).  There  is  no  evidence  of  less  readable  reports  reducing  the  trading  behavior 
of  large  investors.  Further,  small  investors  are  more  affected  by  less  readable  reports  than  large 
investors;  the  differences  in  coefficients  in  columns  (3)  and  (4)  on  SML  *  COMPLEX  and  LRG  * 
COMPLEX  are  both  significant  (p  <  .01). 

As  with  Table  4,  most  of  the  control  variables  in  Table  5  are  significant  in  the  expected 
direction.  However,  some  of  the  specific  coefficients  on  small  and  large  investors  warrant  addi¬ 
tional  discussion.  Specifically,  coefficients  on  MV  are  significantly  positive  only  for  small  inves¬ 
tors,  which  is  consistent  with  small  investors  trading  more  around  10-K  filings  that  are  larger 
potentially  because  these  firms  are  more  familiar.  The  coefficients  on  MTB  are  significantly  nega¬ 
tive  for  both  small  and  large  investors,  suggesting  that  the  10-K  filing  for  growth  firms  contains 
less  pertinent  information  for  both  sets  of  investors.  The  coefficients  on  EARNINGS  are  positive 
for  small  trade  activity  in  three  of  the  four  regressions,  which  indicates  that  small  investors  trade 
more  when  earnings  are  higher.  Furthermore,  the  coefficients  on  NY_EARNINGS  are  significantly 
negative  (positive)  for  small  (large)  investors  in  most  regressions.  These  differences  suggest  that 
small  (large)  investors’  trade  less  (more)  when  the  filings  contain  information  related  to  positive 
future  earnings,  which  could  be  interpreted  as  more  sophisticated  investors  taking  more  of  a 
long-term  perspective. 

Another  interesting  insight  provided  in  Table  5  are  the  coefficients  on  LRG  *  COMPLEX 
when  READ_EOG  and  READ_PE  are  investigated.  The  positive  association  between  the  readabil¬ 
ity  measures  and  increased  abnormal  trading  is  inconsistent  with  the  prediction  and  appears  to  be 
consistent  with  large  investors  being  attracted  to  more  complex  reports.  This  attraction  could  be 
due  to  large  investors  trading  more  either  because  they  have  a  processing  advantage  over  other 
investors,  or  because  certain  firms  cater  to  institutional  investors  by  making  their  reports  more 
technical. 

To  address  whether  this  association  is  consistent  with  investors’  overall  trading  strategies,  I 
investigate  whether  large  (small)  investors  are  associated  with  firms  that  have  more  (less)  complex 
filings.  I  first  create  a  measure  of  trading  volume  consistent  with  You  and  Zhang  (2009)  by  scaling 
the  average  daily  trading  volume  during  the  pre-  and  event  periods  by  the  total  shares  outstanding. 
I  then  compare  the  trading  volume  by  event  period  across  terciles  of  report  length  and 
readability.^"^  The  results  are  graphed  in  Figure  3(a)  and  3(b),  where  Panel  A  (C)  reveals  that  small 
investors  trade  more  in  both  the  pre-  and  event  periods  when  reports  are  shorter  (more  readable). 
In  un tabulated  results,  small  investor  trading  volume  in  the  tercile  of  short  reports  in  Panel  A  is 
significantly  higher  than  the  trading  volume  for  the  tercile  of  long  reports  (i.e.,  WORDS)  during 
both  the  pre-  and  event  periods  (p  =  .06  and  p  =  .02,  respectively).  Small  investor  trading  volume 
is  also  higher  for  more  readable  reports  in  Panel  C,  but  the  differences  are  insignificant. 

In  contrast  to  small  investor  trading  being  associated  with  shorter  reports.  Panels  B  and  D  of 
Figure  3(b)  show  that  large  trade  volume  is  actually  greater  for  firms  with  longer  and  less  readable 
reports.  Specifically,  in  Panel  B  the  large  investor  trade  is  significantly  greater  for  long  reports 
relative  to  shorter  reports  in  both  the  pre-  and  event  periods  (p  <  .01;  untabulated).  Similar 


Terciling  by  year  enables  me  to  compare  firms  within  a  given  year,  while  parsing  out  the  effects  of  decimalized  trading 
in  the  latter  periods.  I  use  terciles,  based  on  evidence  provided  by  Lys  and  Sabino  (1992), 
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FIGURE  3a 

Trading  by  Tercile  of  Report  Complexity 


Panel  A:  Small  Investors’  Daily  Trading  Volume  by  Tercile  of  WORDS 
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Panel  B:  Large  Investors’  Daily  Trading  Volume  by  Tercile  of  WORDS 
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Daily  trading  volume  is  the  number  of  shares  traded  scaled  by  total  number  of  shares  outstanding  by  investor 
group  (i.e.,  small  or  large).  Panel  A  (B)  plots  the  mean  of  small  (large)  trade  volume  based  on  WORDS. 


evidence  exists  in  Panel  D,  wherein  large  investors  trade  significantly  more  during  both  the  pre- 
and  event  periods  when  reports  are  less  readable  (p  <  .01  and  p  =  .03,  respectively).^^  Overall, 


Results  using  the  alternative  measures  of  WORDS +TABLES  are  similar,  but  more  significant  for  all  the  relationships 
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FIGURE  3b 

Trading  by  Tercile  of  Report  Complexity 


Panel  C:  Small  Investors’  Daily  Trading  volume  by  Tercile  of  RE AD  PE 
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Panel  D:  Large  Investors’  Daily  Trading  Volume  by  Tercile  of  READ_PE 
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Panel  C  (D)  plots  the  mean  of  small  (large)  trade  volume  based  on  READjPE.  Terciles  are  created  by  the 
appropriate  report  complexity  variable  (i.e.,  WORDS  or  READJPE)  within  each  year  to  control  for  the  increased 
proportion  of  small  trade  to  total  trade  volume  during  the  latter  part  of  the  sample.  The  middle  tercile  is  elimi¬ 
nated  to  limit  the  comparison  to  more  extreme  groups. 


examined  in  Figure  2.  Similarly,  the  results  for  READJFOG  are  also  more  significant  for  large  investors,  but  are 
insignificant  in  the  predicted  direction  for  small  investors. 
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the  evidence  in  Figure  3(a)  and  Figure  3(b)and  supports  the  notion  that  small  (large)  investors  are 
associated  with  short  (long)  reports  in  both  the  pre-  and  event  period.^® 

I  further  examine  whether  cross-sectional  differences  in  reporting  complexity  (while  holding 
time  constant)  affects  small  and  large  investor  trading  behavior  in  Panel  B  of  Table  5.  Consistent 
with  Panel  A,  the  coefficients  on  SML  *  COMPLEX  columns  (1)  through  (4)  are  negative  and 
significant.  I  also  find  that  the  coefficients  on  LRG  *  COMPLEX  are  negative  and  significant  when 
the  length  is  measured  as  the  total  of  both  textual  and  tabulated  content  in  column  (2).  This  finding 
is  consistent  with  more  sophisticated  investors  being  affected  by  more  complex  reports  issued  by 
certain  firms.  I  continue  to  find  evidence  that  more  complex  reports  are  more  likely  to  lead  to 
lower  abnormal  trade  for  small  investors  relative  to  large  investors,  except  when  examining  the 
effects  of  WORDS +TABLES.  However,  this  lack  of  difference  between  large  and  small  investors 
in  column  (2)  appears  more  related  to  large  investors  being  affected  by  the  complexity  as  mea¬ 
sured  by  the  text  and  tabulated  information  than  by  a  lack  of  impact  on  small  investors.  Overall, 
the  evidence  from  Table  5,  Panel  B  is  consistent  with  small  investors  trading  less  in  the  short- 
window  around  firms  with  10-K  filings  when  firms’  reports  are  more  complex. 

I  also  investigate  whether  small  investors  are  affected  by  deviations  in  reporting  complexity 
over  time  (holding  the  firm  constant)  in  Table  5,  Panel  C.  Overall,  the  results  are  consistent  with 
the  earlier  panels,  although  the  significance  levels  of  the  complexity  measures  are  diminished  in 
all  the  regressions  and  the  specific  effects  on  READ_EOG  (  READ_PE)  are  insignificant  (weakly 
significant).  The  weaker  findings  on  READ_PE  are  most  likely  due  to  limited  variation  in  the 
measure  over  time  (see  Table  3,  Panel  C).  In  summation,  the  evidence  from  Panel  C  supports  the 
notion  that  increases  in  report  complexity  over  time  have  a  significant  impact  on  small  investors. 

Overall,  the  results  in  Table  4  and  Table  5  provide  evidence  consistent  with  a  decrease  in  total 
trading  activity  when  reports  are  more  complex.  This  evidence  appears  due  primarily  to  small 
investor  trading  behavior,  although  there  is  some  evidence  that  length  also  has  an  effect  on  large 
investors.  Further  within-year  and  within-firm  analyses  suggest  that  the  association  between  com¬ 
plex  reports  and  lower  abnormal  trading  appears  to  be  driven  by  both  underlying  firm  reporting 
complexity  as  well  as  variation  in  disclosure  complexity  over  time. 

Trading  Consensus 

The  negative  association  between  report  complexity  and  small  investor  trade  volume  in  the 
previous  section  is  consistent  with  more  complex  reports  increasing  processing  costs.  I  next 
investigate  whether  small  (large)  investors  process  data  in  a  different  way  from  other  small  (large) 
investors  when  reports  are  more  complex  by  replacing  the  dependent  variable  in  Equation  (2)  with 
TRADE_CONS.  Table  6,  Panel  A  provides  regression  results  for  small  and  large  investors’  con¬ 
sensus  trade  behavior.  Overall,  the  results  are  consistent  with  more  complex  reports  leading  to 
more  disagreement  among  small  investors  who  elect  to  trade.  For  example,  the  coefficients  on 
SML  *  COMPLEX  in  columns  (1),  (2),  (3),  and  (4)  are  all  negative  and  significant  (all  p  <  .01). 
The  coefficients  on  LRG  *  COMPLEX  are  unexpectedly  positive,  which  is  consistent  with  large 
investors  processing  complex  disclosures  in  a  similar  manner  to  other  large  investors  when  reports 
are  more  complex.  Finally,  I  find  support  in  all  four  columns  that  more  complex  reports  result  in 


Although  it  is  possible  that  report  length  is  a  criterion  investors  use  to  base  their  decisions  on  which  firms  to  follow, 
investors  are  more  likely  to  start  following  a  firm  based  on  attributes  such  as  familiarity  (Merton  1987). 
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greater  disagreement  among  small  investors  thanamong  large  investors. 

As  with  abnormal  trading  volume  regressions,  the  majority  of  control  variables  are  significant 
in  the  expected  direction  in  Table  6,  but  some  of  these  coefficients  also  warrant  further  discussion. 
Specifically,  PVOL  is  significant  in  different  directions  for  small  and  large  investors,  implying  that 
the  10-K  resolves  prior  disagreement  for  small  filers  but  leads  to  lower  consensus  for  large 
investors.  However,  this  interpretation  should  be  taken  with  caution,  as  large  investors  could 
merely  be  unraveling  their  positions  based  on  their  different  priors  about  the  10-K  filing.  The 
coefficients  on  both  MV  and  MTB  are  significantly  negative  in  all  but  one  of  the  regressions, 
suggesting  that  there  is  more  disagreement  among  investors  of  large  and  growth  firms.  Finally,  the 
positive  coefficients  on  EARNINGS  can  be  interpreted  as  greater  consensus  when  the  underlying 
profitability  is  positive,  whereas  the  negative  coefficients  on  LRG  *  NY_EARNINGS  suggests 
disagreement  among  large  investors  regarding  the  forward-looking  information  in  the  filing. 

Consistent  with  the  analysis  on  trading  activity.  Table  6  also  reports  the  regression  equations 
including  both  within-year  and  within-firm  analyses.  The  coefficients  on  SML  *  COMPLEX  are 
significantly  negative  in  all  four  of  the  reported  regressions  in  Panel  B.  Furthermore,  the  evidence 
in  all  four  regressions  suggests  that  small  traders  are  more  likely  to  disagree  than  their  larger,  more 
sophisticated  counterparts.  Table  6,  Panel  C  provides  similar  evidence  of  an  association  between 
report  complexity  and  lower  consensus  trade  for  small  investors.  However,  the  evidence  is  weaker 
in  all  such  regressions,  and  the  evidence  for  READ_PE,  which  has  little  variation  over  time,  is 
insignificant.  Overall,  the  evidence  in  this  section  reveals  that  more  complex  reports  appear  to  be 
associated  with  a  small  but  statistically  significant  decrease  in  consensus  among  small  investors. 
For  perspective,  a  55-page  increase  results  in  a  decrease  in  small  investor  consensus  from  a 
median  value  of  19.9  percent  down  to  19.5  percent. 

In  sum,  the  findings  related  to  consensus  are  consistent  with  experimental  evidence  in  Barron 
et  al.  (2004),  where  increases  in  disclosure  lead  nonprofessional  investors  to  generate  (or  infer) 
private  information.  However,  as  opposed  to  Barron  et  al.’s  (2004)  finding  that  sophisticated 
professional  investors  are  unaffected  by  the  increased  disclosure,  there  is  some  evidence  that  large 
investors  are  actually  more  likely  to  have  increased  consensus  when  more  data  is  provided  and  the 
writing  is  more  technical  (less  readable). 

VI.  ADDITIONAL  ANALYSES 
Combined  Reporting  Attributes 

In  addition  to  separately  examining  the  effects  of  length  and  readability,  I  also  re-run  all 
analyses  including  both  measures  of  complexity  in  the  same  model.  When  both  length  and  read¬ 
ability  are  included  in  the  regressions,  the  effects  of  longer  reports  are  similar  to  those  reported  in 
Table  4,  Table  5,  and  Table  6,  albeit  slightly  weaker  in  a  few  regressions.  However,  the  effects 
of  readability  are  insignificant  in  the  predicted  direction  in  all  but  one  regression.  This  evidence  is 


Small  and  large  investors  are  likely  to  have  different  levels  of  prior  information.  Although  these  potential  differences  in 
private  information  between  large  and  small  investor  groups  do  not  affect  the  within-group  comparison,  they  do  make 
it  difficult  to  interpret  the  implications  of  the  differential  consensus  between  small  and  large  investors  on  the  abnormal 
trading  results.  Specifically,  if  large  investors  have  more  private  information,  they  may  be  more  likely  to  process 
information  in  a  similar  manner  because  they  have  more  precise  priors  leading  to  greater  consensus.  As  such,  it  is 
difficult  to  determine  whether  large  investors’  lower  trading  reaction  to  the  filings  (compared  to  small  investors)  is  due 
to  common  processing  or  superior  processing  capabilities. 

The  coefficients  and  significance  levels  on  measures  of  length  coefficients  are  not  significantly  affected  by  the  inclusion 
of  the  readability  metrics.  Specifically,  when  the  readability  measures  are  included,  all  the  coefficients  on  WORDS  and 
WORDS +TABLES  in  the  pooled  regressions  are  significant  at  the  1  percent  level  (one-tailed),  except  when  READ_PE 
is  examined  in  the  total  abnormal  volume  regressions  where  the  coefficients  are  significant  at  the  5  percent  and  10 
percent  levels,  respectively.  The  coefficients  on  measures  of  length  are  quantitatively  similar  for  the  within-firm  and 
within-year  as  well. 
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consistent  with  both  measures  of  reporting  complexity  serving  as  substitutes,  with  the  effects  of 
longer  reports  appearing  to  dominate  the  readability  of  the  reports  when  both  measures  of  com¬ 
plexity  are  analyzed  simultaneously. 

This  analysis  is  useful  in  highlighting  the  differences  between  this  study  and  a  concurrent 
study  by  Loughran  and  McDonald  (2010),  who  also  investigate  the  reporting  attributes  of  the  10-K 
filings  during  a  similar  time  frame.  While  their  study  focuses  primarily  on  the  readability  (not 
length)  of  financial  statements,  the  two  studies  overlap  in  the  investigation  of  small  investor  trade 
surrounding  the  filing.  Both  studies  document  that  after  implementing  more  sophisticated  mea¬ 
sures  of  readability  (plain  English),  improvements  in  readability  lead  to  increased  small  investor 
trading  activity.  The  studies  differ  in  the  strength  of  that  relation  after  simultaneously  investigating 
alternative  measures  of  length. 

The  discrepancy  between  the  two  studies  could  stem  from  several  differences  in  composition. 
For  instance,  the  studies  differ  in  sample  size,  size  cut-offs  for  small  investors,  inclusion  of  certain 
control  variables,  trading  windows,  and  the  measurement  of  plain  English  readability.  Perhaps 
most  importantly,  differences  could  stem  from  research  design  choices.  Specifically,  Loughran  and 
McDonald  (2010)  investigate  a  change  in  abnormal  trading  but  include  level-control  variables. 
Hence,  any  deviations  in  firm  characteristics  (e.g.,  size,  complexity)  or  report  characteristics  (e.g., 
length)  over  time  are  uncontrolled  in  their  analysis.  Additionally,  as  opposed  to  the  abnormal 
volume  measure  used  in  this  study,  they  use  a  dependent  variable  that  is  not  de-trended  to  account 
for  the  increase  in  decimalized  trading  during  the  latter  part  of  the  sample  they  investigate.  In  sum, 
the  evidence  across  both  studies  is  consistent  with  small  investors  being  affected  by  less  readable 
reports,  but  differences  in  proprietary  measures  of  plain  English  readability  and  research  design 
choices  make  it  difficult  to  directly  compare  differences  in  significance  of  that  association  when 
accounting  for  report  length. 

Short-Term  versus  Long-Term  Trading  Effects 

As  previously  discussed,  the  evidence  in  Figure  1  suggests  that  the  majority  of  trading  occurs 
in  the  five-day  window  surrounding  the  filing.  As  such,  this  study’s  primary  focus  is  on  the  effects 
report  complexity  on  investor  trading  activity  in  the  short  window  around  the  filing.  The  under¬ 
lying  assumption  is  that,  as  processing  costs  increase,  small  investors  would  continue  to  follow 
certain  stocks  but  would  elect  to  delay  their  trades  either  to  give  them  time  to  process  the  reports 
or  to  time  their  trades  to  coincide  with  easier  to  access  information  events. 

To  determine  whether  the  trading  effects  documented  above  are  short-term  or  long-term  in 
nature,  I  re-run  the  analyses  in  Table  4,  Table  5,  and  Table  6  after  increasing  the  trading  window 
to  10-  and  20-day  periods.  When  the  event  window  is  lengthened  to  10  days,  the  coefficients  on 
small  investors  remain  negative  in  the  abnormal  volume  regressions.  However,  the  coefficients 
and  significance  is  diminished  on  most  tests  and  become  insignificant  at  conventional  levels  in 
about  half  the  regressions.  When  the  windows  are  expanded  to  20  days,  the  significance  on  the  all 
the  coefficients  is  further  diminished.  In  contrast  to  the  abnormal  volume  regressions,  the  results 
on  TRADEjCONS  are  relatively  unaffected  by  lengthening  the  windows,  as  all  the  coefficients 
remain  negative  and  significant  at  the  1  percent  level  for  both  the  10-  and  20-day  windows  for  all 
measures  of  report  complexity.  Overall,  the  evidence  suggests  that  while  the  effects  of  report 
complexity  are  observable  in  longer  windows,  they  are  most  pronounced  in  the  short  window 
around  the  filing. 

Other  Information  Releases 
Concurrent  Information  Releases 

As  with  any  event  study,  it  is  possible  that  other  information  released  during  the  event 
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window  could  affect  the  results.  To  provide  additional  assurance  that  the  earnings  announcements 
released  in  the  short  window  surrounding  the  10-K  filing  are  not  affecting  the  results,  I  eliminate 
all  observations  where  AFTEAD  is  less  than  or  equal  to  five  days  and  find  that  the  results  are 
unaffected  by  removing  these  observations  (untabulated).  It  is  possible  that  other  concurrent 
information  releases  may  also  affect  trading  behavior.  However,  in  this  setting,  concurrent  infor¬ 
mation  releases  would  likely  act  as  a  filtering  mechanism  that  would  bias  against  finding  an  effect 
on  information  processing. 

Prior  Information  Releases 

During  the  latter  part  of  the  sample  period  investors  likely  have  greater  access  to  alternative 
sources  of  information  (e.g.,  financial  blogs)  that  may  pre-empt  the  information  in  the  10-K  filing. 
These  alternative  sources  of  information  released  prior  to  the  filing  could  lead  to  increased  trade 
during  the  control  window  and  decreased  trade  during  the  event  window.  Due  to  the  trend  in  report 
length,  this  possibility  could  provide  a  potential  alternative  explanation  for  the  findings  in  this 
study.  To  rule  out  this  alternative,  I  examine  variations  in  the  ratio  of  daily  average  trading  volume 
during  the  pre-period  divided  by  the  event  period  by  year  during  the  sample  period.  I  find  that  the 
ratio  of  pre-period/event  period  trading  behavior  significantly  decreases  for  small  investors  from 
the  beginning  to  the  end  of  the  sample  period  (p  <  .01;  untabulated),  while  there  is  no  significant 
change  in  the  proportion  of  large  investor  pre-volume  to  event  period  trade.  In  summary,  there  is 
no  evidence  that  increased  access  to  information  by  either  small  or  large  investors  during  the  latter 
part  of  the  sample  period  affects  trading  behavior  in  a  manner  that  would  bias  toward  the  findings. 

MD&A  Reporting  Complexity 

In  my  primary  analyses,  I  examine  the  length  of  the  entire  10-K  document  including  MD&A, 
footnotes,  contracts,  and  other  exhibits.  However,  it  is  possible  that  the  effects  are  driven  by 
certain  parts  of  the  document,  such  as  the  MD&A  section.  To  address  this  issue,  I  obtain  MD&A 
measures  from  Li  (2009)  and  examine  the  MD&A  section  separately.  I  find  only  weak  evidence  of 
an  effect  of  MD&A-specific  reporting  complexity  on  investor  trading,  which  suggests  that  the 
complexity  of  the  MD&A  section  is  not  driving  the  results. 

VII.  CONCLUSION 

The  evidence  in  this  study  is  consistent  with  more  complex  filings  being  too  costly  for  some 
investors  to  process.  Specifically,  I  find  evidence  that  more  complex  filings  are  associated  with 
reduced  trading  activity  and  lower  consensus  for  small  investors,  but  have  only  a  limited  impact 
on  large  investors.  I  perform  additional  within-firm  and  within-year  analyses  and  find  similar 
results,  which  suggest  that  the  association  between  more  complex  reports  and  lower  abnormal 
trading  appears  driven  by  both  relative  finn  reporting  complexity  as  well  as  deviations  in  reporting 
complexity  over  time.  Additional  analysis  reveals  that  the  effects  of  readability  and  length  appear 
to  act  as  substitutes,  with  longer  reports  appearing  to  dominate  the  readability  of  the  reports  when 
analyzing  both  measures  of  complexity  simultaneously. 

The  evidence  provided  in  this  study  should  prove  useful  to  regulators.  For  instance,  the  results 
highlight  the  importance  of  considering  processing  costs  in  future  disclosure  regulation  and,  there¬ 
fore,  challenge  the  assumption  that  more  disclosure  helps  level  the  playing  field  between  small  and 
large  investors.  Further,  although  the  SEC  has  made  progress  in  assisting  small  investors  in 
gaining  electronic  access  to  financial  information  (Asthana  et  al.  2004),  simply  making  more  data 
publicly  available  may  not  necessarily  aid  small  investors,  as  the  increased  cost  of  processing  the 
additional  data  makes  the  10-K  filings  too  costly  for  many  small  investors  to  process.  Finally, 
initiatives  to  reduce  the  processing  costs  of  financial  filings  may  serve  as  an  important  factor  in 
ensuring  information  is  impounded  quickly  into  market  prices. 
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APPENDIX  A 

10-K  FILING  EDIT  PROCEDURES 

This  appendix  provides  details  of  the  methods  used  to  download  and  edit  the  10-K  filings.  I 
first  download  all  10-K  reports  from  EDGAR  and  then  edit  the  filings  before  calculating  the 
readability  scores.  I  follow  the  process  documented  in  Li  (2008),  with  some  slight  modifications. 

I  delete  the  heading  information  that  is  contained  between  (SEC-HEADER)  and  (/SEC- 
HEADER)  and  all  text  that  begins  with  (TABLE)  and  ends  with  (/TABLE).^®  I  also  eliminate  all 
lines  that  contain  (S)  or  (C)  or  special  characters  (...)  and  (&..),  along  with  all  lines  that  contain 
strings  such  as  (TEXT),  (DOCUMENT),  (PAGE),  (TYPE),  or  /PRIVACY-ENHANCED/.  Li 
(2008)  eliminates  all  paragraphs  containing  these  tags.  However,  these  strings  often  fail  to  contain 
end  tags  making  it  difficult  to  delete  specific  sections.  To  address  this  issue,  I  eliminate  only  the 
lines  containing  this  coding. 

Further,  I  convert  embedded  HTML  code  to  proper  ASCII  characters  and  run  a  Perl  package 
to  strip  all  remaining  HTML  tags  (for  further  information  see  http:/www.ascii.cl/htmlcodes.htm). 
All  "  -jpgj  ”  “-pdf.”  and  “.gif’  files  are  eliminated.  Finally,  I  delete  all  paragraphs  (defined  as  two 
lines  containing  text  with  a  blank  line  before  and  after)  with  more  than  50  percent  non-alphabetic 
characters  to  ensure  that  all  tables,  tabulated  texts,  and  financial  statements  are  eliminated.  Al¬ 
though  this  approach  is  not  perfect  in  eliminating  all  numeric  or  HTML  code,  my  review  of  a  large 
number  of  these  filings  indicates  that  the  code  does  a  reasonable  job  at  editing  the  text.  Further,  the 
En  Fathom  module  with  Perl  attempts  to  distinguish  words  from  acronyms  and  other  characters, 
which  should  minimize  any  noise  attributable  to  remaining  HTML  code.  There  is  no  reason  to 
believe  that  any  remaining  HTML  code  biases  the  findings  in  this  study. 

Word  and  Table  Counts 

I  run  the  En  Fathom  module  in  Perl  to  identify  the  number  of  words  in  the  document.  To 
count  the  number  of  cells  included  in  the  document,  I  locate  all  of  the  HTML  tags  that  correspond 
to  the  syntax  (TABLE  or  (table  (prior  to  their  deletion  as  documented  above).  The  number  of  cells 
within  a  table  is  then  computed  by  counting  the  number  of  (TD  or  (td  tags  found  within  a  table. 
Finally,  I  sum  the  number  of  cells  across  all  tables  and  add  that  count  to  the  number  of  words  in 
the  document  to  create  the  WORDS +TABLES  variable. 

Financial  Statements  Included  by  Reference 

Finally,  to  ensure  that  the  results  in  this  study  are  not  affected  by  firms  that  include  their 
financial  statement  by  reference,  I  use  a  PERL  script  to  identify  these  firms  and  find  that  the  results 
are  unaffected  by  eliminating  these  observations  (untabulated). 


APPENDIX  B 

CALCULATION  OF  FOG  AND  PLAIN  ENGLISH  READABILITY  MEASURES 

This  appendix  explains  the  details  of  measuring  length  and  creating  readability  indices  after 
the  text  has  been  edited  (as  described  in  Appendix  A). 


I  follow  Li  (2008)  and  delete  tables  “that  begin  with  (TABLE)  and  end  with  (/TABLE).”  However,  I  find  that  certain 
reports  contain  significant  amounts  of  text  between  table  tags.  For  example,  in  Lowe’s  Companies  1994  10-K  (http:// 
www.sec.gOv/Archives/edgar/data/60667/0000060667-94-000015.txt),  large  portions  of  text,  including  the  entire 
MD&A  section,  are  eliminated  under  this  approach.  Hence,  I  repeat  the  entire  analysis  without  deleting  the  information 
between  the  table  tags  (but  keeping  the  requirement  to  delete  all  paragraphs  with  more  than  50  percent  non-alphabetic 
characters).  The  results  are  quantitatively  similar  using  both  approaches. 
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Fog  Readability 

After  the  file  has  gone  through  the  edit  process  described  in  Appendix  A,  the  text  is  then 
analyzed  using  the  En  Fathom  package  in  Perl.  This  package  calculates  a  variety  of  textual 
statistics,  including  number  of  characters,  number  of  words,  percent  of  complex  words,  number  of 
sentences,  number  of  text  lines,  number  of  paragraphs,  syllables  per  word,  and  words  per  sentence. 
Based  on  these  statistics,  the  package  produces  the  Fog  readability  index. 

Plain  English  Readability 

I  compute  the  plain  English  readability  score  used  in  this  study  by  identifying  the  specific 
plain  English  problems  highlighted  by  the  SEC  and  matching  them  to  the  specific  variables 
identified  by  a  proprietary  text-editing  program  called  StyleWriter  (http .7/www.  style writer- 
usa.com). 


Plain  English  Writing  Factors 

SEC  Plain  English  Problems 

Passive  voice 
Weak/Hidden  verbs 
Superfluous  words 
Legal  and  financial  jargon 
Numerous  defined  terms 
Abstract  words 
Unnecessary  details 
Long  sentences 
Unreadable  design  and  layout 


StyleWriter  Plain  English  Measure 

Passive  verbs 
Hidden  verbs 

Overwriting  (overuse  of  qualifying  words) 
Legal  words  and  Jargon/ Abstract  words 
NA 

Jargon/Abstract  words 
Tautologies/Overwriting 
Number  of  words/ Avg.  sentence  length 
NA 


This  software  is  designed  as  an  add-in  editing  program  to  be  used  in  text  processing  software 
and,  thus,  requires  the  document  to  be  opened  in  a  “word  processor”  before  the  document  can  be 
analyzed.  This  limitation  results  in  several  additional  processing  requirements. 

First,  each  document  must  be  opened  in  text-processing  software  and  then  individually  ana¬ 
lyzed  by  StyleWriter.  I  automate  this  process  using  a  keyboard/mouse  click  macro  to  open  and 
score  each  document,  but  the  need  to  open  and  analyze  each  document  individually  severely  limits 
the  number  of  documents  that  can  be  efficiently  processed.  Second,  the  Perl  output  in  .txt  format 
has  a  “f”  symbol  (i.e.,  carriage  return)  at  the  end  of  each  line.  This  means  StyleWriter  sees  each 
line  as  a  paragraph,  which  invalidates  several  of  the  plain  English  results.  Hence,  prior  to  pro¬ 
cessing  the  documents,  I  use  a  Perl  script  to  replace  all  f  with  a  space  and  keep  all  instances  where 
there  were  two  consecutive  K  symbols  (i.e.,  actual  paragraph  break).  Finally,  the  program  is 
designed  to  write  out  limited  summary  statistics  to  allow  writers  to  improve  their  writing  style. 
However,  as  originally  designed,  the  program  does  not  output  the  frequency  of  all  plain  English 
violations.  Therefore,  I  worked  with  StyleWriter  programmers  to  modify  the  program  to  enable  me 
to  export  the  relevant  plain  English  violations  for  each  document. 
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ABSTRACT:  Legal  scholars  and  accounting  practitioners  have  expressed  concern  that 
the  U.S.  legal  system  might,  in  cases  of  undetected  fraud,  effectively  penalize  auditors 
for  identifying  and  investigating  fraud  risks  (AlCPA  2004;  Coffee  2004;  Golden  et  al. 
2006).  This  study  draws  on  counterfactual  reasoning  theory  to  provide  experimental 
evidence  indicating  that  this  concern  is  warranted.  Consistent  with  counterfactual  rea¬ 
soning  theory,  I  find  that  evaluators  in  a  natural  (i.e.,  between-participants)  environment 
are  more  likely  to  hold  auditors  liable  for  failing  to  detect  fraud  when  the  auditors 
investigated  for  the  perpetrated  fraud,  relative  to  when  the  auditors  did  not  investigate 
for  the  fraud.  Findings  from  a  less  natural  within-participants  experiment,  however, 
indicate  that  the  between-participant  findings  likely  were  unintentional:  That  is,  the 
same  evaluators,  when  later  asked  to  judge  alternative  levels  of  fraud  investigation 
simultaneously,  are  less  likely  to  hold  auditors  liable  for  failing  to  detect  fraud  when  the 
auditors  investigated  for  the  fraud,  relative  to  when  the  auditors  did  not  investigate  for 
the  fraud. 

Keywords:  fraud  risks;  auditor  liabiiity;  counterfactual  reasoning  theory;  negative 
affect. 

Data  Availability:  Data  and  experimentai  materials  used  in  this  study  are  available 
upon  request. 

I.  INTRODUCTION 

Auditing  standards  require  auditors  to  identify  fraud  risks  during  the  planning  stage  of  their 
audits  and  then  design  audit  procedures  to  investigate  the  identified  risks  (AICPA  2002, 
2006).  While  identifying  and  investigating  fraud  risks  should  help  auditors  detect  fraud, 
several  publications  suggest  such  actions  could  increase  auditors’  liability  in  the  event  of  an  audit 
failure.  In  particular,  a  SAS  No.  99  practice  aide  advises  auditors  against  documenting  the  specific 
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fraud  risks  they  identify  during  the  required  fraud  brainstorming  sessions  because  doing  so  may 
provide  unnecessary  litigation  exposure  to  the  firm  (AICPA  2004).  In  addition,  a  recent  forensic 
accounting  guide,  created  by  a  partner  in  a  Big  4  forensic  accounting  practice,  warns  that  enhanc¬ 
ing  audit  procedures  to  meet  the  public’s  heightened  expectations  of  auditors  to  detect  fraud  may 
further  raise  expectations  and  the  accompanying  risk  of  litigation  (Golden  et  al.  2006).  Similar 
concern  has  also  been  expressed  in  the  law  literature.  Specifically,  Coffee  (2004)  claims  that 
auditors’  best  defense  in  cases  of  undetected  fraud  is  ignorance,  such  that  auditors  have  an  incen¬ 
tive  not  to  inquire  too  closely — lest  one  acquire  information  to  put  one  on  notice. 

The  objective  of  this  study  is  to  leverage  relevant  psychology  theory  to  empirically  test 
whether  the  above  concerns  are  warranted.  Counterfactual  reasoning  theory  indicates  that  inves¬ 
tigating  fraud  risks  likely  can  increase  auditors’  liability  in  the  event  of  an  audit  failure  due  to 
undetected  fraud.  In  particular,  based  on  prior  research  examining  the  antecedents  and  conse¬ 
quences  of  counterfactual  reasoning  (Kahneman  and  Miller  1986;  Miller  and  McFarland  1986; 
Macrae  1992;  Sherman  and  McConnell  1995),  I  predict  that  evaluators  in  cases  of  undetected 
fraud  will  experience  more  intense  counterfactual  thoughts  of  what  the  auditors  could  have  done 
differently  to  detect  the  fraud  and  will  judge  auditors  more  harshly  when  the  auditors  performed 
audit  procedures  to  investigate  for  the  fraud,  relative  to  when  the  auditors  did  not  investigate  for 
the  fraud. 

To  test  these  predictions,  I  conduct  an  experiment  in  which  participants  serving  in  the  role  of 
individual  jurors  evaluate  auditors  who  failed  to  detect  fraud  during  a  recent  audit.  I  manipulate 
whether  the  auditors  performed  audit  procedures  to  investigate  a  specific  fraud  risk  related  to  the 
perpetrated  fraud.  After  reading  the  case,  participants  decide,  first  in  a  between-participants  envi¬ 
ronment  and  then  in  a  within-participants  environment,  whether  the  auditors  were  negligent  and 
therefore  liable  for  plaintiff  losses. 

Several  factors  suggest  that  jurors  in  real-world  auditor  negligence  lawsuits  make  auditor 
liability  decisions  in  environments  more  similar  to  a  between-participants  environment  than  a 
within-participants  experimental  environment.  Specifically,  the  unique  risk  factors  surrounding 
individual  real-world  audits  and  jurors’  general  lack  of  auditing  experience  and  limited  knowledge 
of  audit  procedures  suggest  that  jurors  in  real-world  auditor  negligence  lawsuits  are  not  able  to 
compare  alternative  levels  of  auditor  investigation  across  identical  circumstances.  Thus,  partici¬ 
pants’  between-participants  responses  provide  a  more  realistic  indication  of  jurors’  assessments  of 
auditor  liability  in  cases  of  undetected  fraud  than  participants’  within-participants  responses.  This 
view  is  consistent  with  prior  experimental  studies  of  jurors’  assessments  of  auditor  liability,  which 
test  their  hypotheses  in  between-participants  environments  (Kadous  2000,  2001;  Lowe  et  al.  2002; 
Clarkson  et  al.  2002). 

Within-participants  responses  nevertheless  are  informative.  Specifically,  the  salience  of  ma¬ 
nipulated  factors  in  within-participants  designs  enables  participants  to  identify  and  correct  incon¬ 
sistencies  between  their  natural  (i.e.,  between-participants)  judgments  and  their  beliefs  about  how 
their  judgments  should  vary  with  levels  of  a  manipulated  factor.  In  this  sense,  participants’  re¬ 
sponses  in  within-participants  environments  provide  an  indication  as  to  whether  their  between- 
participants  judgments  are  intentional  (Kahneman  and  Tversky  1996). 

Consistent  with  counterfactual  reasoning  theory,  1  find  that  evaluators  in  a  between- 
participants  environment  experience  more  intense  thoughts  of  what  the  auditors  could  have  done 
differently  to  detect  the  fraud  and  are  more  likely  to  hold  auditors  liable  for  plaintiff  losses  when 
the  auditors  performed  audit  procedures  to  investigate  for  the  fraud,  relative  to  when  the  auditors 
did  not  investigate  for  the  fraud.  The  observed  between-participants  decision  pattern,  however, 
appears  to  be  unintentional,  as  I  find  that  the  same  evaluators,  when  subsequently  placed  in  a 
within-participants  environment,  are  less  likely  to  hold  auditors  liable  for  plaintiff  losses  when  the 
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auditors  performed  audit  procedures  to  investigate  for  the  fraud,  relative  to  when  the  auditors  did 
not  investigate  for  the  fraud. 

This  study  makes  several  important  contributions  to  the  accounting  literature.  Scholars  and 
practitioners  have  expressed  concern  that  the  U.S.  legal  system  might,  in  cases  of  undetected 
fraud,  effectively  penalize  auditors  for  investigating  fraud  risks  (Coffee  2004;  Golden  et  al.  2006). 
This  study  informs  their  arguments  with  relevant  psychology  theory  and  provides  evidence  to 
indicate  whether  such  concern  is  warranted.  Results  indicate  that  performing  audit  procedures  to 
investigate  specific  fraud  risks  can  increase  jurors’  assessments  of  auditor  liability  in  cases  of 
undetected  fraud.  This  study  therefore  supports  the  validity  of  recently  expressed  concerns  regard¬ 
ing  the  potential  legal  hazards  of  investigating  fraud  risks. 

In  addition,  by  examining  evaluator  judgment  in  both  between-  and  within-participants  envi¬ 
ronments,  this  study  advances  the  current  understanding  of  evaluators’  assessments  of  auditor 
liability  in  cases  of  undetected  fraud.  The  study’s  within-participants  results  demonstrate  that 
evaluators,  when  able  to  see  and  compare  alternative  levels  of  fraud  investigation  simultaneously, 
understand  that  higher  levels  of  fraud  investigation  indicate  higher  levels  of  auditor  care  and, 
therefore,  imply  lower  levels  of  auditor  liability.  The  between-participants  results,  in  contrast, 
demonstrate  that  evaluators  have  difficulty  applying  this  standard  in  the  more  natural  setting  where 
alternative  levels  of  auditor  investigation  under  identical  circumstances  likely  do  not  exist  and  are, 
therefore,  unavailable  for  comparison. 

Improving  evaluators’  liability  decisions  requires  that  evaluators  understand  normative  stan¬ 
dards  and  are  willing  to  apply  those  standards  (Wegener  et  al.  2000).  The  within-participants 
results  demonstrate  that  evaluators  in  cases  of  undetected  fraud  meet  these  two  criteria.  This  study 
therefore  suggests  that  the  observed  between-participants  decision  pattern  potentially  could  be 
reversed.  Additional  empirical  research  could  be  directed  at  identifying  an  effective  debiasing 
method  to  reverse  evaluators’  unintentional  tendency,  when  evaluating  individual  cases  of  unde¬ 
tected  fraud,  to  judge  auditors  who  performed  audit  procedures  to  investigate  for  the  fraud  more 
harshly  than  auditors  who  did  not  investigate  for  the  fraud. 

Section  II  develops  theory  and  presents  the  study’s  hypotheses.  Section  III  outlines  the  ex¬ 
perimental  design  and  method.  Section  IV  presents  results.  The  final  section  summarizes,  identifies 
directions  for  future  research,  and  discusses  limitations. 

II.  THEORY  AND  HYPOTHESES 

Background 

Taking  actions  that  increase  the  quality  of  an  audit  should,  based  on  liability  laws,  reduce 
auditor  liability  in  the  event  of  an  audit  failure  (Causey  and  Causey  1991;  Kadous  2000).  Thus, 
economic  models  of  auditor  liability  assume  that  taking  actions  to  increase  the  quality  of  an  audit 
will  reduce  auditor  liability  in  the  event  of  an  audit  failure  (Smith  and  Tidrick  1998;  Radhakrish- 
nan  1999).  Economic  models  of  auditor  liability  therefore  imply  that  performing  audit  procedures 
to  investigate  fraud  risks  (assuming  such  actions  increase  audit  quality)  will  reduce  auditor  liabil¬ 
ity  in  the  event  of  an  audit  failure. 

Empirical  research,  however,  suggests  that  evaluators  often  lack  the  knowledge  and/or  ability 
to  make  decisions  that  are  consistent  with  legal  standards  (Robbennolt  et  al.  2003).  Information 
disclosed  during  a  trial  therefore  does  not  always  affect  juror  decision-making  in  a  manner  that  is 
consistent  with  legal  standards.  In  an  audit  litigation  setting,  Kadous  (2000)  finds  that  juror 
standards  of  auditor  care  increase  with  the  severity  of  the  consequences  associated  with  an  audit 
failure  (despite  legal  standards  that  indicate  juror  standards  of  care  should  remain  independent  of 
consequence  severity).  This  finding  suggests  that  jurors  and  other  lay  evaluators  lack  a  clear 
understanding  of  what  auditors  should  be  held  responsible  for  and  instead  rely  on  the  single  case 
they  are  presented  to  establish  the  appropriate  standard  of  care.  As  a  result,  taking  actions  that 
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increase  the  quality  of  an  audit  might  not  decrease  auditors’  liability  in  the  event  of  an  audit  failure 
(Kadous  2000).  With  respect  to  fraud,  performing  audit  procedures  to  investigate  fraud  risks  might 
not  decrease  and  could,  as  Coffee  (2004)  and  Golden  et  al.  (2006)  suggest,  increase  auditors’ 
liability  in  the  event  of  an  audit  failure  due  to  undetected  fraud. 

Below,  I  review  prior  research  examining  the  antecedents  and  consequences  of  counterfactual 
reasoning  and  then  apply  the  main  findings  to  the  current  study’s  audit  litigation  setting.  As  I 
discuss  below,  prior  psychology  research  suggests  that  counterfactual  reasoning  will  significantly 
affect  evaluators’  assessments  of  auditor  liability  in  cases  of  undetected  fraud.  Counterfactual 
reasoning  theory,  therefore,  provides  an  appropriate  basis  for  developing  expectations  regarding 
the  effects  of  investigating  fraud  risks  on  evaluators’  assessments  of  auditor  liability  in  cases  of 
undetected  fraud. 

Antecedents  of  Counterfactual  Reasoning 

In  the  context  of  harmful  events,  counterfactual  reasoning  refers  to  imagining  how  prior 
outcomes  could  have  been  avoided  (Roese  1997;  Alicke  et  al.  2008).  Prior  research  on  accidental 
injury/death  litigation  indicates  that  juror  assessments  of  defendant  liability  increase  with  Jurors’ 
counterfactual  thoughts  (Miller  and  McFarland  1986;  Bothwell  and  Duhon  1994).  Analogously,  in 
cases  of  undetected  financial  statement  fraud,  evaluators’  assessments  of  auditor  liability  likely 
will  increase  with  the  extent  of  their  counterfactual  thoughts  as  to  what  the  auditors  could  have 
done  differently  to  detect  the  fraud. 

The  extent  to  which  evaluators  engage  in  counterfactual  reasoning  following  negative  out¬ 
comes  depends  on  the  perceived  mutability  of  the  prior  events  (Kahneman  and  Miller  1986). 
Specifically,  the  easier  it  is  for  evaluators  to  imagine  more  favorable  alternative  outcomes  to  prior 
events,  the  more  evaluators  will  engage  in  counterfactual  reasoning.  As  I  discuss  below,  various 
factors  affect  the  pereeived  mutability  of  prior  events  (Roese  1997).  Consequently,  in  cases  of 
undetected  fraud,  the  extent  to  which  evaluators  engage  in  counterfactual  reasoning  (and  therefore 
the  effect  of  counterfactual  reasoning  on  evaluators’  assessments  of  auditor  liability)  likely  de¬ 
pends  on  the  presence  or  absence  of  several  factors  that  affect  the  perceived  mutability  of  prior 
events. 

One  factor  that  affects  the  perceived  mutability  of  prior  events  is  outcome  closeness.  Specifi¬ 
cally,  it  is  easier  to  mentally  alter  prior  negative  events  and  imagine  more  favorable  alternative 
outcomes  when  prior  events  occur  by  narrow  as  opposed  to  wide  margins  (Kahneman  and  Miller 
1986).  Close  outcomes  (i.e.,  near  misses)  facilitate  the  imagination  of  alternative  outcomes  be¬ 
cause  it  is  easier  to  imagine  alternative  outcomes  when  one  can  identify  specific  occurrences  that, 
if  altered,  would  have  changed  the  outcome  of  the  event  in  question.  Since  identifying  such  critical 
occurrences  is  easier  when  negative  outcomes  occur  by  narrow,  as  opposed  to  wide,  margins,  it  is 
easier  to  imagine  alternative  outcomes  following  near  misses  (Kahneman  and  Miller  1986). 

In  addition  to  close  outcomes,  prior  research  also  indicates  that  actions  are  easier  to  mutate 
than  non-actions.  Consequently,  counterfactual  reasoning  increases  following  actions  as  opposed 
to  non-actions  (Gleicher  et  al.  1990;  Sherman  and  McConnell  1995).  Prior  studies  offer  several 
explanations  for  the  effect  of  action  (as  opposed  to  inaction)  on  the  extent  of  evaluators’  counter- 
factual  thoughts  (Landman  1987).  One  explanation  relevant  to  this  study  is  that  action  directs 
evaluators’  attention  to,  and  therefore  highlights  individuals’  causal  roles  in,  prior  negative  out¬ 
comes  (Greyer  and  Gurhan  1997).  This  increases  the  belief  that  altering  individuals’  behavior 
likely  would  result  in  a  more  favorable  outcome,  which  makes  it  easier  to  imagine  such  an 
alternative  outcome.  Thus,  evaluators’  counterfactual  thoughts  of  what  individuals  could  have 
done  differently  to  avert  a  negative  outcome  tend  to  increase  following  actions  as  opposed  to 
non-actions  (Greyer  and  Gurhan  1997). 
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Both  factors  discussed  above  suggest  that  evaluators  in  cases  of  undetected  fraud  will  expe¬ 
rience  greater  counterfactual  thoughts  of  what  the  auditors  could  have  done  differently  to  detect 
the  fraud  when  the  auditors  performed  audit  procedures  to  investigate  for  the  fraud,  relative  to 
when  the  auditors  did  not  investigate  for  the  fraud.  Specifically,  I  predict  that,  in  cases  of  unde¬ 
tected  fraud,  evaluators  will  perceive  auditors  who  performed  audit  procedures  to  investigate  for 
the  fraud  to  have  been  closer  to  detecting  the  fraud  than  auditors  who  did  not  investigate  for  the 
fraud.  Thus,  based  on  prior  research  examining  the  effect  of  outcome  closeness  on  counterfactual 
reasoning,  I  expect  evaluators’  thoughts  of  what  the  auditors  could  have  done  differently  to  detect 
the  fraud  to  increase  when  the  auditors  performed  audit  procedures  to  investigate  for  the  fraud.  In 
addition,  prior  research  examining  the  effect  of  actions  (versus  non-actions)  on  counterfactual 
reasoning  also  indicates  that  evaluators’  thoughts  of  what  the  auditors  could  have  done  differently 
to  detect  the  fraud  will  increase  when  the  auditors  performed  audit  procedures  to  investigate  for 
the  fraud  (i.e.,  took  action),  relative  to  when  the  auditors  did  not  investigate  for  the  fraud  (i.e.,  did 
not  take  action). 

When  examining  the  extent  of  evaluators’  counterfactual  thoughts,  prior  research  considers 
several  factors,  including  both  frequency  and  intensity  (Roese  1994;  Sanna  and  Turley-Ames 
2000).  Sanna  and  Turley-Ames  (2000)  find  that  counterfactual  intensity  explains  variance  in 
affective  reactions  beyond  that  explained  by  counterfactual  frequency,  while  counterfactual  fre¬ 
quency  does  not  account  for  variation  beyond  that  of  counterfactual  intensity.  Accordingly,  I 
primarily  consider  counterfactual  intensity  as  opposed  to  counterfactual  frequency.  In  particular,  I 
expect  evaluators  in  cases  of  undetected  fraud  to  experience  more  intense  counterfactual  thoughts 
of  what  the  auditors  could  have  done  differently  to  detect  the  fraud  when  the  auditors  performed 
audit  procedures  to  investigate  for  the  fraud. 

HI:  Evaluators  in  cases  of  undetected  fraud  will  experience  more  intense  thoughts  of  what 
the  auditors  could  have  done  differently  to  detect  the  fraud  when  the  auditors  performed 
audit  procedures  to  investigate  for  the  fraud,  relative  to  when  the  auditors  did  not  inves¬ 
tigate  for  the  fraud. 

Consequences  of  Counterfactual  Reasoning 

As  noted  above,  evaluators’  assessments  of  individuals’  responsibility  for  prior  negative  out¬ 
comes  increase  with  the  extent  of  their  counterfactual  thoughts  (Miller  and  McFarland  1986; 
Macrae  1992;  Bothwell  and  Duhon  1994).  Prior  research  demonstrates  that  the  consequences  of 
counterfactual  reasoning  are  widespread,  including  emotions  and  causal  inferences  (Roese  1997). 
Thus,  several  affective  and  cognitive  mechanisms  could  mediate  the  relationship  between  coun¬ 
terfactual  reasoning  and  evaluator  assessments  of  individuals’  liability  for  prior  negative  out¬ 
comes.  In  the  following  paragraphs,  I  examine  several  specific  consequences  of  counterfactual 
reasoning  that  I  expect  to  increase  evaluators’  assessments  of  auditor  liability  in  cases  of  audit 
failure  due  to  undetected  fraud. 

One  frequently  cited  reason  why  counterfactual  reasoning  increases  evaluators’  liability  as¬ 
sessments  is  that  counterfactual  reasoning  amplifies  evaluators’  emotional  (affective)  reactions  to 
prior  negative  outcomes  (Robbennolt  and  Sobus  1997).  Counterfactual  reasoning  (i.e.,  thinking 
about  what  could  have  been  done  differently  to  avoid  a  negative  outcome)  increases  the  belief  that 
a  prior  outcome  could  have  been  avoided  (McMullen  et  al.  1995;  Roese  1997).  The  belief  that  a 
prior  negative  outcome  could  have  been  avoided  makes  the  actual  outcome  appear  more  negative 
and  poignant  (Roese  1997),  which  in  turn  amplifies  evaluators’  affective  responses  to  the  event 
(Markman  et  al.  1993;  Medvec  et  al.  1995;  Roese  and  Olson  1995;  Alicke  et  al.  2008).  Conse¬ 
quently,  evaluators’  affective  responses  to  prior  negative  outcomes  increase  with  the  intensity  of 
their  counterfactual  thoughts  (Sanna  and  Turley-Ames  2000). 
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Increased  affective  reactions  among  evaluators  in  litigation  settings  have  been  found  to  result 
in  antipathy  for  defendants  and  sympathy  for  plaintiffs  (Macrae  1992).  Therefore,  I  expect  more 
intense  counterfactual  thoughts  among  evaluators  (and  thus  greater  affective  reactions)  to  result  in 
greater  antipathy  for  defendants  and  sympathy  for  plaintiffs.  Prior  research  indicates  that  negative 
affect  directed  toward  defendants  increases  the  harshness  of  evaluators’  assessments  of  defendant 
liability  (Macrae  1992).  In  an  audit  litigation  setting,  Kadous  (2001)  finds  that  evaluators  in  cases 
of  undetected  fraud  use  negative  affect  as  a  cue  to  auditor  blameworthiness.’  In  addition.  Miller 
and  McFarland  (1986)  report  that  increased  sympathy  for  plaintiffs  results  in  harsher  judgments  of 
defendants.  Consequently,  in  cases  of  undetected  fraud,  the  combination  of  increased  antipathy  for 
the  auditors  and  increased  sympathy  for  the  plaintiff  likely  will  result  in  more  severe  judgments  of 
auditor  liability.  Therefore,  I  expect  more  intense  counterfactual  thoughts  (and  thus  greater  auditor 
antipathy/plaintiff  sympathy  among  evaluators)  to  result  in  harsher  evaluations  of  the  auditors  in 
cases  of  undetected  fraud. 

Prior  psychology  research  also  indicates  that  the  effects  of  counterfactual  reasoning  extend 
beyond  evaluators’  affective  reactions  (Roese  1997).  Therefore,  in  cases  of  undetected  fraud,  the 
intensity  of  evaluators’  counterfactual  thoughts  likely  affects  cognitive  assessments  that  are  rel¬ 
evant  to  the  evaluators’  ultimate  assessments  of  auditor  •  liability.  One  such  assessment  likely 
influenced  by  counterfactual  reasoning  is  evaluators’  assessment  of  auditor  competence.  In  par¬ 
ticular,  when  auditors  fail  to  detect  fraud,  thinking  about  what  the  auditors  could  have  done 
differently  to  detect  the  fraud  could  highlight  the  apparent  necessity  of  the  omitted  procedures  and, 
thus,  decrease  evaluators’  assessments  of  auditor  competence.  Consequently,  in  cases  of  undetec¬ 
ted  fraud,  I  expect  evaluators’  assessments  of  auditor  competence  to  directly  decrease  with  the 
intensity  of  their  thoughts  of  what  the  auditors  could  have  done  differently  to  detect  the  fraud. 

In  addition,  prior  research  suggests  that  counterfactual  reasoning  could  indirectly  decrease 
evaluators’  assessments  of  auditor  competence  through  evaluators’  affective  reactions.  There  are 
several  reasons  why  evaluators’  affective  reactions  could  affect  their  assessments  of  auditor  com¬ 
petence.  First,  prior  research  indicates  that  information  that  is  linked  with  an  individual’s  current 
affective  state  is  more  likely  to  be  activated,  recalled,  and  used  in  cognitive  tasks,  which  leads  to 
congruency  between  the  individual’s  affective  state  and  her  judgments  (Bower  1981;  Bower  and 
Forgas  2001).  Thus,  auditor  antipathy/plaintiff  sympathy  could  result  in  a  disproportionate  weight¬ 
ing  of  case  information  that  is  unfavorable  to  the  auditor  and  ultimately  decrease  evaluators’ 
assessments  of  auditor  competence.  Second,  evaluators  could  view  their  affective  reactions  as 
information  regarding  their  assessment  of  auditor  competence  (Schwarz  and  Clore  1983;  Kadous 
2001).  Specifically,  evaluators  could  view  negative  feelings  toward  the  auditors  as  an  indication 
that  the  auditors  did  not  perform  the  audit  competently.  Based  on  these  reasons,  I  expect  more 
intense  counterfactual  thoughts  to  indirectly  decrease  evaluators’  assessments  of  auditor  compe¬ 
tence  through  affective  reactions. 

Finally,  counterfactual  thoughts  imply  causal  connections  that  can  directly  affect  assessments 
of  blame  (Roese  1997).  For  example,  in  cases  of  undetected  fraud,  thinking  about  what  an  auditor 
could  have  done  differently  to  detect  the  fraud  indicates  that  the  audit  failure  would  not  have 
occurred  if  the  auditor  would  have  acted  differently.  In  turn,  the  belief  that  the  audit  failure  would 
not  have  occurred  if  the  auditor  would  have  acted  differently  implies  that  the  audit  failure  was 
caused  by  the  auditors.  Such  causal  implications  naturally  increase  evaluators’  assessments  of 
blame  (Roese  1997).  Thus,  beyond  the  effects  of  counterfactual  reasoning  on  evaluators’  affective 


'  Cornell  et  al.  (2009)  report  that,  by  issuing  an  apology  in  cases  of  audit  failure,  auditors  can  effectively  reduce 
evaluators’  affective  reactions  to  an  audit  failure,  which  decreases  evaluators’  need  to  assign  blame  and  ultimately 
reduces  the  severity  of  their  assessments  of  auditor  negligence.  This  finding  provides  further  empirical  support  for  the 
link  between  evaluators’  affective  responses  and  their  assessments  of  auditor  negligence  in  cases  of  audit  failure. 
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reactions  and  their  assessments  of  auditor  competence,  I  expect  evaluators’  assessments  of  auditor 
liability  to  directly  increase  with  the  intensity  of  their  counterfactual  thoughts. 

Summary  of  the  Antecedents  and  Consequences  of  Counterfactual  Reasoning 

Prior  research,  for  several  reasons,  suggests  that  evaluators  in  cases  of  undetected  fraud  will 
experience  more  intense  counterfactual  thoughts  of  what  the  auditors  could  have  done  differently 
to  detect  the  fraud  when  the  auditors  performed  audit  procedures  to  investigate  for  the  fraud, 
relative  to  when  the  auditors  did  not  investigate  for  the  fraud  (HI).  As  a  result,  I  expect  evaluators 
to  be  more  likely  to  experience  antipathy  for  the  auditors,  to  be  more  likely  to  experience  sym¬ 
pathy  for  the  plaintiff,  and  to  have  lower  assessments  of  auditor  competence  when  the  auditors 
performed  audit  procedures  to  investigate  for  the  fraud,  relative  to  when  the  auditors  did  not 
investigate  for  the  fraud.  Due  to  these  effects,  and  also  due  to  direct  effects  of  counterfactual 
reasoning  on  evaluators’  liability  assessments,  I  expect  evaluators’  between-participants  assess¬ 
ments  of  auditor  liability  to  be  more  severe  when  the  auditors  performed  audit  procedures  to 
investigate  for  the  fraud,  relative  to  when  the  auditors  did  not  investigate  for  the  fraud.  Figure  1 
presents  a  model  of  the  hypothesized  effects  of  investigating  fraud  risks  on  evaluators’  assess¬ 
ments  of  auditor  liability  in  cases  of  undetected  fraud. 

H2:  Evaluators’  between-participants  assessments  of  auditor  liability  in  cases  of  undetected 
fraud  will  be  more  severe  when  the  auditors  performed  audit  procedures  to  investigate 
for  the  fraud,  relative  to  when  the  auditors  did  not  perform  audit  procedures  to  investi¬ 
gate  for  the  fraud. 

Importantly,  while  prior  research  is  consistent  with  H2,  the  prediction  does  not  lack  tension. 
In  studying  the  effects  of  counterfactual  reasoning  on  juror  assessments  of  defendant  liability. 
Miller  and  McFarland  (1986)  manipulate  outcome  closeness  (which,  in  turn,  positively  affects  the 
extent  of  evaluators’  counterfactual  thoughts)  through  a  factor  that  is  unrelated  to  the  defendant’s 
level  of  care.  If  one  holds  defendant  care  constant,  theory  clearly  predicts  that  increased  counter- 
factual  reasoning  will  result  in  harsher  liability  assessments  of  defendants.  In  the  current  study’s 
audit  litigation  setting,  however,  the  same  factor  expected  to  increase  the  intensity  of  evaluator 
counterfactual  thoughts  (i.e.,  whether  the  auditors  performed  audit  procedures  to  investigate  for 
the  fraud)  also  increases  the  level  of  care  provided  by  the  auditors.  That  is,  in  this  setting,  auditors 
provide  a  higher  level  of  care  when  the  evaluators’  counterfactual  reasoning  is  expected  to  be 
relatively  high,  and  a  lower  level  of  care  when  their  counterfactual  reasoning  is  expected  to  be 
relatively  low.  As  such,  the  higher  level  of  care  provided  by  auditors  could  offset  the  effects  of 
increased  counterfactual  reasoning  on  evaluators’  assessments  of  auditor  liability.  Consequently,  it 
is  not  self-evident  that  the  results  of  the  experiment  will  support  H2. 

Within-Participants  Assessments  of  Auditor  Liability 

In  addition  to  examining  evaluators’  between-participants  assessments  of  auditor  liability,  I 
also  examine  evaluators’  within-participants  assessments  of  auditor  liability.  The  purpose  of  ex¬ 
amining  participants’  within-participants  assessments  is  to  examine  whether  evaluators’  assess¬ 
ments  of  auditor  liability  in  the  more  natural  between-participants  environment  are  intentional 
(Kahneman  and  Tversky  1996).  In  the  following  paragraphs,  I  consider  whether  evaluators’  as¬ 
sessments  of  auditor  liability  in  a  within-participants  environment  likely  vary  from  their  assess¬ 
ments  of  auditor  liability  in  a  between-participants  environment. 

Hypothesis  2  predicts  that  evaluators’  between-participants  assessments  of  auditor  liability  in 
cases  of  undetected  fraud  will  be  more  severe  when  the  auditors  performed  audit  procedures  to 
investigate  for  the  fraud.  One  reason  why  this  prediction  might  not  hold,  however,  is  that  evalu¬ 
ators  could  have  more  favorable  assessments  of  the  level  of  care  provided  by  the  auditors  when  the 
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FIGURE  1 

Hypothesized  Model  of  the  Effects  of  Investigating  Fraud  Risks  on  Evaluator  Assessments  of 

Auditor  Liability 


This  figure  illustrates  my  proposed  model  of  how  performing  audit  procedures  to  investigate  fraud  risks  can 
affect  evaluators’  assessments  of  auditor  liability  in  cases  of  undetected  fraud.  The  parenthetical  comment  next 
to  each  link  represents  the  expected  coefficient  sign. 

“Investigation  for  fraud”  refers  to  whether  the  auditors  performed  audit  procedures  to  investigate  for  the  fraud. 
“Counterfactual  intensity”  refers  to  the  intensity  of  participants’  thoughts  of  what  the  auditors  could  have  done 
differently  to  detect  the  fraud. 

“Affective  reactions  to  the  case”  refers  to  participants’  affective  reactions  to  the  case.  Positive  (negative)  num¬ 
bers  are  indicative  of  affective  reactions  that  are  favorable  to  the  auditor  (plaintiff). 

“Assessments  of  auditor  competence”  refers  to  participants’  assessments  of  how  competent  the  auditors  were  in 
their  performance  of  the  audit. 

“Assessments  of  auditor  negligence”  refers  to  participants’  probability  assessments  of  how  likely  it  was  that  the 
auditors  were  negligent  in  their  performance  of  the  audit. 


auditors  performed  audit  procedures  to  investigate  for  the  fraud,  relative  to  when  the  auditors  did 
not  investigate  for  the  fraud.  Nevertheless,  I  expect  more  intense  counterfactual  thoughts  of  what 
the  auditors  could  have  done  differently  to  detect  the  fraud,  in  a  between-participants  environment, 
to  adversely  affect  evaluators’  assessments  of  the  level  of  care  provided  by  the  auditors. 

In  a  within-participants  environment,  however,  I  do  not  expect  more  intense  counterfactual 
thoughts  to  adversely  affect  evaluators’  assessments  of  the  level  of  care  provided  by  the  auditors. 
The  results  of  a  survey  by  Frank  et  al.  (2001)  suggest  that  non-accountant  evaluators  believe  that 
auditors  who  investigate  for  fraud  provide  a  higher  level  of  care  than  auditors  who  did  not 
investigate  for  fraud,  ceteris  paribus.  Therefore,  when  evaluators  in  cases  of  undetected  fraud  are 
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placed  in  an  environment  where  they  can  directly  compare  auditors  who  investigated  for  the  fraud 
versus  auditors  who,  under  identical  circumstances,  did  not  investigate  for  the  fraud,  I  expect 
evaluators  to  conclude  that  the  auditors  who  investigated  for  the  fraud  provided  a  higher  level  of 
care  than  the  auditors  who  did  not.  Thus,  I  expect  evaluators’  within-participants  assessments  of 
auditor  liability  to  be  less  severe  when  the  auditors  performed  audit  procedures  to  investigate  for 
the  fraud,  relative  to  when  the  auditors  did  not  investigate  for  the  fraud. 

H3:  Evaluators’  within-participants  assessments  of  auditor  liability  in  cases  of  undetected 
fraud  will  be  less  severe  when  the  auditors  performed  audit  procedures  to  investigate  for 
the  fraud,  relative  to  when  the  auditors  did  not  perform  audit  procedures  to  investigate 
for  the  fraud. 

III.  EXPERIMENTAL  DESIGN  AND  METHOD 

To  test  HI  and  H2,  I  employ  a  between-participants  design  that  manipulates  whether,  and  to 
what  extent,  the  auditors  performed  audit  procedures  to  investigate  for  the  fraud  (no  investigation, 
low  investigation,  and  high  investigation).  To  test  H3,  I  employ  a  within-participants  design  that 
includes  the  same  manipulations  as  in  the  between-participants  setting.  Thus,  to  test  HI,  H2,  and 
H3, 1  compare  participant  responses  across  the  following  conditions:  (1)  no  investigation;  (2)  low 
investigation;  and  (3)  high  investigation. 

Participants 

Two  hundred  twenty-nine  undergraduate  students  participated  in  the  study.  Participants  were 
recruited  at  two  U.S.  state  universities  from  a  wide  array  of  non-business  school  courses.  To 
motivate  effort,  participants  were  truthfully  told  that  participants  who  correctly  answered  at  least 
85  percent  of  a  series  of  review  questions  would  be  entered  in  a  cash  lottery.  Prior  research  finds 
that  verdicts  from  student  participants  do  not  differ  from  more  diverse  samples  of  participants 
(Bomstein  1999;  Zickafoose  and  Bomstein  1999).  Similarly,  in  an  audit  litigation  setting,  Kadous 
(2001)  finds  no  significant  differences  in  the  judgments  made  by  student  participants  versus 
participants  selected  for  jury  duty.  Thus,  students  appear  to  be  appropriate  participants  for  this 
study. 

Experimental  Procedures 

Participants  were  first  randomly  assigned  to  experimental  conditions  (see  “Independent  Vari¬ 
ables”  below)  and  were  given  an  experimental  case  to  read  and  complete.^  The  case  packet 
provides  background  information  regarding  the  nature  and  purpose  of  financial  statements  and  the 
external  audit  process,  the  details  of  an  audit  of  a  fictional  mining  company,  the  details  of  a 
financial  statement  fraud  the  auditors  failed  to  detect,  and  the  transcript  from  a  subsequent  auditor 
negligence  lawsuit.  The  undetected  financial  statement  fraud  pertains  to  the  mining  company’s 
intentional  understatement  of  its  environmental  restoration  liability.  The  mining  company  perpe¬ 
trated  the  fraud  by  misrepresenting  to  the  auditors  the  costs  they  expected  to  incur  to  reclaim  one 
of  their  inactive  quarries.  The  transcript  from  the  lawsuit  includes  opening  and  closing  statements 
by  both  plaintiff  and  defense  attorneys  and  testimony  by  equally  credentialed  expert  witnesses  for 
both  the  plaintiff  and  defense. 

After  reading  the  case,  participants  make  several  assessments  of  auditor  liability  for  plaintiff 
damages  (see  dependent  variables).  Then,  participants  complete  a  post-experimental  survey  (see 
mediating  variables).  Finally,  participants  complete  a  within-participants  questionnaire,  in  which 
participants  read  different  scenarios  describing  how  the  auditors,  in  the  same  circumstance  as  the 


^  The  experimental  case  is  adapted  from  the  cases  used  in  Peecher  and  Piercey  (2008)  and  Kadous  (2000). 
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auditor  described  in  the  first  case,  performed  the  audit  differently  but  still  failed  to  detect  the 
fraud. ^  The  audit  scenarios  presented  in  the  within-participants  setting  correspond  to  the  condi¬ 
tions  the  participants  did  not  see  in  the  first  experimental  case.  After  each  scenario,  participants 
make  assessments  of  auditor  liability  for  the  auditors  in  the  given  scenario. 

Independent  Variables 

I  first  manipulate  whether  the  auditors  performed  audit  procedures  to  investigate  for  the 
perpetrated  fraud.  In  the  no-investigation  condition,  the  auditors  do  not  identify  the  fraud  as  a 
fraud  risk  and,  thus,  do  not  perform  audit  procedures  to  investigate  for  the  fraud  (no-investigation 
condition).  In  the  investigation  conditions,  the  auditors  perform  audit  procedures  to  investigate  for 
the  fraud.  In  one  investigation  condition,  the  auditors  investigate  for  the  fraud  by  making  inquiries 
of  the  company’s  president  (low-investigation  condition).  In  the  other  investigation  condition,  the 
auditors  make  inquiries  of  the  company  president  and  also  hire  a  geology  firm  to  test  the  soil  from 
a  sample  of  the  company’s  quarries  for  hazardous  materials  that  could  increase  their  environmen¬ 
tal  restoration  liability  (high-investigation  condition).  I  include  two  investigation  conditions  to 
examine  whether  the  effect  of  performing  audit  procedures  to  investigate  fraud  risks  on  evalua¬ 
tors’  assessments  of  auditor  liability  in  cases  of  undetected  fraud  varies  across  different  levels  of 
auditor  investigation. 

Dependent  Measures 

The  experiment  examines  the  intensity  of  evaluators’  counterfactual  thoughts  of  what  the 
auditors  could  have  done  differently  to  detect  the  fraud.  To  measure  the  intensity  of  participants’ 
counterfactual  thoughts,  participants  are  asked  on  a  post-experimental  survey  how  seriously,  on  a 
scale  from  0  to  100,  they  thought  about  what  the  auditors  could  have  done  differently  to  detect  the 
fraud.  Sanna  and  Turley-Ames  (2000)  use  a  similar  measure  of  counterfactual  intensity.  The 
intensity  of  participants’  counterfactual  thoughts  is  the  dependent  variable  of  interest  for  HI,  and 
is  expected  to  mediate  the  effect  predicted  in  H2. 

The  experiment  also  examines  three  measures  of  evaluators’  assessments  of  auditor  liability 
for  plaintiff  losses  in  cases  of  undetected  fraud.  The  first  measure  asks  participants  to  indicate 
whether  the  auditors  were  negligent  in  performing  the  audit  (yes,  no).  The  second  measure  asks 
participants  to  indicate  how  probable  (0-100  scale)  it  is  that  the  auditors  were  negligent  in  per¬ 
forming  the  audit.  Participants  who  indicate  that  the  auditors  were  negligent  are  then  asked  to 
indicate  the  amount  of  compensatory  damages  (0-$9  million)  they  would  require  the  auditors  to 
pay  to  the  plaintiff. 

Participants’  yes/no  decisions  of  auditor  negligence  and  their  probability  assessments  of  au¬ 
ditor  negligence  are  the  primary  dependent  variables  of  interest  for  H2  and  H3,  as  those  two 
variables  are  more  direct  measures  of  evaluators’  assessments  of  whether  the  auditors  are  liable  for 
plaintiff  losses,  which  is  the  study’s  construct  of  interest.  Participants’  damage  assessments,  while 
a  less  direct  measure  of  evaluations  of  negligence,  are  measured  and  reported  to  indicate  the 
potential  economic  significance  of  the  effects  of  differences  in  participants’  assessments  of  auditor 
negligence  across  conditions."^ 

Proposed  Mediating  Variables 

The  following  mediating  variables  were  measured  on  a  post-experimental  survey:  (1)  the 
direction  and  strength  of  participants’  affective  reactions  to  the  auditors  and  plaintiff;  and  (2) 


2 

I  counterbalanced  the  presentation  order  of  the  audit  scenarios  to  control  for  order  effects. 

Kadous  (2000)  notes  that  it  is  likely  that  once  a  juror  has  determined  that  an  auditor  is  liable  for  a  plaintiff’s  loss,  the 
juror  will  use  a  second  set  of  factors  to  determine  the  amount  of  the  liability. 
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participants’  assessments  of  auditor  competence.  To  measure  participants’  affective  reactions,  the 
survey  asks  participants  to  indicate  the  direction  and  strength  of  their  feelings  toward  the  auditors 
on  a  scale  from  —100  to  100  (—100  =  very  negative  feelings  to  100  =  very  positive  feelings). 
The  survey  then  asks  participants  to  indicate  the  direction  and  strength  of  their  feelings  toward  the 
plaintiff  on  the  same  scale.  I  subtract  participants’  responses  regarding  their  feelings  for  the 
plaintiff  from  their  responses  regarding  their  feelings  for  the  auditor  to  create  an  overall  measure 
of  participants’  affective  reactions  to  the  case  (—200  =  very  strong  pro-plaintiff  affective  reaction; 
0  =  neutral  overall  affective  reaction;  200  =  very  strong  pro-auditor  affective  reaction).^  To 
measure  participants’  assessments  of  auditor  competence,  the  survey  asks  participants  to  indicate 
how  competent,  on  a  scale  from  0  to  100,  they  believe  the  auditors  were  in  their  performance  of 
the  audit. 


IV.  RESULTS 

The  experimental  case  includes  several  questions  to  examine  the  salience  of  the  manipula¬ 
tions.  In  particular,  the  case  asks  participants  about  the  nature  of  the  auditors’  procedures  to 
investigate  for  the  perpetrated  fraud.  Participant  responses  to  these  questions  indicate  that  they 
attended  to  and  understood  the  manipulations.  Excluding  data  from  participants  who  did  not 
answer  the  manipulation  checks  correctly  does  not  change  the  inferences  drawn  from  tests  of  HI, 
H2,  or  H3. 

The  experiment  was  conducted  at  different  locations  on  separate  dates  approximately  22 
months  apart.  The  additional  participant  responses  provided  by  the  second  data  set  allow  for  a 
more  accurate  structural-equations  based  analysis  of  this  study’s  mediating  variables  (Lei  and 
Lomax  2005).  The  inferences  drawn  from  tests  of  HI,  H2,  and  H3  are  the  same  across  both  data 
sets.  In  addition,  there  are  no  significant  differences  in  participant  responses  for  any  dependent  or 
mediating  variables  across  the  two  data  sets.  Thus,  I  combine  participants’  responses  from  both 
data  sets  when  reporting  the  results  of  tests  of  HI,  H2,  H3,  and  the  results  of  the  study’s  structural 
equation-based  path  analysis. 

Test  of  HI 

To  test  HI,  I  compare  the  intensity  of  participants’  counterfactual  thoughts  when  the  auditors 
performed  audit  procedures  to  investigate  for  the  fraud  relative  to  when  the  auditors  did  not 
investigate  for  the  fraud  (see  Table  1).  Participants’  counterfactual  thoughts  are  more  intense  in  the 
high-investigation  condition  (Mean  (M)  =  60.79)  than  in  the  no-investigation  condition  (M  = 
45.86),  (ti5o  =  3.57,  p  <  0.001,  two-tailed).  In  addition,  participants’  counterfactual  thoughts  are 
more  intense  in  the  low-investigation  condition  (M  =  63.00)  than  in  the  no-investigation  condition 
(M  =  45.86),  (ti5i  =  4.14,  p  <  0.001,  two-tailed).  Thearefore,  supporting  HI,  results  indicate  that 
evaluators  in  cases  of  undetected  fraud  experience  more  intense  thoughts  of  what  the  auditors 
could  have  done  differently  to  detect  the  fraud  when  the  auditors  performed  audit  procedures  to 
investigate  for  the  fraud,  relative  to  when  the  auditors  did  not  investigate  for  the  fraud.  Further¬ 
more,  this  result  holds  across  two  levels  of  fraud  investigation. 


^  Macrae  (1992)  demonstrates  that  counterfactual  reasoning  affects  evaluators’  affective  reactions  to  both  the  plaintiff  and 
the  defense  in  civil  litigation  contexts.  Thus,  the  effect  of  evaluators’  affective  reactions  on  their  assessments  of  auditor 
liability  likely  depends  on  the  evaluators’  affective  reactions  to  both  the  auditors  and  the  plaintiff.  Specifically,  m 
evaluator’s  negative  feelings  toward  the  auditors  will  be  more  likely  to  increase  her  assessment  of  auditor  liability  if  the 
evaluator  has  positive  feelings  toward  the  plaintiff  than  if  she  has  negative  feelings  toward  the  plaintiff.  Similarly,  an 
evaluator’s  positive  feelings  toward  the  plaintiff  will  be  more  likely  to  increase  her  assessment  of  auditor  liability  if  the 
evaluator  has  negative  feelings  toward  the  auditors  than  if  she  has  positive  feelings  toward  the  auditors 
®  The  intensity  of  participants’  counterfactual  thoughts  did  not  significantly  vary  across  the  low-  and  high-mvestigation 
conditions  (ti5i  =  0.546,  p  =  0.586,  two-tailed). 
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TABLE  1 

Between-Partidpants  Results 


Panel  A:  Descriptive  Statistics 


Experimental  Condition 

Counterfactual 
Intensity 
Mean 
(Std.  Dev.) 

Negligent 

(Yes/No) 

Proportion 

(Percent) 

Probability 
Negligent 
Mean 
(Std.  Dev.) 

Damages 
Awarded 
Mean 
(Std.  Dev.) 

No  investigation  (n  =  76) 

45.86 

30/76 

40.63 

$1.94mil 

(26.35) 

(39.47) 

(27.21) 

(2.72) 

Low  investigation  (n  =  77) 

63.00 

54/77 

51.75 

$3.82mil 

(24.86) 

(70.13) 

(24.97) 

(3.09) 

High  investigation  (n  =  76) 

60.79 

55/76 

53.03 

$3.56mil 

(25.19) 

(72.36) 

(25.06) 

(2.69) 

Column  total 

56.58 

139/229 

48.48 

$3. 11  mil 

(26.48) 

(60.70) 

(26.25) 

(2.95) 

Panel  B:  Planned  Contrasts  HI  and  H2 

No  Investigation  versus  High  Investigation 

HI;  Counterfactual  intensity 

t  =  3.57 

p  <  0.001 

H2:  Negligent  (yes/no) 

16.68 

p  <  0.001 

H2:  Probability  Negligent 

t  =  2.92 

p  =  0.004 

H2:  Damages  Awarded 

t  =  3.67 

p  <  0.001 

Panel  C:  Planned  Contrasts  HI  and  H2 

No  Investigation  versus  Low  Investigation 

HI:  Counterfactual  Intensity 

t  =  4.14 

p  <  0.001 

H2:  Negligent  (yes/no) 

/=  14.52 

p  <  0.001 

H2:  Probability  Negligent 

t  =  2.64 

p  =  0.009 

H2:  Damages  Awarded 

t  =  3.92 

p  <  0.001 

Panel  A  reports  the  descriptive  statistics  for  participants’  responses  regarding  the  intensity  of  participants’  counterfactual 
thoughts  and  their  between-participants  assessments  of  auditor  liability  across  experimental  conditions.  The  Counterfactual 
Intensity  column  reports  participants’  responses  regarding  the  intensity  of  their  counterfactual  thoughts  regarding  what  the 
auditors  could  have  done  differently  to  detect  the  fraud.  The  Negligent  (Yes/No)  column  reports  participants’  responses  as 
to  whether  (yes/no)  the  auditors  were  negligent  in  performing  the  audit.  The  Probability  Negligent  column  reports  partici¬ 
pants’  probability  assessments  of  whether  the  auditors  were  negligent.  The  Damages  Awarded  column  reports  the  amount 
of  damages  participants’  awarded  to  the  plaintiff. 

In  the  no-investigation  condition,  the  auditors  did  not  identify  the  perpetrated  fraud  as  a  fraud  risk  and  did  not  perform 
audit  procedures  to  investigate  the  fraud  risk.  In  the  low-investigation  condition,  the  auditors  identified  the  perpetrated 
fraud  as  a  fraud  risk  and  investigated  for  the  fraud  exclusively  by  making  inquiries  of  the  mining  company’s  president.  In 
the  high-investigation  condition,  the  auditors  investigated  for  the  fraud  by  making  inquiries  of  the  president  and  also  by 
hiring  a  geology  firm  to  test  a  random  sample  of  the  mining  company’s  inactive  quarries  for  the  presence  of  hazardous 
materials  that  might  cause  the  EPA  to  require  the  mining  company  to  use  a  more  expensive  restoration  method. 

Panel  B  reports  the  results  of  the  planned  comparisons  between  participants’  between-participants  responses  in  the  no¬ 
investigation  versus  the  high-investigation  conditions.  The  reported  p-value  remains  significant  at  a  Bonferroni  adjusted 
family-wise  a  of  .05. 

Panel  C  reports  the  results  of  the  planned  comparisons  between  participants’  between-participants  responses  in  the  no¬ 
investigation  versus  the  low-investigation  conditions.  The  reported  p-value  remains  significant  at  a  Bonferroni  adjusted 
family-wise  a  of  .05. 
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Test  of  H2 

To  test  H2, 1  compare  participants’  between-participants  assessments  of  auditor  liability  when 
the  auditors  performed  audit  procedures  to  investigate  for  the  fraud  relative  to  when  the  auditors 
did  not  investigate  for  the  fraud  (see  Table  1).  Participants  are  more  likely  to  hnd  the  auditors 
negligent  in  the  high-investigation  condition  (72.36  percent)  than  in  the  no-investigation  condition 
(39.47  percent),  (1,  n  =  152)  =  16.68,  p  <  0.001.  Next,  participants’  probability  assessments 
of  auditor  negligence  are  greater  in  the  high-investigation  condition  (M  =  53.03)  than  in  the 
no-investigation  condition  (M  =  40.63),  (tjso  =  2.92,  p  =  0.004,  two-tailed).  Finally,  the  monetary 
damages  awarded  to  the  plaintiff  are  greater  in  the  high-investigation  condition  (M  =  $3.56  mil) 
than  in  the  no-investigation  condition  (M  =  $1.94  mil),  (tj48  =  3.67,  p  <  0.001,  two-tailed). 

In  addition,  participants  are  more  likely  to  find  the  auditors  negligent  in  the  low-investigation 
condition  (70.13  percent)  than  in  the  no-investigation  condition  (39.47  percent),  (1,  n  =  153)  = 
14.52,  p  <  0.001.  Next,  participants’  probability  assessments  of  auditor  negligence  are  greater  in 
the  low-investigation  condition  (M  =  51.75)  than  in  the  no-investigation  condition  (M  =  40.63), 
(ti5i  =  2.64,  p  =  0.009,  two-tailed).  Finally,  monetary  damages  awarded  to  the  plaintiff  are  greater 
in  the  low-investigation  condition  (M  =  $3.82  mil)  than  in  the  no-investigation  condition  (M  = 
$1.94  mil),  (ti45  =  3.92,  p  <  0.001,  two-tailed).  Therefore,  participants’  between-participants  as¬ 
sessments  of  auditor  liability  are  consistently  more  severe  when  the  auditors  performed  audit 
procedures  to  investigate  for  the  fraud,  relative  to  when  the  auditors  did  not  investigate  for  the 
fraud.  In  addition,  these  results  hold  across  two  levels  of  fraud  investigation.^  These  findings 
support  H2. 

Structural  Equations-Based  Path  Analysis 

The  theory  underlying  this  study  holds  that  particular  factors  mediate  the  relationship  between 
whether  the  auditors  performed  audit  procedures  to  investigate  for  the  fraud  and  the  participants’ 
assessments  of  auditor  liability  in  cases  of  undetected  fraud.  Table  2  reports  descriptive  statistics 
for  these  mediating  variables.  Table  3  reports  simple  correlations  among  the  mediators,  partici¬ 
pants’  probability  assessments  of  auditor  negligence  and  the  manipulated  variable  (i.e.,  whether 
the  auditors  investigated  for  the  fraud).  To  examine  the  hypothesized  relationships  between  the 
auditors’  decision  to  investigate  for  the  fraud,  the  study’s  mediating  variables,  and  participants’ 
probability  assessments  of  auditor  negligence,  I  use  structural  equations-based  path  analysis. 

The  study’s  primary  construct  of  interest  is  evaluators’  assessments  of  auditor  negligence,  as 
opposed  to  evaluators’  subsequent  assessments  of  damage  amounts.  Therefore,  I  examine  partici¬ 
pants’  assessments  of  auditor  negligence,  as  opposed  to  their  damage  assessments,  in  the  structural 
equation-based  path  analysis.  I  examine  participants  probability  assessments  of  auditor  negli¬ 
gence,  as  opposed  to  participants’  yes/no  decisions  of  auditor  negligence,  because  structural  equa¬ 
tion  models  generally  use  continuous  variables  as  opposed  to  binary  categorical  variables,  as 
continuous  variables  are  less  likely  than  categorical  variables  to  violate  normality  assumptions 
within  structural  equation  models  (Kline  1998). 


’  Participant  assessments  of  auditor  liability  did  not  vary  across  the  low-  and  high-investigation  conditions  regarding 
judgments  (yes/no)  of  auditor  negligence  (1,  n  =  153)  =  0.09,  p  =  0.759),  probability  assessments  of  auditor 
negligence  6,51  =  0.315,  p  =  0.753),  or  damage  assessments  6147  =  0.55,  p  -  0.583). 

*  It  is  possible  to  use  categorical  variables  in  standard  structural  equation  models  if  there  is  not  a  severe  violation  of  the 
normality  assumption.  A  generally  accepted  rule  is  that  a  categorical  variable  can  be  used  in  a  standard  structoal 
equation  model  if  the  distribution’s  kurtosis  does  not  exceed  -l-/~1.5  (Schumacher  and  Lomax  2004).  The  kurtosis  of 
the  distribution  of  participants’  yes/no  decisions  of  auditor  negligence  is  -1.82,  which  is  outside  of  the  acceptable 

range. 
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TABLE  2 

Descriptive  Statistics  for  Mediating  Variables 


Experimental  Condition 

Counterfactual 

Intensity 

Mean 
(Std.  Dev.) 

Participants’ 
Affective  Reactions 
Mean 
(Std.  Dev.) 

Participants’ 
Assessments 
of  Auditor 
Competence 
Mean 
(Std.  Dev.) 

No  investigation  (n  =  76) 

45.86 

-3.03 

68.68 

(26.35) 

(45.54) 

(20.98) 

Low  investigation  (n  =  77) 

63 

-23.34 

60.29 

(24.86) 

(48.27) 

(21.45) 

High  investigation  (n  =  76) 

60.79 

-18.11 

62.28 

(25.19) 

(46.65) 

(23.07) 

Column  Total 

56.58 

-14.86 

63.73 

(26.48) 

(47.43) 

(22.05) 

This  table  reports  the  mean  and  (standard  deviation)  of  the  study’s  proposed  mediating  variables  by  condition.  See  Table 
1  for  a  description  of  the  experimental  conditions. 

Descriptions  of  mediating  variables  are  below. 

“Counterfactual  Intensity”  refers  to  participants’  responses  on  a  scale  from  0  to  100  as  to  how  seriously  they  thought  about 
what  the  auditors  could  have  done  differently  to  prevent  the  audit  failure. 

“Participants’  Affective  Reactions”  refers  to  the  strength  of  participants’  overall  affective  reactions  to  the  case  (-200  = 
very  strong  pro-plaintiff  reaction,  0  =  neutral  overall  reaction,  200  =  very  strong  pro-auditor  reaction). 

“Participants’  Assessments  of  Auditor  Competence”  refers  to  participants’  assessments  on  a  scale  from  0  to  100  as  to  the 
competence  of  the  auditors. 


In  performing  the  structural  equations-based  path  analysis,  I  combine  participants’  responses 
in  the  high-  and  low-investigation  conditions  into  one  group.  Combining  the  high-  and  low- 
investigation  conditions  is  appropriate  because  the  study’s  theory  and  hypotheses  pertain  to 
whether  the  auditors  performed  audit  procedures  to  investigate  for  the  fraud,  as  opposed  to  the 
extent  of  the  procedures  performed.  In  addition,  there  are  no  significant  differences  between  the 
high-  and  low-investigation  conditions  regarding  the  proposed  mediating  variables  or  dependent 
variables.  Examining  the  low-  and  high-investigation  conditions  individually  does  not  change  the 
inferences  from  the  path  model. 

A  likelihood  ratio  Chi-square  test  indicates  the  model-implied  covariance  matrix  does  not 
differ  from  the  observed  covariance  matrix  (3,  n  =  229)  =  5.01,  p  =  0.171.  The  Tucker-Lewis 
Index  indicates  the  proportion  of  improvement  of  the  fit  of  the  model  over  a  null  model  is  0.960 
and  the  incremental  fit  index  is  0.988,  both  of  which  are  above  the  standard  cutoff  value  of  an 
acceptable  fit  of  0.90  (Kline  1998).  Thus,  the  model  describes  the  relationships  in  the  data  well.  I 
use  t-tests  to  test  the  significance  of  each  path  coefficient. 

Figure  2  presents  the  path  model  with  unstandardized  coefficients  for  each  hypothesized  link. 
Consistent  with  HI,  the  relationship  between  whether  the  auditors  investigated  for  the  fraud  and 
the  intensity  of  participants’  counterfactual  thoughts  of  what  the  auditors  could  have  done  differ¬ 
ently  to  detect  the  fraud  (link  1)  is  significantly  positive  (t227  =  4.51,  p  <  0.001,  two-tailed).  The 
relationship  between  the  intensity  of  participants’  counterfactual  thoughts  and  their  affective  re¬ 
actions  (link  2)  is  significantly  negative— i.e.,  as  participants’  counterfactual  thoughts  increased  in 
intensity,  their  affective  reactions  were  less  favorable  to  the  auditor,  (t227  =  4.89,  p  <  0.001, 
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TABLE  3 

Correlation  Matrix  for  Mediating  Variables 
(between-participants  environment) 


Auditor 

Investigation 

Counter- 

factual 

Intensity 

Participants’ 

Affective 

Reactions 

Participants’ 
Assessment 
of  Auditor 
Competence 

Probability 
Assessment 
of  Auditor 
Negligence 

Auditor  investigation 
Counterfactual  intensity 
Participants’  affective  reactions 
Participants’  assessments  of 

1 

0.286** 

-0.176** 

-0.159* 

1 

-0.308** 

-0.249** 

1 

0.338** 

1 

auditor  competence 
Assessment  of  the  probability 

0.211** 

0.320** 

-0.361** 

-0.528** 

1 

of  auditor  negligence 


*,  **  Indicate  the  estimated  coefficients  are  significantly  different  from  0  at  5  percent  and  1  percent  levels  respectively,  in 
two-tailed  tests. 

This  table  reports  the  Pearson  correlations  between  the  study’s  independent  variable  i.e.,  whether  the  auditors  investigated 
for  the  perpetrated  fraud,  the  study’s  proposed  mediating  variables,  and  participants’  probability  assessments  of  auditor 
negligence. 

“Auditor  investigation”  refers  to  whether  the  auditors  performed  audit  procedures  to  investigate  for  the  perpetrated  fraud. 
“Counterfactual  intensity”  refers  to  participants’  responses  on  a  scale  from  0  to  100  as  to  how  seriously  they  thought  about 
what  the  auditors  could  have  done  differently  to  prevent  the  audit  failure. 

“Participants’  affective  reactions”  refers  to  the  strength  of  participants’  overall  affective  reactions  to  the  case  (-200  =  very 
strong  pro-plaintiff  reaction,  0  =  neutral  overall  reaction,  200  =  very  strong  pro-auditor  reaction). 

“Participants’  assessments  of  auditor  competence”  refers  to  participants’  assessments  on  a  scale  from  0  to  100  as  to  the 
competence  of  the  auditors. 

“Assessment  of  the  probability  of  auditor  negligence”  refers  to  participants’  assessments  of  the  probability  that  the  auditors 
were  negligent  in  their  performance  of  the  audit. 


two-tailed).  The  relationship  between  the  intensity  of  participants’  counterfactual  thoughts  and 
participants’  assessments  of  auditor  competence  (link  3)  is  significantly  negative  (t227  =  2.48,  p  = 
0.013,  two-tailed).  The  relationship  between  evaluators’  affective  reactions  and  their  assessments 
of  auditor  competence  (link  4)  is  significantly  positive  (t227  =  4.46,  p  <  0.001,  two-tailed),  indi¬ 
cating  that  participants’  assessments  of  auditor  competence  increased  (decreased)  as  their  affective 
reactions  were  more  (less)  favorable  to  the  auditor. 

As  expected,  the  direct  relationship  between  the  intensity  of  participants  counterfactual 
thoughts  and  their  probability  assessments  of  auditor  negligence  (link  5)  is  significantly  positive 
(t227  =  2.82,  p  =  0.005,  two-tailed).  The  relationship  between  participants’  affective  reactions  to 
the  case  and  their  assessments  of  auditor  negligence  (link  6)  is  significantly  negative  (t227  =  2.80, 
p  =  0.005,  two-tailed) — i.e.,  as  participants’  affective  reactions  were  less  favorable  to  the  auditor, 
their  assessments  of  auditor  negligence  increased.  The  relationship  between  participants  assess¬ 
ments  of  auditor  competence  and  their  assessments  of  auditor  negligence  (link  7)  is  significantly 
negative  (t227  =  7.44,  p  <  0.001,  two-tailed).^ 


®  A  separate  model  that  includes  a  direct  fink  between  whether  the  auditors  investigated  for  the  fraud  and  participant 
assessments  of  auditor  liability  (along  with  the  links  between  the  proposed  mediating  variable  as  found  in  Figure  2) 
indicates  the  direct  relationship  between  whether  the  auditors  investigated  for  the  fraud  and  participants’  probability 
assessments  of  auditor  negligence  is  not  significant  1.33,  p  =  0.183,  two-tailed).  However,  if  one  removes  the 

proposed  mediating  variables  from  the  model,  then  the  direct  relationship  between  whether  the  auditors  investigated  for 
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In  summary,  consistent  with  the  hypothesized  predictions,  path  analysis  indicates  that,  in 
cases  of  undetected  fraud,  evaluators  experience  more  intense  thoughts  of  what  the  auditors  could 
have  done  differently  to  detect  the  fraud  when  the  auditors  performed  audit  procedures  to  inves¬ 
tigate  for  the  fraud.  As  a  result,  evaluators’  affective  reactions  to  the  case  are  less  favorable  to  the 
auditors  and  their  assessments  of  auditor  competence  are  lower  when  the  auditors  performed  audit 
procedures  to  investigate  for  the  fraud.  In  turn,  these  effects  increase  evaluators’  probability 
assessments  of  auditor  negligence. 

Sensitivity  analysis  was  conducted  by  calculating  modification  indices  for  the  model.  Modi¬ 
fication  indices  estimate  the  increase  in  the  overall  model’s  fit  by  adding  specific  paths  not  in¬ 
cluded  in  the  current  model.  Thus,  modification  indices  indicate  whether  model  components  are 
connected  in  a  way  not  specified  by  the  model  (Kadous  et  al.  2005).  An  examination  of  modifi¬ 
cation  indices  suggests  that  no  additional  paths  will  increase  the  model’s  fit,  using  the  conven¬ 
tional  0.05  level  of  statistical  significance. 

Additional  Analysis 

Recall  from  Section  II,  prior  research  suggests  two  potential  reasons  why  evaluators  have 
more  intense  counterfactual  thoughts  of  what  the  auditors  could  have  done  differently  to  detect  the 
fraud  when  the  auditors  investigated  for  the  fraud.  The  first  reason  pertains  to  outcome  closeness 
(Kahneman  and  Miller  1986)  and  the  second  pertains  to  the  relationship  between  action  (versus 
inaction)  and  counterfactual  reasoning  (Sherman  and  McConnell  1995).  To  investigate  the  effects 
of  evaluators’  perceptions  of  outcome  closeness,  I  measure,  on  the  post-experimental  survey, 
participant  perceptions  of  how  close  the  auditors  were  to  detecting  the  fraud  (0  to  100  scale). 
Consistent  with  expectations,  participants  perceived  auditors  who  investigated  for  the  fraud  as 
having  been  closer  to  detecting  the  fraud  than  auditors  who  did  not  (t227  =  4.86,  p  <  0.001, 
two-tailed).  In  addition,  participants’  perceptions  of  outcome  closeness  are  positively  correlated 
with  the  intensity  of  their  counterfactual  thoughts  (p  =  0.061,  two-tailed).  However,  the  relation¬ 
ship  between  participants’  perceptions  of  outcome  closeness  and  the  intensity  of  their  counterfac¬ 
tual  thoughts  is  not  significant  when  the  analysis  controls  for  whether  the  auditors  investigated  for 
the  fraud  (t227  =  0.72,  p  =  0.473,  two-tailed).  This  result  indicates  that  more  intense  evaluator 
counterfactual  thoughts  when  the  auditors  investigated  for  the  fraud  (link  1)  are  not  attributable  to 
increased  perceptions  of  outcome  closeness,  but  rather  to  the  fact  the  auditors  took  action  to 
investigate  for  the  fraud  (Gleicher  et  al.  1990;  Sherman  and  McConnell  1995).  Kahneman  and 
Miller  (1986)  assert  that  the  intensity  of  evaluators’  counterfactual  thoughts  increases  with  the 
mutability  of  prior  events.  The  results  suggest  that  the  fact  that  the  auditors  took  action  to  inves¬ 
tigate  for  the  fraud  makes  the  auditors’  failure  to  detect  the  fraud  more  mutable.  Specifically, 
results  suggest  that  when  evaluators  know  that  the  auditors  took  action  to  investigate  for  a  perpe¬ 
trated  fraud,  it  is  relatively  easy  for  the  evaluators  to  imagine  the  auditors  adjusting  their  audit 
procedures  in  manner  that  would  have  allowed  them  to  detect  the  fraud.  However,  when  evalua¬ 
tors  know  that  the  auditors  did  not  identify  the  perpetrated  fraud  as  a  fraud  risk  and,  thus,  did  not 


the  fraud  and  participants’  probability  assessments  of  auditor  negligence  is  significantly  positive  6227  =  3.27,  p  =  0.001, 
two-tailed).  These  results  suggest  the  effect  of  investigating  fraud  risks  on  participants’  assessments  of  auditor  negli¬ 
gence  in  cases  of  undetected  fraud  is  mediated  by  the  intensity  of  participants’  counterfactual  thoughts,  their  affective 
reactions  to  the  auditor  and  plaintiff,  and  their  assessments  of  auditor  competence.  Sobel  tests  for  mediation  indicate  that 
each  mediating  variable  is  a  significant  mediator  when  examined  individually.  First,  the  intensity  of  participants’ 
counterfactual  thoughts,  when  examined  individually,  mediates  the  effect  of  auditor  investigation  on  evaluators’  assess¬ 
ments  of  auditor  negligence  in  cases  of  undetected  fraud  (Z  =  3.11,  p  =  0.002,  two-tailed).  Next,  participants’  affective 
reactions,  when  examined  individually,  mediate  the  effect  of  auditor  investigation  on  evaluators’  assessments  of  auditor 
negligence  (Z  =  2.42,  p  =  0.016,  two-tailed).  Finally,  participants’  assessments  of  auditor  competence,  when  examined 
individually,  mediate  the  effect  of  auditor  investigation  on  evaluators’  assessments  of  auditor  negligence  (Z  =  2.34  p  = 
0.019,  two-tailed). 


The  Accounting  Review 
American  Accounting  Association 


November  2010 


Can  Identifying  and  Investigating  Fraud  Risks  Increase  Auditors  ’Liability? 


2161 


FIGURE  2 

Empirical  Model  of  the  Effects  of  Investigating  Fraud  Risks  on  Evaluator  Assessments  of 

Auditor  Liability 


This  figure  shows  the  results  of  the  path  analysis  that  simultaneously  tests  the  relationships  among  the  proposed 
mediating  variables.  Unstandardized  path  coefficient  and  corresponding  p-value  (two-tailed)  from  t-tests  are 
shown  next  to  each  path.  Overall  goodness  of  fit  is  measured  through  the  Tucker-Lewis  index  (96.0  percent), 
which  measures  the  proportion  of  improvement  of  the  fit  of  the  model  over  a  null  model,  and  confirmed  with  a 
Chi-square  test,  x^(3,n  =  229)  =  5.01,  p  =  0.171,  and  an  incremental  fit  index  (98.8  percent). 

“Investigation  for  fraud”  refers  to  whether  the  auditors  performed  audit  procedures  to  investigate  for  the  fraud. 
In  performing  the  above  path  analysis,  I  combined  participants’  responses  in  the  high-  and  low-investigation 
conditions  into  one  group.  Combining  the  high-  and  low-investigation  conditions  is  appropriate  because  the 
paper’s  theory  and  hypotheses  pertain  to  whether  the  auditors  performed  audit  procedures  to  investigate  for  the 
fraud,  as  opposed  to  the  extent  of  the  procedures.  In  addition,  there  are  no  significant  differences  between  the 
high-  and  low-investigation  conditions  regarding  the  proposed  mediating  variables  or  dependent  variables.  Ex¬ 
amining  the  low-  and  high-investigation  conditions  individually  does  not  change  the  inferences  from  the  path 
model. 

“Counterfactual  intensity”  refers  to  the  intensity  of  participants’  thoughts  of  what  the  auditors  could  have  done 
differently  to  detect  the  fraud. 

“Affective  reactions  to  the  case”  refers  to  participants  affective  reactions  to  the  case.  Positive  (negative)  num¬ 
bers  are  indicative  of  affective  reactions  that  are  favorable  to  the  auditor  (plaintiff). 

“Assessments  of  auditor  competence”  refers  to  participants’  assessments  of  how  competent  the  auditors  were  in 
their  performance  of  the  audit. 

“Assessments  of  auditor  negligence”  refers  to  participants’  probability  assessments  of  how  likely  it  was  that  the 
auditors  were  negligent  in  their  performance  of  the  audit. 
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investigate  for  the  fraud,  it  is  relatively  difficult  for  the  evaluators  to  imagine  the  auditors  first 
identifying  the  fraud  as  a  fraud  risk  and  then  deciding  to  perform  the  correct  set  of  audit  proce¬ 
dures  necessary  to  detect  the  fraud. 

Test  of  H3 

To  test  H3,  I  compare  participants’  within-participants  assessments  of  auditor  liability  when 
the  auditors  investigated  for  the  fraud  relative  to  when  the  auditors  did  not  investigate  for  the  fraud 
(see  Table  4).  A  Chi-square  test  from  a  generalized  estimating  equation  model  indicates  that  the 
probability  of  participants  in  a  within-participants  environment  finding  the  auditors  negligent  is 


TABLE  4 


Within-Participants  Results 


Experimental  Condition 

Negligent 

(Yes/No) 

Proportion 

(Percent) 

Probability 

Negligent 

Mean 
(Std.  Dev.) 

Damages 
Awarded 
Mean 
(Std.  Dev.) 

No  investigation  (n  =  225) 

187/225 

68.25 

$5.28mil 

(83.11) 

(27.00) 

(3.04) 

Low  investigation  (n  =  227) 

154/227 

55.35 

$3.92mil 

(67.84) 

(31.05) 

(3.25) 

High  investigation  (n  =  226) 

75/226 

28.03 

$1.78mil 

(33.19) 

(30.24) 

(2.74) 

Column  Total 

416/678 

50.58 

$3.65mil 

(61.36) 

(33.88) 

(3.34) 

Panel  B:  Planned  Contrasts  H3 
No  Investigation  versus  High  Investigation 

H3:  Negligent  (yes/no)  ;^  =  86.86  p  <  0.001 

H3:  Probability  negligent  t  =  14.25  p  <  0.001 

H3:  Damages  awarded  t  =  12.10  p  <  0.001 


Panel  C:  Planned  Contrasts  H3 
No  Investigation  versus  Low  Investigation 

H3:  Negligent  (yes/no)  12.53  p  <  0.001 

H3:  Probability  negligent  t  =  4.75  p  <  0.001 

H3:  Damages  awarded  t  =  4.64  p  <  0.001 


Panel  A  reports  the  descriptive  statistics  for  participants’  responses  regarding  the  intensity  of  participants’  counterfactual 
thoughts  and  their  within-participants  assessments  of  auditor  liability  across  experimental  conditions.  See  Table  1  for  a 
description  of  experimental  conditions  and  auditor  liability  assessments. 

Panel  B  reports  the  results  of  the  planned  comparisons  between  participants’  between-participants  responses  in  the  no¬ 
investigation  versus  the  high-investigation  conditions.  Chi-square  tests  are  produced  by  generalized  estimating  equation 
and  t-tests  come  from  pairwise  comparison  from  a  repeated  measure  ANOVA.  The  reported  p-value  remains  significant  at 
a  Bonferroni  adjusted  family-wise  a  of  .05. 

Panel  C  reports  the  results  of  the  planned  comparisons  between  participants’  between-participants  responses  in  the  no¬ 
investigation  versus  the  low-investigation  conditions.  Chi-square  tests  are  produced  by  generalized  estimating  equation  and 
t-tests  come  from  pairwise  comparison  from  a  repeated  measure  ANOVA.  The  reported  p-value  remains  significant  at  a 
Bonferroni  adjusted  family-wise  a  of  .05. 
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smaller  in  the  high-investigation  condition  (33.19  percent)  than  in  the  no-investigation  condition 
(83.11  percent)  (^  =  86.86,  p  <  0.001).**^  Second,  pairwise  comparison  from  a  repeated  measures 
ANOVA  indicates  that  participants’  within-participants  probability  assessments  of  auditor  negli¬ 
gence  are  smaller  in  the  high-investigation  condition  (M  =  28.03)  than  in  the  no-investigation 
condition  (M  =  68.25),  (t262=  14.25,  p  <  0.001,  two-tailed).  Finally,  pairwise  comparison  from  a 
repeated  measures  ANOVA  indicates  that  participants’  within-participants  damage  awards  are 
smaller  in  the  high-investigation  condition  (M  =  $1.78  mil)  than  in  the  no-investigation  condition 
(M  =  $5.28  mil),  (t26o=  12.10,  p  <  0.001,  two-tailed). 

In  addition,  a  Chi-square  test  from  a  generalized  estimating  equation  model  indicates  that  the 
probability  of  participants  in  a  within-participants  environment  finding  the  auditors  negligent  is 
smaller  in  the  low-investigation  condition  (67.84  percent)  than  in  the  no-investigation  condition 
(83.11  percent),  {x^=  12.53,  p  <  0.001).  Second,  pairwise  comparison  from  a  repeated  measures 
ANOVA  indicates  that  participants’  within-participants  probability  assessments  of  auditor  negli¬ 
gence  are  smaller  in  the  low-investigation  condition  (M  =  55.35)  than  in  the  no-investigation 
condition  (M  =  68.25),  (t262  =  4.75,  p  <  0.001,  two-tailed).  Finally,  pairwise  comparison  from  a 
repeated  measures  ANOVA  indicates  that  participants’  within-participants  monetary  damage 
awards  are  smaller  in  the  low-investigation  condition  (M  =  $3.92  mil)  than  in  the  no-investigation 
condition  (M  =  $5.28  mil),  (t26o  =  4.64,  p  <  0.001,  two-tailed).  Therefore,  participants’  within- 
participants  assessments  of  auditor  liability  are  consistently  less  severe  when  the  auditors  per¬ 
formed  audit  procedures  to  investigate  for  the  fraud,  relative  to  when  the  auditors  did  not  inves¬ 
tigate  for  the  fraud.  These  results  support  H3.'* 

V.  SUMMARY  AND  DISCUSSION 

Practitioners  and  legal  scholars  have  expressed  concern  that  the  U.S.  legal  system  might,  in 
cases  of  undetected  fraud,  effectively  penalize  auditors  for  identifying  and  investigating  fraud 
risks  (AICPA  2004;  Coffee  2004;  Golden  et  al.  2006).  This  study  provides  relevant  psychology 
theory  and  experimental  evidence  to  indicate  whether  such  concern  is  warranted.  Consistent  with 
counterfactual  reasoning  theory,  evaluators  in  a  natural  (i.e.,  between-participants)  setting  are  more 
likely  to  hold  auditors  liable  for  failing  to  detect  fraud  when  the  auditors  performed  audit  proce¬ 
dures  to  investigate  for  the  fraud,  relative  to  when  the  auditors  did  not  investigate  for  the  fraud. 
Therefore,  the  study  suggests  that  recent  concern  regarding  the  potential  legal  hazards  of  investi¬ 
gating  fraud  risks  is  warranted.  While  I  do  not  examine  auditors’  overall  incentives  to  investigate 
fraud  risks,  the  possibility  of  the  legal  system,  in  any  circumstance,  effectively  penalizing  auditors 
for  investigating  fraud  risks  is  troubling.  Future  research  is  therefore  needed  to  further  assess  the 
severity  of  this  finding. 

In  addition,  I  find  that  evaluators  in  a  within-participants  setting  are  less  likely  to  hold 
auditors  liable  for  failing  to  detect  fraud  when  the  auditors  performed  audit  procedures  to  inves¬ 
tigate  for  the  fraud.  This  finding  demonstrates  that  evaluators  understand  that  higher  levels  of 


Generalized  estimating  equations  permit  specification  of  a  working  correlation  matrix  that  controls  for  within-subject 
correlation  of  responses  on  dependent  variables  of  several  different  distributions,  including  normal,  binomial,  and 
Poisson.  Consequently,  generalized  estimating  equations  estimate  more  efficient  and  unbiased  regression  parameters 
relative  to  ordinary  least-squares  regression,  particularly  for  dichotomous  dependent  variables  (Ballinger  2004). 

"  Within-participant  assessments  of  auditor  liability  are  consistently  less  severe  in  the  high-  versus  the  low-investigation 
condition.  A  Chi-squcne  test  from  a  generalized  estimating  equation  model  indicates  that  the  probability  of  participants 
in  a  within-participants  environment  finding  the  auditors  negligent  is  smaller  in  the  high-  versus  the  low-investigation 
condition  (;^^  =  63.20,  p  <  0.001).  Second,  pairwise  comparison  from  a  repeated  measures  ANOVA  indicates  that 
participants’  within-participants  probability  assessments  of  auditor  negligence  are  smaller  in  the  high-  versus  the  low- 
investigation  condition  (t262=  10.93,  p  <  0.001).  Finally,  pairwise  comparison  from  a  repeated  measures  ANOVA 
indicates  that  participants’  within-participants  damage  assessments  are  smaller  in  the  high-  versus  the  low-investigation 
condition  (t26o  =  8.61,  p  <  0.001). 
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auditor  investigation  imply  lower  levels  of  auditor  liability  and  that  evaluators  intend  to  apply  this 
standard  in  cases  of  undetected  fraud.  Consequently,  the  observed  between-participants  decision 
pattern  appears  to  be  unintentional  (Kahneman  and  Tversky  1996).  This  is  important  because  it 
suggests  that  a  debiasing  method  could  potentially  reverse  the  troubling  between-participants 
decision  pattern  (Wegener  et  al.  2000).  Prior  research  identifies  several  methods  of  debiasing 
evaluators’  assessments  of  auditor  liability  (e.g.,  Anderson  et  al.  1997;  Kadous  2001;  Clarkson  et 
al.  2002;  Grenier  et  al.  2009).  The  knowledge  gained  from  prior  studies  and  from  this  study 
provides  direction  for  future  research  attempting  to  identify  an  effective  method  of  reversing 
evaluators’  unintentional  tendency,  when  evaluating  individual  cases  of  undetected  fraud,  to  judge 
auditors  who  investigated  for  the  fraud  more  harshly  than  auditors  who  did  not  investigate  for 
fraud. 

This  study  also  extends  psychology  research  examining  the  effects  of  counterfactual  reason¬ 
ing  on  evaluators’  assessments  of  defendant  liability  in  litigation  settings.  Prior  studies  have 
examined  settings  where  the  level  of  care  provided  by  the  defendant  is  constant  across  high-  and 
low-counterfactual  conditions.  Conversely,  the  level  of  care  provided  by  the  defendant  (i.e.,  the 
auditor)  in  this  study  varies  between  conditions.  I  find  that  the  intensity  of  evaluators’  counterfac¬ 
tual  thoughts  increases  with  the  auditors’  level  of  fraud  investigation  and,  thus,  also  increases  with 
the  level  of  care  provided  by  the  auditor.  In  addition,  consistent  with  prior  research,  I  find  that  the 
severity  of  evaluators’  assessments  of  auditor  liability  increases  with  the  intensity  of  their  coun¬ 
terfactual  thoughts.  These  findings  suggest  that  the  effects  of  counterfactual  reasoning  increase 
evaluators’  assessments  of  defendant  liability  in  settings  where  more  intense  counterfactual  rea¬ 
soning  is  associated  with  higher  levels  of  defendant  care. 

This  study  is  subject  to  limitations.  First,  consistent  with  cases  used  in  prior  research,  differ¬ 
ences  in  the  auditors’  actions  across  conditions  require  that  the  plaintiff  and  defense  attorneys’ 
arguments  vary  between  conditions.  In  the  no-investigation  condition,  the  plaintiff  attorneys  argue 
that  the  lack  of  awareness  and  consequent  investigation  of  a  major  fraud  risk  indicates  auditor 
negligence.  In  the  low-  and  high-investigation  conditions,  the  plaintiff  attorneys  argue  that  the 
failure  to  detect  a  fraud  that  the  auditors  investigated  indicates  the  auditors  did  not  effectively 
investigate  a  known  fraud  risk  and  were  therefore  negligent.  In  all  conditions,  the  defense  attor¬ 
neys  argue  the  fraud  is  extremely  rare,  that  other  auditors  would  not  have  detected  the  fraud,  and 
that  auditors  cannot  be  expected  to  detect  all  frauds.  In  addition,  in  the  high-investigation  condi¬ 
tion,  the  defense  attorney  compels  the  plaintiff’s  expert  witness  to  concede  that  sample  testing  is 
consistent  with  generally  accepted  auditing  standards.  The  differences  across  conditions  raise  the 
possibility  that  the  between-participants  results  could  be  either  partially  or  fully  attributable  to 
differences  in  the  strength  of  the  attorneys’  arguments  across  conditions  as  opposed  to  the  pre¬ 
sented  theory.  There  are  several  reasons,  however,  why  I  believe  this  possibility  is  unlikely. 

First,  prior  to  running  the  experiment,  a  legal  scholar  reviewed  the  experimental  materials  for 
realism  and  ensured  the  consistency  of  the  strength  of  the  wording  of  the  attorneys’  arguments 
across  conditions.  I  incorporated  her  suggestions  into  the  experimental  materials  to  her  satisfaction 
that  the  experimental  case  was  realistic  and  that  the  strength  of  the  attorneys’  language  did  not 
vary  across  conditions.  Second,  several  reported  results  collectively  suggest  the  observed  between- 
participants  effects  are  attributable  to  the  presented  theory,  as  opposed  to  differences  in  the 
strength  of  the  attorneys’  arguments.  Specifically,  there  is  no  significant  difference  in  the  intensity 
of  participants’  counterfactual  thoughts  across  the  low-  and  high-investigation  conditions.  Yet,  the 
content  of  the  attorneys’  arguments  significantly  varied  across  these  conditions,  which  suggests 
that  the  intensity  of  evaluators’  counterfactual  thoughts  was  not  a  function  of  the  content  of  the 
attorneys’  arguments  but,  rather,  a  function  of  whether  the  auditors  performed  audit  procedures  to 
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investigate  for  the  fraud.  Also,  within  each  experimental  condition  (and  thus  holding  the  content  of 
the  attorneys’  arguments  constant),  there  is  a  significant  positive  correlation  between  the  intensity 
of  participants’  counterfactual  thoughts  and  their  assessments  of  auditor  negligence.  This  indicates 
that  more  intense  counterfactual  reasoning  was  associated  with  harsher  assessments  of  auditor 
liability,  irrespective  of  the  content  of  the  attorneys’  arguments. 

Finally,  it  is  important  to  note  that  the  expected  litigation  cost  associated  with  an  audit  is  a 
function  of  two  factors:  (1)  the  probability  of  audit  failure,  and  (2)  the  expected  litigation  cost 
given  audit  failure.  This  study  examines  the  expected  litigation  costs  given  audit  failure,  but  not 
the  probability  of  audit  failure.  This  study  is,  therefore,  silent  as  to  the  overall  effect  of  investi¬ 
gating  fraud  risks  on  auditors’  expected  litigation  costs.  Accordingly,  this  study  does  not  imply  that 
auditors  should  avoid  investigating  fraud  risks. 
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Editor’s  note:  Two  copies  of  books  for  review  should  be  sent  to  the  Book  Review 
Editor:  Stephen  A.  Zeff,  Rice  University,  Jesse  H.  Jones  Graduate  School  of  Business, 

6100  Main  St.,  Houston,  TX  77005.  The  policy  of  The  Accounting  Review  is  to 
publish  only  those  reviews  solicited  by  the  Book  Review  Editor.  Unsolicited  reviews 
will  not  be  accepted. 

Developments  in  New  Reporting  Models,  Information  for  Better  Markets  (London,  U.K.: 
ICAEW  Financial  Reporting  Faculty,  2009,  ISBN  978-84152-615-7,  electronic  version 
downloadable  from  ICAEW.com  website,  pp.  vii,  98). 

Developments  in  New  Reporting  Models  is  a  discourse  on  the  process  of  evolution  of  business  reporting 
models.  A  prior  publication  also  produced  by  the  Institute  of  Chartered  Accountants  of  England  and  Wales 
(ICAEW),  New  Reporting  Models  for  Business  (2003),  assessed  the  merits  of  a  number  of  proposed  business 
reporting  models.  The  current  publication  revisits  the  proposed  models  briefly  and  updates  the  list  (Appendix  2), 
but  its  objective  is  not  to  evaluate  specific  models.  Instead,  the  report  focuses  on  the  mechanisms  of  change  in 
business  reporting  models — the  factors  that  facilitate  the  process  and  the  factors  that  hinder  it.  The  principal 
authors  of  the  report  are  Brian  Singleton-Green  and  Robert  Hodgkinson,  who  are  the  senior  staff  of  the  ICAEW’s 
Information  in  Betters  Markets  thought  leadership  program. 

The  central  premise  of  the  text  is  that  business  reporting  models  evolve  naturally  over  time  as  they  adapt  to 
changes  in  environmental  factors.  The  authors  conclude  that,  although  calls  for  revolutionary  change  in  business 
reporting  models  are  a  necessary  element  of  the  process  of  evolution,  such  calls  are  unlikely  to  accomplish  their 
goal  of  widespread  adoption  of  ostensibly  “new”  business  reporting  models. 

The  authors  submit  that  business  reporting  is  shaped  by  its  environment,  which  includes  a  society’s  regula¬ 
tory,  legal,  and  market  setting,  and  that  business  reporting  models  respond  to  changes  in  that  environment. 
Nonetheless,  the  authors  recognize  that  certain  frictions  can  hinder  the  natural  evolution  of  the  business  reporting 
model.  Imprudent  regulation,  excessive  litigation  risk,  the  information  needs  of  regulators  supplanting  those  of 
users,  and  frictions  in  the  demand  and  supply  of  information  can  interfere  with  the  proper  development  of  business 
reporting.  They  suggest  that  understanding  and  addressing  the  root  causes  of  inadequate  business  reporting  is  an 
important,  but  often  overlooked,  element  of  the  process  of  evolution. 

Despite  being  written  by  non-academics,  the  authors  are  markedly  committed  to  the  position  that  less  rhetoric 
and  more  reliance  on  research  findings  would  greatly  enhance  the  debate  surrounding  the  adequacy  of  business 
reporting  models.  This  commitment  is  evident  in  several  respects.  First,  the  authors’  discussion  incorporates 
relevant  findings  from  research  previously  commissioned  by  the  ICAEW.  Second,  throughout  the  text,  the  authors 
identify  areas  in  which  they  believe  additional  research  would  advance  the  debate.  Appendix  5  of  the  manuscript 
summarizes  the  opportunities  for  further  research  which  the  authors  identify. 

The  bulk  of  the  report  is  organized  as  follows.  Chapter  1  presents  the  authors’  key  conclusions  that  business 
reporting  evolves  over  time  and  is  a  product  of  its  environment.  They  also  introduce  the  primary  charges  leveled 
at  existing  business  reporting  models,  and  in  subsequent  chapters  they  evaluate  each  charge.  The  authors  consider 
the  charges  that  business  reporting  is:  (a)  fundamentally  misleading  (Chapter  2);  (b)  provides  insufficient  reporting 
of  intangibles  (Chapter  3);  (e)  a  factor  in  causing  or  compounding  the  recent  financial  crisis  (Chapter  4);  (d)  offers 
insufficient  reporting  of  non-financial  information  (Chapter  5),  and;  (e)  is  excessive  in  length  and  complexity 
(Chapter  7). 

In  each  chapter  the  authors  outline  the  arguments  and  research  evidence  (if  any)  for  and  against  the  given 
charge.  The  authors  do  not  attempt  nor  pretend  to  conduct  a  comprehensive  review  of  the  relevant  literature,  but 
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rely  primarily  on  the  findings  of  research  previously  commissioned  by  the  ICAEW.  In  addition,  they  identify  areas 
that  they  believe  are  ripe  for  additional  research.  In  general,  the  authors  conclude  that  none  of  the  charges  leveled 
at  financial  reporting  is  sufficient  to  yield  revolutionary  change  in  existing  business  reporting  models. 

Chapter  6  further  delves  into  the  development  of  business  reporting,  with  particular  emphasis  on  the  point 
that  business  reporting  evolves  in  response  to  changes  within  a  complex  social  environment  in  which  business 
reporting  is  but  one  element  of  a  larger  informational  network.  The  authors  identify  factors  that  might  influence 
the  evolution  of  business  reporting  in  a  favorable  manner,  such  as  peer  pressure  and  users’  demands  for  informa¬ 
tion.  Nevertheless,  the  authors  note  that,  if  business  reporting  is  indeed  broken  or  breaks  in  certain  respects,  it  is 
important  to  understand  and  address  the  factors  that  interfere  with  the  appropriate  evolution  of  business  reporting, 
and  not  simply  call  for  dramatic  change  in  the  model. 

The  final  chapter  of  the  manuscript  outlines  a  way  forward.  The  authors  warn  against  excessive  rhetoric, 
which  they  argue  can  be  a  distraction.  The  authors  note  that  rhetoric  is  often  conflicting  (e.g.,  more  disclosure  is 
needed  but  financial  statements  need  to  be  shorter),  and  they  appeal  to  those  presenting  proposed  alternative 
business  reporting  models  to  be  more  firmly  grounded  in  extant  research.  While  I  applaud  and  agree  with  the 
authors’  plea  regarding  research,  it  is  important  to  recognize  that  various  studies’  research  findings  can  also  be 
conflicting.  Accordingly,  it  is  often  necessary  to  draw  conclusions  based  on  the  overwhelming  weight  of  the 
evidence  as  opposed  to  relying  on  the  specific  findings  of  a  particular  study  or  small  group  of  studies.  Moreover, 

I  believe  that  the  authors’  plea  highlights  the  importance  of  collaboration  between  practitioners,  standard  setters, 
and  academics. 

The  authors  also  consider  the  role  of  business  reporting  within  the  whole  of  the  information  set.  They  make 
the  point  that  business  reporting  is  not  the  only  game  in  town.  Other  sources  of  information  exist.  Moreover,, 
managers  suffer  from  limited  information  and  biases,  such  that  managers  might  not  have  a  competitive  advantage 
in  providing  certain  types  of  information.  Hence,  there  is  no  need  for  a  comprehensive  business  reporting  model, 
nor  would  such  a  model  be  desirable. 

Although  the  text  is  quite  thorough  in  its  consideration  of  the  issues,  at  the  conclusion  of  my  reading  the 
report,  I  was  left  with  several  questions.  First,  the  authors  make  the  important  point  that  different  cultures,  rules  of 
law,  and  legal  environments  are  likely  to  produce  different  approaches  to  business  reporting.  They  do  not,  how¬ 
ever,  discuss  the  implications  of  this  statement  for  the  feasibility  of  the  globalization  of  accounting  standards 
specifically,  which  comprise  an  important  component  of  the  business  reporting  model. 

Second,  the  authors  note  that  the  boundaries  of  “business  reporting”  and  the  term  “business  reporting  model” 
are  not  well  defined,  and  that  the  lack  of  consensus  around  these  terms  serves  to  complicate  the  debate  surrounding 
the  adequacy  of  business  reporting  models.  While  attempting  to  clarify  these  concepts  is  outside  the  scope  of  the 
report,  the  discussion  is  subject  to  a  certain  amount  of  slippage  with  respect  to  what  aspects  of  reporting  are  under 
consideration  at  various  points  in  the  text. 

Third,  the  authors  discuss  the  role  of  “the  invisible  hand”  in  guiding  the  evolution  of  business  reporting,  and 
note  that  in  certain  circumstances  regulation  might  play  a  beneficial  role  in  overcoming  demand  or  supply  frictions 
that  could  hinder  adequate  business  reporting.  They  do  not,  however,  discuss  the  extent  to  which  regulatory  bodies 
do  or  do  not  exist  locally  or  internationally  to  take  on  the  task  of  developing  standards  for  business  reporting 
beyond  the  financial  statements  and  footnotes.  Perhaps  prior  calls  for  revolutionary  change  in  business  reporting 
have  failed  in  part  because  no  obvious  entity  exists  to  take  up  the  cause  of  implementing  the  proposed  changes. 

Finally,  the  authors  discuss  the  extent  to  which  financial  statements  can  or  should  be  “general  purpose.”  They 
adopt  investors  as  the  primary  users  of  business  reporting.  I  believe  that  making  such  a  choice  is  necessary.  It  is 
difficult  to  consider  whether  the  information  produced  by  a  business  reporting  model  is  useful,  without  first  taking 
a  position  on  the  question:  useful  to  whom?  While  some  might  disagree  with  the  authors’  decision  to  focus  on 
investors,  I  find  no  reason  to  quibble  with  this  decision.  Nonetheless,  having  made  this  choice,  the  authors,  I 
believe,  could  have  used  it  to  provide  additional  structure  to  their  discussions  in  Chapters  2  through  8.  For 
example,  if  investors’  needs  consist  of  understanding  the  past,  predicting  the  future,  and  assessing  risk,  the 
authors’  discussion  of  the  charges  leveled  against  existing  models  could  be  couched  in  terms  of  the  extent  to  which 
users’  needs  are,  or  are  not,  met.  The  chapter  on  intangibles  (Chapter  3),  in  which  the  authors  discuss  the  claim 
that  the  reporting  of  intangibles  on  the  balance  sheet  is  insufficient,  is  an  example  of  where  such  a  framework 
might  have  provided  additional  useful  stracture  to  the  discussion. 

In  conclusion,  I  enjoyed  reading  this  thought-provoking  report.  I  commend  the  authors  for  contributing  a 
new,  and  substantive,  twist  to  the  debate  on  business  reporting  models,  by  digressing  from  the  established  path  of 
assessing  the  merits  of  specific  proposed  models  and  instead  embarking  on  a  fresh  avenue  of  considering  the 
process  of  change. 
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I.  INTRODUCTION 

I  am  pleased  to  submit  the  second  Annual  Report  for  The  Accounting  Review  under  the  new 
content  guidelines  adopted  by  the  Publications  Committee  and  Executive  Committee  of  the 
American  Accounting  Association  (AAA).  From  the  feedback  I  have  heard,  the  new  format 
has  succeeded  in  creating  a  more  open  environment  of  full  disclosure  and  accountability  to  our 
constituents.  It  is  also  a  learning  experience  for  the  senior  editor,  providing  useful  feedback  on  our 
processes  and  decisions.  As  I  did  last  year,  I  am  taking  the  liberty  of  interjecting  a  liberal  dose  of 
editorial  commentary  along  with  the  statistics,  subject  to  the  caveat  that  my  interpretations  are 
opinions  with  which  others  may  disagree. 

Section  II  below  updates  last  year’s  qualitative  commentary  on  the  editorial  process  followed 
by  The  Accounting  Review  under  the  current  regime.  Section  III  then  follows  with  the  tabular  data 
requested  by  the  AAA  Publications  Committee,  along  with  some  supplemental  data  I  have  pro¬ 
vided  to  clarify  certain  aspects  of  these  tables.  Section  IV  closes  with  some  personal  notes  of 
thanks  and  remembrances. 

II.  UPDATES  TO  THE  ACCOUNTING  REVIEW  EDITORIAL  PROCESS 

To  avoid  repetition,  I  refer  the  reader  to  last  year’s  report  (Kachelmeier  2009)  for  a  descrip¬ 
tion  of  the  editorial  process  followed  by  The  Accounting  Review.  This  year’s  description  is  limited 
to  an  update  of  new  developments  and  changes  in  process  that  have  occurred  over  the  fiscal  year 
ending  May  31,  2010. 

One  important  update  is  that  several  coeditors  of  The  Accounting  Review  have  generously 
agreed  to  increase  their  allocation  beyond  the  maximum  of  three  new  submissions  per  month  that 
I  promised  upon  establishing  our  editorial  team.  Especially  for  financial-archival  manuscripts  that 
constitute  so  many  of  our  submissions  (as  reported  later),  the  coeditors  who  agreed  to  take  on  a  bit 
more  generally  now  handle  four  and  sometimes  five  new  submissions  per  month.  As  a  conse¬ 
quence,  my  proportion  of  the  total  decision  letters  has  declined  from  last  year,  in  which  I  wrote 
345  (48.0  percent)  of  the  719  decision  letters  sent  from  June  1,  2008  to  May  31,  2009.  For  the 
fiscal  year  of  this  report  ending  May  31,  2010,  I  wrote  205  (30.5  percent)  of  the  journal’s  673 
decision  letters,  delegating  slightly  more  than  two-thirds  of  our  decisions  to  coeditors  whose 
expertise  aligns  more  closely  with  the  area  of  the  submission.  Another  reason  for  the  decline  in  my 


I  am  grateful  for  helpful  feedback  and  suggestions  from  Linda  Bamber,  Jean  Bedard,  and  Tracie  Majors  on  an  earlier 
version  of  this  report,  though  I  assume  full  responsibility.  I  also  thank  Mary  Capps  for  painstakingly  checking  the  current 
affiliations  in  the  Appendix  of  ad  hoc  reviewers. 


Published  Online:  November  2010 


2173 


2174 


Kachelmeier 


share  of  the  decision  letters  is  that  I  wrote  most  of  the  decision  letters  for  “transition”  manuscripts 
that  were  in  process  at  the  time  the  current  editorial  term  began  on  June  1,  2008.  Almost  all  of 
those  manuscripts  have  now  cleared  the  system,  so  current  decisions,  whether  for  new  submissions 
or  for  revisions,  tend  to  be  for  manuscripts  initially  submitted  under  the  current  regime  and 
assigned  to  one  of  the  current  editors  from  the  onset. 

While  my  decision-letter  volume  has  subsided  somewhat,  it  remains  my  preference  to  cen¬ 
tralize  the  reviewer  selection  process.  With  the  help  of  my  truly  outstanding  doctoral  research 
assistant,  Tracie  Majors,  I  review  each  new  submission  and  Tracie’s  search  results  from  various 
databases  to  find  two  well-qualified  and  independent  reviewers,  proposing  those  names  to  the 
assigned  editor  (if  other  than  me).  A  centralized  reviewer  selection  process  helps  to  ensure  uni¬ 
formity  of  reviewer  credentials  across  submissions,  in  addition  to  facilitating  the  coordination 
problem  of  avoiding  multiple  requests  to  the  same  reviewer  at  the  same  time  (although  this  is 
sometimes  unavoidable  for  invited  revisions).  Given  the  demands  on  our  most  frequently  re¬ 
quested  reviewers,  a  typical  strategy  is  to  choose  one  reviewer  from  TAR’s  Editorial  Advisory  and 
Review  Board  (or  a  substitute  of  similar  seniority)  and  one  other  reviewer,  likely  more  junior, 
whom  we  ask  less  often.  Other  times  we  try  to  diversify,  such  as  choosing  a  reviewer  from  each 
of  two  areas  relevant  to  the  submission  under  consideration.  To  avoid  sending  signals,  we  use  a 
random  algorithm  to  ensure  that  the  labeling  of  “Reviewer  A”  and  “Reviewer  B”  is  completely 
arbitrary. 

Over  the  year,  we  found  ourselves  asking  some  ad  hoc  reviewers  about  as  many  times  as  we 
would  ask  an  Editorial  Board  member,  so  in  fairness,  I  generally  invited  those  individuals  to  join 
the  Board  if  they  had  submitted  several  high-quality  reviews  on  a  timely  basis.  Thus,  TAR's 
Editorial  Advisory  and  Review  Board  has  grown  from  118  members  at  the  time  of  our  first  issue 
(January  2009)  to  129  members  as  of  the  date  of  this  report.  An  Editorial  Board  of  129  members 
might  seem  large  (as  recently  as  2005,  there  were  only  68  members),  but  I  am  continually  amazed 
at  how  difficult  it  is  to  find  “open”  Editorial  Board  members  who  do  not  already  have  a  review 
assignment  in  hand.  Sometimes  “Professor  X”  is  the  obvious  choice  to  review  a  new  submission 
closely  related  to  X’s  expertise,  but  if  we  recently  asked  Professor  X  to  review  a  different  manu¬ 
script,  we  generally  look  elsewhere.  In  short,  we  do  the  best  we  can  to  optimize  reviewer  selection, 
considering  both  relevance  and  availability. 

Even  with  129  Editorial  Board  members,  it  would  be  highly  misleading  to  infer  that  these  129 
experts  write  all  the  reviews.  Eor  the  fiscal  year  ending  May  31,  2010,  TAR  asked  an  additional 
453  ad  hoc  reviewers,  as  named  and  thanked  in  the  Appendix,  to  evaluate  one  or  more  submis¬ 
sions.  Thus,  the  journal’s  editorial  decisions  during  the  fiscal  year  ending  May  31,  2010  were 
guided  by  582  (=  453  +  129)  different  experts  across  a  wide  variety  of  topical  and  methodologi¬ 
cal  interests.  The  diversity  inherent  in  582  different  reviewers  is  consistent  with  TAR's,  mission. 

Given  the  reviewer  demands  of  a  journal  with  over  500  new  submissions  per  year  (not  even 
counting  revisions),  several  people  have  asked  me  why  TAR  persists  with  using  two  different 
reviewers  for  each  new  submission,  especially  given  that  some  of  our  competitors  use  only  one.  To 
be  sure,  a  two-reviewer  system  is  costly,  both  in  reviewer  resources  and  in  time  (by  construction, 
we  are  always  waiting  on  the  later  of  the  two  reviewers).  Ex  post,  authors  often  sense  that  it  is 
more  difficult  to  address  the  concerns  of  two  reviewers  than  it  would  be  to  address  a  single 
reviewer’s  concerns,  especially  if  they  could  pick  which  reviewer  to  keep  and  which  to  discard. 
But  therein  lies  the  point.  Editors  have  to  select  reviewers  ex  ante,  and  given  that  reviewers  often 
disagree  (Blank  1991;  Gilliland  and  Cortina  1997;  Lynch  1998),  both  the  editor  and  author  gain 
some  protection  by  getting  two  draws  from  the  distribution  instead  of  just  one.  Thus,  though 
authors  sometimes  see  it  differently,  I  honestly  see  a  two-reviewer  system  as  being  in  the  author’s 
best  interest,  at  least  ex  ante.  When  two  reviewers  forward  different  recommendations,  editorial 
judgment  becomes  critical.  The  general  decision  model  employed  by  the  current  editorial  team  for 
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split  reviews  is  to  consider  whether  the  negative  reviewer  has  identified  a  “fatal  flaw”  that  is 
inherent  to  the  study.  If  so,  we  reject,  but  otherwise  we  generally  move  forward,  even  if  that  means 
overruling  a  reviewer  who  recommends  rejection.  Along  that  line,  another  benefit  of  a  two- 
reviewer  system  is  that  it  provides  useful  feedback  to  the  reviewers,  especially  those  newer  to  the 
process.  That  is,  each  reviewer  can  calibrate  his/her  assessment  against  that  of  the  other  reviewer, 
as  we  send  the  reviewers  copies  of  both  reviews  along  with  a  blind  copy  of  the  editorial  decision 
letter. 

A  new  development  this  year  is  that  I  have  started  to  enforce  the  expectation  that  revisions 
should  be  submitted  within  a  year  of  the  decision  letter  inviting  the  revision.  While  it  has  long 
been  part  of  TAR's  published  Editorial  Policy,  it  is  my  understanding  that  the  “one-year  rule”  was 
rarely  enforced  prior  to  2009.  Beginning  June  1,  2008,  however,  we  started  noting  this  policy 
explicitly  in  decision  letters  inviting  a  revision.  Thus,  when  the  one-year  anniversary  arrives  with 
no  revision  in  hand,  I  generally  write  an  email  to  the  submitting  author  offering  one  more  month 
(essentially,  a  grace  period)  to  revise  and  resubmit.  Six  times  during  the  fiscal  year  ending  May  31, 
2010,  after  the  13th  month  elapsed  with  no  revision,  I  informed  the  author  of  my  intent  to  close  the 
file.  This  might  seem  harsh,  but  if  we  are  serious  about  lessening  the  time  it  takes  from  an  initial 
submission  to  the  ultimate  decision,  part  of  that  responsibility  lies  with  authors.  Moreover,  it  is  my 
experience  that  very  old  revisions  fare  worse  in  the  review  process,  as  they  face  the  risk  of 
obsolescence  (research  tends  to  have  a  short  “shelf  life”). 

There  are  two  caveats  to  my  willingness  to  enforce  the  policy  allowing  one  year  to  revise  and 
resubmit.  First,  sometimes  an  author  will  reply  back  to  my  “one-year  anniversary”  email  with  an 
explanation  of  extenuating  medical  or  personal  circumstances  that  prevented  a  more  timely  revi¬ 
sion,  and  I  try  to  take  those  circumstances  into  account  in  reaching  a  reasonable  accommodation 
to  which  the  author  and  I  can  agree.  Second,  I  have  not  enforced  the  one-year  deadline  for 
revisions  invited  before  the  current  editorial  term  began  on  June  1,  2008,  as  earlier  decision  letters 
did  not  state  that  deadline  explicitly.  By  now,  however,  I  hope  that  the  vast  majority  of  those 
revisions  are  behind  us. 

For  several  other  aspects  of  TAR's  editorial  process,  I  encourage  the  reader  to  see  my  previous 
year’s  report  (Kachelmeier  2009).^  It  is  my  general  understanding  from  the  AAA  Publications 
Committee  that  each  new  senior  editor  will  begin  his/her  term  with  a  report  that  provides  a  more 
comprehensive  description  of  his/her  editorial  process  and  philosophy,  as  I  tried  to  do  last  year, 
followed  by  updates  in  subsequent  years  of  the  editorial  term,  but  without  repeating  points  that 
have  not  changed. 


III.  EDITORIAL  AND  PUBLICATION  STATISTICS 

This  section  of  the  report  provides  specific  tables, requested  by  the  AAA  Publications  Com¬ 
mittee,  along  with  supplemental  data  for  clarification.  To  differentiate  the  supplemental  material, 
each  table  first  reports  the  specific  data  requested  by  the  Publications  Committee,  followed  by  any 
supplemental  data  explained  in  the  discussion  of  each  table. 

Table  1:  Annual  Activity  Summary 

Table  1,  Panel  A  reconciles  TAR'?,  workflow  for  the  prior  and  current  journal  year  ending  May 
31,  2009  and  2010,  respectively.  The  volume  of  new  submissions  is  down  somewhat,  from  557 
last  year  to  502  in  fiscal  2010.  My  best  guess  is  that  this  difference  does  not  reflect  a  reversal  of 
TAR'?  general  trend  of  increasing  submissions,  but  rather  reflects  a  “blip”  of  an  exceptionally  large 


'  In  addition  to  last  year’s  annual  report,  for  my  personal  thoughts  on  the  review  process  in  accounting  journals,  see 
Kachelmeier  (2004). 
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number  of  new  submissions  in  the  summer  months  of  2008  from  authors  who  did  not  want  their 
manuscripts  to  overlap  two  editorial  regimes.  This  conjecture  is  supported  by  the  fact  that  TAR 
received  only  163  new  submissions  during  the  first  five  months  of  calendar  2008  (i.e.,  the  last  five 
months  of  former  Senior  Editor  Dan  Dhaliwal’s  term),  as  compared  to  238  new  submissions 
during  the  first  five  months  of  2009  and  232  new  submissions  during  the  first  five  months  of  2010. 
I  suspect  that  we  may  experience  a  similar  downturn  in  spring  of  2011  for  authors  waiting  for  the 
new  regime  to  begin  on  June  1,  2011.  Authors  should  be  aware,  however,  that  editorial  transition 
issues  are  not  as  complex  as  they  might  appear.  The  current  regime  will  continue  to  choose 
reviewers  through  May  31,  2011,  but  the  next  regime  will  likely  be  making  most  of  the  decisions 
for  manuscripts  submitted  just  before  the  end  of  the  fiscal  year,  after  the  reviews  arrive  sometime 
in  June  or  July. 

As  was  the  case  last  year,  a  limitation  of  Table  1,  Panel  A  is  that  logging  all  submissions 
results  in  double  counting  some  manuscript  files,  to  the  extent  that  TAR  often  receives  two  or  more 
versions  of  the  same  manuscript  within  the  same  year,  following  invitations  to  revise  and  resub¬ 
mit.  For  the  journal  year  ending  May  31,  2010  (2009),  the  673  (719)  total  decisions  reflect  615 
(646)  unique  manuscript  files,  with  the  differences  representing  revisions  of  manuscripts  for  which 
previous  decisions  inviting  revision  had  already  been  logged  within  the  same  fiscal  year. 

Table  1,  Panel  B  is  supplemental,  reporting  submissions  by  calendar  year  instead  of  the 
journal’s  fiscal  year  that  ends  on  May  31.  The  calendar  year  tallies  were  reported  prior  to  2008,  so 
Panel  B  facilitates  comparison  over  time.  It  corroborates  that,  notwithstanding  the  temporary  surge 
in  new  submissions  that  occurred  shortly  after  June  1,  2008,  the  general  trend  of  submissions  has 
been  steadily  increasing  for  several  years.  Specifically,  over  the  past  decade,  TAR's,  submission 
volume  has  approximately  doubled. 

Table  2:  Annual  Outcome  Summary 
Table  2,  Panel  A:  Outcomes  by  Fiscal  Year 

Table  2,  Panel  A  tallies  the  outcomes  for  all  decisions  reached  during  the  fiscal  year.  As 
requested  by  the  Publications  Committee,  this  panel  also  reports  two  “acceptance  rates.”  The  first 
rate  in  column  (e)  divides  the  number  of  acceptances  or  conditional  acceptances  by  the  number  of 
final  accept  or  reject  decisions.  The  second  rate  in  column  (f)  divides  the  same  (conditional) 
acceptances  by  the  total  number  of  decisions  reached.  The  difference  between  the  two  denomina¬ 
tors  reflects  invitations  to  revise  and  resubmit.  Accordingly,  the  first  rate  somewhat  overstates  and 
the  second  rate  somewhat  understates  the  “true”  acceptance  rate.  As  in  last  year’s  report.  Table  2 
includes  “conditional”  acceptances  in  the  acceptance  category,  as  we  have  yet  to  reject  any  manu¬ 
script  that  has  reached  “conditional  acceptance”  status. 

As  with  Table  1,  a  limitation  of  Table  2,  Panel  A  is,  that  the  total  number  of  decisions  in  any 
given  journal  year  includes  some  manuscript  files  for  which  multiple  decisions  are  made  within 
the  same  year,  due  to  invitations  to  revise  and  resubmit.  If  one  divides  the  number  of  acceptances 
by  the  number  of  unique  manuscript  files  processed  (646  and  615  for  the  years  ending  May  31, 
2009  and  2010,  respectively),  the  resulting  estimated  acceptance  rates  are  12.5  percent  (=81/646) 
for  fiscal  2009  and  10.2  percent  (=63/615)  for  fiscal  2010,  bearing  in  mind  that  the  denominator 
for  both  years  includes  several  open  files  awaiting  further  revision,  for  which  final  decisions  have 
yet  to  be  reached. 

The  reader  might  have  cause  for  concern  in  noting  that  our  total  number  of  acceptances  is 
down  from  81  last  year  to  63  this  year.  The  estimated  acceptance  rates  also  reflect  this  downturn. 
However,  I  do  not  perceive  any  substantive  change  in  our  editorial  standards  for  acceptance.  One 
reason  for  the  decline  reflects  my  attempt  to  build  a  modest  “buffer”  of  forthcoming  articles  by 
taking  special  efforts  to  expedite  some  near-acceptance  manuscripts  shortly  after  the  current  edi¬ 
torial  term  began  on  June  1,  2008.  Before  that  time,  TAR  was  essentially  operating  on  an  issue- 


The  Accounting  Review 


November  2010 
American  Accounting  Association 


Kachelmeier 


2178 


cs 


fS  . 


lU 

u 
S 

i«'s 

Cl<*i 
a>  C3  11 

<e  P 


ZZ 


^  4. 

o  ^ 

W  ^  .fi 

X 

0^  C5  ^ 

y  3 

■<  II 


\q  in 
lO  cn 


CB 


IB 


©  9  © 

©  fi  © 

9  §9 

a 

cepta 

and 

inditii 

cepta 

B 

B 

u  =  y 
< 

^  cn 
00  lO 


c<< 

U 

P 

oa 

s 


s 

y 

B 

o 

u 

-4-1 

s 

O 

3 

s 

c 

e 

< 


s-S  S 

«  «  § 

S  y  ;2S 

C  « 


o  r- 
cn  O 
CN  C-J 


C 

O 


<J  pfi 

a>  ^ 


a> 


00  CO 
o  o 


c3 

'3 

u 


pc 

(/> 

0) 

s 

© 

© 

9 

O 


© 

9 

9 

CU 


C 

© 


•3  t3  ©  . 

©  ^^ 


ON  CO 
1—1 

NO 


OJDfn 
fi  tN  9  fO 

^  PPM 


e 


On  O 
O  ^ 

o  o 

CN  <N 


p^  © 

T3  cj  00 

i«  ^ 

©'WPP 

CQ  a  «  + 

©  M 

L-  ©  ^  /— s 

V  u  Pi  ja 
- 

g  II 


VO 

o 

tN 


Im 

y 

o 


S  3 

I  "S 

« 

^oi  II 
©  ^ 
*<  © 


OH  ^  o 
c  ®  S 

S3  y  2  ® 

'O  5  to  r» 

S  5  ®  PP 
pH  t  0  3 

y  fa  ’S  — ' 

•9  i!r  9 

E 


o 

«1  ^ 
C  J3  S 

tS  3  ^ . 

^  O  S  • 
-  r  fo 


o 

r- 


© 

■S7 


CN 


9 

.2 

(/i 

s 

s 

C/D 

y 

Z 


VI 

rtl 

©  o 
c-Svfs 


0^  pfi 


mm  W 

hs;  Hj; 


<2 

c 

o 


o 

CB 

y 

CeS 

y 

s 

o 

u 

s 

O 

"« 

c 


y 

fi 

R 

fa 


4  f\| 

= 

■23., 

CuPff^ 

^  ?3  J 

S’ 


c/3 

©  ^ 

.9  © 

W3  >•  ^ 

1/3.9^ 

•9  0/  9 
-§0i 

CD 


r- 

IT) 

lO 


OH^ 
^  9  fO 


9 

S  9  ;=  . 

2  y  ^  fH, 

g  >H  <3  « 


ON 

o 

o 

CN 


^  D 


cG 


<^  3 


The  Accounting  Review 
American  Accounting  Association 


November  2010 


from  AAA  Presidential  Scholar  Lectures  and  from  occasional  research  forums. 


Annual  Report  and  Editorial  Commentary 


2179 


to-issue  basis,  meaning  that  the  articles  accepted  for  publication  at  the  time  of  the  deadline  for  the 
next  issue  became  the  articles  published  in  that  issue.  This  process  can  create  imbalances  due  to 
the  sometimes  cyclical  nature  of  the  flow  of  submissions  and  revisions,  so  I  decided  early  on  that 
I  would  like  to  have  one  or  two  issues  “in  waiting”  at  the  time  of  each  issue  deadline,  providing 
a  buffer  to  help  smooth  out  the  cyclical  variation.  Accordingly,  during  the  last  half  of  2008,  my 
coeditors  and  I  built  a  buffer  of  approximately  15-20  forthcoming  articles  at  the  conditional 
acceptance  stage,  and  we  have  held  that  buffer  roughly  constant  ever  since. 

A  second  reason  for  the  downturn  in  acceptances  from  81  last  year  to  63  this  year  is  a  bit 
embarrassing,  but  I  will  provide  full  disclosure.  In  May  2010,  I  encountered  a  personal  backlog, 
due  to  my  end-of-semester  teaching  responsibilities,  among  other  priorities.  Thus,  some  condi¬ 
tional  acceptance  letters  drafted  by  my  coeditors  in  April  or  May  did  not  get  sent  out  until  June, 
as  1  always  read  and  provide  my  own  comments  (and  usually  a  marked  copy)  on  all  accepted 
manuscripts  before  sending  out  the  decision.  Ever  the  good  accountant,  I  strictly  applied  a  May  3 1 
cutoff  in  preparing  Table  2,  but  the  reader  should  be  aware  that  we  sent  out  ten  additional 
acceptance  letters  in  the  first  half  of  June  2010.  Including  those  ten  would  have  brought  the  fiscal 
2010  acceptance  total  to  73  instead  of  63.  The  good  news  is  that  I  get  to  count  them  next  year. 

To  put  the  number  of  acceptances  in  perspective,  the  reader  should  consider  our  capacity 
constraint.  As  long  as  journals  continue  to  publish  bound  hard-copies,  there  are  only  so  many 
articles  that  can  fit  in  a  bound  issue.  Effective  2008,  TAR  has  published  six  times  per  year.  In  my 
experience,  I  can  fit  up  to  12  articles  in  each  issue.  Thus,  if  we  fill  the  journal  to  capacity,  we  have 
room  for  12  X  6  =  72  articles  per  year,  one  of  which  goes  automatically  to  the  Presidential 
Scholar  Lecture.  For  the  most  part,  we  have  been  filling  the  journal  to  its  practical  capacity,  which 
happily  coincides  with  what  I  perceive  to  be  a  reasonable,  productive  acceptance  rate  that  pub¬ 
lishes  good  research  while  maintaining  a  high-quality  standard.  I  cannot  make  any  promises  about 
our  ability  to  sustain  full-capacity  production,  as  an  editor  never  knows  what  is  coming  next,  but 
I  think  the  journal  is  in  good  health. 

Table  2,  Panel  B:  Final  Outcome  Resolution  for  All  New  Submissions 

New  this  year  is  supplemental  Panel  B  to  Table  2,  reflecting  what  I  think  is  an  excellent 
suggestion  from  AAA  Publications  Committee  member  Bob  Kaplan  to  calculate  the  journal’s 
“acceptance  rate”  in  a  different  way.  As  I  understand  the  suggestion,  if  one  views  any  given  year’s 
new  submissions  as  the  population  from  which  articles  can  be  accepted,  then  in  subsequent  years, 
after  most  of  the  revisions  have  been  processed,  one  can  tally  how  many  of  that  year’s  new 
submissions  were  accepted,  how  many  were  rejected,  and  how  many  remain  pending  due  to 
outstanding  invitations  to  revise.  Table  2,  Panel  B  reports  this  supplemental  format  for  the  557 
new  submissions  received  during  the  journal  year  ending  May  31,  2009.^  Of  these,  45  (8.1 
percent)  have  been  accepted  through  May  31,  2010,  442  (79.4  percent)  have  been  rejected,  and  the 
remaining  70  manuscripts  (12.6  percent)  are  still  outstanding  as  of  May  31,  2010  due  to  pending 
revisions.  These  statistics  provide  a  useful  floor  and  ceiling  for  TAR's  acceptance  rate:  for  fiscal 
2008-2009,  the  acceptance  rate  can  be  no  lower  than  8.1  percent,  as  those  manuscripts  have 
already  been  accepted,  and  can  be  no  higher  than  20.6  percent  if  all  remaining  pending  revisions 
from  fiscal  2008-2009  are  eventually  accepted.  Of  course,  it  is  unrealistic  to  assume  that  either 
none  or  all  of  the  pending  revisions  will  be  accepted.  My  best  estimate  is  that  splitting  the 
difference  between  the  two  numbers  (an  estimated  acceptance  rate  of  14.4  percent)  comes  mate- 


^  Table  2,  Panel  B  does  not  report  outcomes  for  new  submissions  during  the  journal  year  ending  May  31,  2010,  as  too 
many  of  those  new  submissions  are  awaiting  a  decision  as  of  May  31,  2010  to  make  the  statistics  meaningful. 


The  Accounting  Review 


November  2010 
American  Accounting  Association 


2180 


Kachelmeier 


daily  close  to  our  true  acceptance  rate  for  the  first  year  of  operations  under  the  current  editorial 
regime.  I  will  provide  a  follow-up  table  next  year. 

Exhibit  1:  Histogram  of  Editorial  Rounds  and  Outcomes 

Exhibit  1  provides  a  histogram  of  decision  outcomes  by  round  for  the  journal  year  ending 
May  31,  2010.  As  I  did  last  year,  I  am  taking  the  liberty  of  reporting  more  detail  than  that 
requested  by  the  Committee,  showing  separate  bars  for  “revise”  and  “uncertain”  invitations  as  well 
as  separate  bars  for  “contribution”  and  “validity”  rejections.  The  475  first-round  decisions  in  Panel 
A  of  Exhibit  1  differ  from  the  502  new  submissions  reported  in  Panel  A  of  Table  1  because  Exhibit 
1  includes  98  new  submissions  “in  process”  as  of  June  1,  2009  and  excludes  125  new  submissions 
for  which  a  decision  had  not  yet  been  reached  as  of  May  31,  2010. 

Exhibit  1,  Panel  A  shows  that  most  first-round  manuscripts  are  rejected  because  they  lack  a 
sufficient  incremental  contribution  for  The  Accounting  Review.  Ever  since  we  started  logging 
rejection  decisions  as  being  based  primarily  on  contribution  issues  or  validity  issues,  I  have  been 
struck  by  the  fact  that  the  former  category  is  more  than  twice  as  frequent  as  the  latter.  Indeed,  my 
sense  of  the  modal  first-round  review  report  that  recommends  rejection  is  that  it  contains  wording 
similar  to,  “While  this  study  appears  to  have  been  competently  executed,  it  does  not  provide  much 
new  insight  relative  to  what  we  already  know  from  the  extensive  prior  literature  in  this  area.”  Put 
simply,  following  the  bandwagon  in  a  well-researched  area  is  not  necessarily  a  path  to  success  in 
a  top-tier  journal. 

Under  the  current  editorial  regime.  The  Accounting  Review  has  employed  two  revision  cat¬ 
egories;  (1)  standard  “revise-and-resubmit”  letters,  and  (2)  “uncertain”  letters.  Both  outcomes 
allow  the  author  to  revise,  but  an  “uncertain”  letter  indicates  that  the  outcome  risk  is  higher  than 
usual  for  an  invitation  to  revise,  such  that  the  path  to  a  successful  revision  is  unclear.  When 
sending  an  “uncertain”  letter,  we  encourage  a  careful  assessment  of  whether  the  author  sees  an 
effective  way  to  address  the  concerns  raised,  and  if  not,  that  submission  elsewhere  might  be  best 
for  the  author  from  a  cost-benefit  perspective.  The  easiest  way  for  an  author  to  tell  which  kind  of 
letter  s/he  has  received  is  that  an  “uncertain”  letter  asks  for  an  email  reply  indicating  whether  the 
author  intends  to  revise  and  resubmit.  While  some  might  reason  that  an  author  should  always 
exercise  an  option  to  revise,  the  honest  intent  of  an  “uncertain”  letter  is  that  the  decision  is  unclear 
from  the  editor’s  perspective,  as  a  multiple-round  rejection  can  be  worse  for  the  author  than  a 
first-round  rejection.  As  Exhibit  1  indicates,  most  revision  letters  are  of  the  standard  “revise-and- 
resubmit”  variety,  but  we  try  to  use  the  more  cautious  “uncertain”  wording  when  appropriate.  To 
avoid  misunderstanding,  I  should  clarify  that  even  a  standard  “revise-and-resubmit”  letter  conveys 
outcome  risk;  it  is  only  the  degree  of  that  risk  that  differs  between  the  two  revision  categories. 

Exhibit  1  indicates  that  the  relative  odds  of  success  increase  substantially  in  the  second  round 
and  beyond.  By  the  time  a  manuscript  reaches  the  third  or  fourth  round  (Panel  C),  most  outcomes 
are  acceptances,  as  should  be  the  case  for  manuscripts  that  have  advanced  to  this  stage.  Still, 
nothing  is  guaranteed,  as  is  evidenced  by  the  seven  rejections  in  Exhibit  1,  Panel  C.  As  an  aside, 
the  vast  majority  of  the  decisions  tallied  in  Exhibit  1,  Panel  C  are  for  third-round  manuscripts, 
with  only  nine  that  went  to  the  fourth  round.  We  did  have  one  fifth-round  manuscript  within  the 
journal  year  ending  May  31,  2010,  which,  fortunately,  was  accepted  for  publication. 

Table  3:  Submissions  and  Acceptances  by  Subject  Area  and  Research  Method 

Perhaps  the  most  interesting  statistics  are  in  Table  3,  which  categorizes  submissions  and 
acceptances  by  primary  subject  area  (Panel  A),  primary  method  (Panel  B),  and  subject  areas 
crossed  with  methods  (Panel  C).  Very  similar  to  last  year,  all  three  panels  show  that  the  proportion 
of  acceptances  by  area  is  nearly  identical  to  the  proportion  of  submissions  by  area.  Contingency 
table  tests  fail  to  reject  the  null  hypothesis  of  equivalent  submission  and  acceptance  proportions  by 
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EXHIBIT  I 

Histogram  of  Editorial  Rounds  and  Outcomes 
Journal  Year  Ending  May  31,  2010 


Panel  A:  Eirst-Round  Outcomes  (New  Submissions) 
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EXHIBIT  1 


Panel  C:  Third-  and  Fourth-Round  Outcomes  (Second  and  Third  Revisions) 


and  Resubmit  Contribution  Validity 


Conditional 

Acceptances 


subject  area  (A^df=7  =  5.74;  p  >  0.50)  or  by  method  (A^df=5  =  6-35;  p  =  0.27).^  As  was  the  case  last 
year,  if  any  area  might  be  able  to  state  a  case  for  underrepresentation  relative  to  submissions,  it 
would  be  those  who  conduct  financial-archival  research  (45  percent  of  submissions  and  37  percent 
of  acceptances).  That  said,  a  journal  editor  would  probably  take  issue  with  that  statement  (justi¬ 
fiably)  if  this  editorial  were  subjected  to  the  review  process,  given  the  lack  of  statistical  signifi¬ 
cance  as  reported  above. 

The  broader  point  is  that  one  cannot  draw  inferences  about  acceptance  rates  by  area  (a  ratio) 
by  looking  at  what  journals  publish  (the  numerator  of  that  ratio)."*  Table  3  sheds  insight  by 
reporting  our  denominators  in  terms  of  submissions  by  area,  and  I  submit  that  these  data  add  an 
important  qualification  to  Tuttle  and  Dillard’s  (2007,  398)  assertion  that  the  research  and  publica¬ 
tion  process  in  the  United  States  is  “privileging  financial  accounting  research  competing  for 


^  To  address  the  possibility  that  small  cell  sizes  in  the  less  frequent  categories  might  be  distorting  the  statistics,  I  repeated 
both  the  subject  area  and  methodological  contingency  table  analyses  after  combining  the  “govemmental/NFP,”  “inter¬ 
national,”  “systems,”  and  “other”  subject  areas  into  a  combined  “other”  subject  area  category  and  after  combining  the 
“field/case,”  “survey,”  and  “other”  methods  into  a  combined  “other”  methodological  category.  Results  continue  to  fail  to 
reject  the  null  hypothesis  of  equivalent  submission  and  acceptance  percentages  for  both  subject  areas  =  4.09;  p  = 
0.39),  and  methods  =  4.09;  p  =  0.25). 

Merchant  (2010)  is  a  classic  case  in  point,  drawing  inferences  about  various  journal  specializations  (and  implied  biases) 
from  the  nature  of  the  articles  they  publish.  One  cannot  deny  that  different  accounting  journals  publish  different 
frequencies  of  articles  in  various  areas,  as  Merchant  (2010)  demonstrates  persuasively,  but  analyses  of  journal  content 
cannot  differentiate  the  possibility  of  editorial  biases  from  the  broader  phenomenon  of  self-selection  by  submitting 
authors. 
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prestige  journal  space.”  While  Tuttle  and  Dillard  (2007)  (and  others)  may  well  be  correct  in 
asserting  that  financial-archival  research  is  commanding  an  ever  larger  share  of  the  pie,  Table  3 
suggests  that  The  Accounting  Review  is  a  reflection  of  that  phenomenon,  not  the  cause. 

In  raising  this  point  over  the  past  year  regarding  the  similar  statistics  reported  in  my  previous 
annual  report  (Kachelmeier  2009),  I  have  often  heard  the  counterargument  that  submissions  by 
area  are  endogenous  (to  use  an  empirical-archival  term),  reflecting  authors’  preferences  and  beliefs 
regarding  their  likely  prospects  at  any  given  journal.  Put  simply,  researchers  in  underrepresented 
areas  will  not  submit  to  The  Accounting  Review  if  they  perceive  that  they  will  not  get  a  fair  shake 
at  The  Accounting  Review.  I  understand  this  argument,  but  I  also  find  it  to  be  circular.  It  is  difficult 
for  me  to  understand  how  any  journal  can  be  criticized  for  not  publishing  research  it  does  not 
receive. 

To  be  sure,  inclusive  journals  should  take  steps  to  signal  openness  to  the  scholarly  community, 
such  as  through  its  editorial  board  appointments  and  editors.  I  will  leave  the  reader  to  evaluate  the 
diversity  of  our  129  editorial  board  members  and  the  additional  453  ad  hoc  reviewers  named  in  the 
Appendix.  As  for  editors,  I  am  joined  by  a  diverse  set  of  outstanding  coeditors  such  as  Shannon 
Anderson  (expertise  in  field  studies),  Jim  Hunton  (expertise  in  accounting  information  systems), 
and  Tom  Omer  (expertise  in  taxation).  The  rest  seems  up  to  the  community.  If,  at  some  point,  a 
future  Table  3  shows  an  imbalance  between  submissions  by  area  and  acceptances  by  area,  then  we 
can  engage  a  different  debate  as  to  the  causes  of  that  imbalance.  But  until  then,  I  think  Table  3 
indicates  that  we  are  running  a  fair  game,  with  all  areas  having  roughly  equal  prospects  of  success. 
As  the  acceptance-rate  data  indicate,  those  prospects  are  not  very  high  ex  ante.  But  the  same  high 
standards  of  quality  and  rigor  apply  to  all. 

Table  4:  Author  Affiliations 

Table  4,  Panel  A  tallies  the  number  of  articles  corresponding  to  each  university  for  which  at 
least  one  (co)author  published  an  article  in  Volume  84  (calendar  2009)  or  Volume  85  (calendar 
2010)  of  The  Accounting  Review.  Articles  written  by  k  coauthors  are  attributed  1/k  to  each 
author’s  affiliation,  and  if  an  author  lists  two  affiliations,  half  of  that  author’s  1/k  share  is  allocated 
to  each  affiliation.  As  I  did  last  year,  I  list  the  institutions  in  Table  4,  Panel  A  in  descending  order 
of  the  number  of  articles  published.^  However,  the  intent  of  Panel  A  is  most  certainly  not  to 
compare  universities  in  terms  of  research  productivity.  A  recent  study  by  Coyne  et  al.  (2010) 
reports  a  more  comprehensive  analysis  along  that  line.  Rather,  the  point  of  Table  4,  Panel  A  is  to 
document  the  diversity  of  TAR’s  authors.  Fully  130  different  institutions  were  represented  in  136 
research  articles  published  in  The  Accounting  Review  over  the  past  two  years.  That  seems  pretty 
diverse  to  me. 

Table  4,  Panel  B  reports  data  on  international  diyersity,  tallying  frequency  counts  for  both 
submissions  and  acceptances  for  U.S.  authors  and  non-U.S.  authors.  Separate  columns  report  these 
data  for  the  journal  year  ending  May  31,  2010  and  for  the  two-year  cumulative  totals  combining 
this  year  and  last,  with  the  aggregate  statistics  reducing  the  cumulative  submission  count  for  files 
submitted  in  both  years  due  to  revision.  Authors  from  non-U.S.  affiliations  comprise  27.0  percent 
(26.7  percent)  of  the  fiscal  2010  (cumulative)  submissions,  and  20.1  percent  (19.8  percent)  of  the 
acceptances.  A  contingency  table  test  does  not  reject  the  (two-tailed)  null  hypothesis  of  equal 
submission  and  acceptance  percentages  of  U.S.  versus  non-U.S.  authors  for  fiscal  2010  (A^df=i 
=  1.19;  p  =  0.28),  but  the  statistic  becomes  marginally  significant  in  the  more  powerful  cumula- 


^  The  fact  that  The  University  of  Texas  at  Austin  happens  to  be  listed  first  this  year  (6.08  articles,  or  4.5  percent  of  the 
two-year  total)  gives  me  a  good  opportunity  to  clarify  that,  by  policy,  a  TAR  coeditor  never  serves  as  editor  on  one  of 
his/her  own  colleague’s  manuscripts.  The  6.08  coauthor-adjusted  TAR  articles  for  The  University  of  Texas  at  Austin  over 
the  past  two  years  reflect  nine  different  authors  across  a  variety  of  topics  and  methods. 
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TABLE  4 
Author  Affiliation 


Panel  A:  Universities  and  Other  Institutions  Represented  in  Vols.  84  (2009)  and  85  (2010) 


Author  Affiliation 

Current- Year 
Number  of 
Published  Articles 
(Vol.  85,  2010) 

Two-Year 

Cumulative  Number 
of  Published 
Articles 

(2009  and  2010) 

The  University  of  Texas  at  Austin 

5.58 

6.08 

University  of  Pennsylvania 

2.25 

4.67 

Stanford  University 

2.17 

4.17 

Michigan  State  University 

1.75 

3.92 

Harvard  University 

1.58 

3.75 

Dartmouth  College 

3.42 

3.42 

University  of  Illinois  at  Urbana-Champaign 

1.50 

3.17 

The  University  of  Georgia 

2.92 

2.92 

The  University  of  Iowa 

0.50 

2.75 

University  of  Michigan 

1.33 

2.67 

University  of  Pittsburgh 

1.00 

2.67 

University  of  Toronto 

0.83 

2.58 

Indiana  University 

1.00 

2.50 

The  University  of  Chicago 

1.50 

2.33 

The  George  Washington  University 

1.00 

2.33 

Florida  International  University 

1.25 

2.08 

The  Pennsylvania  State  University 

1.33 

2.00 

The  University  of  New  South  Wales 

0.00 

1.83 

University  of  Missouri-Columbia 

1.00 

1.83 

Nanyang  Technological  University 

1.50 

1.83 

Hong  Kong  Polytechnic  University 

1.67 

1.79 

The  University  of  Oklahoma 

1.17 

1.75 

The  University  of  Texas  at  Dallas 

0.58 

1.75 

University  of  California,  Los  Angeles 

1.33 

1.67 

Emory  University 

1.67 

1.67 

The  University  of  North  Carolina  at  Chapel  Hill 

1.33 

1.67 

Texas  A&M  University 

0.33 

1.67 

The  University  of  Utah 

0.83 

1.67 

University  of  Wisconsin-Madison 

0.25 

1.58 

University  of  California,  Berkeley 

0.00 

1.50 

Singapore  Management  University 

0.00 

1.50 

University  of  Minnesota 

0.00 

1.25 

University  of  Alberta 

1.17 

1.17 

Brigham  Young  University 

0.00 

1.17 

Hong  Kong  University  of  Science  and  Technology 

0.33 

1.17 

University  of  Southern  California 

0.00 

1.17 

University  of  Virginia 

0.50 

1.17 

Aarhus  University 

1.00 

1.00 

University  of  Arkansas 

1.00 

1.00 

Universidad  Carlos  UI  de  Madrid 

0.00 

1.00 

Georgia  Institute  of  Technology 

0.00 

1.00 

Humboldt  University-Berlin 

0.00 

1.00 
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Panel  A:  Universities  and  Other  Institutions  Represented  in  Vols.  84  (2009)  and  85  (2010) 


Author  Affiliation 

Iowa  State  University 
McGill  University 
University  of  Melbourne 
Miami  University  (Ohio) 

National  University  of  Singapore 

University  of  New  Hampshire 

Purdue  University 

University  of  Rochester 

Saint  Louis  University 

Santa  Clara  University 

Southern  Methodist  University 

Tel  Aviv  University 

Washington  University  at  St.  Louis 

University  of  Alabama 

Cornell  University 

University  of  Houston 

Maastricht  University 

Massachusetts  Institute  of  Technology 

The  Ohio  State  University 

University  of  Oregon 

Seoul  National  University 

Bentley  University 

University  of  California,  Irvine 

Erasmus  University 

Duke  University 

Rice  University 

University  of  South  Florida 

University  of  Washington 

Concordia  University 

University  of  British  Columbia 

University  of  Connecticut 

University  of  Notre  Dame 

University  of  South  Carolina 

University  of  Amsterdam 

Boston  University 

Columbia  University 

University  of  Florida 

Goethe  University 

Korea  University 

University  of  Miami 

Norwegian  School  of  Management 

Utah  State  University 

Washington  State  University 

University  of  Waterloo 


Current- Year 
Number  of 
Published  Articles 
(Vol.  85,  2010) 

Two-Year 

Cumulative  Number 
of  Published 
Articles 

(2009  and  2010) 

0.00 

1.00 

0.00 

1.00 

0.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

0.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

0.00 

1.00 

0.83 

0.83 

0.33 

0.83 

0.33 

0.83 

0.00 

0.83 

0.67 

0.83 

0.83 

0.83 

0.83 

0.83 

0.58 

0.83 

0.25 

0.75 

0.00 

0.75 

0.33 

0.75 

0.00 

0.67 

0.00 

0.67 

0.00 

0.67 

0.67 

0.67 

0.50 

0.63 

0.33 

0.58 

0.00 

0.58 

0.00 

0.58 

0.58 

0.58 

0.00 

0.50 

0.50 

0.50 

0.00 

0.50 

0.00 

0.50 

0.00 

0.50 

0.50 

0.50 

0.50 

0.50 

0.50 

0.50 

0.00 

0.50 

0.00 

0.50 

0.00 

0.50 

(continued  on  next  page) 
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Panel  A:  Universities  and  Other  Institutions  Represented  in  Vols.  84  (2009)  and  85  (2010) 


Author  Affiliation 

University  of  Antwerp 

University  of  Arizona 

Arizona  State  University 

University  of  Auckland 

University  of  California,  Davis 

University  of  California,  San  Diego 

Carnegie  Mellon  University 

China  Europe  International  Business  School 

Colorado  State  University 

Drexel  University 

George  Mason  University 

Gradient  Analytics,  Inc. 

HEC  Paris 
Inha  University 
Lehigh  University 

University  of  Massachusetts  Dartmouth 

McMaster  University 

Meijo  University 

University  of  Nevada,  Las  Vegas 

New  York  University 

North  Carolina  State  University 

Northeastern  University 

Northwestern  University 

Osaka  University  of  Economics 

Sungkyunkwan  University 

Syracuse  University 

The  University  of  Tennessee 

Virginia  Commonwealth  University 

Virginia  Polytechnic  Institute  and  State  University 

Wichita  State  University 

Ball  State  University 

Chapman  University 

University  of  Hong  Kong 

Illinois  State  University 

Indian  School  of  Business 

Kent  State  University 

University  of  Massachusetts,  Amherst 

Mississippi  State  University 

Oklahoma  State  University 

Tilburg  University 

Barclays  Global  Investors 

Copenhagen  Business  School 

Interdisciplinary  Center-Herzliya 

University  of  Southern  Maine 


Current- Year 
Number  of 
Published  Articles 
(Vol.  85,  2010) 

Two-Year 

Cumulative  Number 
of  Published 
Articles 

(2009  and  2010) 

0.00 

0.33 

0.33 

0.33 

0.00 

0.33 

0.00 

0.33 

0.00 

0.33 

0.00 

0.33 

0.33 

0.33 

0.33 

0.33 

0.00 

0.33 

0.33 

0.33 

0.33 

0.33 

0.33 

0.33 

0.00 

0.33 

0.33 

0.33 

0.33 

0.33 

0.33 

0.33 

0.33 

0.33 

0.00 

0.33 

0.00 

0.33 

0.00 

0.33 

0.33 

0.33 

0.00 

0.33 

0.33 

0.33 

0.00 

0.33 

0.33 

0.33 

0.00 

0.33 

0.00 

0.33 

0.33 

0.33 

0.33 

0.33 

0.33 

0.33 

0.00 

0.25 

0.00 

0.25 

0.00 

0.25 

0.25 

0.25 

0.00 

0.25 

0.00 

0.25 

0.25 

0.25 

0.25 

0.25 

0.25 

0.25 

0.25 

0.25 

0.00 

0.17 

0.00 

0.17 

0.00 

0.17 

0.00 

0.17 
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Panel  B:  U.S.  and  non-U.S.  Affiliations  Represented  in  Vols.  84  (2009)  and  85  (2010) 


Journal  Year  Ending 
May  31,  2010 


Cumulative  from  June  1,  2008  to 
May  31,  2010 


Number 
(Percentage)  of 
Submissions 
Representing 
Unique 

Manuscript  Files 
with  Decisions 


Number 
(Percentage)  of 
Acceptances 


Number 
(Percentage)  of 
Submissions 
Representing 
Unique 

Manuscript  Files 
with  Decisions 


Number 
(Percentage)  of 
Acceptances 


U.S.  Institutions  448.93  (73.0%)  50.34  (79.9%) 

Non-U.S.  Institutions  166.07  (27.0%)  12.66  (20.1%) 


825.84  (73.3%)  115.50  (80.2%) 

301.16  (26.7%)  28.50  (19.8%) 


Panel  A  reports  author  affiliations  for  all  articles  published  in  Volume  84  (calendar  2009)  and  Volume  85  (calendar  2010) 
of  The  Accounting  Review,  excluding  invited  commentaries.  For  articles  written  by  k  coauthors,  the  table  allocates  1/k  of 
an  article  to  each  coauthor’s  affiliation.  For  authors  indicating  two  affiliations,  that  author’s  1/k  share  is  allocated  equally 
to  both  affiliations. 

Panel  B  reports  data  for  authors’  affiliations,  allocating  1/k  of  each  manuscript  to  each  of  k  coauthors.  Submissions  data  for 
the  journal  year  ending  May  31,  2010  reflect  the  615  unique  files  for  which  decisions  were  reached  in  fiscal  2010, 
excluding  files  for  which  two  or  more  decisions  were  reached  within  the  year  due  to  invitations  to  revise  and  resubmit. 
Cumulative  submissions  data  from  June  1,  2008  to  May  31,  2010  (i.e.,  two  journal  years)  reflect  the  1,127  unique 
manuscript  files  for  which  a  decision  was  logged  in  either  year,  but  not  double  counting  files  for  which  a  decision  was 
reached  in  both  years  (due  to  revision).  Acceptance  data  reflect  all  acceptances  and  conditional  acceptances  reached  in  the 
journal  years  ending  May  31,  2009  and  May  31,  2010,  including  some  acceptances  that  are  scheduled  for  publication  in 
early  2011.  Hence,  the  acceptance  data  in  this  panel  will  not  equal  the  total  acceptances  in  Panel  A,  as  Panel  A  only  reflects 
actual  articles  published  in  calendar  2009  and  calendar  2010,  excluding  forthcoming  articles  accepted  for  future 
publication. 


live  test  =  3.01;  p  =  0.08).  My  hunch  is  that  non-U.S.  authors  are  probably  underrepre¬ 

sented  a  bit  in  acceptances  relative  to  submissions,  but  as  I  indicated  last  year,  I  honestly  do  not 
perceive  any  such  underrepresentation  as  an  overt  bias  against  non-U.S.  authors.  Rather,  I  suspect 
that  the  factors  that  influence  reviewer  assessments  of  manuscript  quality  are  correlated  to  some 
extent  with  the  U.S./non-U.S.  categorization.  To  avoid  misunderstanding,  please  note  that  this 
conjecture  is  a  comment  on  the  distribution,  not  a  generalization  about  non-U.S.  authors.  For  the 
vast  majority  of  our  submissions,  I  have  discerned  no  systematic  quality  differences  between  U.S. 
and  non-U.S.  authors,  and  indeed,  current  data  indicate  that  about  one  out  of  every  five  articles 
published  in  TAR  is  written  by  a  non-U.S.  author.  My  comment  applies  only  to  the  tails  of  the 
distributions. 

Table  5:  Processing  Time 

Table  5  reports  processing  time  data  for  the  673  decisions  reached  from  June  1,  2009  through 
May  31,  2010,  defining  “processing  time”  as  the  number  of  days  from  the  submission  to  the  date 
my  assistant,  Mary  Capps,  sends  the  decision  letter.  Our  overall  processing  time  has  not  improved 
since  last  year — in  fact,  the  average  is  up  from  81  days  last  year  to  86  days  this  year.  I  was  curious 
to  learn  why  it  takes  nearly  three  months  to  reach  an  (average)  editorial  decision,  so  I  dug  a  bit 
deeper  by  first  determining  the  later  reviewer  for  each  reviewed  submission  and  then  calculating 
the  average  review  time  for  that  later  reviewer.  Calculating  the  review  time  for  the  later  of  the  two 
reviewers  is  appropriate  because  a  two-reviewer  system  is  subject  to  somewhat  of  a  weakest- 
link”  problem— even  if  the  first  reviewer  is  timely,  a  decision  based  on  two  reviews  cannot 
proceed  until  the  second  review  arrives. 

For  the  journal  year  ending  May  31,  2010,  the  later  of  the  two  reviewers  took  an  average  of 
55  days  to  return  his/her  review — call  it  two  months.  If  our  average  total  turnaround  time  is  about 
three  months,  this  means  that  the  typical  manuscript  is  out  for  review  for  two  months,  with  an 
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TABLE  5 


Processing  Time  from  Date  of  Submission  to  Date  of  Sending  the  Decision 
Journal  Year  Ending  May  31,  2010 


Processing  Time 

Number  of 
Manuscripts 

Percentage 

Cumulative 
Number  of 
Manuscripts 

Cumulative 

Percentage 

<30  days 

36 

5.3% 

36 

5.3% 

31-60  days 

72 

10.7% 

108 

16.0% 

61-90  days 

227 

33.7% 

335 

49.8% 

91-120  days 

303 

45.0% 

638 

94.8% 

>121  days 

35 

5.2% 

673 

100.0% 

Mean  processing  time:  86  days. 
Median  processing  time:  91  days. 


additional  month  for  (1)  initial  administrative  processing  to  log  the  submission  in  our  system  and 
check  for  previous,  related  submissions,  (2)  the  initial  “pre-review”  by  my  research  assistant  and 
by  me  to  identify  potential  reviewers,  (3)  reviewer  clearance  with  the  assigned  editor,  (4)  process¬ 
ing  invitations  to  both  reviewers  and  following  up,  as  necessary,  (5)  finding  and  sending  invita¬ 
tions  to  alternative  reviewer(s)  if  the  initially  invited  reviewer(s)  decline,  (6)  editorial  processing 
of  the  reviews,  (7)  editorial  processing  of  the  manuscript,  (8)  formulating  and  writing  the  decision 
letter,  (9)  senior  editor  reading  of  a  coeditor’s  decision  letter  and  consultation,  if  applicable,  and 
(10)  administrative  processing  of  the  decision  letter  and  corresponding  statistics  for  our  databases. 
Yes,  I  know  this  ten-point  list  sounds  a  bit  defensive,  but  my  intent  is  to  communicate  that  there 
are  many  things  to  do  for  each  file,  and  especially  when  other  demands  (like  teaching)  are  present, 
it  is  difficult  to  get  the  total  non-review  time  materially  lower  than  a  month,  though  we  try  and  are 
successful  to  varying  degrees  at  different  times  of  the  year.  (The  Holiday  season  is  especially 
problematic — authors  probably  should  not  expect  a  very  quick  turnaround  when  submitting  a 
manuscript  on  December  15.) 

Regarding  review  time,  some  might  advise  reaching  a  decision  based  on  one  review  if  the 
second  review  is  not  in  hand  within  two  months,  and  indeed,  that  was  my  intent  when  I  started  the 
term.  Especially  this  year,  however,  my  coeditors  and  I  have  tended  to  wait  for  both  reviews,  even 
if  that  means  a  turnaround  time  that  exceeds  what  we  like  to  see.  It  is  easy  to  favor  a  one-reviewer 
decision  in  abstract  terms,  but  when  one  is  looking  at  an  actual  manuscript  with  one  negative 
review  that  could  be  due  to  a  matter  of  taste,  I  think  most  authors  would  favor  waiting  a  couple 
more  weeks  for  a  second  opinion.  Ex  post,  of  course,  if  the  later  review  also  turns  out  negative  and 
the  decision  is  to  reject,  it  would  have  been  better  to  have  made  that  decision  earlier,  but  editors 
do  not  have  the  luxury  of  hindsight  when  only  one  review  is  in  hand. 

There  are  a  couple  of  bright  spots  in  our  turnaround  statistics.  First,  while  many  manuscripts 
take  three  months  from  submission  to  decision,  very  few  (5.2  percent  to  be  exact)  take  four 
months  or  longer.  Also,  if  the  goal  is  to  reach  a  final  decision  on  a  timely  basis,  I  would  submit 
that  the  turnaround  time  for  each  round  comprises  only  a  small  part  of  the  total.  Time  spent  on  the 
author’s  desk  between  rounds  can  be  quite  significant,  as  can  the  total  number  of  rounds  before  a 
final  decision  is  reached.  Along  that  line,  I  think  it  is  comforting  that  only  nine  of  our  673  editorial 
decisions  in  fiscal  2010  were  in  the  fourth  round,  and  only  one  reached  the  fifth  round  (upon  which 
it  was  accepted).  We  try  to  reach  closure  by  the  third  round  in  the  vast  majority  of  cases. 
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IV.  SOME  NOTES  OF  THANKS  AND  REMEMBRANCE 

Notes  of  Thanks 

As  I  did  last  year,  I  close  this  report  with  several  notes  of  thanks.  First,  I  thank  the  person  with 
whom  most  submitting  authors  and  reviewers  are  quite  familiar  but  have  never  met — Mary  Capps. 
Mary  works  tirelessly  to  keep  the  “trains  running  on  time,”  and  she  also  is  the  source  of  those 
emails  many  reviewers  have  received  with  the  capitalized  word  “REMINDER”  at  the  beginning  of 
the  subject  line.  She  is  also  fond  of  the  one-line  email  “Have  you  had  a  chance  to  get  to  this  yet?,” 
followed  by  a  smiley  face  icon.  At  a  recent  conference  I  attended,  a  faculty  member  observed  to 
me,  “Mary  is  quite  persistent,  isn’t  she?”  He  said  it  with  a  smile,  and  quickly  followed  up  with  a 
note  of  appreciation  that  someone  cares  so  much  about  making  the  journal  work  as  efficiently  and 
effectively  as  possible,  while  remaining  sensitive  to  personal  circumstances  that  can  arise  from 
time  to  time.  I  agree. 

Second,  I  thank  my  doctoral  research  assistant,  Tracie  Majors.  Tracie  has  become  “scary 
good”  at  researching  new  submissions  to  identify  potential  reviewers.  She  recognizes  potential 
conflicts  of  interest  not  only  from  affiliations  and  coauthorships,  but  also  from  different  perspec¬ 
tives.  A  typical  Tracie  comment  would  be,  “Professor  X  clearly  knows  this  area,  but  the  author  is 
taking  issue  with  X’s  favored  conclusion,  so  if  you  decide  to  go  with  X,  you  really  need  a  second 
reviewer  with  a  more  independent  perspective.”  Some  might  question  the  appropriateness  of  a 
doctoral  student  playing  such  an  active  role  in  proposing  possible  reviewers,  but  I  view  it  as  one 
of  the  wisest  decisions  I  have  made  as  senior  editor.  Unlike  professors  who  develop  different  kinds 
of  political  baggage.  Trade’s  only  exposure  to  a  new  submission  is  the  manuscript  itself  and  the 
literature  to  which  it  contributes.  To  be  clear,  Tracie  only  suggests  a  list  of  potential  reviewers  for 
my  consideration  (she  does  not  make  the  decisions),  but  I  have  found  her  input  to  be  invaluable. 

Third,  I  thank  my  coeditors.  Shannon  Anderson,  Harry  Evans,  Jim  Hunton,  Kathryn  Kadous, 
Sanjay  Kallapur,  Ranjani  Krishnan,  Laureen  Maines,  Paul  Newman,  Tom  Omer,  Wayne  Thomas, 
Mark  Trombley,  Beverly  Walther,  and  Paul  Zarowin.  They  never  complain,  even  when  they 
should,  and  their  decisions  are  consistently  of  the  highest  ethical  and  professional  caliber.  Eor  all 
their  hard  work,  they  get  paid  the  grand  sum  of  zero,  so  I  am  extremely  grateful  for  their  dedicated 
service.  With  over  500  new  submissions  per  year,  it  would  be  impossible  for  any  senior  editor  to 
do  this  job  alone.  Delegation  to  experts  is  essential,  and  I  am  comfortable  placing  complete  trust 
in  these  13  outstanding  professionals. 

Eourth,  I  thank  TAR’s  Editorial  Advisory  and  Review  Board,  as  listed  in  the  inside  cover 
material  of  each  issue.  Like  our  coeditors.  Editorial  Board  members  get  paid  the  whopping  sum  of 
zero  for  their  service  to  TAR,  and  they  put  in  a  tremendous  amount  of  hours  completing  six  and 
sometimes  more  reviews  per  year.  Six  reviews  per  year  might  not  seem  too  oppressive,  but  one  has 
to  keep  in  mind  that  our  most  demanded  reviewers  are  probably  also  in  high  demand  by  other 
journals,  and  indeed,  several  of  them  serve  on  multiple  journal  editorial  boards.  In  essence,  the 
reward  for  gaining  a  reputation  for  doing  timely,  high-quality  reviews  is  more  review  requests.  We 
let  no  good  deed  go  unpunished. 

Eifth,  I  thank  our  ad  hoc  reviewers,  the  unsung  heroes  of  any  successful  journal.  Continuing 
a  tradition  I  started  last  year,  the  Appendix  lists  all  453  ad  hoc  reviewers  who  completed  one  or 
more  review  reports  between  June  1,  2009  and  May  31,  2010.  I  challenge  the  reader  to  find  any 
research  area  (topical  or  methodological)  that  is  not  covered  by  at  least  a  few  names  in  this 
Appendix,  so  I  hope  that,  in  addition  to  thanking  these  individuals  for  their  generosity,  the  Ap¬ 
pendix  also  serves  to  signal  TAR’s  diversity. 

Sixth,  I  thank  the  exceptional  professional  staff  of  the  American  Accounting  Association.  I 
would  start  naming  names,  but  surely  I  would  inadvertently  leave  someone  out  who  should  be 
thanked,  so  I  will  just  leave  it  at  that.  We  send  them  accepted  manuscripts,  and  they  magically 
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make  a  neat  looking  journal  appear,  including  an  invaluable  copyediting  service.  (Authors:  please 
do  not  read  that  as  an  excuse  not  to  check  your  conditionally  accepted  manuscripts  carefully 
before  sending  us  the  final  revision.) 

Seventh,  I  thank  The  University  of  Texas  at  Austin,  including  McCombs  School  of  Business 
Dean  Tom  Gilligan  and  Department  of  Accounting  Chair  Urton  Anderson,  for  supporting  my 
service  as  senior  editor  of  The  Accounting  Review.  While  most  of  my  working  hours  since  2008 
have  been  for  an  activity  governed  by  the  American  Accounting  Association,  it  is  The  University 
of  Texas  at  Austin  that  pays  my  salary  and  makes  this  service  possible.  I  still  teach,  which  is  my 
preference;  I  would  never  want  to  be  out  of  the  classroom  entirely.  But  it  is  clear  that  the  nature 
of  my  academic  position  has  changed  dramatically  over  the  past  two  years  (actually  three,  count¬ 
ing  the  extensive  planning  efforts  an  incoming  senior  editor  must  undertake  before  the  term 
begins).  I  thank  my  employer  for  understanding  and  supporting  me  in  this  role  and  for  recognizing 
the  requisite  tradeoffs. 

Eighth,  and  perhaps  most  importantly,  I  thank  my  wife  Paula  and  my  daughters  Karen, 
Nicole,  and  Sara.  Last  year  at  this  time,  my  annual  report  mentioned  the  deck  lighting  I  had  been 
stalling  for  months  while  working  virtually  every  weekend  on  TAR  matters.  I  finally  got  the 
lighting  installed  (I  think  I  could  faintly  hear  my  daughters  in  the  background  singing  the  Halle¬ 
lujah  chorus).  This  year’s  winner  of  the  procrastination  award  goes  to  my  1992  Camaro,  which  sits 
in  our  garage  gathering  spider  webs  while  I  stall  taking  action  on  either  fixing  it  or  putting  it  out 
of  its  misery.  We  are  all  going  camping  next  week — this  time  in  Colorado.  I  will  put  a  few  TAR 
files  in  the  pile  with  the  camping  gear,  which  I  hope  Paula  “forgets”  to  bring  along  (meaning  the 
files,  not  the  gear).  So  add  seven  days  to  our  average  turnaround  statistics  for  this  month — I  need 
the  break,  and  family  comes  first  when  all  is  said  and  done.  One  sad  note  is  that  our  loyal  dog 
Midnight  will  not  be  anxiously  awaiting  our  return  this  time,  as  she  passed  away  last  spring  at  the 
ripe  old  age  of  11.  The  nice  thing  about  dogs  is  that  their  love  is  unconditional,  no  matter  how 
badly  you  screw  up.  We  miss  her  greatly. 

Remembrances 

While  not  part  of  the  annual  report  per  se,  I  will  exercise  my  editorial  prerogative  to  offer  a 
few  comments  on  two  leaders  in  our  profession  who  passed  away  this  year — John  Dickhaut  and 
Anthony  Hopwood.  Both  exhibited  inspirational  leadership  in  daring  to  challenge  the  received 
wisdom  in  accounting  research,  and  both  were  true  scholars  in  every  sense  of  that  word. 

My  most  vivid  early  recollection  of  John  Dickhaut  was  when  I  was  a  doctoral  student  at  the 
University  of  Florida  in  the  mid-1980s  and  John  was  an  invited  workshop  guest.  1  distinctly 
remember  an  intense  discussion  of  backward  induction  and  multiperiod  reputations  with  John 
while  in  the  men’s  underwear  department  of  the  local  JCPenney,  as  John  had  asked  me  to  take  him 
there  during  the  lunch  hour  because  he  had  forgotten  to  pack  any  clothes  for  his  two-day  visit. 
This  was  vintage  John  Dickhaut — a  man  who  could  forget  trivial  things  like  packing  a  suitcase, 
but  who  at  the  same  time  was  absolutely  brilliant  in  his  understanding  of  the  interface  between 
human  behavior  and  accounting.  About  ten  years  later,  I  gave  John  Dickhaut  most  of  the  credit  for 
making  sure  the  two  words  “and  experimental”  were  included  in  the  “Aims  and  Scope”  statement 
for  Review  of  Accounting  Studies,  a  new  journal  for  which  John  was  one  of  the  founding  editors. 
For  The  Accounting  Review,  I  take  special  pride  in  the  fact  that  we  had  the  privilege  of  publishing 
two  of  John’s  final  scholarly  works — both  in  the  same  issue  (Dickhaut  2009;  Dickhaut  and  Xin 
2009).  Those  who  value  diversity  and  quality  in  accounting  scholarship  should  read  both  articles. 
Finally,  on  a  somewhat  sad  but  telling  note,  the  reader  will  see  John’s  name  in  the  Appendix  of  ad 
hoc  reviewers  this  year,  as  he  turned  in  a  review  of  a  first-round  submission  just  the  week  before 
he  passed  away,  using  voice-recognition  software  to  help  compose  his  thoughts.  He  recommended 
revision. 
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I  did  not  have  as  much  personal  knowledge  of  Anthony  Hopwood,  but  I  greatly  appreciate  his 
influence  on  the  discipline,  not  only  as  editor  of  Accounting,  Organizations  and  Society,  but  also 
as  a  persistent  champion  for  diversity  in  accounting  scholarship.  I  wonder  how  he  might  react  to 
Table  3  in  this  report.  In  the  commentary  based  on  his  2006  AAA  Presidential  Scholar  Lecture, 
Professor  Hopwood  issued  a  special  challenge  for  The  Accounting  Review  as  the  Association’s 
flagship  journal,  urging  that  “every  effort  be  made  to  encourage  The  Accounting  Review  to  em¬ 
brace  the  new,  the  innovative,  what  accounting  research  might  be  in  the  process  of  becoming,  and 
new  interdisciplinary  perspectives  ...  in  a  way  that  provides  both  a  catalyst  and  a  model  for  other 
journals  of  influence”  (Hopwood  2007,  1373).  The  challenge  is  profound.  I  am  trying,  and  it  is  my 
heartfelt  conviction  that  TAR  senior  editors  and  coeditors  past,  present,  and  future  would  join  me 
in  agreeing  with  Professor  Hopwood’s  challenge.  But  innovation  in  any  discipline  is  a  team  sport. 
We  need  the  help  of  the  scholarly  community  in  accounting  to  make  it  happen. 


APPENDIX 

TAR  Ad  Hoc  Reviewers,  June  1,  2009-May  31,  2010 

Note:  This  table  lists  all  ad  hoc  reviewers  who  submitted  at  least  one  review  report  between 
June  1,  2009  and  May  31,  2010.  It  does  not  include  the  129  members  of  the  Editorial  Advisory  and 
Review  Board,  as  those  names  are  listed  separately  in  the  inside  cover  material  of  each  issue.  Ad 
hoc  reviewers  whose  reports  were  submitted  after  May  31,  2010  will  be  included  in  next  year’s 
report. 

Lawrence  Abbott,  University  of  Wisconsin-Milwaukee 

A.  Rashad  Abdel-khalik,  University  of  Illinois  at  Urbana-Champaign 

Margaret  Abemethy,  University  of  Melbourne 

David  Aboody,  University  of  California,  Los  Angeles 

Andrew  Acito,  The  University  of  Iowa 

Christopher  Agoglia,  University  of  Massachusetts  Amherst 

Kris  Allee,  Michigan  State  University 

Michael  Alles,  Rutgers  University 

Urton  Anderson,  The  University  of  Texas  at  Austin 

Christopher  Armstrong,  University  of  Pennsylvania 

Sharad  Asthana,  The  University  of  Texas  at  San  Antonio 

Brad  Badertscher,  University  of  Notre  Dame 

Kee-Hong  Bae,  York  University 

Mark  Bagnoli,  Purdue  University 

Bok  Baik,  Seoul  National  University 

Wendy  Bailey,  University  of  South  Carolina 

Steven  Balsam,  Temple  University 

Michael  B  amber.  The  University  of  Georgia 

Ran  Bamiv,  Kent  State  University 

Mary  Barth,  Stanford  University 

Jan  Barton,  Emory  University 

Eli  Bartov,  New  York  University 

Mark  Beasley,  North  Carolina  State  University 

Paul  Beck,  University  of  Illinois  at  Urbana-Champaign 

Joy  Begley,  University  of  British  Columbia 

Bruce  Behn,  University  of  Tennessee 

Messod  Beneish,  Indiana  University 

Daniel  Bens,  University  of  Arizona 
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Jeremy  Bertomeu,  Northwestern  University 

Anne  Beyer,  Stanford  University 

Sreedhar  Bharath,  Arizona  State  University 

Neil  Bhattacharya,  Southern  Methodist  University 

Sanjeev  Bhojraj,  Cornell  University 

Gary  Biddle,  The  University  of  Hong  Kong 

Bruce  Billings,  Florida  State  University 

Mary  Brooke  Billings,  New  York  University 

Erv  Black,  Brigham  Young  University 

Jeff  Boone,  The  University  of  Texas  at  San  Antonio 

Jan  Bouwens,  Tilburg  University 

Robert  Bowen,  University  of  Washington 

Kendall  Bowlin,  University  of  Mississippi 

Joe  Brazel,  North  Carolina  State  University 

Francois  Brochet,  Harvard  University 

Jason  Brown,  Indiana  University 

Jennifer  Brown,  Arizona  State  University 

Jorg  Budde,  University  of  Bonn 

David  Burgstahler,  University  of  Washington 

Jeffrey  Burks,  University  of  Notre  Dame 

Robert  Bushman,  University  of  North  Carolina  at  Chapel  Hill 

Brian  Cadman,  University  of  Utah 

Steven  Cahan,  University  of  Auckland 

Andrew  Call,  The  University  of  Georgia 

Dennis  Campbell,  Harvard  University 

Eddy  Cardinaels,  Tilburg  University 

Peter  Carey,  Monash  University 

Tina  Carpenter,  The  University  of  Georgia 

Elizabeth  Carson,  The  University  of  New  South  Wales 

Judson  Caskey,  University  of  California,  Los  Angeles 

Gavin  Cassar,  University  of  Pennsylvania 

Jeffrey  Casterella,  Colorado  State  University 

Marcus  Caylor,  University  of  South  Carolina 

Sandra  Chamberlain,  University  of  British  Columbia 

Dennis  Chambers,  Kennesaw  State  University 

Chih-Ying  Chen,  Singapore  Management  University 

Clara  Xiaoling  Chen,  University  of  Illinois  at  Urbana-Champaign 

Kung  Chen,  University  of  Nebraska — Lincoln 

Peter  Chen,  Hong  Kong  University  of  Science  and  Technology 

Qi  Chen,  Duke  University 

Shuping  Chen,  The  University  of  Texas  at  Austin 

Xia  Chen,  University  of  Wisconsin — Madison 

Agnes  Cheng,  Louisiana  State  University 

Shijun  Cheng,  University  of  Maryland 

Jong-Hag  Choi,  Seoul  National  University 

Margaret  Christ,  The  University  of  Georgia 

Peter  Christensen,  Aarhus  University 

Ted  Christensen,  Brigham  Young  University 

Wai-Fong  Chua,  The  University  of  New  South  Wales 
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Hyeesoo  (Sally)  Chung,  Arizona  State  University  West 

Bryan  Church,  Georgia  Institute  of  Technology 

Peter  Clarkson,  University  of  Queensland 

Shana  Clor-Proell,  University  of  Wisconsin — Madison 

Daniel  Cohen,  New  York  University 

Dan  Collins,  The  University  of  Iowa 

Carlos  Corona,  The  University  of  Texas  at  Austin 

Masako  Darrough,  Baruch  College-CUNY 

Somnath  Das,  University  of  Illinois  at  Chicago 

Sudipto  Dasgupta,  Hong  Kong  University  of  Science  and  Technology 

Angela  Davis,  University  of  Oregon 

Gus  De  Franco,  University  of  Toronto 

Herui  Dekker,  Vrije  University  Amsterdam 

Joel  Demski,  University  of  Florida 

Aiyesha  Dey,  University  of  Minnesota 

Ilia  Dichev,  Emory  University 

John  Dickhaut,  Chapman  University 

J.  Richard  Dietrich,  The  Ohio  State  University 

Shane  Dikolli,  Duke  University 

Ming  Dong,  York  University 

Jeffrey  Doyle,  Utah  State  University 

Julia  D’ Souza,  Cornell  University 

Scott  Dyreng,  Duke  University 

Christine  Earley,  Providence  College 

David  Easley,  Cornell  University 

Frank  Ecker,  Duke  University 

Merle  Ederhof,  University  of  Michigan 

Eti  Einhom,  Tel  Aviv  University 

Randal  Elder,  Syracuse  University 

W.  Brooke  Elliott,  University  of  Illinois  at  Urbana-Champaign 

David  Erkens,  University  of  Southern  California 

Michael  Ettredge,  University  of  Kansas 

Diana  Falsetta,  University  of  Miami 

Qintao  Fan,  University  of  California,  Berkeley 

C.  Edward  Fee,  Michigan  State  University 

Bill  Felix,  University  of  Arizona 

Mei  Feng,  University  of  Pittsburgh 

Andrew  Ferguson,  University  of  Technology  Sydney 

Fabrizio  Ferri,  New  York  University 

Rebecca  Files,  The  University  of  Texas  at  Dallas 

Urs  Fischbacher,  University  of  Konstanz 

Joseph  Fisher,  Indiana  University 

Mary  Margaret  Frank,  University  of  Virginia 

Richard  Frankel,  Washington  University  at  St.  Louis 

Pingyang  Gao,  The  University  of  Chicago 

Jon  Garfinkel,  The  University  of  Iowa 

Jennifer  Gaver,  The  University  of  Georgia 

Lisa  Milici  Gaynor,  University  of  South  Florida 

Weili  Ge,  University  of  Washington 
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Marshall  Geiger,  University  of  Richmond 

Aloke  Ghosh,  Baruch  College-CUNY 

Michael  Gibbs,  The  University  of  Chicago 

Dan  Givoly,  The  Pennsylvania  State  University 

Guojin  Gong,  The  Pennsylvania  State  University 

Elizabeth  Gordon,  Temple  University 

Severin  Grabski,  Michigan  State  University 

Jeffery  Gramlich,  University  of  Southern  Maine 

Audrey  Gramling,  Kennesaw  State  University 

Barbara  Murray  Grein,  Drexel  University 

Paul  Griffin,  University  of  California,  Davis 

Zhaoyang  Gu,  University  of  Minnesota 

Wayne  Guay,  University  of  Pennsylvania 

Ferdinand  Gul,  The  Hong  Kong  Polytechnic  University 

Sanjay  Gupta,  Michigan  State  University 

Ilan  Guttman,  Stanford  University 

Charles  Hadlock,  Michigan  State  University 

Jane  Hamilton,  LaTrobe  University 

Jackie  Hammersley,  The  University  of  Georgia 

Michelle  Hanlon,  Massachusetts  Institute  of  Technology 

Bruce  Haslem,  Florida  State  University 

John  Hassell,  Indiana  University  Indianapolis 

Richard  Hatfield,  University  of  Alabama 

Rachel  Hayes,  University  of  Utah 

Carla  Hayn,  University  of  California,  Los  Angeles 

Gary  Hecht,  Emory  University 

Frank  Heflin,  Florida  State  University 

Bill  Heninger,  Brigham  Young  University 

Karen  Hennes,  University  of  Oklahoma 

Don  Herrmann,  Oklahoma  State  University 

Max  Hewitt,  Indiana  University 

Stephen  Hillegeist,  INSEAD 

lessen  Hobson,  University  of  Illinois  at  Urbana-Champaign 

Leslie  Hodder,  Indiana  University 

Vicky  Hoffman,  University  of  Pittsburgh 

Rani  Hoitash,  Bentley  University 

Udi  Hoitash,  Northeastern  University 

Lori  Holder- Webb,  Western  New  England  College 

Keith  Houghton,  Australian  National  University 

Paul  Hribar,  The  University  of  Iowa 

John  Hughes,  University  of  California,  Los  Angeles 

Kai  Wai  Hui,  Hong  Kong  University  of  Science  and  Technology 

Mingyi  Hung,  University  of  Southern  California 

David  Hurtt,  Baylor  University 

Mark  Huson,  University  of  Alberta 

Amy  Hutton,  Boston  College 

Lee-Seok  Hwang,  Seoul  National  University 

Yuhchang  Hwang,  Arizona  State  University 

Troy  Hyatt,  Boise  State  University 
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Paul  Irvine,  The  University  of  Georgia 

Alan  Jagolinzer,  Stanford  University 

Karim  Jamal,  University  of  Alberta 

Surya  Janakiriman,  The  University  of  Texas  at  Dallas 

Nicole  Jenkins,  Vanderbilt  University 

Ross  Jennings,  The  University  of  Texas  at  Austin 

Kevan  Jensen,  University  of  Oklahoma 

John  (Xuefeng)  Jiang,  Michigan  State  University 

W.  Bruce  Johnson,  The  University  of  Iowa 

Derek  Johnston,  Colorado  State  University 

Bjorn  Jorgensen,  University  of  Colorado  at  Boulder 

Paul  Kalyta,  McGill  University 

Sok-Hyon  Kang,  The  George  Washington  University 

Tony  Kang,  Oklahoma  State  University 

Ron  Kasznik,  Stanford  University 

Sharon  Katz,  Columbia  University 

Elizabeth  Keating,  Boston  College 

Mozaffar  Khan,  University  of  Minnesota 

Jeong-Bon  Kim,  City  University  of  Hong  Kong 

Oliver  Kim,  University  of  Maryland 

Michael  Kinney,  Texas  A&M  University 

Sandy  Klasa,  University  of  Arizona 

Ken  Klassen,  University  of  Waterloo 

Kalin  Kolev,  Yale  University 

Gopal  Krishnan,  Lehigh  University 

Jagan  Krishnan,  Temple  University 

Linda  Krull,  University  of  Oregon 

Xi  (Jason)  Kuang,  Georgia  Institute  of  Technology 

Susan  Kulp,  The  George  Washington  University 

Soo  Young  Kwon,  Korea  University 

Ryan  LaFond,  BlackRock,  Inc. 

Richard  Lambert,  University  of  Pennsylvania 

Wayne  Landsman,  University  of  North  Carolina  at  Chapel  Hill 

Mark  Lang,  University  of  North  Carolina  at  Chapel  Hill 

Christian  Laux,  Goethe  University 

Volker  Laux,  The  University  of  Texas  at  Austin 

Charles  Lee,  Stanford  University 

Yen-Jung  Lee,  National  Taiwan  University 

Craig  Lefanowicz,  University  of  Virginia 

Clive  Lennox,  Nanyang  Technological  University 

Andrew  Leone,  University  of  Miami 

Christian  Leuz,  The  University  of  Chicago 

Baruch  Lev,  New  York  University 

Edward  X.  Li,  University  of  Rochester 

Laura  Yue  Li,  University  of  Illinois  at  Urbana-Champaign 

Oliver  Li,  University  of  Arizona 

Siqi  Li,  Santa  Clara  University 

Xu  Li,  Lehigh  University 

Yue  Li,  University  of  Toronto 


The  Accounting  Review 


November  2010 
American  Accounting  Association 


2198 


Kachelmeier 


Pierre  Liang,  Carnegie  Mellon  University 
Woody  Liao,  University  of  California,  Riverside 
Anne  Lillis,  University  of  Melbourne 
Marlys  Lipe,  University  of  Oklahoma 
Robert  Lipe,  University  of  Oklahoma 

Petro  Lisowsky,  University  of  Illinois  at  Urbana-Champaign 

Chao-Shin  Liu,  University  of  Notre  Dame 

ling  Liu,  University  of  California,  Los  Angeles 

Josh  Livnat,  New  York  University 

Kin  Lo,  University  of  British  Columbia 

Gerald  Lobo,  University  of  Houston 

Tom  Lopez,  University  of  Alabama 

Ken  Lorek,  Northern  Arizona  University 

Tim  Loughran,  University  of  Notre  Dame 

Henock  Louis,  The  Pennsylvania  State  University 

Tong  Lu,  University  of  Houston 

Yvonne  Lu,  Lehigh  University 

Russell  Lundholm,  University  of  Michigan 

Luann  Lynch,  University  of  Virginia 

Victor  Maas,  University  of  Amsterdam 

Mario  Maletta,  Northeastern  University 

Xiumin  Martin,  Washington  University  at  St.  Louis 

Michal  Matejka,  Arizona  State  University 
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EDITORIAL  POLICY  AND  STYLE  INFORMATION 

EDITORIAL  POLICY 

According  to  the  policies  set  by  the  Publications  Committee  (which  were  endorsed  by  the  Executive 
Committee  and  were  published  in  the  Accounting  Education  News,  June  1987),  The  Accounting  Review 
should  be  viewed  as  the  premier  journal  for  publishing  articles  reporting  the  results  of  accounting  research 
and  explaining  and  illustrating  related  research  methodology.  The  scope  of  acceptable  articles  should  embrace 
any  research  methodology  and  any  accounting-related  subject,  as  long  as  the  articles  meet  the  standards 
established  for  publication  in  the  journal  ...  No  special  sections  should  be  necessary.  The  primary,  but  not 
exclusive,  audience  should  be  as  it  is  now — academicians,  graduate  students,  and  others  interested  in 
accounting  research.” 

The  primary  criterion  for  publication  in  The  Accounting  Review  is  the  significance  of  the  contribution  an 
article  makes  to  the  literature.  Topical  areas  of  interest  to  the  journal  include  accounting  information  systems, 
auditing  and  assurance  services,  financial  accounting,  management  accounting,  taxation,  and  all  other  areas  of 
accounting,  broadly  defined.  The  journal  is  also  open  to  all  rigorous  research  methods. 

The  efficiency  and  effectiveness  of  the  editorial  review  process  is  critically  dependent  upon  the  actions 
of  both  authors  submitting  papers  and  the  reviewers.  Authors  accept  the  responsibility  of  preparing  research 
papers  at  a  level  suitable  for  evaluation  by  independent  reviewers.  Such  preparation,  therefore,  should  include 
subjecting  the  manuscript  to  critique  by  colleagues  and  others  and  revising  it  accordingly  prior  to  submission. 
The  review  process  is  not  to  be  used  as  a  means  of  obtaining  feedback  at  early  stages  of  developing  the 
research. 

Reviewers  and  editors  are  responsible  for  providing  constructive  and  prompt  evaluations  of  submitted 
research  papers  based  on  the  significance  of  their  contribution  and  on  the  rigor  of  analysis  and  presentation. 

MANUSCRIPT  PREPARATION  AND  STYLE 

The  Accounting  Review’s  manuscript  preparation  guidelines  follow  The  Chicago  Manual  of  Style  (15th  ed.; 
University  of  Chicago  Press).  Another  helpful  guide  to  usage  and  style  is  The  Elements  of  Style,  by  William 
Strunk,  Jr.,  and  E.  B.  White  (Macmillan).  Spelling  follows  Webster’s  Collegiate  Dictionary. 

FORMAT 

1.  All  manuscripts  should  be  formatted  in  12-point  font  on  8  1/2  x  11"  paper  and  should  be  double-spaced, 
except  for  indented  quotations. 

2.  Manuscripts  should  be  as  concise  as  the  subject  and  research  method  permit,  generally  not  to  exceed  7,000 
words. 

3.  Margins  should  be  at  least  one  inch  from  top,  bottom,  and  sides. 

4.  To  promote  anonymous  review,  authors  should  not  identify  themselves  directly  or  indirectly  in  their 
papers  or  in  experimental  test  instruments  included  with  the  submission.  Single  authors  should  not  use  the 
editorial  “we”. 

5.  A  cover  page  should  show  the  title  of  the  paper,  all  authors’  names,  titles  and  affiliations,  email  addresses, 
and  any  acknowledgments. 

6.  The  American  Accounting  Association  encourages  use  of  gender-neutral  language  in  its  publications. 

Pagination:  All  pages,  including  tables,  appendices  and  references,  should  be  serially  numbered.  Major 
sections  should  be  numbered  in  Roman  numerals.  Subsections  should  not  be  numbered. 

Numbers:  Spell  out  numbers  from  one  to  ten,  except  when  used  in  tables  and  lists,  and  when  used  with 
mathematical,  statistical,  scientific,  or  technical  units  and  quantities,  such  as  distances,  weights  and  measures. 
For  example:  three  days;  3  kilometers;  30  years.  All  other  numbers  are  expressed  numerically. 

Percentages  and  Decimal  Fractions:  In  nontechnical  copy  use  the  word  percent  in  the  text. 

Hyphens:  Use  a  hyphen  to  join  unit  modifiers  or  to  clarify  usage.  For  example:  a  cross-sectional  equation; 
re-form.  See  Webster’s  for  correct  usage. 

Keywords:  The  abstract  must  be  followed  by  at  least  three  keywords  to  assist  in  indexing  the  paper  and 
identifying  qualified  reviewers. 
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ABSTRACT/INTRODUCTION 

An  Abstract  of  about  100  words  (150  maximum)  should  be  presented  on  a  separate  page  immediately 
proceeding  the  text.  The  Abstract  should  concisely  inform  the  reader  of  the  manuscript’s  topic,  its  methods, 
and  its  findings.  The  Keywords  statement  should  appear  immediately  below  the  Abstract.  The  text  of  the 
paper  should  start  with  a  section  labeled  “1.  Introduction,”  which  provides  more  details  about  the  paper’s 
purpose,  motivation,  methodology,  and  findings.  Both  the  Abstract  and  the  Introduction  should  be  relatively 
nontechnical,  yet  clear  enough  for  an  informed  reader  to  understand  the  manuscript’s  contribution.  The 
manuscript’s  title,  but  neither  the  author’s  name  nor  other  identification  designations,  should  appear  on  the 
Abstract  page. 

TABLES  AND  FIGURES 

The  author  should  note  the  following  general  requirements: 

1 .  Each  table  and  figure  (graphic)  should  appear  on  a  separate  page  and  should  be  placed  at  the  end  of  the 
text.  Each  should  bear  an  Arabic  number  and  a  complete  title  indicating  the  exact  contents  of  the  table  or 
figure.  Tables  and  figures  should  define  each  variable.  The  titles  and  definitions  should  be  sufficiently 
detailed  to  enable  the  reader  to  interpret  the  tables  and  figures  without  reference  to  the  text. 

2.  A  reference  to  each  graphic  should  be  made  in  the  text. 

3.  The  author  should  indicate  where  each  graphic  should  be  inserted  in  the  text. 

4.  Graphics  should  be  reasonably  interpreted  without  reference  to  the  text. 

5.  Source  lines  and  notes  should  be  included  as  necessary. 

Equations:  Equations  should  be  numbered  in  parentheses  flush  with  the  right-hand  margin. 
DOCUMENTATION 

Citations:  Within-text  citations  are  made  using  an  author-year  format.  Cited  works  must  correspond  to  the 
list  of  works  listed  in  the  “References”  section.  Authors  should  make  an  effort  to  include  the  relevant  page 
numbers  in  the  within-text  citations. 

1.  In  the  text,  works  are  cited  as  follows:  author’s  last  name  and  year,  without  comma,  in  parentheses.  For 
example:  one  author,  (Berry  2003);  two  authors,  (Fehr  and  Schmidt  2003);  three  or  more  authors, 
(Dechow  et  al.  1998);  more  than  one  work  cited,  (Cole  and  Yakushiji  1984;  Dechow  et  al.  1995;  Levitt 
1998);  with  two  works  by  the  same  author(s),  (Nelson  2003,  2005). 

2.  Unless  confusion  would  result,  do  not  use  “p.”  or  “pp.”  before  page  numbers.  For  example,  (Dechow  and 
Dichev  2002,  41-42). 

3.  When  the  reference  list  contains  two  or  more  works  by  the  same  author  (as  the  only  author  or  first  of  two 
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NOVEMBER  2010  PLACEMENT  ADS 


The  deadline  for  free  position  ads  to  be  included  in  this  section  is  two  months  prior 
to  the  date  of  publication.  The  Accounting  Review  is  published  in  January,  March, 
May,  July,  September,  and  November.  Position  ads  in  the  “Placement  Ads”  are  free 
with  the  purchase  of  a  job  posting  in  the  AAA  Career  Center.  For  more  information 
on  how  to  purchase  a  job  posting  in  the  Career  Center  and  receive  your  free  position 
ad  in  The  Accounting  Review,  go  to  the  AAA  website  at  http;//aaahq.org  and  click  on 
“Career  Center,”  or  call  our  office  at  941-921-7747. 

601  SIENA  COLLEGE,  Accounting  Department  invites  applications  for  a  tenure-track  position  begin¬ 
ning  September  2011.  Applications  will  be  accepted  until  position  is  filled.  Rank  and  salary  depen¬ 
dent  upon  experience  and  credentials.  A  Ph.D./D. B. A.  with  concentration  in  accounting  is  expected, 
relevant  business  experience  desirable.  An  experienced  tax  professional  with  an  L.L.M.  is  acceptable 
for  someone  desiring  a  position  teaching  tax.  Research  and  professional  development  activities 
appropriate  to  academically  qualified  status  for  AACSB  accreditation  and  the  School  of  Business 
mission  are  necessary.  If  sufficient  Academically  Qualified  individuals  are  not  located,  then  indi¬ 
viduals  who  are  presently  Professionally  Qualified  and  willing  to  pursue  a  Ph.D.  or  D.B.A.  may  be 
considered  for  a  position  as  a  Visiting  Professor.  Individuals  hired  under  such  conditions  would  have 
their  status  evaluated  annually  and  renewal  would  be  dependent  upon  continued  satisfactory 
progress  toward  a  Ph.D.  or  D.B.A.  Siena  College  is  an  independent  liberal  arts  college  in  the 
Catholic  and  Franciscan  tradition,  located  in  New  York’s  Capital  District.  The  college  enrolls  2,800 
full-time  students,  1,200  in  the  School  of  Business.  Please  forward  applications  to;  Accounting 
Search  Committee,  do  School  of  Business,  Siena  College  515  Loudon  Road,  Loudonville,  NY 
12211-1462.  Siena  College  is  an  Equal  Opportunity  Employer  and  encourages  applications  from  all 
qualified  candidates. 


602  BOISE  STATE  UNIVERSITY,  Department  of  Accountancy  invites  applications  for  a  tenure- track 
managerial  accounting  faculty  position  at  either  the  Assistant  or  Associate  rank  starting  Fall  20 1 1 .  A 
Ph.D.  in  business  with  an  accounting  emphasis  is  required.  Professional  certification  is  preferred. 
Expectations  include  a  commitment  to  high-quality  teaching,  scholarship  leading  to  conference 
presentations  and  refereed  publications,  and  university  and  community  service.  Both  the  College  and 
the  Department’s  undergraduate  and  graduate  programs  are  AACSB-accredited.  BSU  is  located  m 
downtown  Boise,  Idaho,  a  thriving  community  offering  city  amenities  and  convenient  access  to  an 
array  of  outdoor  recreational  opportunities.  Salary  is  competitive  within  the  Western  region.  Review 
of  applications  begins  immediately  and  continues  until  the  position  is  filled.  For  more  infonnation 
regarding  this  position  and  application  procedures,  please  visit:  http://hrs.boisestate.edu/joblistmgs/ 
faculty/;  or  contact:  Dr.  Paul  Bahnson  at  pbahnson@boisestate.edu.  Boise  State  University  is  an 
EEO/AA  institution,  veteran’s  preference. 
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603  DRAKE  UNIVERSITY,  College  of  Business  and  Public  Administration,  School  of  Accounting, 
invites  applications  for  an  Assistant/Associate  Professor  of  Accounting  position,  starting  in  Fall 
2011,  teaching  primarily  cost/managerial  accounting  at  the  undergraduate  and  graduate  level.  The 
normal  teaching  requirements  are  nine  credit  hours  per  semester.  Demonstrated  effectiveness  in 
teaching  and  continuing  professional  development  is  expected.  A  doctorate  in  Accounting  at  time  of 
appointment  is  preferred,  but  ABD  candidates  close  to  completion  will  be  considered.  Candidates 
must  also  be  able  to  provide  evidence  of  demonstrated  teaching  effectiveness  at  the  university-level 
and  continuing  professional  development.  Candidates  for  the  Associate  level  must  show  a  record  of 
scholarly  publications  and  an  active  research  stream.  Salary  is  competitive  with  peer  AACSB  insti¬ 
tutions.  Drake  University  is  a  private,  comprehensive  university  dedicated  to  quality  teaching.  It  is 
located  in  Des  Moines,  Iowa,  an  insurance  and  financial  center  with  a  metropolitan  population  over 
500,000.  Enrollment  is  selective  and  limited  to  approximately  5,500  students  in  the  schools  and 
colleges  of  the  University.  Both  the  College  of  Business  and  Public  Administration  and  School  of 
Accounting  hold  AACSB  accreditation.  Send  application  letter  with  current  vita  listing  three  (3) 
references  to:  Dr.  Stephen  Gara,  Search  Committee  Chair,  School  of  Accounting,  Drake  University, 
2507  University  Avenue,  Des  Moines,  lA  50311;  Email:  Stephen.Gara@drake.edu;  URL:  http:// 
www.drake.edu/hr/.  Drake  University  is  an  EEO. 


604  UNIVERSITY  OF  DAYTON,  Department  of  Accounting  invites  applications  for  a  tenure-track 
position  at  the  Assistant  Professor  level  beginning  August  16,  2011.  The  ability  to  teach  introductory 
and  advanced  courses  in  U.S.  taxation  at  the  undergraduate  and  graduate  levels,  as  well  as  other 
related  accounting  or  law  courses,  is  required,  as  is  previous  successful  professional  tax  experience, 
CPA  certification,  and  tax  teaching  experience.  Candidates  must  meet,  and  have  the  ability  to  main¬ 
tain  AACSB  business  and  accounting  accreditation  requirements  for  “Academically  Qualified”  (AQ) 
status.  This  includes  a  Ph.D.  from  an  AACSB-accredited  institution  or  a  combination  of  a  master’s 
degree  in  business  with  an  emphasis  in  accounting,  a  J.D.,  and  a  L.L.M.  degree  with  specialization 
in  taxation.  Candidates  should  have  the  potential  for  appropriate  teaching,  research,  and  service.  We 
prefer  candidates  who  have  experience  teaching  students  from  diverse  racial  and  ethnic  backgrounds 
and  who  have  demonstrated  interest  in  mentoring  traditionally  underrepresented  students  in  their 
area  of  expertise.  We  prefer  a  candidate  who  will  embrace  values  consistent  with  Marianist  and 
Catholic  traditions.  For  more  information  on  UD,  see:  http://www.udayton.edu/business/accounting/ 
index. php.  The  complete  position  description  and  the  process  for  applicants  to  apply  are  available  at: 
https://jobs.udayton.edu/.  Closing  Date:  September  30  or  until  filled.  UD  is  Ohio’s  largest  indepen¬ 
dent  university  and  one  of  the  nation’s  largest  Catholic  universities.  UD  is  firmly  committed  to  the 
principle  of  diversity  and  is  an  Affirmative  Action/Equal  Opportunity  Employer.  Persons  of  color, 
women,  individuals  with  disabilities,  and  veterans  are  encouraged  to  apply. 


605  THE  UNIVERSITY  OF  NORTH  CAROLINA  AT  PEMBROKE,  Department  of  Accounting  and 
Information  Technology,  School  of  Business  invites  applications  for  one  tenure-track  position  in 
accounting  to  begin  January  2011  or  later.  Rank  and  salary  depend  upon  qualifications.  Candidates 
with  a  Ph.D.  in  Accounting  are  preferred.  ABDs  will  be  considered  if  degree  is  to  be  completed  by 
the  hiring  date.  Applicants  from  all  areas  of  accounting  will  be  considered.  Professional  certifica¬ 
tions  and  experience  are  preferred.  Applicants  must  complete  the  online  application  form  through 
UNCP’s  Online  Employment  System  found  at  http://www.uncp.edu/hr.  UNC  Pembroke  is  an  Equal 
Employment/Affirmative  Action  Employer. 
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606  WILFRID  LAURIER  UNIVERSITY,  School  of  Business  and  Economics  invites  applications  for 
one  position  as  Assistant  and  one  position  as  Associate  or  Full  Professor  in  the  Accounting  area, 
commencing  July  1,  2011.  The  positions  are  subject  to  budgetary  approval.  Preference  will  be  given 
to  applicants  with  research  and  teaching  interests  in  Auditing,  Financial  Accounting,  and  Manage¬ 
ment  Accounting.  Having  a  professional  designation  will  be  an  asset.  Applicants  without  a  designa¬ 
tion  will  be  encouraged  to  obtain  one  after  they  are  hired.  Applicants  for  these  positions  must  meet 
the  minimum  qualification  of  a  Ph.D.  in  Accounting  or  ABD.  Applicants  must  also  exhibit  an 
interest  in  and  the  capacity  to  conduct  quality  scholarly  research  and  exhibit  the  potential  for  a  high 
level  of  teaching  competence  at  both  the  graduate  and  undergraduate  levels.  For  the  senior  position, 
preference  will  be  given  to  applicants  with  past  Ph.D.  student  supervision  experience  and  an  estab¬ 
lished  record  of  research  publications  in  accounting.  All  qualified  candidates  are  encouraged  to 
apply;  however,  Canadians  and  Permanent  Residents  will  be  given  priority.  Applicants  should  send 
a  CV,  a  summary  of  research  objectives  and  teaching  experience,  a  sample  research  article,  and  the 
names  of  three  references.  Electronic  applications  are  encouraged.  Direct  your  application  and 
supporting  documents  by  November  30,  2010  to:  Dr.  Peter  Carayannopoulos,  Associate  Dean  of 
Business,  Faculty  Development  &  Research,  School  of  Business  and  Economics,  Wilfrid  Laurier 
University,  75  University  Avenue,  West  Waterloo,  Ontario  N2L  3C5,  CANADA;  Email: 
bizrecruit@wlu.ca.  Wilfrid  Laurier  University  is  committed  to  equity  and  values  diversity.  We  wel¬ 
come  applications  from  qualified  individuals  of  all  genders  and  sexual  orientations,  persons  with 
disabilities.  Aboriginal  persons,  and  persons  of  a  visible  minority.  Members  of  the  designated  groups 
must  self-identify  to  be  considered  for  employment  equity. 


607  OHIO  NORTHERN  UNIVERSITY  seeks  applicants.  Responsibilities:  Administer  all  aspects  of  the 
Accounting  graduate  program  including  admissions.  Primary  teaching  responsibilities,  under  a  two- 
two  teaching  load,  will  include  Auditing  and  Accounting  Information  Systems  and  other  courses 
consistent  with  the  candidate’s  professional  interests  and  background.  A  well-established  record  of 
scholarship  is  expected.  Qualifications:  Doctorate  in  Accounting  and  demonstrated  commitment  to 
excellence  in  teaching  (both  at  the  undergraduate  and  graduate  levels)  and  academic  advising. 
Evidence  of  research  publications  in  peer-reviewed  journals  that  would  qualify  the  applicant  for  rank 
of  Associate  Professor.  Teaching  experience  involving  distance  education  a  plus.  Compensation: 
Salary  dependent  upon  experience  and  qualifications.  Fringe  benefits  include  TIAA-CREF  retire¬ 
ment  program,  medical  insurance,  disability  benefits,  life  insurance,  educational  benefits,  and  tuition 
remission  for  dependents  attending  Ohio  Northern  University  for  an  initial  degree.  Terms  of  Em¬ 
ployment:  Tenure-track  nine-month  appointment.  Employment  Date:  September  6,  2011.  Please 
contact  Dr.  James  W.  Fenton  via  email  at:  j-fenton.l@onu.edu;  or  phone:  (419)  772-2070. 


608  UNIVERSITY  OF  SAN  FRANCISCO,  School  of  Business  and  Professional  Studies  seeks  appli¬ 
cants  for  one  tenure-track  faculty  position  in  the  field  of  accounting  at  the  rank  of  Assistant  or 
Associate  Professor  starting  Fall  2011.  The  School  seeks  applicants  conducting  research  in  all 
aspects  of  accounting  but  gives  preference  to  either  of  the  following  areas:  (1)  Auditing  focusing  on 
professional  ethics,  behavioral  aspects  of  auditing,  forensic  accounting,  fraud,  or  other  aspects  of 
auditing  and  assurance  services;  (2)  Financial  Accounting  focusing  on  international  harmonization 
and  IFRS,  consolidations,  or  other  aspects  of  financial  accounting  and  reporting.  A  successful  can¬ 
didate  must  meet  the  following  minimum  qualifications:  (1)  an  earned  doctorate  in  accounting  or  a 
related  field  by  the  time  of  appointment;  (2)  an  expectation  of  excellence  in  teaching,  and  (3)  an 
established  or  emerging  research  program.  The  successful  candidate  will  teach  both  required  and 
elective  undergraduate  and  M.B.A.  courses.  You  will  undertake  scholarly  research  in  a  collaborative 
and  supportive  environment  that  welcomes  inter-disciplinary  and  practice-oriented  scholarship  as 
well  as  discipline-based  and  theoretical  work.  The  School  of  Business  and  Professional  Studies 
offers  bachelor’s  and  master’s  degree  programs.  The  School  is  housed  in  Malloy  Hall,  a  new, 
purpose-built  facility  with  fully  modem  classrooms,  offices,  and  informal  workspaces.  Further  in¬ 
formation  regarding  USF  and  the  School  of  Business  and  Professional  Studies  is  available  at: 
http://web.usfca.edu/bps. 
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609  UNIVERSITY  OF  SOUTH  FLORIDA,  ST.  PETERSBURG  seeks  applicants  for  an  Assistant,  As¬ 
sociate,  or  Full  lYofessor  in  Accounting.  Responsibilities  will  include  teaching  undergraduate 
courses  and/or  graduate  courses  in  accounting;  participating  in  college  and  university  committees; 
leading  through  professional  and  community  service.  Research  and  publication  will  be  expected 
consistent  with  rank.  SALARY  $90,000  and  higher,  depending  on  faculty  credentials  and  budgetary 
approvals.  Minimum  Qualifications:  Doctoral  degree  in  accounting  from  an  AACSB-accredited 
institution.  Preferred  Qualifications:  Candidates  possessing  the  following  additional  qualifications 
will  be  given  preference:  (1)  primary  teaching  and  research  interests  in  the  intermediate  financial 
accounting  and  advanced  financial  accounting  topics.  A  secondary  interest  in  forensic  accounting 
and  fraud  examination  is  desirable;  (2)  online  and/or  graduate  teaching  experience;  (3)  published 
research  in  accounting  journals;  (4)  recent  experience  (past  five  years)  teaching  upper-level  account¬ 
ing  at  an  AACSB-accredited  institution;  and  (5)  Research  or  teaching  in  corporate  social  responsi¬ 
bility  (CSR)  topics.  Please  contact  John  F.  Jewell  at  email:  jewell@mail.usf.edu;  or  phone:  (727) 
873-4110.  Applicants  should  apply  for  this  position  by  visiting:  https://employment.usf.edu/ 
applicants/jsp/shared/search/SearchResults_css.jsp 


610  UNIVERSITY  OF  NOTRE  DAME,  Department  of  Accountancy,  invites  applications  for  position 
openings  at  all  levels.  Assistant,  Associate,  or  Full  Professor.  All  research  areas  will  be  considered. 
The  appointment  is  expected  to  begin  July  1,  2011.  Applicants  for  Assistant  Professor  positions 
should  expect  to  complete  doctoral  degree  requirements  before  the  start  of  the  contract  period.  All 
candidates  should  have  a  strong  commitment  to  scholarly  research  and  excellence  in  teaching. 
Salary  is  very  competitive  and  research  support  is  excellent.  Please  submit  curriculum  vita,  three 
letters  of  recommendation,  teaching  evaluations,  and  an  example  of  scholarly  work  to: 
acctdept@nd.edu.  Also,  address  inquiries  to:  acctdept@nd.edu.  The  deadline  for  applications  is 
January  15,  2011.  For  more  information  about  the  University  of  Notre  Dame,  the  Mendoza  College 
of  Business,  and  the  Department  of  Accountancy  please  visit  the  department’s  website  at:  http:// 
business.nd.edu/accountancy/.  The  University  of  Notre  Dame  is  an  Affirmative  Action  Employer 
with  a  strong  commitment  to  fostering  a  culturally  diverse  atmosphere  for  faculty,  staff,  and  stu¬ 
dents.  Women,  minorities,  and  those  attracted  to  a  university  with  a  Catholic  identity  are  especially 
encouraged  to  apply. 


611  GONZAGA  UNIVERSITY  seeks  applicants  for  an  Assistant  or  Associate  Professor,  with  strong 
emphasis  in  taxation  or  audit/assurance.  Required:  Ph.D.  in  accounting  or  closely  related  discipline, 
at  or  near  completion,  or  J.D.;  demonstrated  evidence  of  teaching  effectiveness  and  scholarly  re¬ 
search.  Desired:  Professional  certification  and  industry/public  accounting  experience  Salary:  Com¬ 
mensurate  with  experience,  AACSB-competitive.  Gonzaga  University  is  a  private  liberal  arts  insti¬ 
tution  of  approximately  6,000  students,  recognized  by  U.S.  News  &  World  Report  as  among  the  top 
regional  universities  in  the  West.  There  are  approximately  1,100  undergraduate  and  250  graduate 
students  enrolled  in  the  business  school,  (about  25—30  percent  are  in  the  accounting  program). 
Gonzaga  accounting  students  have  consistently  performed  well  on  the  CPA  exam,  achieving  a 
first-time  pass  rate  well  in  excess  of  the  national  average,  and  are  heavily  recruited  by  CPA  firms  and 
industry.  Gonzaga’s  educational  mission  is  set  in  the  Jesuit  tradition  with  emphasis  on  the  moral  and 
ethical  implications  of  learning,  living,  and  working  within  today’s  global  societies.  The  school  is 
AACSB-accredited  and  offers  B.B.A.,  M.B.A.,  and  M.Acc.  degrees.  In  addition,  the  accounting 
program  recently  achieved  separate  accreditation.  The  School  is  housed  in  a  technologically  up-to- 
date  60,000  sq  ft  building  located  along  the  Spokane  River,  in  the  midst  of  Gonzaga’s  beautifully 
landscaped  115-acre  campus.  Gonzaga  University  is  a  Jesuit,  Catholic,  humanistic  institution,  and  is 
interested  in  candidates  who  will  contribute  to  its  distinctive  mission.  Gonzaga  University  is  an 
AA/EEO  employer  committed  to  diversity.  Send  applications  to:  Dr.  Gary  Weber  at: 
weber  @  gonzaga.edu . 
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612  MARQUETTE  UNIVERSITY  invites  applications  for  one  tenure-track  position  in  Accounting  In¬ 
formation  Systems  beginning  Fall  2011.  Applicants  should  hold  a  doctoral  degree  or  be  in  the  final 
stages  of  dissertation.  The  position  will  be  filled  at  the  rank  of  Assistant  Professor.  The  primary 
teaching  responsibility  will  be  in  Accounting  Information  Systems  at  the  undergraduate  and  graduate 
levels  with  a  secondary  area  in  Managerial,  Fraud,  Audit,  Financial,  or  International.  The  normal 
teaching  load  is  12  semester  hours  (6  hours  each  semester)  during  the  first  year  and  15  semester 
hours  thereafter.  Salary  and  research  support  are  competitive.  The  department  has  12  full-time 
faculty.  Marquette  offers  AACSB-accredited  undergraduate  and  graduate  degrees  in  Accounting  and 
Business.  All  applicants  must  apply  online  at:  http://careers.marquette.edu/applicants/ 
Central ?quickFind= 52728.  Inquiries  may  be  made  to:  Michael  D.  Akers,  Professor  and  Chair, 
Department  of  Accounting,  PO  Box  1881,  College  of  Business  Administration,  Marquette  Univer¬ 
sity,  Milwaukee,  WI  53201-1881;  Email:  Michael.Akers@Marquette.edu.  Marquette  University  is 
an  Affirmative  Action/Equal  Opportunity  Employer. 


613  BROCK  UNIVERSITY  invites  applications  for  one  (1)  probationary  tenure-track  position  in  Ac¬ 
counting  starting  July  1,  2011.  The  position  rank  is  open  depending  upon  qualifications.  The  avail¬ 
ability  of  this  position  is  subject  to  final  budgetary  approval.  Job  Qualifications:  A  Ph.D.  in  Account¬ 
ing  is  required  (completed  or  near  completion).  Previous  academic  experience  and  professional 
designation  are  desirable,  but  not  required.  We  seek  candidates  with  the  potential  for  excellence  in 
teaching  and  research.  Responsibilities:  This  is  a  probationary  tenure- track  position  requiring  quality 
research,  teaching,  and  service.  We  welcome  applications  from  candidates  with  interest  in  any  area 
of  accounting  research  and  teaching.  The  Department  of  Accounting  has  25  full-time  faculty  mem¬ 
bers  and  is  in  the  Faculty  of  Business,  which  has  approximately  90  full-time  members.  We  offer  a 
full  range  of  graduate  and  undergraduate  programs,  both  research  and  professionally  oriented:  Mas¬ 
ter  of  Accountancy,  Master  of  Science  with  an  Accounting  stream,  M.B.A.  with  an  optional  Ac¬ 
counting  stream.  Bachelor  of  Accounting,  and  Bachelor  of  Business  Administration  with  an  optional 
Accounting  stream.  Co-op  options  are  available  in  both  undergraduate  and  graduate  programs.  The 
Faculty  of  Business  is  AACSB-accredited.  Our  accounting  programs  are  accredited  by  the  Institute 
of  Chartered  Accountants  of  Ontario,  the  Society  of  Management  Accountants  of  Ontario,  and  the 
Certified  General  Accountants  of  Ontario.  More  information  on  Brock  University  can  be  found  on 
the  University’s  website:  http://www.brocku.ca. 


614  BINGHAMTON  UNIVERSITY,  SUNY,  AACSB-accredited  School  of  Management  invites  appli¬ 
cations  for  an  Assistant  Professor  position  in  Accounting  for  Fall  2011,  pending  budget  approval. 
Candidates  must  have  (or  expect)  a  Ph.D.  degree  by  Fall  2011.  High-quality  research  as  evidenced 
by  publication  in  the  top-tier  accounting  journals  is  expected.  Research  is  supported  with  a  two- 
course/semester  teaching  assignment.  The  School  oF  Management  also  supports  research  with  sub¬ 
scriptions  to  WRDS  and  other  standard  databases.  The  Uiuversity  offers  bachelor’s,  master’s,  and 
Ph.D.  programs  in  accounting.  More  details  are  available  at:  http://som.binghamton.edu/faculty/ 
index.htm.  Located  three  hours  from  NYC  and  one  hour  from  Syracuse  and  Ithaca,  Binghamton  is 
a  low-cost-of-living  area  with  200,000  people.  Interested  candidates  should  post  a  curriculum  vita 
and  other  supporting  documents  to  the  appropriate  position  listing  at:  http:// 
binghamton.interviewexchange.com/.  Application  materials  to  be  submitted  electronically.  Candi¬ 
dates  must  have  (or  expect)  a  Ph.D.  degree  by  Fall  2011.  Binghamton  University  is  an  Equal 
Opportunity/Affirmative  Action  Employer. 
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615  PURDUE  UNIVERSITY,  Krannert  School  of  Management  invites  applicants  for  one  or  more 
tenure-track  appointments  in  accounting  beginning  Fall  2011.  Rank  is  at  either  the  Assistant  or 
Associate  Professor  level.  Candidates  for  an  Assistant  Professor  position  should  have  a  doctorate  in 
accounting,  management,  or  closely  related  field,  and  exhibit  the  capacity  for  research  publishable  in 
top-tier  accounting  journals  and  excellence  in  teaching.  ABDs  will  be  considered.  Candidates  for  an 
Associate  Professor  position  should  have  an  earned  doctorate  and  an  established  record  of  excel¬ 
lence  in  research,  teaching,  and  service  on  doctoral  dissertation  committees.  A  tenured  Associate 
Professor  appointment  is  possible  for  a  candidate  with  the  appropriate  credentials.  The  successful 
candidate  will  teach  financial  or  managerial  accounting  to  undergraduate  and/or  graduate  students, 
conduct  research  in  their  field  of  expertise,  and  participate  in  other  school  activities.  Opportunities  to 
teach  in  the  executive  program  may  also  be  available  to  candidates  with  prior  teaching  experience. 
We  offer  a  competitive  compensation  package.  Applications  will  be  reviewed  beginning  October  15, 
2010.  Applications  should  include  a  CV,  representative  research  output,  and  evidence  of  teaching 
ability.  Applicants  should  also  be  prepared  to  provide  at  least  three  letters  of  reference.  Applications 
should  be  sent  to:  Professor  Mark  Bagnoli,  Krannert  Graduate  School  of  Management,  Purdue 
University,  West  Lafayette,  IN  47907.  Purdue  University  is  an  Equal  Opportunity/Equal  Access/ 
Affirmative  Action  Employer  fully  committed  to  achieving  a  diverse  workforce. 


616  MASSACHUSETTS  INSTITUTE  OF  TECHNOLOGY,  Sloan  School  of  Management,  Accounting 
Group  is  seeking  new  faculty  at  all  levels  (Assistant,  Associate,  and  Full  Professor)  for  the  academic 
year  beginning  July  1,  2011,  or  as  soon  as  possible  thereafter.  Applicants  will  be  evaluated  on  their 
potential  for  and/or  record  of  excellence  in  research  and  teaching;  faculty  responsibilities  include 
teaching  financial  accounting  at  graduate  and  undergraduate  level.  Please  include  evidence  of  re¬ 
search  such  as  a  publication,  working  paper,  or  dissertation  proposal  with  other  application  materi¬ 
als.  The  deadline  for  applications  is  November  30,  2010.  However,  applications  will  be  evaluated  as 
soon  as  they  are  complete,  if  submitted  at  an  earlier  date.  Applications  must  include  a  cover  letter, 
curriculum  vitae,  and  letters  of  recommendation.  Applications  should  be  sent  to:  Professor  Michelle 
Hanlon  Chair,  Accounting  Group  Search  Committee,  MIT  Sloan  School  of  Management,  77  Mas¬ 
sachusetts  Avenue,  E62-668,  Cambridge,  MA  02139.  For  overnight  deliveries,  please  use  the  fol¬ 
lowing  address:  MIT  Sloan  School  of  Management,  100  Main  Street,  E62-668,  Cambridge,  MA 
02142.  MIT  is  an  Equal  Opportunity  Employer  committed  to  building  a  culturally  diverse  intellec¬ 
tual  community  and  strongly  encourages  applications  from  women  and  minorities. 


617  INDIANA  UNIVERSITY,  Kelley  School  of  Business,  Department  of  Accounting  invites  applica¬ 
tions  for  tenure-track/tenured  faculty  positions  effective  Fall  2011.  Preference  will  be  given  to  those 
with  teaching  and  research  interests  in  Auditing.  Compensation  levels  are  competitive.  Candidates 
must  possess  a  doctoral  degree  by  Fall  2011  and  have  a  strong  commitment  to  excellence  in 
scholarly  research  and  teaching.  Interested  candidates  should  send  a  current  vita,  a  research  paper 
suitable  for  presentation  at  a  workshop,  three  letters  of  reference  (six  letters  for  tenured  offers),  and 
evidence  of  teaching  ability  to  the  contact  person  indicated  below.  Applications  received  before 
November  30,  2010  will  be  given  full  consideration.  Contact  address:  Laureen  Maines,  Chair, 
Department  of  Accounting,  Kelley  School  of  Business,  1309  E.  10th  Street,  Suite  540,  Bloomington, 
IN  47405-1701;  Email:  lmaines@indiana.edu.  Indiana  University  is  an  Affirmative  Action/Equal 
Opportunity  Employer  committed  to  excellence  through  diversity.  The  University  actively  encour¬ 
ages  applications  of  women,  minorities,  and  persons  with  disabilities. 
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618  UNIVERSITY  OF  HAWAIT  AT  WEST  O’AHU,  Professional  Studies  seeks  candidates  for 
Assistant/Associate  Professor  in  Accounting,  position  number  85439,  tenure-track,  full-time,  nine- 
month,  to  begin  August  1,  2011.  Duties:  Assistant  Professor.  This  position  is  responsible  for  teaching 
accounting  courses  in  the  Business  Administration  program.  Responsibilities  include,  but  are  not 
limited  to.  teaching  courses  in  Accounting  (primary  need  is  Auditing);  developing  new  courses; 
providing  academic  advising,  and  supervising  practicum  students;  and  teaching  distance  education 
courses.  The  successful  candidate  is  also  expected  to  conduct  research  or  applied  scholarship  in  the 
discipline  and  to  participate  in  faculty  governance.  Associate  Professor:  Same  as  above.  Minimum 
qualifications  for  Assistant  Professor:  Earned  doctorate  degree  from  an  accredited  college  or  univer¬ 
sity  in  Accounting,  or  a  related  field  with  an  emphasis  in  Accounting.  (ABD  candidates  are  eligible 
to  apply  but  must  complete  all  degree  requirements  prior  to  appointment.)  Evidence  of  high-quality 
teaching  and  potential  for  conducting  relevant  scholarly  and  applied  research.  Teach  distance  edu¬ 
cation  courses.  Minimum  qualifications  for  Associate  Professor:  All  of  the  above  plus  five  years  of 
teaching  experience.  Evidence  of  publications  in  peer-reviewed  journals  in  the  area  of  Accounting. 
Desirable  qualifications  for  Assistant  Professor:  Ability  to  teach  Auditing,  Intermediate  Financial 
Accounting,  Advanced  Accounting,  and/or  Governmental  Accounting.  Ability  to  interact  effectively 
in  a  multicultural  setting,  experience  teaching  diverse  populations.  Demonstrated  ability  to  teach 
distance  education  courses.  CPA  or  other  certification.  Desirable  qualifications  for  Associate  Profes¬ 
sor:  Ability  to  teach  Auditing,  Intermediate  Financial  Accounting,  Advanced  Accounting,  and/or 
Governmental  Accounting.  Ability  to  interact  effectively  in  a  multicultural  setting,  experience  teach¬ 
ing  diverse  populations.  Demonstrated  ability  to  teach  distance  education  courses.  CPA  or  other 
certification.  Pay  range:  The  salary  will  be  commensurate  with  qualifications  and  experience.  Ap¬ 
plicants  may  apply  to:  http://www.uhwo.hawaii.edu/vacancies 


619  UNIVERSITY  OF  INDIANAPOLIS,  School  of  Business  is  seeking  applications  for  an  Assistant 
Professor  of  Accounting.  This  is  a  tenure-track  position  with  a  start  date  of  the  Fall  2011.  The 
successful  candidate  will  participate  in  teaching,  service,  and  scholarship  as  it  relates  to  the  field  of 
accounting.  The  ideal  candidate  will  have  a  strong  background  in  teaching  undergraduate  account¬ 
ing,  as  well  as  be  professionally  qualified.  Ph.D./D. B. A.  in  accounting  is  preferred  (ABD  candidates 
may  apply).  An  M.B.A./CPA  is  also  acceptable,  given  extensive  professional  and  academic  experi¬ 
ence.  Candidates  must  have  demonstrated  proficiency  in  teaching  and  knowledge  of  the  accounting 
field.  Applied  scholarship  and  active  participation  in  the  discipline  is  a  plus.  The  University  of 
Indianapolis,  School  of  Business  is  nationally  accredited  by  the  Association  of  Collegiate  Business 
Schools  and  Programs  (ACBSP).  Applications  may  be  submitted  at:  https://jobs.uindy.edu/. 


620  THE  UNIVERSITY  OF  TENNESSEE  AT  KNOXVILLE,  Department  of  Accounting  and  Informa¬ 
tion  Management  is  accepting  applications  for  a  full-time,  tenure-track  position  to  begin  August  1, 
2011.  Candidates  should  hold  a  doctoral  degree  with  an  emphasis  in  accounting  or  expect  to  com¬ 
plete  all  degree  requirements  by  his  or  her  starting  date.  The  ideal  candidate  will  have  a  research 
interest  in  corporate  governance  and  a  teaching  interest  in  auditing  or  financial  accounting.  Candi¬ 
dates  for  Assistant  Professor  should  demonstrate  excellent  teaching  potential  and  have  an  academic 
background  that  strongly  suggests  an  ability  to  establish  an  ongoing  program  of  substantive  re¬ 
search.  Candidates  for  advanced  rank  should  demonstrate  excellent  achievement  in  teaching  and 
have  a  substantive  research  record  that  reflects  a  significant  ongoing  research  program.  All  faculty 
searches  are  subject  to  the  availability  of  funding.  Please  send  cover  letter,  vita,  and  references, 
preferably  by  email,  to:  Kenneth  E.  Anderson,  Search  Committee  Chair,  Department  of  Accounting 
and  Information  Management,  CBA,  University  of  Tennessee  Knoxville,  Tennessee  37996-0560; 
Email:  kea@utk.edu.  The  University  of  Tennessee  is  an  EEO/AA/Title  Vl/Title  IX/Section  504/ 
ADA/ADEA  institution  in  the  provision  of  its  education  and  employment  programs  and  services.  All 
qualified  applicants  will  receive  equal  consideration  for  employment  without  regard  to  race,  color, 
national  origin,  religion,  sex,  pregnancy,  marital  status,  sexual  orientation,  gender  identity,  age, 
physical  or  mental  disability,  or  covered  veteran  status. 
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621  EASTERN  MICHIGAN  UNIVERSITY  seeks  applicants  for  one  tenure-track  position  at  the 
Assistant/Associate  level  in  accounting  beginning  Fall  2011,  subject  to  funding  approval.  All  areas 
of  specialization  will  be  considered.  Individuals  who  have  a  Ph.D.  in  hand  are  strongly  preferred, 
but,  at  minimum,  an  ABD  from  an  AACSB-accredited  university  or  its  equivalent  leading  to  a  Ph.D. 
is  required.  Professional  certifications  such  as  a  CPA/CMA/CIA  and/or  business  experience  are 
desirable.  Salary  is  competitive.  Faculty  members  are  expected  to  teach  specific  courses,  conduct 
research,  and  participate  in  university  functions.  EMU  is  a  public,  comprehensive  university  of 
23,000  students  located  in  southeastern  Michigan  and  is  an  Equal  Opportunity  employer.  The  Col¬ 
lege  of  Business  is  AACSB-accredited.  To  apply  for  Posting  #fall01e  go  to;  https:// 
www.emujobs.com/,  click  on  the  “View/Apply  for  Faculty  and  Administrative  Positions”  button 
located  in  the  middle  of  the  page.  If  you  have  questions,  contact  Professor  Linda  Burilovich,  Chair 
of  the  Personnel  Committee  at:  linda.burilovich@emich.edu.  Review  of  applications  will  continue 
until  the  position  is  filled. 


622  IDAHO  STATE  UNIVERSITY,  Department  of  Accounting  seeks  a  full-time  tenure-track  Assistant 
or  Associate  Professor  for  a  position  available  for  the  2011-2012  academic  year.  Salary  will  be 
commensurate  with  experience  and  qualifications.  Both  the  College  of  Business  and  the  Accounting 
Program  are  accredited  by  AACSB.  Our  accounting  faculty  support  excellent  undergraduate  and 
M.Acc.  programs.  Responsibilities:  Teaching  at  the  undergraduate  and  graduate  levels  with  preferred 
emphasis  areas  in  financial  accounting.  A  primary  interest  in  auditing  with  a  secondary  interest  in 
financial  accounting  will  also  be  considered.  Normal  course  load  includes  two  preps  and  three 
sections.  Peer-reviewed  scholarly  work  and  service  are  expected.  Minimum  Qualifications:  The 
successful  candidate  will  demonstrate  a  collegial  attitude,  excellent  communication  skills,  and  an 
eagerness  to  support  our  students  through  a  mentoring/  advising  process.  Contact  Information:  To 
apply,  send  a  letter  of  interest;  resume;  and  the  names,  addresses,  and  telephone  numbers  of  three 
professional  references  to:  Dr.  Bob  Picard,  Chair,  Department  of  Accounting,  College  of  Business, 
Idaho  State  University,  Campus  Box  8020,  Pocatello,  ID  83209-8020.  Idaho  State  University  is  an 
AA/EOE.  Minorities,  women,  and  veterans  are  encouraged  to  apply. 


623  FLORIDA  INTERNATIONAL  UNIVERSITY,  School  of  Accounting  invites  applications  for  in¬ 
structors,  clinical  or  tenure-track  positions  at  all  ranks  in  all  areas  of  accounting  and  for  instructors 
of  business  law  beginning  Fall  2009.  Doctorate  or  L.L.M.  (Taxation)  required  for  accounting  posi¬ 
tions;  J.D.  required  for  business  law  positions.  Preference  may  be  given  to  applicants  with  publica¬ 
tions  in  their  respective  fields.  Expectations  include  strong  commitment  to  excellence  in  graduate 
and  undergraduate  teaching,  research,  and  service.  Duties  may  include  teaching  in  weekend  and 
overseas  programs.  The  School  of  Accounting  is  separately  accredited  by  AACSB  and  offers  B.Acc., 
M.Acc.,  M.S.T.  and  Ph.D.  programs.  FIU  is  identified  in  the  Carnegie  classification  as  a  Research 
University/High  Research  Activity  institution  with  enrollments  in  excess  of  38,000  students.  FIU  is 
a  growing  public  university  that  admitted  its  first  class  in  1972.  It  has  a  diverse  student  body 
including  both  full-time  residential  students  as  well  as  commuters.  FIU’s  modem  picturesque  cam¬ 
pus  with  tropical  flora  is  located  in  the  suburbs  of  Miami.  To  apply,  send  a  vita  and  names  of  five 
references  to:  Ana  Estevill,  Florida  International  University,  School  of  Accounting,  11200  SW  8th 
St.,  RB  210,  University  Park,  Miami,  EL  33199;  Email:  estevill@fiu.edu.  FIU  is  an  Equal  Oppor¬ 
tunity  Employer.  Applications  from  minority  candidates  are  encouraged. 
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624  OKLAHOMA  STATE  UNIVERSITY,  Spears  School  of  Business,  School  of  Accounting  invites 
applications  beginning  Fall  2011  at  the  Assistant,  Associate,  or  Full  Professor  level.  All  areas  of 
research  and  teaching  emphasis  will  be  considered.  We  are  especially  interested  in  faculty  with  a 
proven  research  record  indicating  the  ability  to  publish  in  “A”  level  journals.  Applicants  must  have 
an  appropriate  earned  doctorate  and  those  at  the  senior  level  should  have  a  distinguished  ongoing 
record  of  scholarship.  All  applicants  must  have  a  commitment  to  instructional  excellence.  The 
position  is  located  on  the  OSU/Stillwater  campus,  but  could  be  called  upon  to  occasionally  teach  on 
the  OSU/Tulsa  campus.  Applicant  screening  will  begin  immediately  and  will  continue  until  the 
position  is  filled.  We  intend  to  hold  campus  interviews  for  the  position  during  the  Fall  2010  semes¬ 
ter.  All  positions  are  contingent  upon  available  funding.  Interviews  for  a  second  open  position 
(pending  funding  approval)  will  be  held  during  the  Spring  2011  semester.  A  letter  of  application, 
curriculum  vitae,  and  a  summary  of  teaching  evaluations  should  be  sent  to:  Don  Herrmann,  Head, 
School  of  Accounting,  Oklahoma  State  University,  401  Business  Building,  Stillwater,  OK  74078; 
Phone:  (405)  744-5123;  Email:  osu-soa@okstate.edu.  Oklahoma  State  University  is  an  Affirmative 
Action/Equal  Opportunity /E- Verify  employer  committed  to  diversity.  Please  contact  us  for  details. 


625  FAIRLEIGH  DICKENSON  UNIVERSITY,  Department  of  Accounting,  Taxation,  and  Law,  Silber- 
man  College  of  Business  is  accepting  applications  for  a  tenure-track  position  in  Accounting  at  the 
Assistant/Associate  Professor  rank.  Silberman  College  has  AACSB-accredited  undergraduate  and 
graduate  programs.  Candidates  should  have  a  Ph.D.  in  Accounting  from  an  AACSB-accredited 
institution  (or  ABD  nearing  completion  of  dissertation)  with  evidence  of  or  potential  for  high-quality 
teaching  and  research.  The  successful  candidate  will  be  expected  to  teach  undergraduate  and  gradu¬ 
ate  level  accounting  courses.  The  department  offers  an  undergraduate  in  accounting,  M.S.  Account¬ 
ing,  and  M.S.  Taxation,  with  over  300  majors  enrolled.  The  normal  teaching  load  is  9  credit  hours 
per  semester.  Salary  and  benefits  are  competitive  with  AACSB-accredited  institutions.  Employment 
is  contingent  upon  a  satisfactory  background  check.  Candidates  for  hire  will  be  required  to  sign  a 
waiver  authorizing  the  background  check  and  produce  a  Social  Security  Card. 


626  TEXAS  A&M  UNIVERSITY-CORPUS  CHRISTI,  College  of  Business,  Department  of  Accounting 
and  Business  Law  invites  applications  for  a  tenure-track  Assistant  Professor  position  beginning  as 
early  as  Fall  2010,  subject  to  final  budgetary  approval.  The  position  is  open  to  all  areas  of  account¬ 
ing  specialization.  Applicants  should  possess  a  doctoral  degree  in  accounting  from  an  AACSB- 
accredited  institution  and  demonstrate  a  strong  commitment  to  scholarly  research  and  teaching 
excellence.  Certification  and  professional  experience  are  preferred.  Salary  is  competitive.  The  Col¬ 
lege  of  Business  undergraduate  and  graduate  business  and  accounting  programs  are  accredited  by 
AACSB  International  and  the  College  will  be  moving  into  a  new  building  early  in  the  Spring  2011. 
Please  submit  applications  online  at:  http://islanderjobs@tamucc.edu.  For  more  information  on  the 
position,  the  University,  and  the  College  go  to:  http://www.cob.tamucc.edu.  TAMUCC  is  an  AA/ 
F.F.n  employer  committed  to  excellence  (http://www.tamucc.edu/~hrweb/). 


627  EASTERN  ILLINOIS  UNIVERSITY  invites  applicants  for  a  9-month  tenure-track  position  at  the 
Assistant  or  Associate  level  in  Accountancy.  Position  begins  Fall  2011.  Candidate  should  possess 
expertise  and  teaching  experience  in  either  Financial  or  Tax  Accounting,  although  other  specialties 
will  be  considered.  The  successful  candidate  will  also  conduct  research  in  accountancy  and  partici¬ 
pate  in  curriculum  development  and  appropriate  committees.  A  Ph.D.  or  D.B.A.  is  preferred,  al¬ 
though  ABDs  close  to  completion  will  be  considered.  Candidates  should  demonstrate  a  teaching, 
research,  and  service  record  commensurate  with  the  desired  professional  rank.  Evidence  of  strong 
instructional  effectiveness  and  scholarly  productivity  is  essential,  as  are  strong  communication  and 
interpersonal  skills.  Business  experience  and  professional  certification  are  desirable.  Applicants 
should  submit  electronically  the  following  materials:  (1)  application  letter,  (2)  curriculum  vitae,  and 
(3)  names  and  contact  information  for  three  references  to:  Dr.  Hank  Davis,  Director  of  Accountancy, 
at:  busrecrt@eiu.edu  with  “Accounting  Position  Application”  in  the  subject  line.  Finalists  will  be 
asked  to  provide  transcripts.  Review  of  applications  will  begin  on  August  17,  2010. 
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628  UNIVERSITY  AT  ALBANY,  SUNY,  Department  of  Accounting  and  Law  invites  applications  for 
one  tenure-track  position  at  the  Assistant  Professor  level  starting  January  1,  2010,  subject  to  bud¬ 
getary  approval  and  available  funds.  The  candidates  must  have  a  doctoral  degree  from  a  college  or 
university  accredited  by  a  U.S.  Department  of  Education  (DOE)  or  internationally  recognized  ac¬ 
crediting  organization  and  teaching  experience  in  managerial/cost/financial  accounting.  Research 
interests  in  empirical  manageriaiyfinancial  accounting  are  preferred.  The  successful  candidate  will 
demonstrate  evidence  of  a  strong  commitment  to  scholarly  research  and  excellence  in  teaching,  and 
an  ability  to  work  with  and/or  instruct  diverse  groups  of  people  is  required.  Preference  will  be  given 
to  candidates  with  a  previous  record  of  success  at  the  Assistant  Professor  level.  Candidates  are 
requested  to  submit  their  vitae  and  contacts  for  three  references  by  email  to:  Ingrid  E.  Eisher, 
Accounting  Search  Chair,  Department  of  Accounting  and  Law,  School  of  Business,  The  University 
at  Albany,  State  University  of  New  York;  Email:  ifisher@uamail.albany.edu.  Review  of  applications 
will  begin  immediately  and  continue  until  the  position  is  filled.  The  University  at  Albany  is  an 
EEO/AA/IRCA/ADA  employer. 


629  UNIVERSITY  OE  OREGON,  Lundquist  College  of  Business  University,  Department  of  Accounting 
is  seeking  to  fill  one  tenure-track  position  in  accounting,  subject  to  funding  availability.  Rank: 
Assistant  Professor.  Employment  Beginning:  September  16,  2011.  Description:  The  Lundquist  Col¬ 
lege  is  committed  to  publishing  research  in  top  accounting  journals,  providing  a  high  level  of  service 
to  students  and  interacting  with  the  professional  community.  Responsibilities  include  teaching  ac¬ 
counting  courses  at  the  undergraduate  and  graduate  levels  as  well  as  supervising  Ph.D.  students. 
Promotion  and  tenure  depend  on  the  quality  of  research  and  teaching.  The  successful  candidate  will 
have  the  ability  to  work  effectively  with  faculty,  staff,  and  students  from  diverse  backgrounds. 
Qualifications:  Completed  or  nearly  completed  Ph.D.  degree  with  demonstrated  potential  for  teach¬ 
ing  excellence  and  the  ability  to  publish  economics-based  research  in  high-quality  academic  jour¬ 
nals.  Salary:  Competitive.  Application  Procedure:  Send  letter  of  interest,  curriculum  vitae,  copies  of 
research  papers,  evidence  of  teaching  performance,  and  three  letters  of  reference  to: 
ActgOpenSearch@lcbmail.uoregon.edu.  Application  Deadline:  Review  of  applications  begins  De¬ 
cember  1,  2010  and  will  continue  until  a  suitable  candidate  is  identified.  The  University  of  Oregon 
is  an  Equal  Opportunity/ Affirmative  Action  institution  committed  to  cultural  diversity  and  compli¬ 
ance  with  the  Americans  with  Disabilities  Act. 


630  UNIVERSITY  OF  SOUTHERN  CALIFORNIA,  Marshall  School  of  Business,  Leventhal  School  of 
Accounting  seeks  a  Dean.  The  Leventhal  School,  named  in  honor  of  Kenneth  and  Elaine  Leventhal, 
who  have  provided  a  significant  endowment  for  the  School,  offers  undergraduate  and  graduate 
programs  in  accounting  that  are  routinely  ranked  among  the  top  10  programs  in  accounting.  The 
School’s  faculty  members  are  thought-leaders  in  the  practice  and  scholarship  of  accounting.  The 
Leventhal  School  is  committed  to  excellence  in  teaching  and  research.  The  Marshall  School  of 
Business,  of  which  the  Leventhal  School  is  a  part,  is  ranked  in  the  top-tier  business  schools  in 
undergraduate  and  graduate  education,  and  is  a  leading  producer  of  scholarly  research  in  all  fields  of 
business.  Candidates  for  the  position  of  Dean  should  be  established  scholars  in  accounting  or 
accounting  professionals  with  a  significant  record  of  management  and  an  understanding  of  the 
environment  of  a  major  research  university.  The  successful  candidate  will  be  able  to  advance  the 
School’s  thought  leadership,  cultivate  and  strengthen  the  School’s  educational  programs,  work  ef¬ 
fectively  as  part  of  the  Marshall  School,  and  coordinate  development  efforts  with  the  Marshall 
School  and  the  University  of  Southern  California.  Nominations  and  applications  should  be  directed 
to:  Chair,  Leventhal  School  Dean  Search  Committee  Marshall  School  of  Business  University  of 
Southern  California  3670  Trousdale  Parkway,  BRI  101,  Los  Angeles,  CA  90089-0802.  Located  near 
the  heart  of  Los  Angeles,  the  University  of  Southern  California  is  home  to  approximately  31,000 
students  and  4,300  faculty.  USC  Marshall  is  renowned  for  its  high-ranking  undergraduate,  graduate, 
international,  and  executive  education  programs,  an  exceptional  faculty  engaged  in  leading-edge 
research,  a  diverse  and  creative  student  body,  and  a  commitment  to  technological  advancement.  The 
research  productivity  of  Marshall’s  200  full-time  faculty  ranks  among  the  top  15  business  schools  in 
the  world.  The  University  of  Southern  California  values  diversity  and  is  committed  to  Equal  Op¬ 
portunity  in  employment.  Women  and  men,  and  members  of  all  racial  and  ethnic  groups,  are  en¬ 
couraged  to  apply. 
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631  NORTHEASTERN  UNIVERSITY  is  seeking  applications  for  tenure-track  positions  effective  Sep¬ 
tember  2011  at  the  Assistant  or  Associate  Professor  rank.  We  seek  candidates  who  have  or  expt.ct  to 
have  a  doctorate  in  accounting  or  a  closely  related  business  field  before  the  start  date  of  September 
2011  and  who  are  committed  to  excellence  in  research  and  teaching.  A  strong  research  program 
focused  on  top-tier  accounting  journals  is  desired  and  teaching  experience  in  accounting  is  preferred. 
While  the  group  needs  faculty  to  teach  in  all  areas,  it  has  particular  needs  for  faculty  to  teach  all 
levels  of  financial  and  managerial  accounting.  The  accounting  group  provides  a  supportive  environ¬ 
ment  in  which  collaboration  in  research  and  teaching  is  encouraged.  The  position  offers  competitive 
compensation  and  teaching  responsibilities  that  enable  a  significant  level  of  high-impact  research. 
Rank  and  salary  are  dependent  on  qualifications.  Summer  funding,  graduate  assistants,  and  travel 
support  are  available.  Accounting  courses  are  offered  at  both  the  undergraduate  and  at  the  graduate 
levels  (M.B.A.,  M.S./M.B.A.,  M.S.A.,  and  M.S.  Tax).  Please  submit  your  application  online  at: 
http://www.cba.neu.edu/faculty/.  Click  on  “Faculty  Positions.”  Include  a  cover  letter,  your  current 
CV,  and  two  samples  of  your  scholarly  work.  Questions  regarding  this  position  may  be  directed  to: 
Dr.  Marjorie  Platt  at:  m.platt@neu.edu.  Northeastern  University  is  an  Affirmative  Action/Equal 
Opportunity/Title  IX  Employer  and  welcomes  applications  from  minorities,  women,  and  disabled 
persons. 


632  MIAMI  UNIVERSITY,  Department  of  Accountancy  invites  applicants  for  a  non-tenure-track  posi¬ 
tion  at  the  clinical  faculty  level  beginning  fall  semester  2011.  This  position  primarily  is  to  teach  and 
provide  service  to  the  department.  At  minimum,  applicants  must  have  an  accounting-related  master’s 
degree  and  university-level  teaching  experience.  A  doctorate,  professional  certification,  and  relevant 
professional  work  experience  are  desirable.  Salary  and  benefits  are  competitive.  Both  the  under¬ 
graduate  and  graduate  accounting  programs  are  AACSB-accredited.  Faculty  must  be  either  academi¬ 
cally  or  professionally  qualified.  Information  about  the  faculty  and  curriculum  is  available  on  the 
web  at:  http://www.sba.muohio.edu.  Please  submit  your  resume  and  letters  from  three  references  to: 
Dr.  Marc  A.  Rubin,  Department  of  Accountancy,  Farmer  School  of  Business,  Miami  University, 
Oxford,  OH  45056;  Email:  rubinma@muohio.edu.  Screening  of  applications  begins  October  1,  2010 
and  will  continue  until  the  position  is  filled.  Campus  crime  and  safety  report:  http:// 
www.muohio.edu/righttoknow  (hard  copy  upon  request).  Miami  University  is  an  EOE/AA  employer 
with  smoke-free  campuses. 


633  MAYO  CLINIC,  in  Rochester,  MN,  is  seeking  an  experienced  Senior  Financial  Auditor  to  assist 
Audit  Managers  in  the  performance  of  team-based  operational,  compliance,  financial  statements, 
integrated  and  system  reviews,  as  well  as  Enterprise  Risk  Management  (ERM)  implementation.  In 
addition  to  your  ability  to  lead  audit  projects  to  meet  department  metrics,  the  position  calls  for 
excellent  communication  and  group-facilitation  skills.'  To  qualify,  you  must  have  three-to-five  years 
of  auditing  experience  (public  accounting  with  a  national  or  regional  firm  preferred).  You  must  have 
current  CPA,  CIA,  or  CMA  certification  (or  willingness  to  work  toward  certification)  and  familiarity 
with  MS  Office  Suite;  ACL  and  Auto  Audit  a  plus.  Expect  to  travel  20-25  percent  of  the  time.  To 
apply  or  learn  more  about  this  and  other  opportunities,  please  visit  http://www.mayo-clinic- 
jobs.eom/job/Financial-Operational-Sr_-Auditor-27782-Job/867056/. 
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634  UCLA  Faculty  Practice  Group  seeks  accountant,  responsible  for  providing  general  accounting  as¬ 
sistance  to  the  Account  Manager.  Primary  responsibilities  include  processing  and  coordinating  gen¬ 
eral  ledger  month-end  closing  with  internal  and  external  data  customers  and  suppliers,  financial 
reporting,  and  serving  as  the  department  liaison.  The  Benefits  of  Belonging:  As  a  valued  member  of 
our  staff,  you’ll  enjoy  outstanding  benefits  that  include  health,  dental,  and  vision  plans  that  begin  on 
your  first  day  and  a  retirement  plan  that  is  one  of  the  best  in  the  nation.  You  also  receive  13  paid 
holidays  and  15  vacation  days  beginning  your  first  year.  We  also  offer  2/3  tuition  reduction  at 
UCLA.  Qualified  candidates  will  have  3-5  years  of  prior  healthcare  accounting  experience,  includ¬ 
ing  preparation  of  financial  statements  and  journal  entries,  as  well  as  excellent  analytical  and 
problem-solving  skills.  Skill  in  analyzing  financial  information  and  a  thorough  knowledge  of  ac¬ 
cepted  accounting  principles  and  practices  are  also  required.  Bachelor’s  degree  with  a  major  in 
Accounting,  Business  Administration,  or  closely  related  field  is  required.  For  more  information, 
please  contact  Reggie  Glynn  at:  RGlynn@mednet.ucla.edu,  Phone:  (310)  794-0506;  or  apply  online 
at:  http://hr.healthcare.ucla.edu,  and  reference  Job  Code  FI52734.  EOF. 


635  UNIVERSITY  OF  WISCONSIN-LA  CROSSE,  College  of  Business  Administration  (AACSB- 
accredited)  invites  applications  for  two  tenure-track  accounting  positions  in  its  Department  of  Ac¬ 
countancy  starting  August  2011.  Candidates  seeking  the  rank  of  Assistant  or  Associate  Professor 
should  possess  a  Ph.D.  or  D.B.A.  in  Accounting  (or  a  related  discipline).  ABDs  will  be  considered 
at  the  instructor  level  until  completion  of  their  degree.  Preference  given  to  candidates  with  primary 
specialization  in  Tax  or  Assurance  Services,  with  other  areas  seriously  considered.  For  the  tax 
position,  a  J.D.  and  advanced  tax  degree  or  extensive  tax  experience  will  be  considered.  Professional 
certification  is  highly  desirable.  These  positions  offer  an  array  of  resources  for  professional  devel¬ 
opment.  The  college  values  continuing  self-development  in  teaching,  scholarship,  and  service  con¬ 
sistent  with  AACSB  standards.  Review  of  applications  will  begin  November  15,  2010  and  continue 
until  positions  are  filled.  For  additional  information  about  this  position  and  to  apply,  please  visit: 
https://employment.uwlax.edu/.  UW-La  Crosse  is  an  Equal  Opportunity  Employer  committed  to 
increasing  the  diversity  of  its  workforce. 


636  YORK  UNIVERSITY,  Schulich  School  of  Business  is  seeking  an  outstanding  scholar  to  fill  a  named 
Chair  in  Accounting  at  the  rank  of  Professor.  The  appointment  will  be  effective  for  July  1,  2011. 
Preferred  candidates  will  have:  a  strong  record  of  quality  research  in  accounting,  including  financial 
accounting,  managerial  accounting,  capital  markets,  international  accounting,  and/or  capital  markets 
research  relating  to  accounting  issues;  a  leadership  position  within  academic  accounting  research 
marked  by  substantive  and  methodological  innovation;  strong  evidence  of  teaching  effectiveness  and 
a  willingness  to  employ  a  variety  of  sound  pedagogies,  including  case-method  teaching,  that  are 
consistent  with  the  needs  of  the  Accounting  Area  within  the  Schulich  School  of  Business  at  both 
graduate  and  undergraduate  levels,  and  eligibility  for  prompt  appointment  to  the  Faculty  of  Graduate 
Studies;  willingness  and  ability  to  contribute  to  the  doctoral  program  as  a  dissertation  supervisor;  a 
doctorate  in  accounting  or  related  field.  Interested  candidates  should  submit:  letter  of  application, 
including  a  statement  of  current  research;  curriculum  vita;  evidence  of  effective  teaching;  and  three 
letters  of  reference.  Please  apply  to:  Dr.  Thomas  H.  Beechy,  Professor  Emeritus  of  Accounting, 
Search  Committee  Chair,  Schulich  School  of  Business,  York  University,  4700  Keele  Street,  Toronto, 
Ontario,  Canada,  M3J  1P3;  Email:  tbeechy@schulich.yorku.ca;  Phone:  (416)  736  5097,  (647)  588- 
3065.  Deadline  for  applications:  December  1,  2010.  Salary  and  benefits  are  competitive.  All  York 
University  positions  are  subject  to  budgetary  approval.  York  University  is  an  Affirmative  Action 
Employer.  The  Affirmative  Action  Program  can  be  found  on  the  York  University  website  at:  http:// 
www.yorku.ca/acadjobs,  or  a  copy  can  be  obtained  by  calling  the  Affirmative  Action  office  at  (416) 
736-5713.  All  qualified  candidates  are  encouraged  to  apply;  however,  Canadian  citizens  and  perma¬ 
nent  residents  will  be  given  priority. 
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637  BARUCH  COLLEGE-CUNY,  Stan  Ross  Department  of  Accountancy  invites  applications  for  an¬ 
ticipated  openings  for  tenure-track  and  visiting  positions  at  all  ranks  (Assistant  Professor,  Associate 
Professor,  Pull  Professor,  Clinical  Professor,  Distinguished  Lecturer,  Instructor,  and  Lecturer).  Areas 
of  specialization  include  financial  and  managerial  accounting,  taxation,  auditing  (internal  and  exter¬ 
nal),  and  accounting  information  systems.  Rank  depends  on  qualifications,  and  salaries  are  competi¬ 
tive.  For  ranks  of  Assistant/Associate/Full  Professor,  candidates  should  have  a  completed  Ph.D. 
degree,  a  record  of  excellent  scholarly  research  and  demonstrated  teaching  effectiveness.  Please 
submit  curriculum  vitae,  a  research  paper,  and  letters  of  reference  to;  Professor  Igor  Vaysman,  Stan 
Ross  Department  of  Accountancy,  One  Bernard  Baruch  Way,  B 12-225,  New  York,  NY  lOOIO. 
Positions  are  open  until  filled.  Baruch  College  is  an  AA/EO/IRCA/ADA  Employer. 


638  YORK  UNIVERSITY,  Schulich  School  of  Business  invites  applications  for  a  tenure-stream  position 
in  accounting  at  the  Senior  Assistant,  Associate,  or  Full  Professor  level,  effective  July  1,  2011. 
Preferred  candidates  will  have:  a  leadership  position  within  the  discipline,  marked  by  significant 
contributions  to  accounting  or  auditing  research;  a  track  record  of  publications  in  top-tier  journals; 
evidence  of  an  ongoing  research  program  with  potential  for  further  publications  in  top-tier  journals; 
a  willingness  and  demonstrated  ability  to  supervise  masters  and/or  doctoral  students,  and  eligibility 
for  prompt  appointment  to  the  Faculty  of  Graduate  Studies;  evidence  of  effective  teaching  and  a 
willingness  to  employ  a  variety  of  sound  pedagogies  consistent  with  the  needs  of  the  Accounting 
Area  and  the  Schulich  School  of  Business;  a  doctorate  in  accounting  or  related  field.  The  successful 
applicant  will  be  expected  to  support  both  undergraduate  and  graduate  education.  Please  apply  via 
email  to:  Dr.  Cameron  Graham,  Chair  of  the  Accounting  Search  Committee,  Email: 
cgraham®  schulich. yorku.ca.  Applications  should  include:  an  application  letter,  a  curriculum  vitae, 
three  letters  of  reference,  a  current  working  paper,  a  one-page  statement  of  current  research  program/ 
interest  and  information  about  teaching  performance.  Deadline  date  for  receiving  applications  is 
December  1,  2010.  Salary  and  benefits  are  competitive.  All  York  University  positions  are  subject  to 
budgetary  approval.  York  University  is  an  Affirmative  Action  Employer.  The  Affirmative  Action 
Program  can  be  found  on  York’s  website  at:  http://www.yorku.ca/acadjobs,  or  a  copy  can  be  ob¬ 
tained  by  calling  the  Affirmative  Action  office  at  (416)  736-5713.  All  qualified  candidates  are 
encouraged  to  apply;  however,  Canadian  citizens  and  permanent  residents  will  be  given  priority. 


639  COLUMBIA  UNIVERSITY,  Business  School  seeks  applications  for  faculty  positions  in  Accounting. 
Rank  is  open.  Appointment  at  senior  rank  requires  outstanding  record  of  scholarship.  Candidates 
with  an  interest  in  Financial  Accounting  research  will  be  preferred.  We  are  looking  for  an  outstand¬ 
ing  person  with  evidence  of  a  strong  commitment  to  both  teaching  and  research.  Candidates  should 
possess  a  doctoral  degree  or  be  close  to  completion.  We  offer  a  highly  supportive  research  environ¬ 
ment  and  easy  interaction  with  faculty  in  other  disciplines.  Salary  and  benefits  are  competitive. 
Please  send  resume,  an  unpublished  research  paper,  and  letter(s)  of  recommendation  to:  Professor 
Tim  Baldenius,  Columbia  University,  Business  School,  617  Uris  Hall,  3022  Broadway,  New  York, 
NY  10027.  To  apply  for  this  position,  go  to  our  online  application  site:  http:// 
academicjobs.columbia.edu/applicants/Central?quickFind=52217.  Deadline:  January  2,  2011.  Co¬ 
lumbia  University  is  an  Equal  Opportunity/ Affirmative  Action  employer. 


640  COLLEGE  OF  SAINT  BENEDICT/SAINT  JOHN’S  UNIVERSITY  is  seeking  to  fill  two  tenure- 
track  positions  in  Accounting  and  Finance,  at  the  rank  of  Assistant  Professor,  to  begin  August  2011. 
The  successful  candidates  will  be  expected  to  teach  upper-  and  lower-division  accounting  courses  in 
our  curriculum.  Members  of  the  department  occasionally  share  in  the  teaching  of  courses  in  the 
common  curriculum.  All  members  have  a  commitment  to  supporting  student  research.  The  ability  to 
teach  one  or  more  of  the  following  courses  would  be  an  advantage;  Information  Systems,  Audit,  or 
Advanced  Accounting.  Preferred  qualifications  include  Ph.D.,  D.B.A.,  J.D.  with  underlying  account¬ 
ing  degree,  or  Master’s  Degree  in  Accounting/Business/Finance  plus  CPA  certificate.  If  preferred 
qualifications  are  not  met,  the  minimum  requirement  is  a  Master’s  Degree  in  Accounting,  Business 
Administration,  Finance,  or  Taxation.  For  further  information  and  apply  online  visit:  https:// 
employment.csbsju.edu.  Affirmative  Action/Equal  Opportunity  Employer. 
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641  UNIVERSITY  OF  MASSACHUSETTS  AMHERST,  Department  of  Accounting  seeks  a  collabora¬ 
tive  leader  as  Department  Chair  who  will  work  to  further  enhance  the  Department’s  stature  as  a 
leading  accounting  program,  develop  strategies  for  undergraduate  and  graduate  education,  and 
strengthen  relationships  with  an  extensive  alumni  network.  Applicants  should  have  a  record  of 
accomphshment  worthy  of  tenure  and  appointment  as  Full  Professor  at  a  research-intensive  univer¬ 
sity.  Candidates  with  leadership  experience  and  significant  roles  interacting  with  university  and 
corporate  stakeholders  are  preferred.  Please  submit  a  curriculum  vitae,  contact  information  for  three 
references,  and  copies  of  research  papers  or  publications  to:  Dr.  Chris  Agoglia,  Chair,  Search  Com¬ 
mittee,  Room  222B,  Isenberg  School  of  Management,  University  of  Massachusetts,  121  Presidents 
Drive,  Amherst,  MA  01003.  Application  review  will  start  November  15,  2010  and  continue  until  the 
position  is  filled.  The  University  of  Massachusetts  Amherst  is  an  Affirmative  Action/Equal  Oppor¬ 
tunity  Employer.  Applications  from  women  and  minority  candidates  are  especially  encouraged. 


642  ARKANSAS  STATE  UNIVERSITY,  College  of  Business  is  recruiting  applicants  for  the  position  of 
Assistant  Professor  of  Accounting.  The  position  is  a  tenure-track,  9-month  position  engaged  in 
teaching,  research,  and  service  and  requires  a  terminal  degree  from  an  AACSB -accredited  university. 
(ABDs  near  completion  will  be  considered.)  Please  visit:  https://jobs.astate.edu  for  detailed  infor¬ 
mation  and  to  apply  for  position  F00440.  Applications  will  be  accepted  through  October  1  or  until 
the  position  is  filled.  Job  Duties:  Teach  various  accounting  courses  as  assigned,  engage  in  research 
leading  to  publications  in  peer-reviewed  journals,  perform  service  activities  at  the  department,  col¬ 
lege,  and  university  level.  Technical  areas  most  desired  are:  Auditing,  AIS,  Financial,  and  Manage¬ 
rial.  Arkansas  State  University  is  an  Equal  Opportunity/Affirmative  Action  Employer  with  a  strong 
institutional  commitment  to  the  achievement  of  excellence  and  diversity  among  its  faculty  and  staff. 
ASU  is  also  committed  to  creating  a  productive  workplace  in  which  both  persons  and  property  are 
secure.  To  achieve  that  goal,  background  investigations  are  conducted  on  all  final  applicants  recom¬ 
mended  for  employment. 


643  EASTERN  CONNECTICUT  STATE  UNIVERSITY  is  seeking  applicants  for  a  full-time,  tenure- 
track  appointment  in  the  field  of  Accounting  at  the  Assistant/Associate  level,  to  begin  August  2011. 
Applicants  should  hold  a  Ph.D.  in  accounting  or  a  related  business  discipline  or  be  ABD  with 
expected  completion  by  May  2012.  Candidates  must  demonstrate  a  commitment  to  teaching,  schol¬ 
arly,  and  professional  activities,  and  a  willingness  to  be  active  in  service  to  the  college  and  com¬ 
munity.  Applicants  should  hold  a  Ph.D.  in  accounting  or  a  related  business  discipline  or  be  ABD 
with  expected  completion  by  May  2012.  Eastern  is  a  co-educational  public  liberal  arts  university 
serving  approximately  5,400  students.  Additional  information  may  be  found  at:  http:// 
www.eastemct.edu.  Applicants  should  submit  a  statement  of  interest,  current  vita,  graduate  tran¬ 
scripts,  and  three  letters  of  reference  to:  Dr.  Richard  Silkoff  at:  silkoffr@eastemct.edu. 
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644  UNIVERSITY  OF  MICHIGAN  ANN  ARBOR,  Stephen  M.  Ross  School  of  Business  has  tenure- 
track  positions  available  in  Accounting  starting  in  the  2011  academic  year.  Candidates  with  research 
and  teaching  interests  in  all  areas  of  accounting  will  be  considered.  Faculty  positions  at  all  ranks  are 
available.  The  application  should  include  a  research  paper,  which  may  be  based  on  the  candidate’s 
dissertation,  evidence  of  teaching  experience  (if  any),  and  a  curriculum  vitae  that  includes  three 
references.  The  University  of  Michigan  offers  competitive  salary  and  fringe  benefits.  A  qualified 
junior  candidate  must  have  a  Ph.D.  or  be  in  the  final  stages  of  their  dissertation.  A  qualified  candi¬ 
date  must  also  demonstrate  the  ability  and  interest  to  conduct  high-quaUty,  scholarly  research  as  well 
as  a  high  level  of  teaching  competence.  A  qualified  senior  candidate  must  have  a  Ph.D.  and  a  proven 
record  of  high-quality  research,  as  evidenced  by  publication  in  the  top  accounting  journals.  A 
qualified  candidate  will  also  have  a  demonstrated  record  of  teaching  competence  and  evidence  of  the 
ability  and  willingness  to  contribute  to  school  and  professional  service  activities.  To  apply,  please 
visit  the  Stephen  M.  Ross  School  of  Business  at  the  University  of  Michigan  website;  http:// 
www.bus.umich.edu/FacultyRecruiting/.  The  review  of  applications  will  begin  on  November  30, 
2010,  and  we  strongly  encourage  all  applications  to  be  completed  by  that  date.  Applications  will 
continue  to  be  accepted  until  January  15,  2011.  The  University  of  Michigan  is  an  Equal  Opportunity/ 
Affirmative  Action  Employer. 


645  UNIVERSITY  OF  DETROIT  MERCY,  College  of  Business  Administration  seeks  two  tenure-track 
Assistant  Professors  of  Accounting.  Emphasis  on  teaching  excellence;  publications  and  service  also 
required.  Applicants  should  possess  a  doctorate  in  accounting  or  be  ABD.  Preference  given  to 
expertise  in  auditing/forensic  accounting.  More  information  is  at:  http://www.udmercy.edu/hr/jobs/. 
Send  a  letter  of  interest,  resume,  contact  information  for  four  references,  and  evidence  of  teaching 
effectiveness  (if  available)  to;  Dr.  Jeanne  David  at;  jeanne.david@udmercy.edu.  Review  begins 
immediately  and  continues  until  positions  are  filled.  Michigan’s  largest,  most  comprehensive  private 
university,  the  University  of  Detroit  Mercy  is  an  independent  Catholic  institution  of  higher  education 
sponsored  by  the  Religious  Sisters  of  Mercy  and  the  Society  of  Jesus.  The  University  seeks  qualified 
candidates  who  will  contribute  to  the  University’s  mission,  diversity,  and  academic  excellence.  The 
University  of  Detroit  Mercy  is  an  Equal  Opportunity/Affirmative  Action  Employer  with  a  diverse 
faculty  and  student  body,  and  welcomes  persons  of  all  backgrounds. 
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ProQuest  Accounting  &  Tax  includes: 

•  Highly  ranked  global  and  scholarly  journals, 
including  all  journals  from  the  American 
Accounting  Association 

•  Accounting,  tax  management,  and  auditing  trade 
publications  from  top  publishers  such  as  CCH, 
Aspen,  Thomson  Professional  and  Regulatory 
Services,  Source  Media,  Tax  Management  Inc., 
and  Euromoney 

•  AlCPA  publications 

•  International  Federation  of  Accountants  (IFAC) 
standards 

•  Tax  lav\/  reviews  from  top  institutions  such  as 
New  York  University,  Stanford,  Vanderbilt,  and 
University  of  Virginia 


•  Tax  law  and  financial  law  publications  from  noted 
sources  such  as  the  American  Bar  Association 
and  American  Law  Institute 

•  Global  accounting  and  tax  journals  from  the  UK, 
Canada,  Australia,  France,  Spain,  Ireland,  and  more 

•  Financial  advising,  governance,  and  risk 
management  publications  from  Incisive  Media, 

CFA  Institute,  and  Association  for  Financial 
Counseling  and  Planning 

•  Accounting  history  from  publications  such  as 
The  Accounting  Historian’s  Journal 

•  Over  1,500  accounting  dissertations  and  more 
than  7,800  working  papers  from  SSRN,  plus 
select  conference  proceedings 


For  focused  results  in 
accounting  and  tax  research, 
start  here. 


ProQuest  Accounting  &  Tax,  our  premier  accounting  research  database,  makes  research  time 
more  effective  with  centralized  access  to  top  accounting  journals,  reference  reports,  proceedings, 
dissertations,  and  more. 


Researchers  can  instantly  cross-search  over  2,300  publications,  with  hundreds  in  ongoing  full  text,  plus 
dissertations  and  working  papers  covering  accounting  and  tax  topics. 


ABI/INFORM",  subscribers  can: enhance  their  business  database  library  with  ProQuest  Accounting  &  Tax, 
which  provides  a  vast  array  of  additional  full-text  accounting  journals,  trade  publications,  dissertations, 
and  working  papers  not  included  in  ABI/INFORM, 


While  the  world  of  accounting  information  continuously  transitions,  professionals,  educators,  and  a 
growing  number  of  accounting  students  look  to  reliable  resources  for  researching  accounting  trends, 
rules,  issues,  and  history  pertaining  to  the  future  of  accounting. 


ProQuest  Accounting  &  Tax 

Visit  wvsnw.proquest.com/go/accountlng 

today  to  learn  more,  and  sign  up  for  a  free  trial! 
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